Overview
This EmacsConf 2023 talk demonstrates a collaborative workflow for research data processing and documentation in Emacs. The speakers use Org mode as the central document format and combine it with companion packages for knowledge graph visualization, literate programming, and collaborative editing.
Starting from the National Research Data Infrastructure Germany (NFDI), the talk shows how to retrieve information from Wikidata, clean and process it with different programming languages, visualize relationships, and preserve the work through exportable documentation.
Topics covered
- Org mode as a plain-text environment for scientific writing, organization, and publishing
- org-roam and org-roam-ui for linking notes and visualizing a knowledge graph
- org-babel for literate programming and self-documenting code
- SPARQL queries against Wikidata
- Data cleaning and processing with shell, Python, awk, and R
- Collaborative editing in Emacs using CRDT
- Exporting Org documents to formats such as PDF, HTML, and plain text
Speakers
Jonathan Hartman is a trained data scientist and works at the IT Center of RWTH Aachen University, Germany.
Lukas C. Bossert is a trained classical archaeologist and deputy head of the department “Research Process and Data Management” at the IT Center of RWTH Aachen University.
Chapter markers
- 00:00 — Introduction
- 01:16 — Org Mode
- 02:18 — Working together
- 06:27 — Data cleaning
- 08:04 — Processing
- 12:36 — Visualization
- 14:01 — Preserve