Overview

This EmacsConf 2023 talk demonstrates a collaborative workflow for research data processing and documentation in Emacs. The speakers use Org mode as the central document format and combine it with companion packages for knowledge graph visualization, literate programming, and collaborative editing.

Starting from the National Research Data Infrastructure Germany (NFDI), the talk shows how to retrieve information from Wikidata, clean and process it with different programming languages, visualize relationships, and preserve the work through exportable documentation.

Topics covered

  • Org mode as a plain-text environment for scientific writing, organization, and publishing
  • org-roam and org-roam-ui for linking notes and visualizing a knowledge graph
  • org-babel for literate programming and self-documenting code
  • SPARQL queries against Wikidata
  • Data cleaning and processing with shell, Python, awk, and R
  • Collaborative editing in Emacs using CRDT
  • Exporting Org documents to formats such as PDF, HTML, and plain text

Speakers

Jonathan Hartman is a trained data scientist and works at the IT Center of RWTH Aachen University, Germany.

Lukas C. Bossert is a trained classical archaeologist and deputy head of the department “Research Process and Data Management” at the IT Center of RWTH Aachen University.

Chapter markers

  • 00:00 — Introduction
  • 01:16 — Org Mode
  • 02:18 — Working together
  • 06:27 — Data cleaning
  • 08:04 — Processing
  • 12:36 — Visualization
  • 14:01 — Preserve

Resources

Related Posts

How To: Open Science

How To: Open Science

Tired of Recreating someone else’s work? - How Open Science can accelerate research and overcome reinvention

Have you ever found papers on algorithms but their implementation is missing? Found an interesting analysis but there is no way to check the results, as you don’t have access to the data they were derived from? Ever thought you had a great idea for a project, just to find out a year later that you are not the only research group following that specific idea? Not having access to other people’s code, data, metrics or even their plans for research projects often leads to unnecessary delays and scientific redundancies. There is an easy solution to overcome (almost) all of these issues. It’s called Open Science! What is Open Science? The UNESCO defines Open Science as a construct of “movements and practices aiming to make multilingual scientific knowledge openly available, accessible and reusable for everyone, to increase scientific collaborations and sharing of information for the benefits of science and society, and to open the processes of scientific knowledge creation, evaluation and communication to societal actors […]”. To ensure that everyone has access to scientific knowledge and infrastructure, Open Science focuses on four main concepts.

Read More
Comments on Collaboration - My Experience with the DKZ.2R Rent-an-Expert Program

Comments on Collaboration - My Experience with the DKZ.2R Rent-an-Expert Program

Comments on Collaboration - My Experience with the DKZ.2R Rent-an-Expert Program

At the beginning of this year (2025), I received an email regarding the DKZ.2R “Rent an expert” program. I was very interested in this initiative and therefore applied for support from the scientific consulting team at the Rhine-Ruhr Center for Scientific Data Literacy (DKZ.2R) for assistance with my data analysis.

I obtained my master’s degree in Plant Nutrition from the China Agricultural University and pursued my PhD study at the University of Hohenheim. I am currently a postdoctoral researcher in the Institute of Crop Science and Resource Conservation, Crop Functional Genomics, at the University of Bonn.
My research expertise includes plant culturing, molecule cloning, biochemical analysis and limited data analysis experience on large-scale NGS datasets.
Since the beginning of April 2025, two DKZ.2R consultants were assigned to me: Tarek Iraki, who is proficient in programming languages such as Python, and Lennard Maßmann, who specializes in working with R. Together, we collaboratively worked on my Postdoctoral project, which focuses on the molecular and genomic dissection of lateral root development in maize.

Read More
Documentation From User Experience

Documentation From User Experience

This post is a condensed version of a talk at our Data Compentcy College

If you regularly use scientific software written by others, or tried to replicate interesting research that relies on software, you have probably also invested weeks of work to solve a software problem or even given up on a software because of missing documentation. Finding a project that might be the solution to your problem and then failing to run the code is frustrating. Being unable to run a project you have built yourself years ago is even worse. Having experienced all those setbacks myself in the past I want to use this post to channel that frustration to fuel solutions for better documentation for our current and future projects.

Read More