Archaeoanalysis of Metadata Processing – Hackathon

Date: June 30 (14:00) – July 3, 2026 (12:00) Location: Aachen, Seffenterweg 23 / Kopernikusstr. 6 (IT Center) Catering provided | Bring your own laptop

Registration

To help us plan the event please sign up using this form

Event Poster

When Metadata Becomes Data

What can you really do with bibliographic metadata — and how much knowledge is hiding in plain sight? This hackathon brings together researchers, developers, and students from all disciplines to explore the boundary between metadata and research data.

Our shared data source is iDAI.bibliography, a curated bibliographic database maintained by the German Archaeological Institute, comprising around 1.4 million entries. Working with data from outside your own domain is the best way to appreciate just how rich and powerful metadata can be.

All tools and technologies are welcome — the one requirement is that every step is documented and reproducible.

Tracks

(still under development)

Data Retrieval and Cleaning

Retrieve data from iDAI.bibliography, clean it, and transform it into a consistent, reusable format — the foundation for all other tracks. Tasks include normalizing author names, standardizing dates, and extracting a minimal schema across entries.

Data Analysis and Visualization

Take the cleaned metadata further and turn it into insight. Explore co-authorship networks, trace how research topics rise and fall over decades, or uncover publication patterns by language, place, or document type. Produce interactive dashboards, network graphs, or timelines that make the data speak for itself.

Data Transformation (Wikidata & Semantic Linking)

Map bibliographic entries to Wikidata and contribute to an open, global knowledge graph. Link authors, institutions, journals, and topics to existing Wikidata items — or create new ones where they are missing. Tools like OpenRefine, QuickStatements, or custom scripts are all fair game.

Hackathon Focus

Projects will be evaluated on:

  • Data quality – cleanliness, correctness, and robustness
  • Reproducibility – can others re-run your workflow?
  • Novelty and insight – how much new understanding does your work create?
  • Reuse potential – can the community build on it after the hackathon?
  • Communication – clarity of documentation and quality of visualizations

Satellite Event

On June 30, 4:30–6:30 PM, join us for the Battle of the Editors — a playful warm-up competition kicking off the hackathon weekend. → See separate announcement

Organized in collaboration with DKZ.2R, DKZ WiNoDa and NFDI4Objects.

Related Posts

Comments on Collaboration - My Experience with the DKZ.2R Rent-an-Expert Program

Comments on Collaboration - My Experience with the DKZ.2R Rent-an-Expert Program

Comments on Collaboration - My Experience with the DKZ.2R Rent-an-Expert Program

At the beginning of this year (2025), I received an email regarding the DKZ.2R “Rent an expert” program. I was very interested in this initiative and therefore applied for support from the scientific consulting team at the Rhine-Ruhr Center for Scientific Data Literacy (DKZ.2R) for assistance with my data analysis.

I obtained my master’s degree in Plant Nutrition from the China Agricultural University and pursued my PhD study at the University of Hohenheim. I am currently a postdoctoral researcher in the Institute of Crop Science and Resource Conservation, Crop Functional Genomics, at the University of Bonn.
My research expertise includes plant culturing, molecule cloning, biochemical analysis and limited data analysis experience on large-scale NGS datasets.
Since the beginning of April 2025, two DKZ.2R consultants were assigned to me: Tarek Iraki, who is proficient in programming languages such as Python, and Lennard Maßmann, who specializes in working with R. Together, we collaboratively worked on my Postdoctoral project, which focuses on the molecular and genomic dissection of lateral root development in maize.

Read More
Call for participation

Call for participation

Call for participation!

The Data Literacy Center Rhine-Ruhr (DKZ.2R) issues a call for participation in its “rent-an-expert” project! We offer support for ambitious research projects of PhD students and early postdocs dealing with Data Science and Artificial Intelligence, High Performance Computing and Simulation, and Research Data Management. As the DKZ.2R is funded by the German Federal Ministry of Education and Research (BMBF) as well as the EU, this offer is free of charge!

Read More
Documentation From User Experience

Documentation From User Experience

This post is a condensed version of a talk at our Data Compentcy College

If you regularly use scientific software written by others, or tried to replicate interesting research that relies on software, you have probably also invested weeks of work to solve a software problem or even given up on a software because of missing documentation. Finding a project that might be the solution to your problem and then failing to run the code is frustrating. Being unable to run a project you have built yourself years ago is even worse. Having experienced all those setbacks myself in the past I want to use this post to channel that frustration to fuel solutions for better documentation for our current and future projects.

Read More