Archaeoanalysis of Metadata Processing – Hackathon
Archaeoanalysis of Metadata Processing – Hackathon
Date: June 30 (12:00) – July 3, 2025 (12:00)
Location: Aachen, Seffenterweg 23 / Kopernikusstr. 6 (IT Center)
Catering provided | Bring your own laptop
Registration To help us plan the event please sign up using this form
When Metadata Becomes Data
What can you really do with bibliographic metadata — and how much knowledge is hiding in plain sight? This hackathon brings together researchers, developers, and students from all disciplines to explore the boundary between metadata and research data.
Our shared data source is iDAI.bibliography, a curated bibliographic database maintained by the German Archaeological Institute, comprising around 1.4 million entries. Working with data from outside your own domain is the best way to appreciate just how rich and powerful metadata can be.
All tools and technologies are welcome — the one requirement is that every step is documented and reproducible.
Tracks
(still under development)
Data Retrieval and Cleaning
Retrieve data from iDAI.bibliography, clean it, and transform it into a consistent, reusable format — the foundation for all other tracks. Tasks include normalizing author names, standardizing dates, and extracting a minimal schema across entries.
Data Analysis and Visualization
Take the cleaned metadata further and turn it into insight. Explore co-authorship networks, trace how research topics rise and fall over decades, or uncover publication patterns by language, place, or document type. Produce interactive dashboards, network graphs, or timelines that make the data speak for itself.
Data Transformation (Wikidata & Semantic Linking)
Map bibliographic entries to Wikidata and contribute to an open, global knowledge graph. Link authors, institutions, journals, and topics to existing Wikidata items — or create new ones where they are missing. Tools like OpenRefine, QuickStatements, or custom scripts are all fair game.
Judging Criteria
Projects will be evaluated on:
- Data quality – cleanliness, correctness, and robustness
- Reproducibility – can others re-run your workflow?
- Novelty and insight – how much new understanding does your work create?
- Reuse potential – can the community build on it after the hackathon?
- Communication – clarity of documentation and quality of visualizations
Satellite Event
On June 30, 4:00–6:00 PM, join us for the Battle of the Editors — a playful warm-up competition kicking off the hackathon weekend. → See separate announcement
Organized in collaboration with DKZ.2R, DKZ WiNoDa and NFDI4Objects.