Upload
others
View
3
Download
0
Embed Size (px)
Citation preview
From CegeSoma spreadsheets to Linked Data:a Wikibase journey
Anne Chardonnens
@annechardo
Linking the Past, KBR, 22.11.2019
Andrée de JonghAndrée De JonghAndrée DeJonghDe Jongh, Andrée Eugénie AdrienneCountess Andree ‘Dedee’ de JonghDédéeCyclone DDPetit CycloneThe Postman
1 PERSON = 1 UNIQUE IDENTIFIER
http
://lo
d-cl
oud.
net/
(201
7-08
-22)
}
Two processes run in parallel…0. The data
Working on a sample Working on the whole dataset
1. Cleaning and deduplicating the data
Manual process based on deductionsOpenRefine &
The Python Record Linkage toolkit
2. Reconciling names with external resources
Custom research OpenRefine reconciliation services (Wikidata, Viaf, ULAN etc.)
.
3. Modeling the data
Data modeling based on a representative sample From strings to things (URIs)
4. Running a Wikibase
Locally AGR server
5. Importing data
MediaWiki Graphical User Interface Several solutions for (Semi-)automatic data ingestion
Wikibase-edit
QuickStatements
Wikidata Integrator
Pywikibot
Heard Library Python script
6. Querying data
Simple search box SPARQL Query Service