44
D14.2: Pilot deployment experiments Author: Philipp Gerth, DAI Ariadne is funded by the European Commission’s 7th Framework Programme.

ARIADNE: Pilot deployment experiments

Embed Size (px)

Citation preview

Page 1: ARIADNE: Pilot deployment experiments

D14.2: Pilot deployment experiments

Author: Philipp Gerth, DAI

Ariadne is funded by the European Commission’s 7th Framework Programme.

Page 2: ARIADNE: Pilot deployment experiments

Theviewsandopinionsexpressedinthisreportarethesoleresponsibilityoftheauthor(s)anddonotnecessarilyreflecttheviewsoftheEuropeanCommission.

ARIADNED14.2:Pilotdeploymentexperiments(Public)

Version:1.0(final) 11thNovember2016

Author: PhilippGerth,DAIContributingPartners

PaolaRonzino,AchilleFelicetti,PIN

MariaTheodoridou,MartinDoerr,EleniChristaki,FORTH

EdeltraudAspöck,AnjaMasur,OEAW

WolfgangSchmidle,SebastianCuy,DAI

DouglasTudhope,CeriBinding,USW

Qualityreview: HollyWright,ADS

Page 3: ARIADNE: Pilot deployment experiments

3

TableofContents

ExecutiveSummary...........................................................................................................4

1. IntroductionandObjectives........................................................................................5

2.ARIADNEReferenceModel............................................................................................7

3.LinkedDataEnrichment...............................................................................................103.1Places.............................................................................................................................................................................103.2Time................................................................................................................................................................................113.3Literature.....................................................................................................................................................................113.4ArchaeologicalTerminology................................................................................................................................123.5Actors.............................................................................................................................................................................13

4.MethodologicalApproach............................................................................................14

5.TechnicalEnvironment................................................................................................17X3MLmappingframework..........................................................................................................................................17Blazegraph..........................................................................................................................................................................17Metaphactory.....................................................................................................................................................................17

6.Scenario1:Coins..........................................................................................................186.1.DescriptionofDatasets.........................................................................................................................................18ThedFMROarchive.........................................................................................................................................................18NumismaticarchivesfromtheCOINSproject.....................................................................................................19Arachne.................................................................................................................................................................................19iDAI.field-PergamonProject.....................................................................................................................................20MuseiD-Italiacollections..............................................................................................................................................20NaturalLanguageProcessingdata..........................................................................................................................20ACDMdata..........................................................................................................................................................................21

6.2.KnowledgeGraph....................................................................................................................................................216.3.QueryPossibilities...................................................................................................................................................22

7.Scenario2:Sculptures..................................................................................................247.1.DescriptionofDatasets.........................................................................................................................................24AthenianAgora.................................................................................................................................................................24iDAI.objectsArachne.......................................................................................................................................................24BritishMuseumCollectionOnline.............................................................................................................................25iDAI.fieldproject“Chimtou”........................................................................................................................................25GazetteerofStoneQuarriesintheRomanWorld..............................................................................................25

7.2.KnowledgeGraph....................................................................................................................................................267.3.QueryPossibilities...................................................................................................................................................27Objectcentricqueryforsimilarobjects..................................................................................................................27Enhancingliteratureresearch...................................................................................................................................30

8.Scenario3:Buildings....................................................................................................338.1.DescriptionofDatasets...........................................................................................................................................338.2.KnowledgeGraph......................................................................................................................................................34ROOM37...............................................................................................................................................................................35

Page 4: ARIADNE: Pilot deployment experiments

4

9.Scenario4:FaunalRemains..........................................................................................379.1.DescriptionofDatasets.........................................................................................................................................37AnimalBoneEvidencefromSouthernEngland..................................................................................................37HolocenehistoryofwildlifeinEurope....................................................................................................................37

9.2.KnowledgeGraph....................................................................................................................................................389.3.QueryPossibilities...................................................................................................................................................39

10.Conclusions................................................................................................................40Bibliography.......................................................................................................................................................................41AppendixA..........................................................................................................................................................................43Scenario1:Coins..............................................................................................................................................................43Scenario2:Sculptures....................................................................................................................................................43Scenario4:AnimalRemains........................................................................................................................................43AppendixB..........................................................................................................................................................................44

ExecutiveSummaryTheprimarygoalof theEUprojectARIADNE (AdvancedResearch Infrastructure forArchaeologicalDataset Networking in Europe) is to bring together and integrate existing archaeological datainfrastructures to offer researchers a unified search and discovery on awide range of distributeddatasets, which is very much needed in archaeology. In addition to the aggregation efforts thatresulted in a general catalogue of archaeological resources in the ARIADNE portal, several casestudiesexaminedatighterandsemanticallyricherintegrationusingSemanticWebtechnologies.Thepurposeof this document (deliverableD14.2) is todescribe the activities carriedoutwithinWorkPackage14-AddressingComplexity(WP14),whichresultsinthepresentationoffourdatasets.

Ingeneral,theintegrationofvariousdatasetsfacessomechallengesintermsofdifferentlanguages,terminologyanddatamodelsused.Theintegrateddatasetshavedifferentdatamodelstodescribethe content according to historical reasons and intents of the project, and where the data wascreated. After a short introduction (chapter 1), the ARIADNE Reference Model is described inchapter2.Itoffersacommonschematodescribethesedifferentdatamodels.TheglobalontologyusedinthesescenarioswasCIDOC-CRM,whichwasadaptedandextendedforthedomain-specificuseinarchaeologybytheextensionCRM-ARCHAEO.

Chapter3 includesaclassificationanddescriptionofexisting,referenceablesystemsforcontrolledvocabulary,thesauriandreferencesthatareavailableonline.Thesecontaincommontermsfortime,space and domains that are need to be harmonized and brought together to overcome thedifferencesinlanguageandterminology.

The universal workflow to import the data, transform and export it to a format, which uses theARIADNE Reference Model and common standards, is described in chapter 4. The tools andsoftware,whichhelp theusersmaptheirdata,performdata transformation, fordatastorageandforfinalpresentation,areoutlinedinchapter5.

Page 5: ARIADNE: Pilot deployment experiments

5

Chapters 6-9describe the scenariosused to test and confirm theARIADNEReferenceModelwithreal world data. The scenarios were selected to cover a wide range of different archaeologicalresearchperspectiveswithdifferentrecordedclasses.

Finally,chapter10givesanoverviewofthedataintegrationactivities.

1. IntroductionandObjectivesThe archaeological domain produces vast amounts of very heterogeneous research data likeexcavation data, object descriptions, texts, archives, cadastral plans, historicalmaps coming fromvarious disciplines with different methodologies. This implies a big challenge for organizing,aggregating and publishing the data. The EU project ARIADNE aims at bringing together andintegrating existing archaeological data infrastructures. In addition to the aggregation efforts thatresulted in a general catalogue of archaeological resources in the ARIADNE portal(http://portal.ARIADNE-infrastructure.eu/),severalcasestudiesexaminedatighterandsemanticallyricher integrationusingSemanticWebtechnologies.Toallowan integrationofthisheterogeneousinformationwithoutthelossofmeaning,aglobal,extensibleschema,definedinD14.1,wasused.Ituses CIDOC-CRM as a backbone ontology, together with several extensions bound together anddescribedintheARIADNEReferenceModel.

The selected scenarios diverge from the proposed scenario described in the DOW, as dataavailability was a limiting factor to most of the proposals. The deployed scenarios focussed ondifferentaspectsofarchaeologicalresearch:

Coins: Numismatics is a very traditional science with a lot of experience and early initiatives instandardization. As such, itwas chosen as a very good starting point for item-level integration ofarchaeologicaldatasets.Thelinkagebetweenrepositorymetadataanditem-leveldata,aswellastheintegrationofnaturallanguageprocessingdata,weretested.

Sculptures: This scenario concerns data integration of sources from various disciplines, includingsculpture information and its archaeological context. It focuses on the provenanceof informationaccording to bibliographic references, leading to advanced literature research by using theknowledgegraphfordiscovery.

Page 6: ARIADNE: Pilot deployment experiments

6

Buildings:ThisscenarioisabouttheapplicationoftwoextensionoftheCIDOCCRM,namelyCRMbaandCRMarchaeo,tomodelunstructuredinformationcollectedduringthearchaeologicalexcavationoftheprehistoricZominthos’PalaceinCrete.Thisisanimportantachievementasitrepresentsoneofthefirstattemptsofintegratingarchaeologicallayersandbuiltstructurewithaknowledge-basedmodel.

AnimalRemains: Inthisscenariodifferentzooarchaeologydatasetsaboutboneassemblagesfoundin diverse archaeological sites were integrated and different statistical queries were run on thecommonrepository.

Wood/Dendrochronologycasestudy: An additional case study is described in the ARIADNEdeliverable D15.3 - Semantic Annotation and Linking. This case study investigated the semanticintegration of archaeological datasets with grey literature in different languages, as a combinedeffort between ARIADNEWP15 andWP16. The case study is based on archaeological interest indifferent types of wooden material, samples taken, wooden objects and dating viadendrochronological techniques. Information from Dutch, English, Swedish archaeological reportswasextractedbyNaturalLanguageProcessingpipelines.TheworkwasundertakenbyUniversityofSouthWalesonthetechnicalside,incollaborationwithDANSandSND.

Like the case studies described below in D14.2, thewood/dendrochronology case study uses the

CIDOC-CRM as a backbone ontology, together with the Getty Art and Architecture Thesaurus for

commonvocabulary,expressingthesemantic integrationinRDF.ThecasestudyDemonstratorisa

SPARQLquerybuilderWebapplicationthatseekstohidethecomplexityoftheunderlyingontology.

Astheuserselectsfromthe interface,anunderlyingSPARQLquery isautomaticallyconstructed in

terms of the semantic framework.Queries concernwooden objects (e.g. samples of beechwood

keels),optionallyfromagivendaterange,withautomaticexpansionoverhierarchiesofwoodtypes.

Page 7: ARIADNE: Pilot deployment experiments

7

2.ARIADNEReferenceModel

The ARIADNE Reference Model (http://www.ARIADNE-infrastructure.eu/Resources/ARIADNE-Reference-Model)wasdevelopedbyresearchactivitiesinWP14,leadbyFORTH,andisextensivelydocumented in deliverable D14.1 – Extended CRM. The ontology CIDOC-CRM (http://www.cidoc-crm.org/) is used as a backbone for the ARIADNE Reference Model. CIDOC CRM is in use fordescribing information in the context of cultural heritage and museums. In 2006, CIDOC CRMbecame an ISO standard, and nowadays is widely used in galleries, libraries, museums, researchinstitutesandarchives. In theARIADNEReferenceModel itworksasa top-levelontology,used toenablesemanticinteroperabilitytothemoredomainspecificontologies,extendingCIDOCCRM.

Figure1ARIADNEReferenceModel

● CRMinf(CRMinf,2015):theArgumentationModelisaformalontologyproducedbyStephenStead,PaveprimeLtdandcollaborators,and is intendedtobeusedasaglobalschemaforintegrating metadata about argumentation and inference making in descriptive andempirical sciences, such as biodiversity, geography, archaeology, cultural heritageconservation,researchITenvironmentsandresearchdatalibraries.Itsprimarypurposeistofacilitate the management, integration, mediation, interchange and access to data aboutreasoningbyadescriptionofthesemanticrelationshipsbetweenthepremises,conclusionsandactivitiesofreasoning.Besidesapplication-specificextensions,thismodelisintendedtobecomplementedbyCRMsci,amoredetailedmodelandextensionof theCIDOCCRMformetadataaboutscientificobservation,measurementsandprocesseddataindescriptiveandempiricalsciences,alsocurrentlyavailableinafirststableversion.ReferencedocumentandRDFSencodingareavailableinversion0.7.

Page 8: ARIADNE: Pilot deployment experiments

8

● CRMsci(CRMsci,2016):theScientificObservationModelisaformalontologyintendedtobeused as a global schema for integrating metadata about scientific observation,measurements and processed data in descriptive and empirical sciences, such asbiodiversity, geography, archaeology, cultural heritage conservation, research ITenvironments and research data libraries. Its primary purpose is to facilitate themanagement, integration, mediation, interchange and access to research data bydescriptionofsemantic relationships, inparticularcausalones.CRMsci is in theprocessofbeingvalidated inthecontextoftheARIADNEproject.Themodel isnot“finished”,andallconstructsandscopenotesareopentofurtherelaboration.ReferencedocumentandRDFSencodingareavailableinversion1.2.2

● CRMgeo (CRMgeo, 2013): the Spatiotemporal Model provides a linkage between thestandards of the geospatial and the Cultural Heritage community, in particular betweenGeoSPARQLandCIDOCCRM.Thismodelsaimsatbeingacomprehensivetheoryfromwhichallkindofplacescouldbedescribed,suchas-thistheplaceoftheVarusBattleoristhistheplacewhereLordNelsondied,includinggeometricspecifications.ReferencedocumentandRDFSencodingareavailableinversion1.2.

● CRMdig(CRMdig,2014)isanontologyandRDFSchematoencodemetadataaboutthestepsand methods of production ("provenance") of digitization products and synthetic digitalrepresentations,suchas2D,3DorevenanimatedModelscreatedbyvarioustechnologies.Itsdistinctfeaturesarethecompleteinclusionoftheinitialphysicalmeasurementprocessesandtheirparameters.CRMdighasbeendevelopedasacompatibleextensionofCIDOCCRM(ISO21127), which allows for querying the most relevant facts and returning completedescriptionsencodedinthismodelbygenericCIDOCCRMtermswithouttheneedtorefertoitsspecificproperties.Theapplicationssofarperfectlyconfirmthewideapplicabilityandpotentialofthismodelforallkindsofscientificdataandotherdigitalobjectsanditssuperiormaturityintermsofcoverage,genericity,expressivepowerandlevelofdetail.ReferencedocumentandRDFSencodingareavailableinversion3.2.1.

● CRMba (CRMba,2016;Ronzino, 2015): theBuildingsArchaeologymodelwas conceived tosupport the process of recording the evidences and the discontinuities of matter inarchaeologicalbuildings, inorder to identify theevolutionof thestructure throughout thecenturiesandtorecordtherelationshipsbetweeneachofthebuildingcomponentsamongthem and with the building as a whole. It aims to express the semantic relations of thestratigraphic units of a standing building, taking into account the stratigraphic analysistheoryofthestandingbuildings.CRMbaisintheprocessofbeingvalidatedinthecontextoftheARIADNEproject.Themodelisnot“finished”,allconstructsandscopenotesareopentofurtherelaboration.ReferencedocumentandRDFSencodingareavailableinversion1.4.

● CRMarchaeo (CRMarchaeo,2016):CRMarchaeo isanontologyandRDFSchematoencodemetadataabout thearchaeologicalexcavationprocess.Themodelwas created to supportthisprocessandall thevariousentitiesandactivities related to it.Themodelwascreatedstarting with standards and models already in use by national and international cultural

Page 9: ARIADNE: Pilot deployment experiments

9

heritage institutions, and has been enriched by continuous collaboration with variouscommunities of archaeologists from different countries and schools. CRMarchaeo isintendedtoprovideallnecessarytoolstomanageand integrateexistingdocumentation inorder to formalise knowledge extracted from observations made by archaeologists,recorded in variousways and adopting different standards. Its purpose is to facilitate thesemantic encoding, exchange, interoperability and access of existing archaeologicaldocumentation. The model documents, in a transparent way, the various aspects of thearchaeological excavation process, including the technical details concerning differentmethods of excavation, the reasons for their application and the observations made byarchaeologists during their activities in the field. This approach allows the creation of anobjective documentation that can guarantee the scientific validity of the results, makingthemrevisablefollowingfurtherinvestigations,andreusableindifferentresearchcontexts,inordertoanswerfurther(andpotentiallydifferent)researchquestions.ReferencedocumentandRDFSencodingareavailableinversion1.4.

Page 10: ARIADNE: Pilot deployment experiments

10

3.LinkedDataEnrichment

Theuseofanontologyalonedoesnotautomaticallyleadtodataintegrationofdifferentdatasets,astheontology justdescribes thesemanticsof the informationmodel.Thevaluesand terminologiesusedwill still be diverse in terms of different spellings, errors and languages. For instance, a sitename can differ from dataset to dataset. In the example below (Figure 2 Example for Linked Data

enrichmentforplacetermsusingthe iDAI.gazetteer.),twodifferentobjects,onesculpturedocumentedinArachne(Figure2ExampleforLinkedDataenrichmentforplacetermsusingtheiDAI.gazetteer.left)andone sarcophagus documented in the British Museum collection (Figure 2 Example for Linked Data

enrichmentforplacetermsusingtheiDAI.gazetteer.right),werefoundinthesameplace.Asthevalueforthesameplacediffers,itcouldnotbeidentifiedasthesamesite,thereforeitisultimatelynecessarytorefertoonlineresources,i.e.agazetteer,thatactasstandardizationtoolsinsteadoftextstringswhereverpossible.

Differentcases,wherestandardscouldbeapplied,were identifiedandwillbeexplained infurtherdetail.

Figure2ExampleforLinkedDataenrichmentforplacetermsusingtheiDAI.gazetteer.

3.1Places

The location of a find or the different spellings and the position of an archaeological site arefundamental pieces of information usedbymany systems. Therefore, it is of great importance toprovide systems, which offer a referenceable URI to the desired locationwith different languagerepresentations,spellingsandlocation.TheconceptofGazetteersasareferencesystemisquiteoldandwasadoptedearlyonbyarchaeology(e.g.Clark1932). It isprettystraightforwardtoenhance

Page 11: ARIADNE: Pilot deployment experiments

11

them semantically. In terms of standardization it is themost advanced class and there are somepossibilities:

● Geonames(http://www.geonames.org/)isagazetteerofworldwideplacenamesandoffers

accesstoover10millionplacenames,whicharecategorizedin9differentclasses.Ithasageneric approach and offers very dense data of recent populated places. However,archaeological places are quite sparse. For example, Geonames provides only 117archaeological sites in Greece(http://api.geonames.org/search?featureCode=ANS&maxRows=1000&country=GR).

● Pleiades (http://pleiades.stoa.org/home)was started in2006asadigital representationofthe Barrington Atlas of the Roman and GreekWorld, but was soon adopted for a widerscope.With35.000ancientsites,itisthelargestgazetteerfortheancientworld.

● iDAI.gazetteer is geographical databasedevelopedat theGermanArchaeological Institute,whichprovidesmorethan1millionentriestodescribemodernandancientplacesthatareof interest to archaeologists and also acts as a hub by linking other Gazetteers likeGeonamesandPleiades.Itsscopealsoincludesgeographicdatarecordedatsiteorbuildinglevel,which is particularly usefulwhen connecting different archaeological databases thatoverlapingeographiccoverage.

3.2Time

Temporal information isas importantas spatial information. For theuse casesdescribedwehavenotusedatimegazetteer,butitishighlyrecommendedtodosointhefuture,oncethesystemsaremoretechnicallyevolvedandfilledwithdata.Thereareafewavailabletimegazetteers.

● PeriodO (http://perio.do/) is a large time gazetteer with a pragmatic data model for the

SemanticWeb.ItissupportedbytheARIADNEproject.● iDAI.chronontology (http://chronontology.dainst.org/) is a time gazetteer currently in

development at the German Archaeological Institute. It has amore complex datamodel,whichdistinguishesbetweenthedefinitionofaperiodandthedatinginformationbasedonthisdefinition.

3.3Literature

Thestandardizationinthecontextoflibrarieshasalonghistory,beginningwithin-houserulesforaspecificlibraryandleadingtotheclosecooperationbetweenlibrariesnowadays(Thacker,2000).Tothis end,metadata standards like theMARC21 and international identifiers like the InternationalStandardBookNumber (ISBN)wereestablished. ISBNnumbers,with their introduction in1972asthe ISO standard 2108, are commonly accepted and used. Unfortunately older publications don’thaveISBNnumbers,andastheregistrationisfee-based,smallerpublicationsandspecialisedbooksareoftenpublishedwithoutISBNnumbers,thereforeregistrationisn’tpossible.

Page 12: ARIADNE: Pilot deployment experiments

12

Therearealotofnormdataservicesforbooks,forexampleWorldCat(http://www.worldcat.org/).However,Zenon,withitsfocusonclassicalstudiesandentriesfor individualarticles,wasthemostobvious choice. Zenon is the bibliographical catalogue of the German Archaeological InstituteincludingthedataofallDAIlibraries,theDEIAmmanandtheWinckelmannInstitute.Theresourcewas used for a use case since it is one of the biggest databases for classics with more than 1.2datasetsandcovers,besidebooks,e-resources,mapsandarchivalmaterial.ThemetadatastandardMARC21isusedandURIsareprovided.

3.4ArchaeologicalTerminology

Ingeneral,therearetwodifferentapproachesforarchaeologicalterminology.Onecouldclassifyanddefinetheterminologyforthewholearchaeologyandclassicsinaverygenericway,oronecouldtrytodefineaterminologyonlyforaverysubject-specificsubset.Thereisalwaysatradeoffbetweenscopeandprecision.Acombinationofbothprocedures,wherethespecialisedterminologylinkstothemoregenericterminologywithsomeoverlappingterms,couldbeahybridsolution.

Eventhoughthereisa lotofspecializedvocabularyinusebybiggerinstitutions,theyoftendonotprovide their vocabulary or it is not available in an appropriate format. The British Museum’svocabulary is already provided in SemanticWeb formats, but unfortunately, asGruber and Smith(2014,207)havestated, theyarenot linkedtoexternalvocabularies,whichmakesthe integrationwithotherdatasetsmore complicated, and thedatasets and the vocabulary anopenbut isolatedisland.

● SpecialistOntologies:theterminologyofNomisma.org(http://nomisma.org/)fordescribing

coins and the evolving terminology of Kerameikos.org (http://kerameikos.org/) fordescribingGreek ceramics is readily available in LinkedOpenData formats. They are alsoproviding links to more generic terminology, which allow an easy integration of thespecializeddatasets.

● The Getty Arts & Architecture Thesaurus, AAT in short(http://www.getty.edu/research/tools/vocabularies/aat/), is a highly evolved thesaurus forthehumanities.ItoffersalotofdifferentbranchesfollowingISOstandards,butasitfollowsagenericapproach,itlacksspecializationatsomepoint.Otherlanguagesaresupported,butrarelyexistbesidesDutchandSpanish.

● The iDAI.vocab (http://archwort.dainst.org/de/vocab/) is a thesaurus of archaeologicalterminology with currently 14 different languages. Its aim is to collect and organize theterminologyusedintheservicesoftheDAI.TheGermanthesaurusoperatesasthecentralhub: a German term, for instance, is linked to translations to all the other availablelanguages. The concepts are classified according to the relations specified by the SKOSstandard. Thewords are connected to the equivalent concepts in other referenceworks,suchastheGetty'sAATandDbpedia.

Page 13: ARIADNE: Pilot deployment experiments

13

3.5Actors

Actors are of importance in archaeology for specific find categories, e.g. the issuer of a coin,generally named on the object, or actorsmentioned in inscriptions or iconographic items. Thoseactorsarenotnecessarilyrealhistoricalpersons,theycouldalsobemythologicalcharacterslikethedeitiesofGreekmythology.

● TheVirtual International Authority File (VIAF) (http://viaf.org/) is the biggest international

authoritysystemforpersons,whichcombinesseveralreferencedatasetsandmapsthemtothesameperson.Itsdatamostlyderivesfromthedifferentlibrarysystemsandthusworkswell for realhistoricalpersonsandwriters,which canbe referencedbyURIs, for examplehttp://viaf.org/viaf/60280417.However, the information aboutmythological actors is verysparseandcontainsalotofduplicates.

● Wikidata(https://www.wikidata.org/)isthedatabackboneofWikipedia.Itcontainsalotofmythologicalcharactersandcurrently,togetherwithDBpedia,isthebestresourceavailableformythological characters. It’s also5-star LinkedOpenData, as it contains links tootherresources,e.g.https://www.wikidata.org/wiki/Q163709.

Page 14: ARIADNE: Pilot deployment experiments

14

4.MethodologicalApproach

In Archaeology, relational database management systems are used almost exclusively for storinginformation indatabases,where it isverycommontousedesktopapplications likeFileMakerandMicrosoftAccess.Olderdigitalresearchdataisoftenjuststructuredasspreadsheets.Therefore,thedata integration approach has to be very flexible to allow an integration of all kinds of datasets,databasesand informationsystems.Variousdifferentkindsof structureddatacouldbe integratedbyusingXMLasacommonminimalstandard,whichcouldthenbeprocessedfurther.Thisappliestodesktopdatabases,whichallowexportingtoXML, likeFileMaker,OnlineDatabaseswithinterfacesforexportingdataintoXMLandevenallotherkindsofexportedCSVdatasets,whichcouldthenbeconvertedtoXMLusingappropriatetools.

Figure3Anoverviewofthedataflowfordataintegrationofdifferentdatasets

TomakethisdataavailablefortheSemanticWeb,oneneedstotransformtheXMLdataintotriplesinRDF/XML.Therefore,asufficientmappingspecificationisneededtosupportthetransformationofeach distinct schema of an archaeological dataset (source schema) into the ARIADNE RM (targetschema).Itiscrucialthatduringthetransformation,theinformationencodedinthesourcedatabaseisnotlostandtheinitial“meaning”ispreservedasmuchaspossible.Inseveralcases,informationinthesourcedatabaseis implicit,hiddeninforms,user interfacefieldsor- intheworstcase- intheheadofacurator,thedomainexpert.Thisimpliesthatthetransformationprocesscannotbecarriedout solely by technical people; it requires close collaboration between the domain experts, whopossess the domain knowledge, and the IT experts, who possess the semantic world technicalknowledge.

Thiscomplexprocessofmappingtheschemaofanarchaeologicaldatabasetoacommoncoherentglobalontologyisguidedbyusingamappingandtransformationtoollikethe3MMappingMemoryManager (http://www.ics.forth.gr/isl/3M/) maintained by FORTH. This online open source tool isusedtodescribeandeditschemamappingsinahumanandmachine-readableway.AfterimportingtheXMLdatainto3M,threemorestepsarenecessary:

Page 15: ARIADNE: Pilot deployment experiments

15

● Schema mapping: in this first step mappings are produced from attributes in the sourceschema (dataset) to classes in the target schema (CIDOC-CRM). Therefore, domainknowledgeabouttheexplicitandimplicitsemanticsofthesourceinformationmodel,aswellastechnicalunderstandingofthetargetontologyandthesystem,isneeded.

● URIspecification:thenextsteprequiresassigninganappropriateURItoeachresourceinthedataset.Inanidealworld,youwouldbeabletolinkdirectlytostableURIsoftheintegrateddatasource,ifthedataisavailableonline.Inthecaseofdataavailableonlyoffline,liketheiDAI.fielddata,itisonlypossibletocreatepseudo-URIs.

● Transformation:inthisfinalstepthetransformationofeverysourcedatasetrecordtoasetofappropriateRDF(https://www.w3.org/RDF/)triplesiscarriedout.

Afterthesestepsareapplied,theRDFtriplescanbeimportedintoatriplestore.Wetrieddifferenttriplestores, choosing Blazegraph (https://www.blazegraph.com/) in the end, as it provides verygood performance and usability even with more than several million triples. The data could beimportedusingconsolescripts,whichshouldbeappliedforbiggerdatasets,andawebbasedGUI-based import.Moreelaborate informationsystemsprovidean interface(i.e.OAI-PMH)mappedtoCIDOC-CRM, likeArachne, that allowsexporting thequerieddata inRDF/XML. Thesederived filescouldthenalsobeimportedintothetriplestore,tounifythedatasetsintooneresource,whichcouldbeaccessedbyaSPARQLendpointasanopeninterface.

Oncethedata is importedintothetriplestore,thiscouldbeenrichedusingLinkedOpenDatawebresources,asdescribedinchapter3.LinkedDataEnrichmentTechnically,theenrichmentworksbymappingthesourceterminologytothetargetterminology,whichshouldbereferencedbyURIsofwebresources.ThetriplesthemselvescouldbechangeddirectlywithinthetriplestoreusingSPARQLUpdate (https://www.w3.org/TR/sparql11-update/). Another approach is the manipulation of thetriples in RDF/XML before importing by using XSLT, or another programming language or by justusingatexteditorwithREGEXfunctionality.

In the final user interface, which builds upon themetaphacts platform, the user could access allintegrateddatasetswithintheBlazegraphtriplestoreviatheSPARQLendpoint.ExternaldatacouldalsobeaccessedbyutilizingFederalSPARQLqueriesonthesedistributedtriplestores,aslongastheontologyandstandardsusedoverlap.

Overall, many different datasets with various preconditions were integrated with the proceduredescribedabove,showingtheflexibilityandtheusefulnessoftheutilizedtools.

Page 16: ARIADNE: Pilot deployment experiments

16

Figure4Exampleoftransformeddatafromsourceschema(left)totargetschema(right)

Page 17: ARIADNE: Pilot deployment experiments

17

5.TechnicalEnvironment

X3MLmappingframework

TheX3MLmappingframework(Marketakisetal.,2016,Minadakisetal.,2015)includesamappingeditor,amappingspecificationandamappingmemorytoaccumulateknowledgeandexperience.Itsupports and guides the user during themapping process. Themapping specification is based onX3ML(X3ML,2014),anXMLbaseddeclarativelanguageaimingtosupportthecognitiveprocessofamappingdefinition. Itdescribesschemamappings inbothhumanandmachine-readableform,andsupports the close collaboration of domain and IT experts. X3ML separates schema matching –performedmainlybydomainexperts–fromURIgenerationandterminologymapping–performedmainlybyITexperts.

Blazegraph

Blazegraph is a graph database, which works as an RDF triplestore that natively supports thestandardizedquerylanguageSPARQL1.1,RDFSandtheWebOntologyLanguage(OWL).Itishighlyscalable,asitcouldberunwithastandardCPUor,forenhancedspeed,withasingleGPUandevenaGPUcluster.Itisusedasabackend,tostoreallthedataastriplesinatriplestore.

Metaphactory

The visualization of the scenarios are realized by the Metaphactory platform(http://metaphacts.com/).Metaphactory provides a basic semantic data integration by offering awiki system with widgets, which access the triplestore Blazegraph in the backend. It is an openplatform, which can be used for visualization of SPARQL queries, interactionwith the knowledgegraphandtoconstructuserinterfaces.

Page 18: ARIADNE: Pilot deployment experiments

18

6.Scenario1:Coins

Numismatics is a very traditional science with a lot of experience and early initiatives instandardizationoftheexistingdata(i.e.Bödefeld,Vacano1978).Inrecentyears,numismaticsexcelsintermsofLinkedOpenDataintheDigitalHumanitieswithahighgradeofaccessibledatasetsandstandardized vocabulary. One major collaborative project is Nomisma.org (2016), supported bymanyinstitutions.

Nomisma.orgservesasanauthoritativeresourceinthefieldofnumismatics.ItcollectsandprovidesURIs to common numismatic concepts and terms. Furthermore, a whole ontology (NomismaOntology 2016)was created and is used to integrate the open available databases. The ontologyprovidesaneasyandunderstandablewayfornumismatiststodescribetheirdatasets,butasitisjustlimited to numismatics, it’s very domain specific, if compared to the generic approach of CIDOC-CRM. Overall, numismatics provides a very good starting point for the item-level integration ofarchaeologicaldatasets,asitishighlystandardizedanddataiswidelyavailabletodemonstratetheusefulnessofusingontologies.

6.1.DescriptionofDatasets

ThedFMROarchive

Digitale Fundmünzen der Römischen Zeit in Österreich (dFMRO, digital Coin-finds of the RomanPeriod inAustria) isanonlineMySQLdatabaseof theNumismaticResearchGroupof theAustrianAcademyofSciences(Vondrovec2007).Sincethe1990sitdocumentscoin-findsfromtheCelticandRomanPeriodthathavebeenpublishedinvariousprintedvolumesoftheFMRO(FundmünzenderRömischenZeitinOsterreich/Coin-findsoftheRomanPeriodinAustria)fromthe1970supto2007.

StartingwithaMicrosoftAccessdatabase,itwassetupinitscurrentformin2007andhostsabout76.000 finds.All coins in thedatabasewere found inAustriaanddate fromtheCelticandRomanperiod(actuallytheentireAntiquity);theywereproperlyregisteredsonoillegalfindsareincluded,andmosthavealreadybeenpublishedby thevariousprojectsof theFMRO.Since2007,due toaformer project collaboration, the database also lists coins found in Romania. These coins werepublishedin“ColoniaUlpiaTraianaSarmizegetusa”,thefirstvolumeof“CoinsfromRomansitesandcollectionsofRomancoinsfromRomania”,andrepresentanimportantpartoftheAustrianculturalheritage.ThedFMROarchivewaschosenasthefirsthands-onexercisetomaparelationaldatabaseschematoCIDOCCRM,sinceitrepresentsalargeclassofwell-definedtraditionaldatabases.

The CIDOC-CRM Mapping is available online in the 3M Mapping Memory Manager:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping209

Page 19: ARIADNE: Pilot deployment experiments

19

NumismaticarchivesfromtheCOINSproject

Another source of informationwe have taken into account comes from two numismatic archivesalreadyusedwithintheCOINSproject.Theyincludeasetof1670numismaticrecordscomingfromthe Cambridge Fitzwilliam Museum archive (FWM) and a set of 630 records coming from thedatabaseoftheSoprintendenzaArchaeologicadiRoma(SAR).

The COINS project (Combat On-line Illegal Numismatic Sales) aimed at providing a substantialcontribution to the fight against illegal trade and theft of coins by using state-of-art InformationTechnology.Theprojectdevelopedstandardizedinventoriesbyintegratinglegacyarchivesencodedindifferentformatsandusingdifferentlanguages.ThecreationofareferencecollectionofRomanandGreekcoinswasalsooneofthemostrelevantoutcomesoftheproject.

The FWM archive: The FWM subset comes from the Department of Coins and Medals of theFitzwilliamMuseumDatabase,whichrecordsinformationonmedalsandcoinsofdifferenttypesandage,discoveredduringexcavationsorcomingfromvariousacquisitionsordonations.It iscurrentlykept by the FW museum. Relevant fields used by FWM archive include: coin maker, productionlocation,mint, coin type, category, coinname, inscription, dimensions, production technique, andreferences to images. Databases also include notes concerning record creation andmodification,dateandtime,museumacquisitioninformation.

The CIDOC-CRM Mapping is online available in the 3M Mapping Memory Manager:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping484

TheSARarchive:TheSARdatabase(originallyaMicrosoftAccessDB)wascreatedtocataloguethearchaeologicalfindsofamonetarytypemanagedbytheArchaeologicalSuperintendenceofRome,comingfrompublicandprivatecollections,andfromarchaeologicalexcavationsmadeinthecityofRomeanditsimmediatesurroundings.Themainpurposeofthearchiveistorecordandprovidethedate, the accurate descriptions (by indicating the precise origin or place of issue) and physicalcharacteristicsof the various coins. Further, it also shows the conditionsofdiscovery (excavation,auction, seizure, donation, etc.), the state of preservation and the current location (museum,superintendencies,collectionsandsoon).

The SAR database, in addition to the FWM fields reported above, also provides informationconcerningthephysicalfeaturesandconditionsofthecoins,theregioninwhichaspecificcoinwasminted (apart the exact location), specific informationon the chronology (i.e. the age, century orperiodduringwhichcoinminting tookplace),obverse/reverse inscriptionsof iconographyandthecurrentlocationofthespecificexemplartherecordrefersto.

The CIDOC-CRM Mapping is online available in the 3M Mapping Memory Manager:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping168

Arachne

Arachne16 isthecentralobjectdatabaseoftheGermanArchaeological Institute(DAI).Currently itcontainsmorethan3,700,000digitalobjectswithcorrespondingmetadataandover300,000highlystructureddescriptionsofartefactsofarchaeologicalinterest.Arachnealsoallowsresearchprojects

Page 20: ARIADNE: Pilot deployment experiments

20

tostore,manageandpublishtheirdata intheavailableonlinecatalogues.Therearecurrently485coinsfromthedigitizedmuseuminventoryandresearchprojectdatathatpresentvaryinglevelsofmetadata quality. Some are of excellent quality, such as the 107 coins with figures related toharbours found in the DFG funded ”SPP-Häfen”. Besides a detailed description, these provideextensive informationabout thebibliographic referencesanddatingopinionsofdifferentauthors.Arachne’s data is mapped to CIDOC-CRM and is provided as RDF/XML by an OAI PMH interface(http://arachne.uni-koeln.de/OAI-PMH/oai-pmh.xml?verb=Identify), which was used to harvest theabove-mentioneddata.

iDAI.field-PergamonProject

Sinceitsfirstusagein2005,theiDAI.fielddatabasehasbeenadoptedbyaround35archaeologicalprojects.Itsmainfocusistoprovideafieldresearchdatabaseforarchaeologicalresearchactivities.The modular system also contains a find module with specific attributes for coins found duringexcavationsorsurveys.Inafirstintegrationtest,517coinsofthePergamonprojectwereusedwithdetailedinformationaboutthearchaeologicalcontext.

The CIDOC-CRM Mapping is online available in the 3M Mapping Memory Manager:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping9

MuseiD-Italiacollections

We also investigated the possibility to integrate the collections ofMuseiDItalia, the digital libraryintegratedinCulturaItalia.ThedataareinCIDOCCRMformandcanbeextractedviatheOAI-PMHoftherepository.MuseiDItaliaincludesseveralcollectionsofcoinsfromItalianmuseumssuchas:

● MuseoarcheologiconazionalediVenezia○ Ilmedagliere:serieromana-imitazioniofalsificazionimoderne,86coins○ Ilmedagliere:seriegrecaebizantina,758coins○ Ilmedagliere:serieromanaebarbarica,2307coins

● MuseoarcheologiconazionalediCrotone,RepertiarcheologicieNumismatica,31coins● CollezioneMuseoArcheologicoNazionale-ReggiodiCalabria,136coins● CollezionenumismaticaMuseoArcheologicoNazionalediAltamura,99coins● 3008coinsfromRegioneUmbria

The major problem with these data is that they don’t contain appropriate labels in order to bedisplayed.However, if they undergo a pre-processing stage, they canbe integratedperfectlywellwithotherdatasets.

NaturalLanguageProcessingdata

Asafurtherexperimentinthecoinintegrationusecase,weinvestigatedtheintegrationofnaturallanguage processing data (described in D16.4). 37 records of NLP data from Heslington EastExcavationArchivewereintegratedwiththerestofthecoindatasets.

Page 21: ARIADNE: Pilot deployment experiments

21

ACDMdata

TheARIADNERegistryisthemainresourceoftheproject,wherealltheavailablecollectionsand/ordatasetsaredescribed.So,inordertohaveacompletedescriptionofthecoindatasets,weincludedtheinformationcontainedintheregistry.

6.2.KnowledgeGraph

The datasets that we adopted in this use case have several differences concerning the origin,language,purposeofcreationanduse.Havingbeencreatedbyvariousinstitutionsandfordifferentpurposes,theyhavequitedifferentdatastructures,despitethesimilarityincontent.Thedatabasesof the SAR and the dFMRO, for instance, were created with the purpose of documentingarchaeological discoveries,whichoccurredduringexcavationsor surveys, and containmany fieldsreporting information on provenance and discovery conditions. FWM, on the other side, is amuseum database whose sole purpose is to catalogue acquisition and inventory data of objectsownedandstoredbythemuseumitself,regardlessofthearchaeologicalprovenanceconditions.ThemappingofthecoindatasetsstartedwiththedFMROarchive,whichwaschosenasthefirsthands-onexercisetomaparelationaldatabaseschematoCIDOCCRM,sinceitrepresentsalargeclassofwell-definedtraditionaldatabases.Inclosecooperationwiththedomainexpertswetriedtoidentifyinformationthatwasimplicit,hiddeninforms,inuserinterfacefieldsorwasknownonlybythem.AdetaileddescriptionofthemappingofthedFMROarchivetoCIDOCCRMwaspresentedduringtheCAA2015conference(DoerretAl.2015).ThedFMROmappingwasusedasaguideforthemappingof the SAR and FWM datasets. The records of the FWM archive contain fields with condensedinformationthatneedstobepre-processedandnormalizedbeforeitcanbemappedtoCIDOCCRM.For example, all the information concerning the dimensions of a coin (height, width, weight) isencodedinonefield:

<Dimension>image(height),22,mmimage(width),20,mmweight,3.74</Dimension>

and needs to be normalized before the actual transformation takes place. It isworthmentioningthatthemappingofaschematoCIDOCCRMisnotnecessarilyunique.Theremaybedifferentwaysof approaching theproblem, all correct.However,what is individually correctmay turnout tobeproblematic if considered in the context of a largerprocess. For example, in thedFMROand SARmappings,thecoindenominationwasmappedasanE55Type,whileintheiDAIfielditwasmappedasanE54Dimension.Conceptuallybothapproachesarecorrect,buttheircoexistenceinthesameprocess is clearly problematic. Rather than imposing a unique style,we have chosen to reconcilesuch differences at the query level. The dates are also a crucial point in the integration of thedatasets.Differentformatsandapproachesmayhavebeenusedtoencodetemporalinformationinthesourcedatabases.Tomentionjustasimpleissue,thevaluezeroisusedasadateinsomeofthedatasets, possibly with different meanings: for instance, such a value might indicate the year inwhich Jesus was born, or the fact that the year is unknown, or not recorded. This poses severalproblems.First,zeroisnotavaliddateinRDF(orintheunderlyingXMLtypesystem),sothevaluehastobetransformedintoavaliddate.Butinordertocarryoutthistransformation,itisimportanttoclarifythesemanticsofthezerovalueineachdataset.

Page 22: ARIADNE: Pilot deployment experiments

22

6.3.QueryPossibilities

Theultimategoaloftheintegrationofthediversecoindatasetsistocreateanenvironmentwhere

userswill be able to specifyqueries thatwill beevaluatedon the commonaggregated repository

and will be able to combine results coming from the different datasets. We have identified the

followingresearchquestions:

● Origin-Wheredoesthiscoincomefrom?

● Tracking-Howdiditarrivehere?

● Chronology-First/lastappearance

● Practical/symbolicvalue,incidents-Whyisitdepositedhere?

● Politicalmessage-Whywasitproduced(i.e.”minted”)?

● Economicstability,power-Whywasitwidelyused/notused?

● Statistics-Materialversusnominalvalue

Thereexistseveralqueriesthataretrivialtobeansweredbyeachdatasetseparately,howeverthey

becomeimportantiftheycanbeansweredbytheaggregatedrepository:

● Findcoinsmintedinthesameplace/areaorbythesameauthority

● Find coins produced in the same period or time span (typically the same century or

half/quartercentury)

● Findcoinshavingcommonshape/iconography/inscriptions

● Findcoinsmadebyaspecificmaterial.

Combinations of the above queries can be found useful by the researchers of the numismatic

community and our first experiments with such queries on the aggregated repository are quite

promising. Our experimental aggregated repository contains 72 records (all Roman coins) of the

dFMROarchive,627 records (allRomancoins)of theSARarchive,517 records (12Romancoins1

empty record)of thePergamonarchive (iDAIfield)and1 record fromMuseiD-Italia.The resultsof

somesimplequeriescanbeseeninthefollowingtable:

Page 23: ARIADNE: Pilot deployment experiments

23

Query Total dFMRO SAR Pergamon MuseiD

Findallcoins1216 72 627 516 1

FindRomancoins711 72 627 12

Findbronzeassarius82 29 52 1

Findbronzecoins676 50 270 355 1

Findbronzesextans47 46

Findcoinsproducedintheyear-32

22 4 18

Table1ResultsfromexamplequeriesintheCoinsscenario

The query possibilities described above use the item specific information that is contained in thedatasets. However there is also the catalogue (registry) information, which is of significantimportance.

Theactualquerypowerliesincombiningitemlevelwithcatalogueinfo:

• Find all bronze Antoninianus coins (item level info, retrieves datasets from multipleproviders)

• Findthepublishersofallcollectionsthatcontaincoins(catalogueinfo)• Find the publishers of all collections that contain bronze Antoninianus (item level and

catalogueinfo)

ThecoinexperimentaldemonstratorprovesthattheintegrationofitemlevelinformationwithNLPdataandcatalogueinformationunderacommonconceptualschemaprovidesapowerfultoolthatcananswercomplexqueriesonheterogeneousdataatdifferentlevelsofdetail.

Page 24: ARIADNE: Pilot deployment experiments

24

7.Scenario2:Sculptures

Theresearchonsculpturesisverytraditionalandhighlyembeddedinthecontextofmuseums.Eventhough the museum community under the hood of ICOM’s “International Committee forDocumentation”(http://network.icom.museum/cidoc/) has developed a lot of different metadatastandards likeLIDOorCIDOC-CRMfordataexchange,therearen’tmanysubject-specificstandardsestablishedforthedescriptionofsculptureobjects.Inthefollowing,theselecteddatasetsarebrieflydescribed. Only databases with permissive licenses and where the data could be extracted withreasonableeffortweretakenintoaccount.

7.1.DescriptionofDatasets

AthenianAgora

The American School of Classical Studies at Athens started in 1931 systematically excavating theAthenian Agora. The database of the project is a single integrated online accessible archive(http://agora.ascsa.net/research?v=default) to all the different resource materials: the currentdigital-bornexcavationdocumentation,finds,architecturalplans,drawings,reportsandpublication.Priortothatthedocumentationreliedonfourdifferentanaloguenotebooks,whichgotdigitizedandannotated.All the objects are documented andmapped to theDublin Core standard and allow asearchandopenaccesstoover355.000objectscreatedin85yearsofresearchactivity.

TheSolr(http://lucene.apache.org/solr/)poweredonlinedatabaseoffersanXMLexport,whichwasused toextractmore than1000sculptureobjects.Theseobjectsarehighlycontextualisedwithallother available sources, from which 1700 archaeological contexts and 56.000 bibliographicreferencestomorethan240documentswereselected.

TheCIDOC-CRMMappingof theAgoraDatabaseareonlineavailable in the3MMappingMemoryManager:

• Sculptures:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping487

• Publications:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping511

iDAI.objectsArachne

iDAI.objects Arachne (https://arachne.dainst.org/), as the central object database of the GermanArchaeological Institute, provides researchers a free searching tool available online. Originallystartedin1995asaFilemakerdatabase,itwasintendedasanimagedatabaseforsculpturesoftheresearch archive for antique sculptures (Forschungsarchiv für Antike Plastik) of the University ofCologne(Scheding2014,38).In2004thedatabasemigratedtoMySQLandextendeditsfocustoallkinds of archaeological objects. Currently in its fourth version, the database holdsmore than 3.6

Page 25: ARIADNE: Pilot deployment experiments

25

million objects with associated metadata. All 83.085 sculptures in Arachne are from Roman andGreek times, obtainedmainly through research project data, digitization of archives andmuseumcollections,likethecompleteCatalogueofSculpturesintheAntiquitiesCollectionoftheBerlinStateMuseums(AntikensammlungderStaatlichenMuseenzuBerlin).Themajorityisofexcellentquality,as the sculptures contained a detailed description and are contextualised by more than 1000archaeologicaltypesand88.000bibliographicreferenceswithlinkstotheliteraturecatalogueZenon(http://zenon.dainst.org/).Arachne’sdata ismapped toCIDOC-CRMandprovidedbyanOAIPMHinterface, which was used to harvest the above mentioned data in RDF/XML format:http://arachne.uni-koeln.de/OAI-PMH/oai-pmh.xml?verb=Identify

BritishMuseumCollectionOnline

The British Museum Collection Online database currently provides access to around 2.28 millionrecords. Digitization of the collection is still on going andwill cover every object in theMuseumcollectionwithcorrespondingscientificandconservationmetadata.ThedataisalsoexposedfortheSemantic Web as a SPARQL service: http://collection.britishmuseum.org/, providing access to allobjects in theCollectionOnline,withall object informationmapped toCIDOC-CRMandanonlinepublished thesaurus. For this scenario, a total numberof 52.000openly available sculptureswereused.

iDAI.fieldproject“Chimtou”

Since 2005, the field research documentation system iDAI.field (Schäfer, 2011), based on thecommercial software Filemaker, was developed and maintained by the German ArchaeologicalInstitute (DAI). It provides a very complex information model and supports modules fordocumentation of excavations, surveys, building studies, iconographic studies, material studies,restorationworkandscientificsampling,whichcanbeadaptedforproject-specificneeds.CurrentlyiDAI.field isusedbyaround35fieldresearchprojects fromdifferentuniversitiesandtheDAI foratotalofover1milliondatasets.For thecurrentapproachwe included thedatasetof theChimtouproject’sdatabase(https://www.dainst.org/projekt/-/project-display/33904),witharound500stoneobjects,includingsculptures,whichwerefoundinroughly120archaeologicalcontexts.ThedatawasexportedbyusingFileMaker’sexportfunctionality,whichsupportstheXMLformat.

TheCIDOC-CRMMappingoftheiDAI.fieldisonlineavailableinthe3MMappingMemoryManager:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping303

GazetteerofStoneQuarriesintheRomanWorld

For our second use casewe added a list ofmarble quarries to our data. Sincewe also extractedadditionalinformation,thisdataisahybridbetweenthenormdatasourcesdescribedinchapter3.LinkedDataEnrichmentandthesourcesof“pure”contenthighlightedinthischapter.

ThestudypublishedbyBenRussellin2013iscurrentlythemostcomprehensivecollectionofancientmarbles, and is available either online(http://www.romaneconomy.ox.ac.uk/databases/stone_quarries_database/ or as a PDF:

Page 26: ARIADNE: Pilot deployment experiments

26

http://oxrep.classics.ox.ac.uk/docs/Stone_Quarries_Database.pdf). It isoperatedandhostedbytheOxford Roman Economy Project. All 794 quarries present in the datasets were included via thetransformation of the PDF Document to structured data in the form of CSV, which was in turntransformedtoXML.Theinformationsubmittedconsistsofsite,coordinates,location,country,andformer affiliation to Roman Imperial provinces, a short declaration of the material and itscharacteristics, and themost important literary sources.Thisoverview is suitable foran inventoryand commonbasis ifwealsopointout someminorproblems in the terminology,e.goccasionallymoderntermsarefavouredinsteadofmissingancientterms,astheyareprobablymorewell-known.

TheCIDOC-CRMMappingisonlineavailableinthe3MMappingMemoryManager:

http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping647

7.2.KnowledgeGraph

Once all thenecessary standards (chapter 3. LinkedData Enrichment) are applied and thedata isimportedintothetriplestore,thefullknowledgegraphbecomesaccessibleforfurtherinvestigationsonthedata.

CIDOC-CRMwaschosenasacommonontology,asitisawidelyacceptedandusedontologyintherealm of museums and is gaining more importance in digital humanities as well. There are alsoextensions,beside theCIDOC-CRMCore,whichcanbeused if it is requiredby thedata.WehaveusedCRMsciandCRMarchaeotodescribescientificdataacquisitionandarchaeologicalexcavationprocesses. Furthermore, we used the Functional Requirements for Bibliographic Records (FRBR)ontology to describe the bibliographical records, and the W3C Basic Geo vocabulary(https://www.w3.org/2003/01/geo/)forsimplegeometrydescription.

Beforeapplyingaunifiedqueryoverallintegrateddatasets,themappingsneededtobeharmonized.Smaller differing mappings, like the assignment of measured object dimensions, needed to beharmonized. For the linked open data enrichment (chapter 3. Linked Data Enrichment) we choseWikidata for actors, Getty AAT andWikidata for archaeological terminology, Zenon for literature,andiDAI.gazetteerforplaces.

Page 27: ARIADNE: Pilot deployment experiments

27

Figure5GraphicaloverviewoftheCIDOC-CRMmappedsculpturedatasets.

7.3.QueryPossibilities

Objectcentricqueryforsimilarobjects

Theobject-centricqueryisaboutafragmentaryheadofasatyrthatwasfoundduringexcavationatChimtou in November 2010 (Scheding, 2013). First the object was registered in idai.field andafterwardsinidai.objectsArachne(http://arachne.dainst.org/entity/2295540).Theextentofthehead,thedesignationasasatyr,thestylisticandiconographiccreationandthematerialnamedaswhitemarblearefactsthatcouldbeusefulinfindingcomparableobjects.Theaimistolearnmoreabout

Page 28: ARIADNE: Pilot deployment experiments

28

theexemplarathand,togetinformationaboutotherexemplarsandtogetanoverviewofquarrieswherethesamekindofmarble(whitemarble)wasproduced.If,forexample,ourresearchquestionistosearchforcomparablesatyrstatuesandthe“type”thatisshownbythefragmentaryhead,theparametersforaquerycanbedefinedas:

● Extent:under1.15mheight● Material:whitemarble● Term:satyrhead

Forthetechnicalfulfilmentofthequeryitwasnecessarytoenrichthetripleswithinthetriplestorewith Linked Open Data standards using SPARQL Update. All terms for the material were linkedaccordinglytoappropriateGettyAATtermsbycreatingnewtriples,whiletheactorsrepresentedonthesculpturewerenormalizedbyusingWikidataasacommonstandard.Forexample:

<http://arachne.uni-koeln.de/entity/1092353>crm:P45_consists_of<aat:300011599>

<http://arachne.uni-koeln.de/entity/1092353>crm:P62_depicts<wd:Q163709>

Page 29: ARIADNE: Pilot deployment experiments

29

Figure6Userinterfaceforobjectcentricqueriesoverseveraldatabases

Page 30: ARIADNE: Pilot deployment experiments

30

After these standards were applied, it was possible to apply the same query for all integrateddatabases: thiswas thebasis of a prototypical user interface, see Figure 6User interface for objectcentricqueriesoverseveraldatabases

Theresultsprovideuswithseveralcomparableobtained,reconstructedorfragmentarystatuesthatalreadypointtothefactthatthesatyrheadfromChimtoucanbeassignedthetype“Ludovisi”.Thenext stepwouldbeanevaluationof themost suitableobjectsby further research filters inanewquery, or the matching of “pairs, copies and correspondences” to evaluate the most likely type,which in the context of the satyr found at Chimtou is the Ludovisi type, with many comparableexemplarsfromnumerouscountries-especiallyItaly,butalsofromtheGreekislandofRhodes.

Enhancingliteratureresearch

Literature references are essential for research purposes. Normal bibliographical systems offer atbestathesaurustotagthebibliographicaldataset.Thisapproachislimitedbytheextentoftheusedthesaurus,andforthespecificusecaseonecouldonlysearchforliteraturecontaininginformationabout sculptures. The use of the knowledge graph could enhance literature queries and furtherinvestigation,asitallowsamoreprecisequery.

Toexemplifythisapproach,acloserlookatthemarbleisneeded.Inthiscasethereisanimportantopportunitythatisofferedbyqueriesrelatingtoquarries.Ifthequerydealswiththequestionofanoverall spectrumofone singlemarble,newparameters couldbe formulated. ThePentelicmarble(http://archwort.dainst.org/de/term/6273)waschosen,asitisoneofthemostpopularancientmarbleswithawhitesurface.Togetanoverviewofstatuesmadeofthismarble,thefollowingparameterscouldbeformulated:

● Material:Pentelicmarble● Extent:biggerthan0,70m● Term:statue

In the quarry use case the knowledge graph is used to show all literature containing informationaboutsculpturesmadeofPentelicmarble,with393hits.Aggregate functions, like thecountingofdocumentedsculpturesmadeoutofPentelicmarbleforthespecific literature,offera furtherhintaboutwhichliteraturemightdeliverthebestresults.

ThequerydeliversmanyobjectsinanoverviewfromeverydatabasethatincludesPentelicmarble.ItbecomesobviousthatPentelicmarblehasbeenusedforalongtimeinAntiquitytoproducestatuesand sculpture of every kind, mostly by the Athenian workshops. This can be a starting point forfurtherqueries.

Page 31: ARIADNE: Pilot deployment experiments

31

Figure7SearchingforstatuesmadeofPentelicmarble

Bringing the data together faces a few challenges. For example, there are some remarkabledifferences in the terminology of the different datasets.Wewant to point out two problemswehaveobserved. For ancientmarbles there are notmanydifferences that cause problems, but thehuge quantity of marbles in the database are described using the modern and ancient termsalternatively.Also,while theGettyAATand iDai.vocabprovideusefulcategorizedandmultilingualoverviews ofmanymarbles, the BritishMuseum Collection notablymentions only the parameter“marble”withoutfurtherdetails.Forexample,aquerywiththeparameters“sculpture”and“gialloantico”willdelivermatchesineverydataset,butifweareusing“marmornumidicum”-theliterary-basedancientname-insteadof“gialloantico”,theArachnecancopewithitbutwewon’tgetthedata from the British Museum Collection. The same heterogeneous approach of the datasets is

Page 32: ARIADNE: Pilot deployment experiments

32

alreadypresent in thearchaeological research literature.Analreadywidely accepted collectionofancientmarbleswas recently published by Ben Russell in his “Gazetteer of StoneQuarries in theRoman World”. However, a problematic aspect of Russell’s study is the occasional absence ofancientnamesforsomemarblesthatareknownfromtheliterarysources,whileotherstudiesthatwerefundamentalforthedatasetsinthelastdecadesconsequentlyusedthistechnicalterminology(For instance Gnoli, 1971; Schneider, 1986; Maischberger, 1997). Thus we have a seriousinconsistencyinolderandyoungerdatasets.

These differences can cause huge problems in a query that will sometimes deliver faulty andincomplete results. It becomes apparent that there is a need for common standards and/oragreements about the archaeological terminology and appropriate tools to assist the mappingbetweendifferentterminologies.

Page 33: ARIADNE: Pilot deployment experiments

33

8.Scenario3:Buildings

ThescenariodescribestheapplicationoftheontologicalmodelsCRMbaandCRMarchaeotomodel

anunstructureddatasetabouttheexcavationofaprehistoricarchaeologicalsite,inCrete.Thetwo

CIDOC CRM extensions were developed within ARIADNE to cover the specific needs of the

archaeological investigationand thedocumentationofbuilt structures, and tohelparchaeologists

examineandunderstandthecomplexrelationshipsbetweenallentitiesandactivitiesrelatedto it.

Indeed,archaeologicalinvestigationsarenotonlyfocusedonexploringthesubsurfacedepositsofan

area, but they also include the study of buildings and other built structures,which can be found

standingonthesoilsurfaceorasfragmentsunderneath.

8.1.DescriptionofDatasets

Zominthos is a prehistoric archaeological site located on the north slope of Psiloritis, the highest

mountainontheislandofCrete,Greece-atanaltitudeof1187mabovesealevel.It liesbetween

themodern village of Anogeia and the Idaean Cave, one of themost important sacred caves of

Crete.ItisthelargestknownresidentialcenterofMinoantimesonthemountains.Itwasdiscovered

in 1982 by professor Yiannis Sakellarakis. The excavations revealed a building (Figure 8 Central

Buildingoftheminoansettlement"Zominthos"

(http://www.efsyn.gr/arthro/galazia-kokkini-kai-leyki-zominthos)),conventionallyreferredasthe"Central

Building"andanextendedMinoansettlement.ZominthoswasoccupiedfromtheMinoanperiodto

moderntimes.

Figure8CentralBuildingoftheminoansettlement"Zominthos"(http://www.efsyn.gr/arthro/galazia-kokkini-kai-leyki-zominthos)

Page 34: ARIADNE: Pilot deployment experiments

34

The "CentralBuilding" coversanareaof1,600 sqm. It isoneof themostwell preservedMinoan

buildings,withwallsstandingataheightover2,50m,withat least60roomsonthegroundfloor,

upperstorey,staircases,pavedfloors,corridors,courtyards,windows,entrances,lightwell,storages,

workshops,decoratedprivateroomswithfrescoes,ritualroomswithcolumns,drainagesystem,etc.

It was built of large blocks of local limestone. During the excavations, a large number of

masterpieceswere revealed: bronze figurines, bronze cups, daggers, elaborate pins and pendants

madeofbone,ivoryandbronze,jewelrywithsemi-preciousstonesorbone,seals,rhytaintheform

of animals, chaliceswith floralmotifs, cups, jars, pithoi, etc. Themain phase is dated at the Late

Minoan IA period (1700-1600 BC). It was destroyed by an earthquake around 1600 B.C., which

resultedinafirethatdevastatedthevastmajorityofbuilding’sconstructionelementsandmaterials,

like collapsed parts of the upper storey fallen paved floors, large amounts of stone slabs, burnt

woodenbeam,canefragments,mortar,frescoes,earthclayetc.(Sapouna-Sakellaraki,2013).

TheexcavationisongoingunderthedirectionofDr.Sapouna-SakellarakiEfiandundertheauspices

oftheArchaeologicalSocietyatAthens.

Thedocumentationproducedsince2005consistsofthearchaeologist'sexcavationdiaries,reports,

photos andmaps. All information is locally digitally stored and structured in folders. Someof the

contentisavailableonline,publishedonthe“interactivearchaeology”websiteandinotherarticles

andproceedings.Sincetheexcavationisongoing,thefinalpublicationoftheexcavationisnotready

yet.

In the framework of a study about the construction methods and the materials of the Central

Building, thepossibilityofusingSYNTHESIS,a cultural information systemdevelopedbyFORTH, is

underdiscussion.The information systemsupports scientific andadministrativedocumentationof

museum objects and monuments, and can be extended to enable the documentation of

archaeologicalsitesandbuiltstructures.

8.2.KnowledgeGraph

Fortheaimofthiscasestudyweselectedasubsetofinformation,whichwemodelledusingCIDOC

CRMcoreand itsextensions,mainlyCRMba,CRMarchaeo,andCRMsci.Weusedthe3Meditorto

edittheschemamapping.Thegraphrepresentingthemappingoftheselectedsubset(Room37)to

CIDOCCRMcore,CRMbaandCRMarchaeoisshownbelow(Fig.10).

Page 35: ARIADNE: Pilot deployment experiments

35

Figure9Room37,CentralBuilding,Zominthos

(http://interactive.archaeology.org/zominthos/2016/07/field-notes-2016-week-1/)

ROOM37

Room37(Figure9Room37,CentralBuilding,Zominthos

(http://interactive.archaeology.org/zominthos/2016/07/field-notes-2016-week-1/) ) is located in thesouthwestpartoftheCentralBuilding.Theexcavationofoneoftheupperfloorswascompleted.Itcontainedmostly lepidha-earth and fallen stones. In the northeast corner, the removal of a claystructure from the upper storey was also continued. A concentrated mass of burnt clay-earthconstructionwasuncovered.Inthesoutheastcorneragroupofstonesmaybelongtothesouthernwalloftheroom,whichwasn’tfoundduringthepreviousexcavationperiod.ThesouthernedgeoftheEasternWallwasalsorevealedandtheworksextendedtotheopeningbetweentheRooms37and39.

Several animal bones, some conical cups, twowhetstones and a sealstone, from steatitewith ancarvediconographictheme,smallpiecesofburntwoodandsomesmallgroupsofceramicpots,twoofwhichwerefoundlayingonschistslabsoftheupperstorey’spavedfloor,werecollected.

Page 36: ARIADNE: Pilot deployment experiments

36

Figure10GraphofthemappingofRoom37informationtoCIDOCCRM

A9 Archaeological Excavation Zominthos’excavation

P9 consists of

A1 Archaeological Excavation Process UnitExcavation of Room 37

O19 has found object

B2 Morphological Building SectionRoom 37

B1 Built WorkCentral Building

AP5 removed part or all of

A2 Stratigraphic Volume UnitLayer B

BP2 is constituted by

B5 Stratigraphic Building UnitClay-earth construction

AP5 removed part or all of

A2 Stratigraphic Volume UnitLayer A

AP15 is or contains remains of

E18 Physical ThingFallen stones

AP15 is or contains remains of

BP2 is constituted by

B5 Stratigraphic Building UnitGroup of stones

B5 Stratigraphic Building UnitGroup of stones

AP5 removed part or all of

A2 Stratigraphic Volume UnitLayer C

AP15 is or contains remains of

O19 has found object

AP5 removed part or all of

A2 Stratigraphic Volume UnitLayer D

AP15 is or contains remains of BP1 has section

BP2 is constituted by

AP16 assigned attribute toAP16 assigned attribute to

A6 Group Declaration EventInterpretation of Group of stones

P141 Assigned

B5 Stratigraphic Building UnitSouthern wall of the room

E13 Attribute AssignementClay-earth construction

AP16 assigned attribute to

E13 Attribute AssignementSouthern wall of the room

AP16 assigned attribute to

P141 Assigned

P141 Assigned

Page 37: ARIADNE: Pilot deployment experiments

37

9.Scenario4:FaunalRemains

This scenario evaluates the potential of integrating different scientific datasets in the domain ofarchaeometry. Two different zooarchaeology datasets, which are published by the data centres“ArchaeologyDataService” inYork,and“IANUS”attheDAI inBerlin,werechosenfor integration.Theybothfocusonanimalbonesinarchaeologicalcontexts.

ZooarchaeologyhasalongexperienceinsharingtheirdatasetsandarticlesincommunityportalslikeBone Commons (2016), which launched initially in 2006, the Zooarchaeology social network, theZOOARCHemaillist,andsoon(KansaetAl2001,185ff).Thedisciplineservesasausefulcasestudy,astheterminologyusedishighlystandardized,itsmaterialsandmethodologiesareglobalinscopeandalotofresearchquestionscanoftenbeansweredonlybytakingmultipledatasetsintoaccount.

9.1.DescriptionofDatasets

AnimalBoneEvidencefromSouthernEngland

The“AReviewofAnimalBoneEvidencefromSouthernEngland”project,fundedbyEnglishHeritage,

aimedtoreviewtheanimalboneevidencefromLateBronzeAge-LateIronAgesitesfromsouthern

England.TheRegionalReviewreport (Hambleton2008), forwhich thisdatabaseservesasa freely

available online appendix, provides a synthetic review of published faunal assemblages.

Consequently, analyses (e.g. ageing, butchery, biometric data) focus on the exploitation and

deposition of sheep, cattle, pigs, horses and dogs.Other taxa (e.g.wildmammals, birds, fish and

amphibians)arealsodiscussed.

A total of 108 site reportswere recorded in thedatabase. These 108 'site' records correspond to

reports fromexcavationsat101 separatemonument locations. These sitesgenerated154distinct

'assemblage' records for faunal assemblages. Bibliographic references for all zooarchaeological

reportsreviewedarelistedinthedatabase.

This database was reviewed and published by the Archaeological Data Service:

https://dx.doi.org/10.5284/1000102

The CIDOC-CRM Mapping is online available in the 3M Mapping Memory Manager:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping764

HolocenehistoryofwildlifeinEurope

From 1994 - 1998 the project “Holozängeschichte der Tierwelt Europas” researched the

development of the animalworld in Europe from the Late Pleistocene to theMiddleAges. Three

different institutes have collected and examined published data on the occurrence of wild and

domestic animals (mammals, birds, amphibians, reptiles, fish) and pets from over 8000

Page 38: ARIADNE: Pilot deployment experiments

38

archaeologicalcomplexes.Theevaluationcoveredissuessuchastheinfluenceoftheclimateonthe

animalsinthetransitionfromthePleistocenetotheHolocene,spatialdistributionshiftsofspecies

andtheinfluenceofhumaninterventioninnature.4500publicationswereexaminedandintegrated

inadatabase,whichdocumentsthefaunalremainsinmorethan8500archaeologicalsites.

This database was reviewed and published by the research data centre “IANUS”:https://dx.doi.org/10.13149/001.mcus7z-2

The CIDOC-CRM Mapping is online available in the 3M Mapping Memory Manager:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping765

9.2.KnowledgeGraph

Theknowledgegraphderivedfromthemappingandalignmentofthetwodatasetsdescribedabove.

ToovercomethelanguagebarrierbetweentheGermanandEnglishdatasets,thecommonstandard

Encyclopedia of Life (2016) was used, which provided biological definitions of species in a

classificationtree.Asthethesaurus isnotavailable insemanticwebformats,thetermsusedwere

describedwithRDF/XML.

Figure11GraphrepresentationoftheCIDOC-CRMmappeddatasetsinthe"AnimalRemains"scenario

Page 39: ARIADNE: Pilot deployment experiments

39

9.3.QueryPossibilities

The integration of the zooarchaeology datasets leads to an environmentwhere users are able to

specify queries thatwill be run on a common aggregated repository andwill combine the results

comingfromthedifferentdatasets.

The first presented example is a quite species-centric query, where all sites which do containinghorseremainsareshown.Foraresearcher,whoisfocussedonthedistributiononaspecificspecies,theliteraturereferencescouldbeadesiredstartingpoint.

The second example shows the possibilities on performing statistics with the help of the querylanguageSPARQLoveracommonmappeddataset.Thebarchartshowsthefrequencydistributionofselectedspeciesinthearchaeologicalcontexts.

Figure12Visualizationofexamplequeriesofthe"AnimalRemains"scenario.

Page 40: ARIADNE: Pilot deployment experiments

40

10.Conclusions

The four pilot experiments show a successful integration of diverse datasets, where themethodologyandthetoolsusedhaveshowntheirpracticabilityandusefulnessevenforverydiverseprerequisiteconditionsof thedatasets in termsof language,datastructureand formats.With thedata flow that we have described, the origin of the source data does notmatter as long as it ispossibletoexportthedatainastructureddatasetusinganon-proprietaryopenformat.

Fromanarchaeologicalviewpoint, itbecomesapparentthat it isnecessarytoworkwithstandardsanddeclarationsthatareassimilaraspossibletoeachother,oratleasttomaptheirownvaluelistsand thesauri to these international standards. Common standards and archaeological terminologyarefundamentalfordataintegrationandcomparability.

Combiningopenstructuredandsemanticallydescribeddataandtransparentterminologygivesriseto powerful tools in the daily work of an archaeologist working on excavations, survey data ormuseumsandcollections.Thescenarioshaveillustratedthepotentialforaneasyaccesstoahugeamount on comparable data, which could be searchedwith one single query, which offers greatopportunitiestoallarchaeologicalandinterdisciplinaryareasofexpertise.

Page 41: ARIADNE: Pilot deployment experiments

41

Bibliography

Benecke,N.,vondenDriesch,A.andHeinrich,D., (2016)HolozängeschichtederTierweltEuropas.ForschungsdatenzentrumIANUS,Berlin.(doi:10.13149/001.mcus7z-2)

BoneCommons (2016)BoneCommons–Anonline communitybuildingand sharing resources forarchaeozoology,Availableat:http://alexandriaarchive.org/bonecommons/ (Accessed: 08.11.16)

Bödefeld,H.andVacano,O.(1978)ElektronischeDatenverarbeitunginderantikenNumismatik.EinProjektzurErfassunggriechischerMünztypenamAlthistorischenInstitutderUniversitätDüsseldorf.Chiron,8:587–604,1978.

Clark,M.K.(1935)AgazetteerofRomanremainsinEastYorkshire,Leeds

Doerr, M. and Aspöck, E. (2015) Mapping archaeological databases to CIDOC-CRM, Available at:http://www.ics.forth.gr/_publications/DOERR_MappingCIDOC_1_wFig.pdf(Accessed:03.11.2016)

Doerr,M.,Theodoridou,M.,Aspöck,E.andMasur,A. (2015)Mappingarchaeologicaldatabasestocidoc-crm. In: Proc. 43rd Computer Applications and Quantitative Methods in ArchaeologyConference(CAA2015SIENA),Siena,Italy,April2015.Archaeopress.

Encyclopediaof Life (2016)Encyclopediaof Life–Global access toknowledgeabout lifeonEarth,Availableat:http://eol.org/(Accessed:08.11.2016)

Gnoli,R.(1971)MarmoraRomana,Roma

Gruber, E. andSmith, T. J. (2015) LinkedOpenGreekPottery, In: Proceedingsof the42ndAnnualConferenceonComputerApplicationsandQuantitativeMethods inArchaeology,Oxford,205-214,Availableat:https://caa2014.sciencesconf.org/46119/document(Accessed:03.11.2016)

Hambleton, E. (2009)A Review of Animal Bone Evidence from Southern England[data-set]. York:ArchaeologyDataService[distributor](doi:10.5284/1000102)

Kansa, E., Kansa, S.,Watrall, E. (2011) Archaeology 2.0: New Approaches to Communication andCollaboration.LosAngeles.

Maischberger, M. (1997) Marmor in Rom. Anlieferung, Lager- und Werkplätze in der Kaiserzeit,Wiesbaden,Dr.LudwigReichertVerlag

Marketakis,Y.,Minadakis,N.,Kondylakis,H.,Konsolaki,K.,Samaritakis,G.,Theodoridou,M.,Flouris,G.,&Doerr,M.(2016).X3MLMappingFramework for Information Integration inCulturalHeritageandBeyond.International Journal onDigital Libraries (IJDL), Special Issue on “Extending,MappingandFocusingtheCIDOCCRM”,1-19.

Minadakis, N., Marketakis, Y., Kondylakis, H., Flouris, G., Theodoridou, M., de Jong, G., & Doerr,M.(2015).X3ML Framework: An effective suite for supporting data mappings,Extending.Proceedingsof theWorkshoponExtending,MappingandFocusing theCRMco-locatedwith 19th International Conference on Theory and Practice of Digital Libraries (2015), Poznań,Poland,September17,2015.CEURWorkshopProceedings1656,CEUR-WS.org2016,1-12.

Page 42: ARIADNE: Pilot deployment experiments

42

Nomisma.org(2016)Nomisma.org.Availableat:http://nomisma.org/(Accessed:03.11.2016)

Nomisma Ontology (2016) Nomisma Ontology. Available at: http://nomisma.org/ontology(Accessed:03.11.2016)

Oldman,D.,Doerr,M.,deJong,G.,Norton,B.andWikman(2014)T..Realizinglessonsofthelast20years: A manifesto for data provisioning & aggregation services for the digital humanities. D-LibMagazine,20(7/8).doi:10.1045/july2014-oldman.

Ore, C., Doerr, M., Le Bœuf, P. and S. Stead (2015) Definition of the cidoc conceptual referencemodel version 6.1. Available at: http://www.cidoc-crm.org/Version/version-6.1 (Accessed:03.11.2016)

Ronzino, P., (2015). CIDOC CRMba A CRM Extension for buildings archaeology informationmodelling.(Unpublisheddoctoralthesis).TheCyprusInstitute,Nicosia,Cyprus

Ronzino, P., Niccolucci, F., Felicetti, A., & Doerr, M. (2016). CRMba a CRM extension for thedocumentationofstandingbuildings.InternationalJournalonDigitalLibraries.Vol. 17, n.1,March2016,pp.71-78

Ronzino, P., (2016). Harmonizing the CRMba and CRMarchaeo models. International Journal onDigitalLibraries,SpecialIssueonExtending,MappingandFocusingtheCIDOCCRM.

Russell, B. (2013) Gazetteer of Stone Quarries in the Roman World, Available at:http://oxrep.classics.ox.ac.uk/databases/stone_quarries_database/0to20/(Accessed:03.11.2016)

Sapouna-Sakellaraki, E. 2013. The Wealth of Psiloritis. Zominthos. Archaeology & Arts.http://www.archaeology.wiki/blog/2013/11/18/the-wealth-of-psiloritis/,Accessed25October2016.

Schäfer, F. (2011) “iDAI.field - Ein modulares Dokumentationssystem für archäologischeFeldforschungsprojekte.”KölnundBonnerArchaeologica1:201-208

Scheding,P.(2013)DerKopfeinesSatyrsTypus„Ludovisi“ausSimitthus(Chimtou,Tunesien),KölnerJahrbuchfürVor-undFrühgeschichte46,Köln,133-140

Schneider,R.-M.(1986)BunteBarbaren,Worms,WernerscheVerlagsgesellschaft

Thacker, J. (2000) Standardization and Libraries, In: Study on International Sharing of JapaneseScholarlyInformation

K. Tzompanaki, K. and Doerr, M. (2012) Fundamental categories and relationships for intuitivequeryingCIDOC-CRMbasedrepositories.TechnicalReportTR-429,ICS-FORTH.

Tzompanaki,K.,Doerr,M.,Theodoridou,M.andFundulaki, I. (2013)Reasoningbasedonpropertypropagation on CIDOC-CRM and CRMdig based repositories. Online Proceedings for ScientificWorkshops,2013.

Vondrovec, K. (2007) digitale Fundmünzen der Römischen Zeit in Österreich. Available at:http://www.oeaw.ac.at/antike/index.php?id=358(Accessed:03.11.2016)

Page 43: ARIADNE: Pilot deployment experiments

43

AppendixA

Link collection to all CIDOC-CRM mappings in the 3M Mapping Memory Manager sorted byscenarios.

Scenario1:Coins

TheFitzwilliamMuseum,CoinsandMedals:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping484

ArchaeologicalSuperintendenceofRomeCoinsDatabase:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping168

MappingofiDAIfield_MuenzentoCIDOC-CRM:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping9

OEAWCoinsDB:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping209

Scenario2:Sculptures

AthenianAgoraDB:

• Sculptures:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping487

• Publications:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping511

iDAI.fieldChimtou:

http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping303

GazetteerofStoneQuarriesintheRomanWorld:

http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping647

Scenario4:AnimalRemains

HolozängeschichtederTierweltEuropas:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping765

AnimalBoneEvidencefromSouthernEngland:http://139.91.183.3/3MEditor/Index?type=Mapping&action=view&lang=en&id=Mapping764

Page 44: ARIADNE: Pilot deployment experiments

44

AppendixB

TheARIADNEReferenceModelversion1.0indetailcanbefoundattheARIADNEwebsite:

http://www.ARIADNE-infrastructure.eu/Resources/ARIADNE-Reference-Model

CRMinfversion0.7

Referencedocument:http://www.ics.forth.gr/isl/CRMext/CRMinf/docs/CRMinf-0.7.pdf

RDFSencoding:http://www.ics.forth.gr/isl/CRMext/CRMinf_v0.7.rdfs

CRMsciversion1.2.3

Referencedocument:http://www.ics.forth.gr/isl/CRMext/CRMsci/docs/CRMsci1.2.3.pdf

RDFSencoding:http://www.ics.forth.gr/isl/CRMext/CRMsci_v1.2.3.rdfs

CRMbaversion1.4

Referencedocument:http://www.ics.forth.gr/isl/CRMext/CRMba/docs/CRMba_v1.4.pdf

RDFSencoding:http://www.ics.forth.gr/isl/CRMext/CRMba_v1.4.rdfs

CRMarchaeoversion1.4

Referencedocument:http://www.ics.forth.gr/isl/CRMext/CRMarchaeo/docs/CRMarchaeo_v1.4.pdf

RDFSencoding:http://www.ics.forth.gr/isl/CRMext/CRMarchaeo_v1.4.rdfs

CRMdigversion3.2

Referencedocument:http://www.ics.forth.gr/isl/CRMext/CRMdig/docs/CRMdig3.2.pdf

RDFSencoding:http://www.ics.forth.gr/isl/CRMext/CRMdig_v3.2.rdfs

CRMgeoversion1.2

Referencedocument:http://www.ics.forth.gr/isl/CRMext/CRMgeo/docs/TR435-CRMgeo.pdf

RDFSencoding:http://www.ics.forth.gr/isl/CRMext/CRMgeo_v1.2.rdfs