Interoperability, metadata and data exchange guidelines...Regional metadata guidelines “In...

Preview:

Citation preview

Interoperability,metadataanddataexchangeguidelinesIrynaKuchma,OpenAccessProgrammeManagerLIBSENSEWorkshopII,March11,2019,Accra,Ghana

Attribution4.0International

Interoperability“isthetechnical“glue”connectingcontentandsystemsinthenetworkofrepositoriesandothertools,enablingvalueaddedservicestobebuiltonthisintegratedinfrastructure.

Therealvalueofrepositoriesliesinthepotentialtointerconnectthemtocreateanetworkofrepositories,anetworkthatcanprovideunifiedaccesstoresearchoutputsandbe(re-)usedbymachinesandresearchers.Inordertoachievethispotential,weneedinteroperability.”https://coartraining.gitbook.io/coar-repository-toolkit/interoperability

Metadataandvocabularies“Interoperabilityacrossrepositoriesrequiresstandardizedapproachestometadataandvocabularies…

Metadatais"dataaboutdata“–descriptiveinformationrelatedtoeachresourceintherepository.

Ideally,repositorieswillexposetheirmetadatausingcommonschemaandvocabulariessothattherecordscanbestandardized,andaggregatedbyrepositorynetworks.Inturn,thesenetworkscandevelopmoreusefulserviceswiththemetadata,suchastrackingopenaccess,discoveryofcontent,andanalytics.”https://coartraining.gitbook.io/coar-repository-toolkit/interoperability

https://coartraining.gitbook.io/coar-repository-toolkit/interoperability/metadata-and-vocabularies

Metadataandvocabularies(2)“CurrentlymostrepositoriesexposetheirmetadatathroughtheOpenArchivesInitiative-ProtocolforMetadataHarvesting(OAI-PMH).Thisprotocolallowstherepositorytouseavarietyofmetadataprofiles,inadditiontothesimpleOAI-DCmetadataformatbasedonDublinCore.Forgenericdatarepositories,theDataCitemetadataschemaisthemostwidelyused.Domain-basedmetadataschemasmayalsobeusedbyrepositoriesthatspecializeincollectingcontentfromaspecificdiscipline.”https://coartraining.gitbook.io/coar-repository-toolkit/interoperability/metadata-and-vocabularies

http://repository.costech.or.tz

Regionalmetadataguidelines“Inaddition,thereareregionalguidelinesforrepositoriesdefinedbycertainrepositorynetworks,suchasLAReferencia(LatinAmerica)andOpenAIRE(Europe)requiretheadoptionofcertainspecificmetadataelementsandvocabulariesinordertoprovideservicesbasedonthemetadatatheyaggregate.”https://coartraining.gitbook.io/coar-repository-toolkit/interoperability/metadata-and-vocabularies

WhatshallweincludeinthemetadataguidelinesforAfricanrepositories?

Draftforcomments:https://docs.google.com/document/d/1n9O7tXbaXLcqp8da-9XtymZSMmFDu78GCLBjMp_SRXU/edit#

Comprehensivemetadata

Aimforascomprehensivemetadataaspossible

Trytoincludealldescriptiveinformationprovidedintheresourcethatyouaregoingtouploadinyourrepository

OptimalmetadataTitle(dc.title)-theoriginalwording,orderandspellingoftheresourcetitle.Capitalizepropernounsonly.[Punctuationneednotreflecttheusageoftheoriginal.Subtitlesshouldbeseparatedfromthetitlebyacolon.ThisinstructionwouldresultinTitle:Subtitle(i.e.nospace).https://guidelines.openaire.eu/en/latest/literature/field_title.html]

TitleinEnglish,ifdifferent,inaseparatefield.

Optimalmetadata(2)Author(s)(dc.contributor.author)-eachauthorinaseparatefield.Useinvertedname,sothesyntaxwillbethefollowing:“surname”,“initials”(“firstname”)“prefix”.ForexampleJanHubertdeSmitbecomes<dc:creator>Smit,J.H.(John)de</dc:creator>.Useastandardisedwritingstylefornames,e.g.thewritingstyleusedbythepublisherwhenthisisavailable.Generationalsuffixes(Jr.,Sr.,etc.)shouldfollowthesurname.Omittitles(like“Dr”).Forexample:“Dr.JohnH.deSmitJr.”becomes<dc:creator>SmitJr.,J.H.(John)de</dc:creator>

https://guidelines.openaire.eu/en/latest/literature/field_creator.html

Optimalmetadata(3)

Abstract(dc.description.abstract).

AbstractinEnglish,ifdifferent,inaseparatefield.

Date(dc.date.issued)-recommendedbestpracticeforencodingthedatevalueisdefinedinaprofileofISO8601[W3CDTF]andfollowstheYYYY-MM-DDformat.InDSpaceyoucouldmentiontheyearonlyforjournalarticles.

Optimalmetadata(4)

DigitalObjectIdentifier(dc.identifierordc.identifier.doiordc.identifier.other),e.g.10.1186/s13027-017-0170-5orhttp://doi.org/10.1007/s12374-017-0088-x

Keywords(dc.subject)-eachkeywordinaseparatefield.

Language(dc.language.iso)inISO639standard(2or3lettercode,e.g.enorengforEnglish).

https://uyr.uy.edu.mm/handle/123456789/182

https://www.altmetric.com/products/free-tools/bookmarklet/

https://www.altmetric.com/products/free-tools/institutional-repository-badges/

https://help.altmetric.com/support/solutions/articles/6000086842-getting-started-with-altmetric-on-your-journal-or-institutional-repository

Optimalmetadata(5)

Journaltitle/Conferencetitle(dc.publisher)forjournalarticles/conferenceproceedings.

Journalvolumeandnumber(dc.relation.ispartofseriesordc.citation.issue,dc.citation.spage,dc.citation.epage).

JournalISSN(dc.identifier.issn)/BookISBN

Optimalmetadata(6)

Type(dc.type)-publicationtype.IndicatethetypeofpublicationbasedonalocalrepositoryvocabularyoruseCOARResourcetypevocabularytoindicatethetypeofyourresource

https://guidelines.openaire.eu/en/latest/literature/field_publicationtype.html

http://vocabularies.coar-repositories.org/documentation/resource_types

Optimalmetadata(7)Access(dc.rights)-provideaccessinformation(e.g.OpenAccess).UseCOARAccessRightsVocabularytoindicateaccessrightstoyourresourcehttp://vocabularies.coar-repositories.org/documentation/access_rights--openaccess

--embargoedaccess

--restrictedaccess

--metadataonlyaccessorrestrictedAccessasrecommendedinOpenAIREGuidelinesforLiteratureRepositoriesv3

Optimalmetadata(8)Informationaboutre-use-formaterialspublishedunderCreativeCommonslicenceinthedc.rightsordc.rights.licensefieldmentionthelicense,forexampleCreativeCommonsAttribution4.0International,andindc.rights.uri-thelicenceURL,e.g.http://creativecommons.org/licenses/by/4.0/

Optimalmetadata(9)

Citation(dc.identifier.citation)-suggestedcitationofanitem(e.g.journal'sname,volumeandissueforajournalarticle);thesedetailsallowabetterretrievalofyourdocuments.

Additionalinformation&metadata

ORCID-addanORCIDiDtoauthornames.PromotetheadoptionofORCIDiDstouniquelyidentifyauthors(evenincaseofnameambiguity).EncourageauthorstoregisterwithORCIDinordertoobtainanORCIDiD.InDublinCoreORCIDiDsshouldbeprovideddirectlyasapartoftheauthor'sname(e.g.<dc:author>Summan,Friedrich(ORCID-ID0000-0002-6297-3348)</dc:author>).

Additionalinformation&metadata(2)Description-addadditionaldescription,ifneeded,indc.description.Forexample,providemoredetailsaboutathesis/dissertation:“AResearchdissertationsubmittedtotheSchoolofPublicAdministrationandManagementfortherequirementtoundertakethefieldstudy(inSemester3)forthefulfillmentoftheMasterDegreeinPublicAdministration(MPA)ofMzumbeUniversity”(fromhttp://scholar.mzumbe.ac.tz/handle/11192.1/2408).

Additionalinformation&metadata(3)

Projectinformation-addgrant/projectinformation,whenapplicableindc.relationifaresourcewassupportedbyaproject/grant.

Additionalinformation&metadata(4)AnauthoritativelistofprojectsisexposedbyOpenAIREthroughOAI-PMH,andavailableforallrepositorymanagers.ValuesincludetheprojectnameandprojectID.TheprojectIDequalstheGrantAgreementidentifier,andisdefinedbytheinfo:eu-reponamespacetermgrantAgreement.Thethree-partnamespaceismandatorywhenapplicable(info:eu-repo/grantAgreement/Funder/FundingProgram/ProjectID),whilethesix-partsnamespaceisrecommended.https://guidelines.openaire.eu/en/latest/literature/field_projectid.html

Additionalinformation&metadata(5)

PublicationVersion-whenapplicable,indicatethestatusoftheresourceinthepublicationprocess/theversionofthearticleindc.type.version-forexample,publishedVersion.

Additionalinformation&metadata(6)UsethefollowingcontrolledvocabularyfortheversionofthescientificoutputbasedontheDRIVER-versioninfo:eu-repoversionterms.info:eu-repo/semantics/draft

info:eu-repo/semantics/submittedVersion

info:eu-repo/semantics/acceptedVersion

info:eu-repo/semantics/publishedVersion

info:eu-repo/semantics/updatedVersion

https://guidelines.openaire.eu/en/latest/literature/field_publicationversion.html

Additionalinformation&metadata(7)Format(dc.format)-thephysicalordigitalmanifestationoftheresource.Typically,formatmayincludethemedia-typeordimensionsoftheresource.Formatmaybeusedtodeterminethesoftware,hardwareorotherequipmentneededtodisplayoroperatetheresource.Examplesofdimensionsincludesizeandduration.Recommendedbestpracticeistoselectavaluefromacontrolledvocabulary(forexample,thelistofInternetMediaTypes[MIME]definingcomputermediaformats).Basedonbestpractice,theIANAregisteredlistofInternetMediaTypes(MIMEtypes)isusedtoselectatermfrom.Forthefulllistseehttp://www.iana.org/assignments/media-types.

Additionalinformation&metadata(8)Ifaspecificresourcehasmorethanonephysicalformats(e.g.postscriptandpdf)storedasdifferentobjectfiles,allformatsarementionedintheDCelementformat,forexample:

<dc:format>application/pdf</dc:format>

<dc:format>application/postscript</dc:format>

<dc:format>application/vnd.oasis.opendocument.text</dc:format>

Donotconfusewithpublicationtypeandresourceidentifier.

Additionalinformation&metadata(9)Someexamples:

<dc:format>video/quicktime</dc:format><dc:format>application/pdf</dc:format><dc:format>application/xml</dc:format><dc:format>application/xhtml+xml</dc:format><dc:format>application/html</dc:format><dc:format>application/vnd.oasis.opendocument.text</dc:format>

https://guidelines.openaire.eu/en/latest/literature/field_format.html

Additionalinformation&metadata(10)Embargoenddate(dc.date)-whenaccessissettoembargoedAccesstheenddateoftheembargoperiodmustbeprovided.Thecorrespondingtermisdefinedbyinfo:eu-repo/date/embargoEnd/<YYYY-MM-DD>.EncodingofthisdateshouldbeintheformYYYY-MM-DDconformingtoISO8601.

https://guidelines.openaire.eu/en/latest/literature/field_embargoenddate.html

https://www.base-search.net

https://www.base-search.net/about/en/faq_oai.php

https://guidelines.openaire.eu/en/latest/literature/index_guidelines-lit_v3.html

DataexchangemodelagreementDraftforcomments:https://docs.google.com/document/d/1mfuYnZCMtP43wllvsJ-aYSQxK3LS9RbwG1ZhyZp4-vc/edit#heading=h.xju3bh76qxj

Dataacquisition&datausagepoliciesFornational/regionalrepository/aggregator:howthedataisretrieved,howoften,whatprocessesitgoesthrough[e.g.aggregating,cleaning,transforming,inferring,de-duplicating],whatthequalitychecksarealongalldataprocessingstages;anddatausagepolicy:whoisabletoretrieveaggregateddataandwhatthelicensesare.

Dataacquisitionpolicy

TheOAI-PMH(OpenArchivesInitiativeProtocolforMetadataHarvesting)interoperabilityprotocolisused,whichconsistsofasetofrulesandmethodsthatstandardizetheaccesstocontentofrepositories.Repositoriesareharvested[onceaweek-adjusttoyourworkflows].

Dataacquisitionpolicy(2)

Aggregationpoliciesforpublications,datasetsandotherresearchoutputs:National/regionalaggregatoracceptsthemetadatarecordsofallscientificoutput.Thismeansthatbothopenaccessandnon-openaccessmaterialwillbeincluded.

Dataacquisitionpolicy(3)

Full-textpublications:Anational/regionalaggregatorcollectsbibliographicmetadatarecords[openaccesspublicationsfileswhenevertheseareaccessiblefromtheURLprovidedinthemetadatarecord/bibliographicmetadatarecordsonly-checkandkeepifthisisthecase].End-userswillingtoaccess,download,andreadtheactualfiles[will/willnot-selectone]beabletodosofromanationalaggregator,butwillbeforwardedtotheoriginalsourceofdeposition.

TermsofAgreements(ToU)forContentProvidersAgreementforContentExchangebetweenanational/regionalaggregatorandexternalcontentprovider,inthefollowingreferredtoas[ORGANIZATION]

ObjectivesoftheToU

Anational/regionalaggregatorharvestsbibliographicmetadatarecords[andOpenAccessarticlesfull-textfromcontentproviders-checkandkeepifthisisthecase].

The[ORGANIZATION]mayrequestanational/regionalaggregatornottocollectthefulltextofopenaccesspublications.

Benefitsforcontentproviders

Anational/regionalaggregatorincreasesthevisibilityofthe[ORGANIZATIONs]contentprovideranditspublicationsbyexposingmetadataandURLsleadingtotheprovider’swebsite(provenanceinformation).

TermsofUse:Consentforre-useofmetadataByregisteringthe[ORGANIZATON]'scontentproviderwithanational/regionalaggregator,the[ORGANIZATION]:

Providesmetadatarecordscomplianttothenational/regionalaggregatorguidelines.

Allowsanational/regionalaggregatortoBULKDOWNLOADmetadatarecordsviaatleastoneofthefollowingprotocols:OAI-PMH,FTP(andRESTAPIsifagreedwithanational/regionalaggregator).

TermsofUse:Consentforre-useofmetadata(2)Allowsanational/regionalaggregatortoTRANSFORMmetadatarecords,ifnecessary,tomakeituniformtothenational/regionalaggregatordatamodel.

Allowsanational/regionalaggregatortoENRICHthemetadata,usingnational/regionalaggregatorbesteffortsofdeduplication,text-mining,andend-userfeedback.

TermsofUse:Consentforre-useofmetadata(3)Allowsanational/regionalaggregatortoPUBLISHtheharvestedandtransformedrecords,thustoprovidepublicaccesstothemasCC-BYInternational4.0orsubsequentwithoutanyrestrictionsonreuseinoriginalandderivativeforms.

TermsofUse:Consentforre-useofmetadata&fulltextMetadata:Allowsanational/regionalaggregatortoPUBLISHtheharvestedandtransformedrecords,thustoprovidepublicaccesstothemasCC-BYInternational4.0orsubsequentwithoutanyrestrictionsonreuseinoriginalandderivativeforms.

[Consentforre-useoffulltextsisdescribedhere:https://docs.google.com/document/d/1mfuYnZCMtP43wllvsJ-aYSQxK3LS9RbwG1ZhyZp4-vc/edit#]The[ORGANIZATION]mayrequestanational/regionalaggregatornottocollectthefulltextofopenaccesspublications.

AdditionalprovisionsensuringqualityofserviceThe[ORGANIZATION]willensurethefollowinggoodpracticesarerespected:

Whitelistinganational/regionalaggregatorharvestingservices:agreesnottoblocktheIPaddressrangeusedbytheanational/regionalaggregatorcrawlingand/ordownloadservice;

Dataintegrity:informsanational/regionalaggregatoraboutchangesofexistingrecordidentifiers(e.g.duetoplatformmigrationsorupdates)

[ORGANIZATION]'srepresentationsandwarranties

Additionalprovisionsensuringqualityofservice(2)Anational/regionalaggregatorpublishedmetadataunderCC-BYInternational4.0orsubsequentwithoutanyrestrictionsonreuseinoriginalandderivativeforms.

Theagreementwillterminatewhenanational/regionalaggregatoror[ORGANIZATION]givesnoticeofterminationtotheotherParty(includingendofprojectorservice),inwhichcaseaminimumnoticeofthreemonthswillbegiven.Inthiscaseanational/regionalaggregatorwilltakedownallcopiesmadeof[ORGANIZATION]'sdata.DownloadeddatathatarelicensedunderCC-BYInternational4.0orsubsequentarenotaffectedbytheterminationoftheagreement.

https://www.openaire.eu/data-aquisition-policy

https://www.openaire.eu/terms-of-use-for-content-providers

Thank you! Questions? iryna.kuchma@eifl.net Twitter: @irynakuchma

www.eifl.net

Recommended