Upload
kelda
View
40
Download
0
Tags:
Embed Size (px)
DESCRIPTION
CLARIN Technical Infrastructure PIDs - How far are we?. Usage I. assume that we have a recording of an extinct language and some annotations that tell us what someone said about medicine etc researchers create relations that need to be preserved . Video Recording. from Repository A. - PowerPoint PPT Presentation
Citation preview
CLARIN Technical Infrastructure
PIDs - How far are we?
Usage I
• assume that we have a recording of an extinct language and some
annotations that tell us what someone said about medicine etc• researchers create relations that need to be preserved Video Recording
Sound Recording
Annotations
Recording Session
fromReposit
oryA
fromReposit
oryB
fromReposit
oryC
How long?
Usage IIBiological and cultural processes have evolved together, in a symbiotic spiral; they are now indissolubly linked, with human survival unlikely without such culturally produced aids as clothing, cooked food, and tools. The twelve original essays collected in this volume take an evolutionary perspective on human culture, examining the emergence of culture in evolution and the underlying role of brain and cognition. The essay authors, all internationally prominent researchers in their fields, draw on the cognitive sciences -- including linguistics, developmental psychology, and cognition -- to develop conceptual and methodological tools for understanding the interaction of culture and genome. They go beyond the "how" -- the questions of behavioral mechanisms -- to address the "why" -- the evolutionary origin of our psychological functioning. What was the "X-factor," the magic ingredient of culture -- the element that took humans out of the general run of mammals and other highly social organisms?
Several essays identify specific behavioral and functional factors that could account for human culture, including the capacity for "mind reading" that underlies social and cultural learning and the nature of morality and inhibitions, while others emphasize multiple partially independent factors -- planning, technology, learning, and language. The X-factor, these essays suggest, is a set of cognitive adaptations for culture.
ePublicationRepository 1
eResource Repository 2
How long?
Usage III
eResource2 Repository 2
Ontologyopen registryHow long?
eResource1Repository 1
Usage IV
eResource eResource How long?RDF Assertion
Currently almost 1 Mio PIDs
<?xml version="1.0" encoding="UTF-8"?><METATRANSCRIPT ArchiveHandle="hdl:1839/00-0000-0000-0005-82B0-2" Date="2006-07-18" FormatId="IMDI 3.0" Originator="Editor - Profile:SESSION.Profile.xml" Type="SESSION" Version="1"
xmlns="http://www.mpi.nl/IMDI/Schema/IMDI" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.mpi.nl/IMDI/Schema/IMDI ./IMDI_3.0.xsd">
<Session> <Name>DBD_RIF_14_12_01_064</Name> <Title>Dutch Bilingualism Database, Ethnic Dutch, Session 64</Title>
……….
<MediaFile><ResourceLink ArchiveHandle="hdl:1839/00-0000-0000-0004-DC6B-0">
http://corpus1.mpi.nl/qfs1/media-archive/dbd_data/boumans/T-Cult/Metadata/../Media/dbd_rif_14_12_01_064.wav</ResourceLink>……….
The Problem• could use Cool URIs as the W3C TAG suggests to do• addresses change too often and we cannot influence that • perhaps some exceptions such as
http://www.isocat.org/datcat/DC-1708
???
???
???
you just change one entry in a database
but there is a price of course
Many Suggestions
• URLs: http:/www.mpi.nl/imdi/doc/white-paper all
• HTTP URIs: http://www.isocat.org/isodcr#12345 W3C
• URNs: urn:nbn:nl:ui:13-54321 EU Libs etc
• Handles: hdl:1839/00-0000-0000-0005-82B0-2 many
• ARKs: http://ark.cdlib.org/ark:/13030/ft4w10060w few
• XRIs: xri://broadview.library.example.com/ ?(urn:isbn:0-395-36341-1)
• PURLs: http://purl.oclc.org/OCLC/PURL/FAQ many• DOI: Handles + Business Model
Publisher• OpenURLs: parameterized http-get requests ?• InfoURIs integrate legacy material into Web ?
• etc
EvaluationStandard Robust
SoftwareResolution
SystemResolution
TypeSecurity Admin
Assoc Info
Cost
URL RFC2616 no yes (DNS) single no no no
URN:ISSN ISO2397 no no ? no no no
URN:ISBN ISO2108 no no ? no no no
URN:NBN RFC3188 no no ? no no ?
PURL no no yes single no no no
Handle RFC3650 yes yes multiple yes yes little
DOI Z39.84… yes yes (Handle) multiple yes yes large
ARK no no (yes) multiple (no) yes ?
info URI RFC3668 no no ? no no no
XRI no no no ? no ? ?
simple decision: need to have something robust now without expensive business model and dependencies
How to do
• you need to be registered at the PID service as accepted and trusted partner(trusted partners are only those who can demonstrate that they have a proper repository system)
• you have a set of resources which have URLs • these resources have registered metadata descriptions • you request for these resources PIDs by submitting the
requested information such as URLs, MD5, minimal MD etc• you can do this either manually or via an API
• you get back the PIDs from the service• you enter these PIDs in the metadata description field
(now everyone can use it for reference purposes)
• whenever you change the URLs you need to adapt the entry (probably use a ready-made mover)
Associated information
• want to check authenticity before copying etc (MD5 field)• want to add citation data info extracted from metadata
records• want to solve the problem of having several centers
manipulating the Handle record without interference• want to add a pointer to access permission information • proper monitoring services
• in MPG and CLARIN some money to create robust services
Short Overview
• GWDG Servicehttp://handle.gwdg.de:8080/pidservice
• Java-Documentationhttp://handle.gwdg.de/javadocs/
• 11858/00-ZZZZ-0000-0000-000C-7 -> 'View Handle'• http://www.gwdg.de/aktuell/index.html -> 'Find Handle'
• service will be given also to CLARIN and probably for other research Infrastructure initiatives in Europe