Upload
robert-sanderson
View
2.315
Download
1
Tags:
Embed Size (px)
DESCRIPTION
JCDL 2010 presentation about using Memento to reconstruct the state for web resources involved in annotation.
Citation preview
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
[email protected]@gmail.com
[email protected]@gmail.com
DigitalLibraryResearchandPrototypingTeam
LosAlamosNa@onalLaboratory,USA
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:2
Overview
• BackgroundandResearchQues@on
• OpenAnnota@onCollabora@on
• Memento
• Experiments
• Conclusions
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:3
10+YearsofAnnota@onResearch
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:4
Our2020Vision
web
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:5
ScholarshipisMovingtotheWeb
• Fromsilosofhighlycuratedcontenttoopenworldmodels
• Publishingofscholarlycommunica@ononline:• FirstJournals(DLib,PLoS,…)• ThenUsers(Vivo,ORCID,…)• NowData(NVO,OREChem,…)
• NexttheScien@ficProcess?(MyExperiment,Annota@on,…)
• Withthewebcomesnewchallenges…
• Persistence
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:6
WebResourcesChange…
Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:7
ArchivedCopy,ButNoAnnota@ons
Archived page from: http://www.dracos.co.uk/work/bbc-news-archive/2010/03/08/07.05.html
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:8
CanWeFixThis?Automa@cally?
The desired outcome: Display the correct representation of the Web Resource with the Annotation.
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:9
PreviousAnnota@onPersistenceMethods
• Migrateannota@onsfromoneversiontothenext:• Seektodiscovernewloca@onofoldtargetsegment
• Otherwisediscardtheannota@onasnolongerrelevant• TreatstheAnnota@onasofsecondaryimportance
• Focusedonheuris@cs:• Crossformat,crossloca@on
• Editedtextinsamedocument
• Dynamicallyscalingtargetareas,marksofannota@on• …
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:10
PersistentandWeb‐Centric?
• OAC:DescribeAnnota@onsinaWeb‐centricModel
+
• Memento:MakeNaviga@ngthePastWebEasy
=
• GivenanAnnota@on,displayappropriatearchivedWebResource?• GivenanarchivedWebResource,displayappropriateAnnota@ons?
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:11
OpenAnnota@onCollabora@on
• Focus: Interoperabilitybetweensystemstoenablesharing
• Founda@on: ArchitectureoftheWorldWideWeb
• Framework: LinkedDataGuidelines
• Funding: MellonFounda@onfor18months
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:12
OACDataModel:Basics
• Anoac:Annota@onisanore:Aggrega@onoftwoormoreresources,suchthatone(oac:Body)annotatesatleastoneother(oac:Target)
• WegetOAI‐OREen@@esforfree(ore:ResourceMap,ore:Proxy)
• Allresourcesareregularwebresources
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:13
OACDataModel:Basics
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:14
OACDataModel:Segments
• MostAnnota@onsareaboutpartofaresource
• Resourcesareatomic,intermsofiden@fica@on(byaURI)
• SegmentsoftheresourceapplyinthecontextoftheAnnota@on
• Solu@on:aaachaDescrip@onoftheSegmentofinteresttoanOREProxyfortheresource
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:15
OACDataModel:Segments
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:16
OACDataModel:Time
• Asregularwebresources,ResourceMap,BodyandTargethaverepresenta@onsthatcanchangeover@me
• TheResourceMap,BodyandTargetcanchangeindependentlyofeachother
• IfanAnnota@oninvolvesresourcesastheyexistedatapar@cularpointin@me,thisneedstoberecorded
• ThreedifferentTimemodelsarepossible…
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:17
TimelessAnnota@ons
• TheAnnota@onisalwaysapplicable,regardlessoftherepresenta@onservedfromtheURIsoftheBodyandTargets.
• Example:"ThisisthehomepageofCNN"
• TimelessAnnota@onsdonotneedaspecial@mestamp.
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:18
UniformTimeAnnota@ons
• TheAnnota@onisnotalwaysapplicable,butpertainstothestateoftheBodyandTargetatasinglemomentin@me.
• Example:Tweetisaboutcontemporarystateofawebpage.
• Addmem:whenpropertytoAnnota@on
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:19
VariedTimeAnnota@ons
• TheAnnota@onisnotalwaysapplicable,butpertainstothestateoftheBodyandTargetatdifferentmomentsin@me.
• Example:Blogpostisaboutpreviousday'sstateofawebpage• Addmem:whenpropertytoProxiesforresources
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:20
Memento
• Weknow:theURIoftheTargetandthe@meitwasannotated
• Wedon'tknow:theURIwhereitisarchived• Weneed:Memento
• MementowantstomakeitEasytoNavigatetheWeb’sPast
• Basicidea:UseHTTPtogetfromthecurrentresourcetothearchivedcopyofthatresourceatagiven@me,ratherthansearching
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:21
IssueswiththePastWeb
• Newnamesforarchivedresources
• Whatwascnn.com,becomesarchive.org/web/20010120…• …Andlotsofothernames
• Withnowaytodiscoverthemwithoutsearchingbyhand
• Peopledonotliketosearch• Especiallywhenacomputercoulddoit.
• Naviga@onisinconsistent• Stuckinwebarchivecontentsilo(URIsrewriaen)• Orendupbackinpresent(URIsnotrewriaen)
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:22
TheWebwithoutTimeDimension
eg: http://www.cnn.com/ eg: http://web.archive.org/web/ 20020209001709rn_1/www.cnn.com/?
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:23
TheWebwithTimeDimensionfromMemento
eg: http://www.cnn.com/ eg: http://web.archive.org/web/ 20020209001709rn_1/www.cnn.com/?
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:24
current
Apr 10 2001, 21:39:30 UTC
Aug 15 2004, 08:45:27 UTC
Aug 15 2007, 19:21:58 UTC
www.cnn.com web.archive.org
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:25
current
Apr 10 2001, 21:39:30 UTC
Aug 15 2004, 08:45:27 UTC
Aug 15 2007, 19:21:58 UTC
www.cnn.com web.archive.org
Original Resource
Mementos ?
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:26
current
Apr 10 2001, 21:39:30 UTC
Aug 15 2004, 08:45:27 UTC
Aug 15 2007, 19:21:58 UTC
www.cnn.com web.archive.org
Original Resource
Mementos ?
TimeGate
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:27
current
Apr 10 2001, 21:39:30 UTC
Aug 15 2004, 08:45:27 UTC
Aug 15 2007, 19:21:58 UTC
www.cnn.com web.archive.org
Original Resource
Mementos ?
TimeGate
Conneg with TimeGate to Mementos Link Headers
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:28
Experiments
1. Givenanannota@on,findtheappropriaterepresenta@ons• Createannota@ononresourceknowntochange• Canweusetheinforma@onfromtheannota@ontofaithfullyrecreatetheenvironmentthroughMemento?
2. Givenanarchivedresource,findtheappropriateannota@ons• Createannota@onsatdifferent@mesonresourceknowntobearchived
• Canweusetheinforma@onfromMementotofindtheappropriateannota@ons?
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:29
Experiment1
Retrieve
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:30
Experiment1
Reconstruct
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:31
Experiment1:CreateAnnota@on
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:32
Experiment1:TestwithoutMemento
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:33
Experiment1:TestwithMemento
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:34
Experiment2
• Need to find Original URI, start, end time of representation • Need searchable collection of annotations
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:35
Experiment2:CreateAnnota@ons
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:36
Experiment2:CreateAnnota@ons
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:37
Experiment2:CreateAnnota@ons
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:38
Experiment2:TestwithoutMemento
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:39
Experiment2:TestwithMemento
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:40
Conclusions
• Annota@on,asacorescholarlyprac@ce,isincreasinglyweb‐based
• WeproposeusingOACandMementotoprovideasolu@onforpersistenceofannota@ons,bydisplayingannota@onsintheiroriginalcontextanddisplayingrelevantannota@onsforarchivedresources
• Archivingofannotatedandannota@ngresourcesimportant
• Interes@ngresearchques@onofwhetherannota@onspansmul@plearchivedresources
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:41
ThankYou
• Authors:• [email protected]/[email protected]
• [email protected]/[email protected]
• OAC:• hap://[email protected]/• hap://groups.google.com/group/oac‐discuss
• Memento:• hap://www.mementoweb.org/
• hap://groups.google.com/group/memento‐dev
• ThanksTo:• ScoaAinsworth,LudaBalakireva,TimCole,AnnaGerber,BernhardHaslhofer,EricHetzner,JaneHunter,CliffLynch,MichaelNelson,DougReside,HariharShankar
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel
JCDL 2010, June 21-25, Surfers Paradise, Australia
Slide:42
MementoHTTPFlowHEAD R, (Accept-Datetime)
LinkG
302M, Vary, TCN, LinkR,B,M
200, Content-Datetime, LinkR,B,M
GET G, Accept-Datetime
GET M, (Accept-Datetime)