42
Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia Robert Sanderson [email protected] [email protected] Herbert Van de Sompel [email protected] [email protected] Digital Library Research and Prototyping Team Los Alamos Na@onal Laboratory, USA

Making Web Annotations Persistent over Time

Embed Size (px)

DESCRIPTION

JCDL 2010 presentation about using Memento to reconstruct the state for web resources involved in annotation.

Citation preview

Page 1: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

[email protected]@gmail.com

[email protected]@gmail.com

DigitalLibraryResearchandPrototypingTeam

LosAlamosNa@onalLaboratory,USA

Page 2: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:2

Overview

•  BackgroundandResearchQues@on

•  OpenAnnota@onCollabora@on

•  Memento

•  Experiments

•  Conclusions

Page 3: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:3

10+YearsofAnnota@onResearch

Page 4: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:4

Our2020Vision

web

Page 5: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:5

ScholarshipisMovingtotheWeb

•  Fromsilosofhighlycuratedcontenttoopenworldmodels

•  Publishingofscholarlycommunica@ononline:•  FirstJournals(DLib,PLoS,…)•  ThenUsers(Vivo,ORCID,…)•  NowData(NVO,OREChem,…)

•  NexttheScien@ficProcess?(MyExperiment,Annota@on,…)

•  Withthewebcomesnewchallenges…

•  Persistence

Page 6: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:6

WebResourcesChange…

Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14

Page 7: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:7

ArchivedCopy,ButNoAnnota@ons

Archived page from: http://www.dracos.co.uk/work/bbc-news-archive/2010/03/08/07.05.html

Page 8: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:8

CanWeFixThis?Automa@cally?

The desired outcome: Display the correct representation of the Web Resource with the Annotation.

Page 9: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:9

PreviousAnnota@onPersistenceMethods

•  Migrateannota@onsfromoneversiontothenext:•  Seektodiscovernewloca@onofoldtargetsegment

•  Otherwisediscardtheannota@onasnolongerrelevant•  TreatstheAnnota@onasofsecondaryimportance

•  Focusedonheuris@cs:•  Crossformat,crossloca@on

•  Editedtextinsamedocument

•  Dynamicallyscalingtargetareas,marksofannota@on•  …

Page 10: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:10

PersistentandWeb‐Centric?

•  OAC:DescribeAnnota@onsinaWeb‐centricModel

+

•  Memento:MakeNaviga@ngthePastWebEasy

=

•  GivenanAnnota@on,displayappropriatearchivedWebResource?•  GivenanarchivedWebResource,displayappropriateAnnota@ons?

Page 11: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:11

OpenAnnota@onCollabora@on

•  Focus: Interoperabilitybetweensystemstoenablesharing

•  Founda@on: ArchitectureoftheWorldWideWeb

•  Framework: LinkedDataGuidelines

•  Funding: MellonFounda@onfor18months

Page 12: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:12

OACDataModel:Basics

•  Anoac:Annota@onisanore:Aggrega@onoftwoormoreresources,suchthatone(oac:Body)annotatesatleastoneother(oac:Target)

•  WegetOAI‐OREen@@esforfree(ore:ResourceMap,ore:Proxy)

•  Allresourcesareregularwebresources

Page 13: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:13

OACDataModel:Basics

Page 14: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:14

OACDataModel:Segments

•  MostAnnota@onsareaboutpartofaresource

•  Resourcesareatomic,intermsofiden@fica@on(byaURI)

•  SegmentsoftheresourceapplyinthecontextoftheAnnota@on

•  Solu@on:aaachaDescrip@onoftheSegmentofinteresttoanOREProxyfortheresource

Page 15: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:15

OACDataModel:Segments

Page 16: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:16

OACDataModel:Time

•  Asregularwebresources,ResourceMap,BodyandTargethaverepresenta@onsthatcanchangeover@me

•  TheResourceMap,BodyandTargetcanchangeindependentlyofeachother

•  IfanAnnota@oninvolvesresourcesastheyexistedatapar@cularpointin@me,thisneedstoberecorded

•  ThreedifferentTimemodelsarepossible…

Page 17: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:17

TimelessAnnota@ons

•  TheAnnota@onisalwaysapplicable,regardlessoftherepresenta@onservedfromtheURIsoftheBodyandTargets.

•  Example:"ThisisthehomepageofCNN"

•  TimelessAnnota@onsdonotneedaspecial@mestamp.

Page 18: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:18

UniformTimeAnnota@ons

•  TheAnnota@onisnotalwaysapplicable,butpertainstothestateoftheBodyandTargetatasinglemomentin@me.

•  Example:Tweetisaboutcontemporarystateofawebpage.

•  Addmem:whenpropertytoAnnota@on

Page 19: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:19

VariedTimeAnnota@ons

•  TheAnnota@onisnotalwaysapplicable,butpertainstothestateoftheBodyandTargetatdifferentmomentsin@me.

•  Example:Blogpostisaboutpreviousday'sstateofawebpage•  Addmem:whenpropertytoProxiesforresources

Page 20: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:20

Memento

•  Weknow:theURIoftheTargetandthe@meitwasannotated

•  Wedon'tknow:theURIwhereitisarchived•  Weneed:Memento

•  MementowantstomakeitEasytoNavigatetheWeb’sPast

•  Basicidea:UseHTTPtogetfromthecurrentresourcetothearchivedcopyofthatresourceatagiven@me,ratherthansearching

Page 21: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:21

IssueswiththePastWeb

•  Newnamesforarchivedresources

• Whatwascnn.com,becomesarchive.org/web/20010120…•  …Andlotsofothernames

• Withnowaytodiscoverthemwithoutsearchingbyhand

•  Peopledonotliketosearch•  Especiallywhenacomputercoulddoit.

•  Naviga@onisinconsistent•  Stuckinwebarchivecontentsilo(URIsrewriaen)•  Orendupbackinpresent(URIsnotrewriaen)

Page 22: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:22

TheWebwithoutTimeDimension

eg: http://www.cnn.com/ eg: http://web.archive.org/web/ 20020209001709rn_1/www.cnn.com/?

Page 23: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:23

TheWebwithTimeDimensionfromMemento

eg: http://www.cnn.com/ eg: http://web.archive.org/web/ 20020209001709rn_1/www.cnn.com/?

Page 24: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:24

current

Apr 10 2001, 21:39:30 UTC

Aug 15 2004, 08:45:27 UTC

Aug 15 2007, 19:21:58 UTC

www.cnn.com web.archive.org

Page 25: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:25

current

Apr 10 2001, 21:39:30 UTC

Aug 15 2004, 08:45:27 UTC

Aug 15 2007, 19:21:58 UTC

www.cnn.com web.archive.org

Original Resource

Mementos ?

Page 26: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:26

current

Apr 10 2001, 21:39:30 UTC

Aug 15 2004, 08:45:27 UTC

Aug 15 2007, 19:21:58 UTC

www.cnn.com web.archive.org

Original Resource

Mementos ?

TimeGate

Page 27: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:27

current

Apr 10 2001, 21:39:30 UTC

Aug 15 2004, 08:45:27 UTC

Aug 15 2007, 19:21:58 UTC

www.cnn.com web.archive.org

Original Resource

Mementos ?

TimeGate

Conneg with TimeGate to Mementos Link Headers

Page 28: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:28

Experiments

1.  Givenanannota@on,findtheappropriaterepresenta@ons•  Createannota@ononresourceknowntochange•  Canweusetheinforma@onfromtheannota@ontofaithfullyrecreatetheenvironmentthroughMemento?

2.  Givenanarchivedresource,findtheappropriateannota@ons•  Createannota@onsatdifferent@mesonresourceknowntobearchived

•  Canweusetheinforma@onfromMementotofindtheappropriateannota@ons?

Page 29: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:29

Experiment1

Retrieve

Page 30: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:30

Experiment1

Reconstruct

Page 31: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:31

Experiment1:CreateAnnota@on

Page 32: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:32

Experiment1:TestwithoutMemento

Page 33: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:33

Experiment1:TestwithMemento

Page 34: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:34

Experiment2

•  Need to find Original URI, start, end time of representation •  Need searchable collection of annotations

Page 35: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:35

Experiment2:CreateAnnota@ons

Page 36: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:36

Experiment2:CreateAnnota@ons

Page 37: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:37

Experiment2:CreateAnnota@ons

Page 38: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:38

Experiment2:TestwithoutMemento

Page 39: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:39

Experiment2:TestwithMemento

Page 40: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:40

Conclusions

•  Annota@on,asacorescholarlyprac@ce,isincreasinglyweb‐based

•  WeproposeusingOACandMementotoprovideasolu@onforpersistenceofannota@ons,bydisplayingannota@onsintheiroriginalcontextanddisplayingrelevantannota@onsforarchivedresources

•  Archivingofannotatedandannota@ngresourcesimportant

•  Interes@ngresearchques@onofwhetherannota@onspansmul@plearchivedresources

Page 41: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:41

ThankYou

•  Authors:•  [email protected]/[email protected]

•  [email protected]/[email protected]

•  OAC:•  hap://[email protected]/•  hap://groups.google.com/group/oac‐discuss

•  Memento:•  hap://www.mementoweb.org/

•  hap://groups.google.com/group/memento‐dev

•  ThanksTo:•  ScoaAinsworth,LudaBalakireva,TimCole,AnnaGerber,BernhardHaslhofer,EricHetzner,JaneHunter,CliffLynch,MichaelNelson,DougReside,HariharShankar

Page 42: Making Web Annotations Persistent over Time

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:42

MementoHTTPFlowHEAD R, (Accept-Datetime)

LinkG

302M, Vary, TCN, LinkR,B,M

200, Content-Datetime, LinkR,B,M

GET G, Accept-Datetime

GET M, (Accept-Datetime)