42
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6 th October 2011 1 Memento, Open Annotation, SharedCanvas Robert Sanderson [email protected] Los Alamos National Laboratory Herbert Van de Sompel [email protected] Los Alamos National Laboratory With thanks to our collaborators: • Benjamin Albritton, at Stanford • Tim Cole, at UIUC • Jane Hunter, at Queensland • Michael L Nelson, at Old Dominion • Many others! This research is funded in part by the Andrew W. Mellon Foundation and the Library of Congress.

NLLC 2011: Memento, Open Annotation, SharedCanvas

Embed Size (px)

DESCRIPTION

Presentation at NLLC 2011, at Los Alamos National Laboratory, about the current research projects of the Prototyping team.

Citation preview

Page 1: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

1

Memento, Open Annotation, SharedCanvas

Robert Sanderson [email protected] Los Alamos National Laboratory

Herbert Van de Sompel [email protected] Los Alamos National Laboratory

With thanks to our collaborators: •  Benjamin Albritton, at Stanford •  Tim Cole, at UIUC •  Jane Hunter, at Queensland •  Michael L Nelson, at Old Dominion •  Many others!

This research is funded in part by the Andrew W. Mellon Foundation and the Library of Congress.

Page 2: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

2

Overview … for the Web

•  Memento •  Time Travel for the Web •  Uses

•  Open Annotation •  Interoperable Annotation for the Web •  Use Case: SharedCanvas

•  SharedCanvas •  Distributed Digital Facsimiles for the Web •  Uses

Page 3: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

3

Memento: Time Travel for the Web

Memento wants to make it Easy

to Navigate the Web of the Past

Website: http://www.mementoweb.org/

Page 4: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

4

Tate Online Today

Select Date March 16 2008

Tate Online March 16 2008

From UK National Archives

4

Page 5: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

5

Old Versions of Web Pages Exist …

http://www.cnn.com/ at 2001-09-11: From the Internet Archive

http://en.wikipedia.org/wiki/September_11_attacks at 2001-12-20: From the Wikipedia history

Page 6: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

6

… But are not Integrated with the Web

•  We note three issues:

•  Cannot talk about a resource as it used to exist

•  Cannot navigate to a prior version of a resource, knowing its current version

•  Cannot navigate to the current version of a resource, knowing a prior version

Page 7: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

7

Memento Framework

•  Memento:

•  Introduces a uniform version mechanism to the Web: Time

•  Does not build new archives, but leverages all existing ones

•  Is based on the Architecture of the World Wide Web: resources, representations and links

Page 8: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

8 8

Original Resources and Mementos

Page 9: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

9 9

TimeGate: A Bridge from the Present to the Past

Page 10: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

10 10

Memento Framework in Practice

Page 11: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

11 11

Memento Uses: Browsing with MementoFox

Get it from: http://bit.ly/memfox

Firefox 3.6+ 4000 downloads 400 daily users

Page 12: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

12 12

Memento Uses: Time Series Analysis

Historical data retrieved via Memento from DBPedia archive

Page 13: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

13 13

Memento Uses: Archiving Extent for Scholarship

•  72% in archives and/or still exist

•  High proportion of archived URLs, possibly due to academic level and general disciplines

•  78% in archives and/or still exist

•  45% still exist, but not archived! Possibly due to high value, but very discipline specific references

UNT

arXiv

Page 14: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

14

Web-Centric Annotation: No Persistence

Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14

1

Page 15: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

15

Web-Centric Annotation: No Annotations

Archived page from: http://www.dracos.co.uk/work/bbc-news-archive/2010/03/08/07.05.html

1

Page 16: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

16

Web-Centric Annotation: Cross-Linking via Memento

1

Page 17: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

17

Scholarly Communication and Annotation

Scholarly Communication is increasingly: •  Online •  Open •  Distributed •  Collaborative •  Data-Oriented

Annotation is a scholarly primitive, spanning discipline and level. Need to ensure that Digital Annotations fall under these headings!

•  Apply the standards and architecture of the World Wide Web to the Annotation use case. •  Even if scholar doesn’t share annotations with others, she will want to access them from different tools and environments.

Page 18: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

18

Open Annotation

•  Website: http://www.openannotation.org/

•  Focus on interoperable sharing of annotations •  Web-centric and open, not locked down silos •  Create, consume and interact in different environments •  Build from a simple model for simple cases, to more detailed for complex scholarly annotation requirements

•  Status: Beta, with 9 ongoing funded experiments to inform 1.0

Page 19: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

19

Basic Model

The basic model has three resources: •  Annotation (an RDF document) •  Body (the ‘comment’ of the annotation) •  Target (the resource the Body is ‘about’)

Page 20: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

20

Basic Model Example

Page 21: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

21

Segments of Resources

Most annotations are about part of a resource

Different segments for different media types:

•  Text: paragraph, arbitrary span of words •  Image: rectangular or arbitrary shaped area •  Audio: start and end time points, track name/number •  Video: area and time points •  Other: slice of a data set, volume in a 3d object, …

Page 22: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

22

Constraints

•  Introduce a Constraint that describes the segment of interest •  And a ConstrainedTarget that identifies the segment of interest •  Constraints are resources, so can be expressive and detailed

Page 23: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

23

Constraint Example

Page 24: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

24

Annotation Protocols

Protocol: publish, subscribe, consume tied together

Unlike previous systems, Open Annotation does not mandate a protocol.

No reliance on a client/server combination gives the client autonomy.

Instead we promote a publish/subscribe methodology, where annotations may be stored and consumed from anywhere.

Page 25: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

25

Publish/Subscribe Method

publish

We don’t specify how this transfer should occur

Page 26: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

26

publish subscribe

Publish/Subscribe Method

Nor this.

Page 27: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

27

publish subscribe consume

Publish/Subscribe Method

Nor this.

Page 28: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

28

Open Annotation Usage: SharedCanvas

http://www.shared-canvas.org/

Digital surrogates for digitized documents: •  Initial use case: Medieval Manuscripts

A collaborative future: •  Rich landscape of interconnected

repositories, with seamless user interfaces

We need: •  Shared description to act as input to

shared viewing environments

BNF f.fr 113, folio 1 recto

Page 29: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

29

Naïve Approach: Transcribe Images Directly

Align multiple images, pages without images, fragments of pages… ?!

Page 30: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

30

Canvas Paradigm

A Canvas is an empty space in which to build up a display •  HTML5, SVG, PDF, … even Powerpoint! •  Can "paint" many different resources, including text, images and audio, on to a Canvas

We can use a Canvas to represent a page of a manuscript.

Distributed nature is fundamental in the requirements •  Idea: Use Annotations to paint resources and comments •  Annotations target the Canvas instead of the Images

Page 31: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

31

Canvas to Page Relationship

The Canvas's top left and bottom right corners correspond to the corners of the page

Page 32: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

32

Annotate Image to Canvas

We can paint the canvas by annotating it with resources.

Page 33: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

33

Annotate Text to Canvas Segment

Page 34: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

34

Annotate Choice of Image

Page 35: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

35

Annotate Many Texts

Page 36: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

36

SharedCanvas: Implementation

Page 37: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

37

Annotate Audio Resources

Page 38: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

38

SharedCanvas: Implementation

Page 39: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

39

SharedCanvas: Distributed Transcription …

Page 40: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

40

… of Lab Notebooks and Dissertations?

Page 41: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

41

Summary

SharedCanvas brings primary research objects to the desktop in a powerful, extensible and interoperable fashion

Memento wants to make it Easy to Navigate the Web of the Past

Open Annotation enables sharing Annotations between users, tools and environments.

Page 42: NLLC 2011: Memento, Open Annotation, SharedCanvas

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

42

Thank You!

Me: Robert Sanderson [email protected] [email protected] @azaroth42

Memento: Web: http://www.mementoweb.org/

Open Annotation: Web: http://www.openannotation.org/

SharedCanvas: Web: http://www.shared-canvas.org/

These Slides: http://slidesha.re/…