NLLC 2011: Memento, Open Annotation, SharedCanvas

Preview:

DESCRIPTION

Presentation at NLLC 2011, at Los Alamos National Laboratory, about the current research projects of the Prototyping team.

Citation preview

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

1

Memento, Open Annotation, SharedCanvas

Robert Sanderson rsanderson@lanl.gov Los Alamos National Laboratory

Herbert Van de Sompel herbertv@lanl.gov Los Alamos National Laboratory

With thanks to our collaborators: •  Benjamin Albritton, at Stanford •  Tim Cole, at UIUC •  Jane Hunter, at Queensland •  Michael L Nelson, at Old Dominion •  Many others!

This research is funded in part by the Andrew W. Mellon Foundation and the Library of Congress.

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

2

Overview … for the Web

•  Memento •  Time Travel for the Web •  Uses

•  Open Annotation •  Interoperable Annotation for the Web •  Use Case: SharedCanvas

•  SharedCanvas •  Distributed Digital Facsimiles for the Web •  Uses

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

3

Memento: Time Travel for the Web

Memento wants to make it Easy

to Navigate the Web of the Past

Website: http://www.mementoweb.org/

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

4

Tate Online Today

Select Date March 16 2008

Tate Online March 16 2008

From UK National Archives

4

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

5

Old Versions of Web Pages Exist …

http://www.cnn.com/ at 2001-09-11: From the Internet Archive

http://en.wikipedia.org/wiki/September_11_attacks at 2001-12-20: From the Wikipedia history

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

6

… But are not Integrated with the Web

•  We note three issues:

•  Cannot talk about a resource as it used to exist

•  Cannot navigate to a prior version of a resource, knowing its current version

•  Cannot navigate to the current version of a resource, knowing a prior version

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

7

Memento Framework

•  Memento:

•  Introduces a uniform version mechanism to the Web: Time

•  Does not build new archives, but leverages all existing ones

•  Is based on the Architecture of the World Wide Web: resources, representations and links

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

8 8

Original Resources and Mementos

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

9 9

TimeGate: A Bridge from the Present to the Past

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

10 10

Memento Framework in Practice

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

11 11

Memento Uses: Browsing with MementoFox

Get it from: http://bit.ly/memfox

Firefox 3.6+ 4000 downloads 400 daily users

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

12 12

Memento Uses: Time Series Analysis

Historical data retrieved via Memento from DBPedia archive

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

13 13

Memento Uses: Archiving Extent for Scholarship

•  72% in archives and/or still exist

•  High proportion of archived URLs, possibly due to academic level and general disciplines

•  78% in archives and/or still exist

•  45% still exist, but not archived! Possibly due to high value, but very discipline specific references

UNT

arXiv

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

14

Web-Centric Annotation: No Persistence

Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14

1

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

15

Web-Centric Annotation: No Annotations

Archived page from: http://www.dracos.co.uk/work/bbc-news-archive/2010/03/08/07.05.html

1

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

16

Web-Centric Annotation: Cross-Linking via Memento

1

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

17

Scholarly Communication and Annotation

Scholarly Communication is increasingly: •  Online •  Open •  Distributed •  Collaborative •  Data-Oriented

Annotation is a scholarly primitive, spanning discipline and level. Need to ensure that Digital Annotations fall under these headings!

•  Apply the standards and architecture of the World Wide Web to the Annotation use case. •  Even if scholar doesn’t share annotations with others, she will want to access them from different tools and environments.

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

18

Open Annotation

•  Website: http://www.openannotation.org/

•  Focus on interoperable sharing of annotations •  Web-centric and open, not locked down silos •  Create, consume and interact in different environments •  Build from a simple model for simple cases, to more detailed for complex scholarly annotation requirements

•  Status: Beta, with 9 ongoing funded experiments to inform 1.0

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

19

Basic Model

The basic model has three resources: •  Annotation (an RDF document) •  Body (the ‘comment’ of the annotation) •  Target (the resource the Body is ‘about’)

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

20

Basic Model Example

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

21

Segments of Resources

Most annotations are about part of a resource

Different segments for different media types:

•  Text: paragraph, arbitrary span of words •  Image: rectangular or arbitrary shaped area •  Audio: start and end time points, track name/number •  Video: area and time points •  Other: slice of a data set, volume in a 3d object, …

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

22

Constraints

•  Introduce a Constraint that describes the segment of interest •  And a ConstrainedTarget that identifies the segment of interest •  Constraints are resources, so can be expressive and detailed

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

23

Constraint Example

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

24

Annotation Protocols

Protocol: publish, subscribe, consume tied together

Unlike previous systems, Open Annotation does not mandate a protocol.

No reliance on a client/server combination gives the client autonomy.

Instead we promote a publish/subscribe methodology, where annotations may be stored and consumed from anywhere.

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

25

Publish/Subscribe Method

publish

We don’t specify how this transfer should occur

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

26

publish subscribe

Publish/Subscribe Method

Nor this.

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

27

publish subscribe consume

Publish/Subscribe Method

Nor this.

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

28

Open Annotation Usage: SharedCanvas

http://www.shared-canvas.org/

Digital surrogates for digitized documents: •  Initial use case: Medieval Manuscripts

A collaborative future: •  Rich landscape of interconnected

repositories, with seamless user interfaces

We need: •  Shared description to act as input to

shared viewing environments

BNF f.fr 113, folio 1 recto

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

29

Naïve Approach: Transcribe Images Directly

Align multiple images, pages without images, fragments of pages… ?!

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

30

Canvas Paradigm

A Canvas is an empty space in which to build up a display •  HTML5, SVG, PDF, … even Powerpoint! •  Can "paint" many different resources, including text, images and audio, on to a Canvas

We can use a Canvas to represent a page of a manuscript.

Distributed nature is fundamental in the requirements •  Idea: Use Annotations to paint resources and comments •  Annotations target the Canvas instead of the Images

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

31

Canvas to Page Relationship

The Canvas's top left and bottom right corners correspond to the corners of the page

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

32

Annotate Image to Canvas

We can paint the canvas by annotating it with resources.

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

33

Annotate Text to Canvas Segment

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

34

Annotate Choice of Image

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

35

Annotate Many Texts

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

36

SharedCanvas: Implementation

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

37

Annotate Audio Resources

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

38

SharedCanvas: Implementation

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

39

SharedCanvas: Distributed Transcription …

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

40

… of Lab Notebooks and Dissertations?

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

41

Summary

SharedCanvas brings primary research objects to the desktop in a powerful, extensible and interoperable fashion

Memento wants to make it Easy to Navigate the Web of the Past

Open Annotation enables sharing Annotations between users, tools and environments.

LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011

42

Thank You!

Me: Robert Sanderson rsanderson@lanl.gov azaroth42@gmail.com @azaroth42

Memento: Web: http://www.mementoweb.org/

Open Annotation: Web: http://www.openannotation.org/

SharedCanvas: Web: http://www.shared-canvas.org/

These Slides: http://slidesha.re/…

Recommended