111
@maxkaiser Cooperating with Google The Austrian National Library’s large-scale digitisation partnership with Google Max Kaiser Head R&D, Austrian National Library EVA / MINERVA 2011 Jerusalem 14–16 November, 2011

Cooperating with Google

Embed Size (px)

DESCRIPTION

Presentation at the EVA / MINERVA 2011 Conference in Jerusalem, 15 November 2011

Citation preview

Page 1: Cooperating with Google

@maxkaiser

Cooperating with GoogleThe Austrian National Library’s

large-scale digitisation partnership with Google

Max KaiserHead R&D, Austrian National Library

EVA / MINERVA 2011 Jerusalem 14–16 November, 2011

Page 2: Cooperating with Google

@maxkaiser

Austrian Books Online www.onb.ac.at/ev/austrianbooksonline/

Page 3: Cooperating with Google

@maxkaiser@maxkaiser

www.slideshare.net/maxkaiser

Page 4: Cooperating with Google

@maxkaiser

Digitisation of the entire historical book holdings of the Austrian National Library

Page 5: Cooperating with Google

@maxkaiser

Largest Austrian Public Private Partnership in the cultural sector

Page 6: Cooperating with Google

@maxkaiser

Page 7: Cooperating with Google

@maxkaiser

History back to the 14th century

Page 8: Cooperating with Google

@maxkaiser

One of the world‘s most significant

historical collections

Page 9: Cooperating with Google

@maxkaiser

Quelle: http://commons.wikimedia.org/wiki/File:A

ustria_Hungary_ethnic_de.svg

„Legal Deposit“

Page 10: Cooperating with Google

@maxkaiser

State Hall

Page 11: Cooperating with Google

@maxkaiser http://www.onb.ac.at/sammlungen/siawd/siawd_halev.htm

Page 12: Cooperating with Google

@maxkaiser http://www.onb.ac.at/sammlungen/siawd/100hebraica.htm

Page 13: Cooperating with Google

@maxkaiser

Page 14: Cooperating with Google

@maxkaiser

Page 15: Cooperating with Google

@maxkaiser

Page 16: Cooperating with Google

@maxkaiser

Page 17: Cooperating with Google

@maxkaiser

→Digitisation

Page 18: Cooperating with Google
Page 19: Cooperating with Google

@maxkaiser

→Digitisation→Online Publications→Web Archiving

Page 20: Cooperating with Google

@maxkaiser

600,000 volumes

200 Mio pages

Page 21: Cooperating with Google

@maxkaiser

16th century

2nd half of19th century _

Page 22: Cooperating with Google

@maxkaiser

Google Books

Digital Library Austrian National Library

Page 23: Cooperating with Google

@maxkaiser

Partner Program

Library Program

Google Books

Page 24: Cooperating with Google

@maxkaiser

13 Libraries in Europe

5 National Libraries

Italy

Austria

The Netherlands

Czech Republic

Great Britain

Page 25: Cooperating with Google

@maxkaiser@maxkaiser

>15 Mio. books

>3 Mio. public

domain

Page 27: Cooperating with Google

@maxkaiser

„Stimulating the flow of private funds for the digitisation of cultural assets through equitable public private partnerships

appears as a viable and sustainable way of tackling the pressing question of making Europe’s cultural wealth accessible online and preserving it for future generations.“

Page 28: Cooperating with Google

@maxkaiser

„The key question is not whether public-private partnerships for digitisation should be encouraged, but how‚

and under which conditions.“

Page 29: Cooperating with Google

27 October 2011

Page 30: Cooperating with Google

@maxkaiser

„(...) recommends that Member States (...) encourage partnerships

between cultural

institutions and the private sector

in order to create new ways of funding digitisation

of cultural material and to

stimulate innovative uses of the material, while ensuring that public private partnerships for digitisation

are fair and

balanced

(…).“

Page 31: Cooperating with Google

@maxkaiser

Key principles:1.

Respect for intellectual property rights→

Only public-domain works digitised

2.

Non-exclusivity→

Free to digitise material with other partners

3.

Transparency of the process→

Public Tender

Page 32: Cooperating with Google

@maxkaiser

Key principles:4.

Transparency of agreements→

Very detailed FAQs

online

5.

Accessibility through Europeana→

All files available for non-commercial use

Access via platforms like Europeana→

Provision to research partners

6.

Key criteria→

[Next slide]

Page 33: Cooperating with Google

@maxkaiser

Key criteria for assessing PPPs (selection):

→ (Free) access to material for general public

→Cross-border access→Quality of digital copies for public partner→Usage conditions for public partner

Page 34: Cooperating with Google

@maxkaiser

Additional key elements:→Selection of books by library→ Institute for Restoration involved→Termination

Page 35: Cooperating with Google

Who is paying for what?

http://www.bildarchivaustria.at/downl/1148453/layout/CE%2043_3.jpg

Page 36: Cooperating with Google

@maxkaiser

Costs→Full text-digitisation:

very expensive→Report by

Collections Trust for Comité

des Sages

http://ec.europa.eu/information_society/activities/digital_libraries/ doc/refgroup/annexes/digiti_report.pdf

Page 37: Cooperating with Google

@maxkaiser

Google:→Transport→ Insurance→Scanning→OCR→ Image processing→Quality control→Google Books

Page 38: Cooperating with Google

@maxkaiser

Austrian National Library:→ Provision of Metadata → Selection→ Internal logistics→ Conservational assessment→ Barcoding→ Metadata adjustments→ Data download and control→ Data storage & digital preservation→ Digital Library

Page 40: Cooperating with Google

@maxkaiser

Which books?

Page 41: Cooperating with Google

Entire historical book holdings 16th –19th century

Page 42: Cooperating with Google

@maxkaiser

200.000 volumes State Hall

Page 43: Cooperating with Google

Quelle: http://deu.archinform.net/projekte/107

Department of Manuscripts and Rare Books

Map Department

Page 44: Cooperating with Google

Department of Music

Page 45: Cooperating with Google

Quelle: http://commons.wikimedia.org/wiki/File:Palais_Lobkowitz_Vienna_Oct._2006_006.jpg

Theatre Museum

Page 46: Cooperating with Google

Fidei Commiss Library

Page 47: Cooperating with Google

@maxkaiser@maxkaiser

Page 48: Cooperating with Google

@maxkaiser

7 Work Packages

Book logistics

Metadata / Catalogues

Conservation / Restoration

Data download / Quality control

Access

IT infrastructure

Project management

Page 49: Cooperating with Google

@maxkaiser

Preparatory projectMid -

end 2010

→ Integration with organisational processes→Personnel resources→Logistics workflows

Page 50: Cooperating with Google

@maxkaiser

Internal communication→Change processes→Re-evaluation of workflows→Availability of internal resources

Page 51: Cooperating with Google

Consultation with

other Google partners

Quelle: http://commons.wikimedia.org/wiki/File:M%C3%BCnchen_Bayerische_Staatsbibliothek_001.JPG

Page 52: Cooperating with Google

@maxkaiser

70+ staff members

20+ exclusively for project→Book logistics→Metadata adaptation→Cataloguing→Conservation / restoration→Quality control→Software implementation→Project management

Page 53: Cooperating with Google

@maxkaiser

End of 2010Test shipment & Start operational project

Spring 2011Start of digitisation

Page 54: Cooperating with Google

No individual selection …

Page 55: Cooperating with Google

Format

Page 56: Cooperating with Google

Format

Page 57: Cooperating with Google

Condition

Page 58: Cooperating with Google

Preparation

Page 59: Cooperating with Google

Conservational evaluation

Page 60: Cooperating with Google

Value

Page 61: Cooperating with Google

Logistics in the State Hall

Page 62: Cooperating with Google

Logistics in the State Hall

Page 63: Cooperating with Google

Logistics in the State Hall

Page 64: Cooperating with Google

Challenges…

Page 65: Cooperating with Google

Challenges…

Page 66: Cooperating with Google

Challenges…

Page 67: Cooperating with Google

Challenges…

Page 68: Cooperating with Google

Challenges…

Page 69: Cooperating with Google

Logistics in the „Aurum“ Depot

Page 70: Cooperating with Google

Logistics in the „Aurum“ Depot

Page 71: Cooperating with Google

Preparation for Digitisation

Page 72: Cooperating with Google

Manipulation area …

Page 73: Cooperating with Google

Barcoding

Page 74: Cooperating with Google

Adaptation of metadata

Page 75: Cooperating with Google

@maxkaiser

Page 76: Cooperating with Google

8 minutes / volume

Page 77: Cooperating with Google

@maxkaiser

books

Page 78: Cooperating with Google

@maxkaiser

hours

Page 79: Cooperating with Google

@maxkaiser

working days

Page 80: Cooperating with Google

@maxkaiser

person years

Page 81: Cooperating with Google

Complex cases …

Page 82: Cooperating with Google

Bound-Togethers …

Page 83: Cooperating with Google

Bound-Togethers …

Page 84: Cooperating with Google

Bound-Togethers …

Page 85: Cooperating with Google

„Slim“ volumes …

Page 86: Cooperating with Google

Special collections …

Page 87: Cooperating with Google

Conservational protection

Page 88: Cooperating with Google

Conservational protection

Page 89: Cooperating with Google

Conservational protection

Page 90: Cooperating with Google

@maxkaiserConservational protection

Page 91: Cooperating with Google

Cataloguing of the Fidei Commiss Library

Page 92: Cooperating with Google

Cataloguing of the Fidei Commiss Library

Page 93: Cooperating with Google

Ready for Digitisation …

Page 94: Cooperating with Google

@maxkaiser

Digitisation→ Scanning Center

in Germany

→ Procedures agreed→ Austrian Federal Office for Monuments involved→ Each volume checked after return→ Books unavailable to users for ~ 3 months

Page 95: Cooperating with Google

@maxkaiser

Page 96: Cooperating with Google

@maxkaiser

Digitisation

Data Download

Book Logistics

Quality Control

Storage

Access

ADOCO (Austrian Books Online Download & Control)

Page 97: Cooperating with Google

@maxkaiser

digitised items / day

Page 98: Cooperating with Google

@maxkaiser

Quality control→Automated jobs→Representative samples→ IT assisted discovery of error clusters→Error candidates checked manually

Page 99: Cooperating with Google

@maxkaiser

Page 100: Cooperating with Google

@maxkaiser

Page 101: Cooperating with Google

@maxkaiser

Technologies and Workflows from EC co-funded FP7 projects:

→SCAPE (Scalable Preservation Environments)

→http://www.scape-project.eu/

→ IMPACT (Improving Access to Text)

→http://www.impact-project.eu/

Page 102: Cooperating with Google

http://www.digitisation.eu/

Page 103: Cooperating with Google

@maxkaiser

Data Management & Access

→Data storage: inhouse→ JPEG-2000 master files stored redundantly→ Access copies generated on-the-fly→ URN resolver for permanent identification

Page 104: Cooperating with Google

@maxkaiser

Book Viewer

Catalogue / “Quick Search”

Full-Text Search

[Mobile Apps]

Page 105: Cooperating with Google

@maxkaiser

Image Server

Catalogue

URN Resolver

Master Images

Digital Repository

Quick Search

Fulltext Index Server

Book Viewer

ADOCOGoogle

USER

Page 106: Cooperating with Google

@maxkaiser

Outlook→ Full-Text: new possibilities for research→Data enrichment→Named Entity Recognition→ Linked Data→New data centric research in the Humanities

& Social Sciences→ http://www.diggingintodata.org/

Page 107: Cooperating with Google

@maxkaiser

Page 108: Cooperating with Google

@maxkaiserhttp://books.google.com/books?vid=ONB%2BZ119586207

Page 109: Cooperating with Google

@maxkaiserhttp://books.google.com/books?vid=ONB%2BZ119586207

Page 110: Cooperating with Google

@maxkaiser

More informationwww.onb.ac.at/ev/austrianbooksonlinewww.onb.ac.at/ev/austrianbooksonline/faq.htmtwitter.com/abooksonline

Page 111: Cooperating with Google

@maxkaiser

Thank [email protected]

www.onb.ac.at

www.slideshare.net/maxkaiser

www.linkedin.com/in/maxkaisergplus.to/maxkaiser

twitter.com/maxkaiser