36
Europeana Newspapers WP 4 Aggregation & Indexing Plan Markus Muhr

Europeana Newspapers Aggregation Plan

Embed Size (px)

DESCRIPTION

A presentation by Markus Muhr at the Europeana Newspapers workshop in Amsterdam.

Citation preview

Page 1: Europeana Newspapers Aggregation Plan

Europeana Newspapers WP 4

Aggregation & Indexing Plan

Markus Muhr

Page 2: Europeana Newspapers Aggregation Plan

2

Agenda

● Customer Relationship Management● Aggregation Workflow - Metadata

• Aggregation Workflow - Full-text and Images

• Newspaper Content Browser Options

• Viewing Images

• Delivery to Europeana / Zeitschriftendatenbank

• Aggregation and Indexing Plan

• Questions

Page 3: Europeana Newspapers Aggregation Plan

3

Customer Relationship Management

• SugarCRM

• Management of all administrative information• Organisations, contacts, datasets, projects, etc.

• Important features for project handling• Newspaper collections• Cases per specific collection• Aggregation and Indexing Plan• Automatic reporting

Page 4: Europeana Newspapers Aggregation Plan

4

Customer Relationship Management

Page 5: Europeana Newspapers Aggregation Plan

5

Customer Relationship Management

Page 6: Europeana Newspapers Aggregation Plan

6

Customer Relationship Management

Page 7: Europeana Newspapers Aggregation Plan

7

Aggregation Workflow – Metadata

● Scheduling of ingestion● Datasets ready for harvesting● Create case in CRM: case # to provider● Harvesting metadata (OAI-PMH, FTP, ...)● Enhance metadata (VIAF, Geonames, MACS,...)● Indexing in acceptance portal ● E-mail to provider to accept dataset● Live index = live portal● Delivery to Europeana● Enhancing and publishing in Europeana

Page 8: Europeana Newspapers Aggregation Plan

8

Aggregation Workflow – Metadata

Page 9: Europeana Newspapers Aggregation Plan

9

Aggregation Workflow - Full-text and Images

● Hard-disk delivery by UIBK/CSS● Hard-disk delivery to ULCC● Ingestion and alignment of fulltext and images with

harvested metadata● JPEG 2000 generation for hosted IIP image server● Enrichment with named entities from KB● Indexing into content browser● Adaptations of image viewer for external image servers

• E-mail to partner

Page 10: Europeana Newspapers Aggregation Plan

10

Aggregation Workflow - Full-text and Images

Page 11: Europeana Newspapers Aggregation Plan

11

Aggregation Workflow - Full-text and Images

Page 12: Europeana Newspapers Aggregation Plan

12

Aggregation Workflow - Full-text and Images

Page 13: Europeana Newspapers Aggregation Plan

13

Newspaper Content Browser Options

• Questionnaire to content providers determined how the content would appear in newspaper content browser

• Option 1 - Images and full-text• Option 2 - Snippets of images and full-text• Option 3 - Full-text only• Option 4 - Metadata only• Option 5 - Option 1 via external image server• Option 6 - Option 2 via external image server

Page 14: Europeana Newspapers Aggregation Plan

14

Viewing Images

● The European Library hosts images for Option 1 and 2 ● IIP Image Server with JPEG 2000● Viewing images transformed into JPEG 2000● Ingestion workflow includes transformation step for tifs and

jpgs● Time-demanding operation● Image viewer is IIPMooViewer● Open source projects ● Europeana Regia

http://www.theeuropeanlibrary.org/tel4/virtual/regia

Page 15: Europeana Newspapers Aggregation Plan

15

Viewing Images

● External image servers for Option 5 and 6 ● Current support of external viewers via iframe

● Alignment and highlighting not available● Improved usage of content browser via integrated image

viewer● Adaptations for each different kind of image server● Time-demanding task● Existing viewer that can be easily embedded in the

Newspaper Content Browser are preferable● Technical support at partner libraries is necessary

Page 16: Europeana Newspapers Aggregation Plan

16

Delivery to Europeana / Zeitschriftendatenbank

● Metadata from Full and Associate Partners should go into Newspapers content browser, Europeana portal and Zeitschriftendatenbank / Union Catalogue of Serials

● EDM to Europeana● Duplin Core to Zeitschriftendatenbank

● Europeana Data Model delivery should be finalised soon

Page 17: Europeana Newspapers Aggregation Plan

17

Europeana Data Model

Page 18: Europeana Newspapers Aggregation Plan

18

Dublin Core

Page 19: Europeana Newspapers Aggregation Plan

19

Aggregation and Indexing Plan

● Plan includes aggregation of partners and 11 associated partners

● Q3 first quarter with indexing work● Aggregation and indexing is aligned with deliveries from

UIBK/CCS● Deliveries to Europeana & Zeitschriftendatenbank from Q4

onwards● Aggregation and indexing is split over multiple quarters for

some partners

Page 20: Europeana Newspapers Aggregation Plan

20

Aggregation and Indexing Plan – Q3 2013

● Österreichische Nationalbibliothek / Austrian National Library – Option 5● Currently working on first batch of 1.090k full-text pages

● Kansalliskirjasto / National Library of Finland – Option 1 (new)● Currently working on first batch of 132k full-text pages

and images

Page 21: Europeana Newspapers Aggregation Plan

21

Aggregation and Indexing Plan – Q4 2013

● Landesbibliothek Dr. Friedrich Teßmann / Teßmann Library – Option 2● 857k full-text pages and thumbnail images

● Österreichische Nationalbibliothek / Austrian National Library – Option 5 and 4● Remaining batches of 1.090k full-text pages● Metadata for 5.691k pages

Page 22: Europeana Newspapers Aggregation Plan

22

Aggregation and Indexing Plan – Q4 2014

● Bibliotheque Nationale de France / National Library France – Option 5● First batch of 2.388k full-text pages

● Latvijas Nacionala Biblitoteka / National Library of Latvia – Option 1● 450k full-text pages and images

Page 23: Europeana Newspapers Aggregation Plan

23

Aggregation and Indexing Plan – Q4 2013

● Landsbókasafn Íslands - Háskólabókasafn / National and Univeristy Library of Iceland – Associated Partner ● Metadata for 4.112k pages

● National Library of Spain – Associated Partner● Metadata for 5.831k pages

● Bibliothèque nationale de Luxembourg / National Library of Luxembourg – Associated Partner● Metadata for 620k pages

Page 24: Europeana Newspapers Aggregation Plan

24

Aggregation and Indexing Plan – Q1 2014

● Bibliotheque Nationale de France / National Library France – Option 5● Next batch of 2.388k full-text pages

● Eesti Rahvusraamatukogu / Estonian National Library – Option 1● First batch of 594k full-text pages and images

● Milli Kutuphane Baskanligi / National Library of Turkey – Option 4● Metadata for 9k pages

Page 25: Europeana Newspapers Aggregation Plan

25

Aggregation and Indexing Plan – Q1 2014

● Staatsbibliothek zu Berlin / Berlin State Library – Option 1● First batch of 248k full-text pages and images

● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1● First batch of 1707k full-text pages and images

● Univerzitet u Beogradu / University Library of Belgrade – Option 1● First batch of 408k full-text pages and images

Page 26: Europeana Newspapers Aggregation Plan

26

Aggregation and Indexing Plan – Q1 2014

● National Library of Wales – Associated Partner ● Metadata for 1.100k pages

● National Library and University Library in Zagreb – Associated Partner● Metadata for 300k pages

Page 27: Europeana Newspapers Aggregation Plan

27

Aggregation and Indexing Plan – Q1 2014

● St. Cyril and Methodius National Library / The National Library of Bulgaria – Associated Partner● Metadata for 12k pages

● National Library of Czech Republic – Associated Partner● Metadata for 5.760k pages

Page 28: Europeana Newspapers Aggregation Plan

28

Aggregation and Indexing Plan – Q2 2014

● Bibliotheque Nationale de France / National Library France – Option 5● Next batch of 2.388k full-text pages

● Eesti Rahvusraamatukogu / Estonian National Library – Option 1● Next batch of 594k full-text pages and images

Page 29: Europeana Newspapers Aggregation Plan

29

Aggregation and Indexing Plan – Q2 2014

● Biblioteka Narodowa / National Library of Poland – Option 2● 83k full-text pages and images

● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1● Next batch of 1707k full-text pages and images

● Koninklijke Bibliotheek / National Library of the Netherlands – Option 5● 1.900k full-text pages

Page 30: Europeana Newspapers Aggregation Plan

30

Aggregation and Indexing Plan – Q2 2014

● Narodna in univerzitetna knjižnica / National and University Library of Slovenia – Associated Partner● Metadata for ?k pages

● National Library of Portugal – Associated Partner● Metadata for 400k pages

● National Library of Romania – Associated Partner● Metadata for 442k pages

Page 31: Europeana Newspapers Aggregation Plan

31

Aggregation and Indexing Plan – Q3 2014

● Bibliotheque Nationale de France / National Library France – Option 5● Next batch of 2.388k full-text pages

● Eesti Rahvusraamatukogu / Estonian National Library – Option 1● Next batch of 594k full-text pages and images

Page 32: Europeana Newspapers Aggregation Plan

32

Aggregation and Indexing Plan – Q3 2014

● Staatsbibliothek zu Berlin / Berlin State Library – Option 1● Next batch of 248k full-text pages and images

● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1● Next batch of 1707k full-text pages and images

Page 33: Europeana Newspapers Aggregation Plan

33

Aggregation and Indexing Plan – Q4 2014

● Bibliotheque Nationale de France / National Library France – Option 5● Final batch of 2.388k full-text pages

● Eesti Rahvusraamatukogu / Estonian National Library – Option 1● Final batch of 594k full-text pages and images

Page 34: Europeana Newspapers Aggregation Plan

34

Aggregation and Indexing Plan – Q4 2014

● Staatsbibliothek zu Berlin / Berlin State Library – Option 1● Final batch of 248k full-text pages and images

● Staats- und Universitätsbibliothek Hamburg / State and University Library Hamburg – Option 1● Final batch of 1707k full-text pages and images

● Kansalliskirjasto / National Library of Finland – Option 1● Final batch of 132k full-text pages and images

Page 35: Europeana Newspapers Aggregation Plan

35

Operations Officers

Anastasia Gasia

Junior Operations Officer

[email protected]

Chiara Latronico

Operations Officer

[email protected]

Operations Mailbox: [email protected]

Page 36: Europeana Newspapers Aggregation Plan

Thank you for your attention!

Markus Muhr ([email protected])

www.europeana-newspapers.eu