22
Alastair Dunning Europeana Newspapers September 2013 Challenges and Solutions in Creating a European Historic Newspapers Browser

Challenges and solutions in creating a european historic newspapers browser

Embed Size (px)

DESCRIPTION

Alastair Dunning of The European Library presents the challenges and solutions in creating a web browser for Europe's historic newspapers.

Citation preview

Page 1: Challenges and solutions in creating a european historic newspapers browser

Alastair DunningEuropeana Newspapers

September 2013

Challenges and Solutions in Creating a European Historic

Newspapers Browser

Page 2: Challenges and solutions in creating a european historic newspapers browser

Task $“...

Creation of a full-text index of newspaper content

Development of a newspaper content browser

… ”

Work Package 4 - Aggregation and presentation of digitized newspapers

Page 3: Challenges and solutions in creating a european historic newspapers browser

The European Library is building an interface to allow cross-searching of historic newspapers digitised by project partners

Title-level metadata exported to Europeana.

In reality ...

Page 4: Challenges and solutions in creating a european historic newspapers browser

Timetable

Sep 2013 - Beta version with limited content and functionality made available

2014 - Ongoing inclusion of more content and functionality

Spring 2014 - Usability testing I (subject to project funding)

Winter 2014 - Usability testing II (subject to project funding)

Jan 2015 - All scheduled content and functionality completed

Post-project - Interface sustained as part of The European Library

Page 5: Challenges and solutions in creating a european historic newspapers browser

What content will be included ?

Full Images, Full Text, Metadata

Latvia, Belgrade, Hamburg, Berlin, Estonia, Finland, Netherlands *, Austria *

Snippets of Images, Full Text, Metadata

Frederich Tessman, France *, Poland

Page 6: Challenges and solutions in creating a european historic newspapers browser

Complete Newspaper image can be shownEesti Potimees ehk Naddaleleht , 2 November 1866

(National Library of Estonia)

Page 7: Challenges and solutions in creating a european historic newspapers browser

What content will be included ?

Full Images, Full Text, Metadata

Latvia, Belgrade, Hamburg, Berlin, Estonia, Finland, Netherlands *, Austria *

Snippets of Images, Full Text, Metadata

Frederich Tessman, France *, Poland

Page 8: Challenges and solutions in creating a european historic newspapers browser

Fragment of Newspaper image can be shownDziennik Slaskui, 10 June 1915(National Library of Poland)

Page 9: Challenges and solutions in creating a european historic newspapers browser

Just Metadata

Turkey(Partners with copyright issues)All Associate Partners (for now)

The available content in influenced by what restrictions in copyright and business model from each of the contributing libraries.

What content will be included ?

Page 10: Challenges and solutions in creating a european historic newspapers browser

Just title level metadata can be shown:“Kleine Blatt, 15 November 1932”(National Library of Austria)

(Although can we have dark index of full text ?)

Page 11: Challenges and solutions in creating a european historic newspapers browser

Creating a newspapers interface that ...

• Provides unique value to users• Reflects relationship to original

physical newspaper collections• Is sustainable• Offers contributors added value• Defines relationship to

Europeana• Respects library wishes

Page 12: Challenges and solutions in creating a european historic newspapers browser

Users can cross-searchEuropean Newspapers

18m pages, 10m with full text

Users can see what was published on a particular day across Europe

Users can see information on individual newspapers

Provides unique value to users

Page 13: Challenges and solutions in creating a european historic newspapers browser

Local historiansResearchersUndergraduatesGenealogistsTeachers and / school pupils‘Interested public’….

(According to the project Description of Work it is for the ‘researcher’)

But who are the users ?

Page 14: Challenges and solutions in creating a european historic newspapers browser

Respects library wishes

The available content in influenced by what restrictions in copyright and business model from each of the contributing libraries.

●Location of digital image ●Size of image●Format of image

Page 15: Challenges and solutions in creating a european historic newspapers browser

Reflects relationship to original physical newspaper collections

Not all issues in a newspaper title will be available to TEL, or even digitised

Documents hosted by TEL will be different quality than those

Contextual information vital to ensure user confidence

Page 16: Challenges and solutions in creating a european historic newspapers browser

Embedded in The European Library (TEL) portal

TEL membership fees willhelp with ongoing costs

TEL members can add content to newspaper browser over time

Stable URLs

Is sustainable

Page 17: Challenges and solutions in creating a european historic newspapers browser

Logos and links back to source of original content

But also evidence of usage of library content via TEL / what statistics are needed ?

Offers contributors added value

Page 18: Challenges and solutions in creating a european historic newspapers browser

Interface will respond to usability testing

Harvesting of different material will affect interface

Changing requests from libraries

Uneven quality, especially in OCR will also affect interface

Is developed iteratively

Page 19: Challenges and solutions in creating a european historic newspapers browser

First Iteration

Basic text searchFiltering of results by●date●country●newspaper●language●library

Page 20: Challenges and solutions in creating a european historic newspapers browser

● OCR shown● Zoomable version of full

image ● Clickable links between

full text and image (sometimes)

● Link to newspaper source library (where we have been provided with links)

First Iteration

Page 21: Challenges and solutions in creating a european historic newspapers browser

SecondIteration

● Fragments (where requested by library)

● See information on particular title

● See what was published on a particular day

● Search over titles (not just text)● Other browseable visualisations

of publication and library source

● Search / browse via entities

Page 22: Challenges and solutions in creating a european historic newspapers browser

Newspapers from national libraries of Finland and Austria are available for searching (Sample search terms: Linz, Graz, Salzburg, Turku, Oulu, Tampere)

Testing the Site