28
From web archiving to web collecting The development of the KB’s web archive Anna Rademakers, May 21st 2014

From web archiving to web collecting

Embed Size (px)

DESCRIPTION

From web archiving to web collecting. The development of the KB’s web archive Anna Rademakers, May 21st 2014. Introduction. Collection policy of the KB in general The history of web archiving in the KB From web archiving to web collecting. Mission statement. - PowerPoint PPT Presentation

Citation preview

Page 1: From web archiving to web collecting

From web archiving to web collectingThe development of the KB’s web archive

Anna Rademakers, May 21st 2014

Page 2: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Introduction

• Collection policy of the KB in general

• The history of web archiving in the KB

• From web archiving to web collecting

Page 3: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Mission statement

“We bring people and information together”

• we offer everyone everywhere access to everything published in and about the Netherlands.

• we play a central role in the (scientific) information infrastructure of the Netherlands.

• we promote permanent access to digital information both nationally and internationally.

Page 4: From web archiving to web collecting

Web archiving in the KB

• Corresponds with our general collection policy• Archiving & making permanently accessible

• Since 2007• Ca. 6000 websites

• Using the Wayback Machine• At the moment onsite accessible in the KB for general

user• Datasets for academic research (e.g. Webart)

From web archiving to web collecting May 21st 2014

Page 5: From web archiving to web collecting

Limitations

•No full .nl domain harvest • Dutch websites also in .com and .net domain• No Dutch Deposit Law

•Opt Out System: • Notice sent to web owners, they can object to

being archived• Part of Dutch law, so only applicable to Dutch

websites

From web archiving to web collecting May 21st 2014

Page 6: From web archiving to web collecting

Selection by subject librarians (1)

• 1) Selection made by subject librarians:

• Focus on library collection profile• Dutch heritage, culture, language & history

• Special collections• Event harvesting (national & international (IIPC))

• E.g. 200 years Kingdom of the Netherlands, • the Netherlands in World War I, • the Olympics,…

From web archiving to web collecting Name and/or date

Page 7: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 8: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 9: From web archiving to web collecting

Selection by subject librarians (2)

• 1) Selection made by subject librarians:

• Focus on library collection profile• Dutch heritage, culture, language & history

• Special collections• Event harvesting (national & international (IIPC))• Websites on special topics

• E.g. embassies, • Sinterklaas (Saint-Nicolas),• Product and trade associations,…

From web archiving to web collecting May 21st 2014

Page 10: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 11: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 12: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 13: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 14: From web archiving to web collecting

Selection by subject librarians (3)

• Selection made by subject librarians:

• Focus on library collection profile• Dutch heritage, culture, language & history

• Special collections• Event harvesting (national & international (IIPC))• Websites on special topics

• Canceled websites

From web archiving to web collecting May 21st 2014

Page 15: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 16: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 17: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 18: From web archiving to web collecting

Selection by subject librarians (3)

• Selection made by subject librarians:

• Focus on library collection profile• Dutch heritage, culture, language & history

• Special collections• Event harvesting (national & international (IIPC))• Websites on special topics

• Cancelled websites• Frysian websites (cooperation with Tresoar)

From web archiving to web collecting May 21st 2014

Page 19: From web archiving to web collecting

Selection by subject librarians (3)

• Selection made by subject librarians:

• Focus on library collection profile• Dutch heritage, culture, language & history

• Special collections• Event harvesting (national & international (IIPC))• Websites on special topics

• Cancelled websites• Frysian websites (cooperation with Tresoar)

How can we make our selection more representative and efficient?

From web archiving to web collecting May 21st 2014

Page 20: From web archiving to web collecting

Selection by relevance ranking

• Alexa: 500 most used websites in the Netherlands• Only 160 websites Dutch• Many marketing websites• Technical issues

• Wikipedia: 12000 websites being used as a reference in the Dutch Wikipedia• More objective• Diversity• Wikipedia community

From web archiving to web collecting May 21st 2014

Page 21: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 22: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 23: From web archiving to web collecting

What not to select?

• Technical: databases, webshops,… Publications vs. services• Collections archived by other Dutch institutions, e.g.

• Websites of Political Parties Archipol• Although we do archive governmental websites!

• Websites about or from Rotterdam Municipal archive• Although we do archive websites of national interest!

• Controversial websites, e.g.• Right or left wing extremists• Pedophilia• Motorcycle gangs

From web archiving to web collecting May 21st 2014

Page 24: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 25: From web archiving to web collecting

From web archiving to web collecting May 21st 2014

Page 26: From web archiving to web collecting

What not to select?

• Technical: databases, webshops,… Publications vs. services• Collections archived by other Dutch institutions, e.g.

• Websites of Political Parties Archipol• Although we do archive governmental websites!

• Websites about or from Rotterdam Municipal archive• Although we do archive websites of national interest!

• Controversial websites, e.g.• Right or left wing extremists• Pedophilia• Motorcycle gangs

• KB as quality mark?

From web archiving to web collecting May 21st 2014

Page 27: From web archiving to web collecting

Closing remarks

•Future: • Website with search options• Online if possible• Further cooperation with researchers• Further cooperation in terms of collection development

•No full archiving (because we have no legal framework for that), but building a permanent and accessible representative collection of Dutch websites

From web archiving to web collecting May 21st 2014

Page 28: From web archiving to web collecting

Thank you

Anna [email protected]

From web archiving to web collecting

May 21st 2014