22

Hard Content, Fab Front-end @ IIPC 2014

Embed Size (px)

DESCRIPTION

Presentation at the International Internet Preservation Consortium conference 2014 in Paris on the web archiving project the Netherlands Institute for Sound and Vision did together with Dutch public broadcaster NTR.

Citation preview

Page 1: Hard Content, Fab Front-end @ IIPC 2014
Page 2: Hard Content, Fab Front-end @ IIPC 2014

HARD CONTENT, FAB FRONT-ENDArchiving websites of the Dutch Public Broadcasters

10-04-2023

Lotte Belice Baltussen | R&D, Netherlands Institute for Sound and VisionIIPC | 21 May 2014 | BnF, Paris

Page 3: Hard Content, Fab Front-end @ IIPC 2014

10 april 2023

Nederlands Instituut voor Beeld en Geluid

3

Sound and Vision

• 70% of Dutch AV heritage• > 850,000 hours

• 2M photos• 20,000 objects

• Large paper archives

Page 4: Hard Content, Fab Front-end @ IIPC 2014
Page 5: Hard Content, Fab Front-end @ IIPC 2014

“The Archive as a Laboratory”

Web archiving since 2008 (LiWA, several pilots) with various objectives

Page 6: Hard Content, Fab Front-end @ IIPC 2014

NTR PILOT(2013-2014)

10-04-2023

WHY:• Saving websites selected to be taken offline• Getting insights in user requirements• Create great front and back-end• Provide public access• Shape future plans

Page 7: Hard Content, Fab Front-end @ IIPC 2014

WEBSITES

10-04-2023

Page 8: Hard Content, Fab Front-end @ IIPC 2014

CRAWLING ISSUES

Page 9: Hard Content, Fab Front-end @ IIPC 2014

ACCESS ISSUES

Page 10: Hard Content, Fab Front-end @ IIPC 2014

USER REQUIREMENTS, PT. 1

Phase 1: Focus group

Page 11: Hard Content, Fab Front-end @ IIPC 2014
Page 12: Hard Content, Fab Front-end @ IIPC 2014
Page 13: Hard Content, Fab Front-end @ IIPC 2014

USER REQUIREMENTS SUMMARY

• Communication and informatione.g. “As a user, I can suggest a website that should be archived”

• Metadatae.g. “As a user, I can see the crawl date for each archived URL”

• Searchinge.g. “As a user, I can search full-text through a single archived website”

• Visualisatione.g. “As a user, I can see side-by-side comparisons of the same URL that was archived at different moments in time”

Page 14: Hard Content, Fab Front-end @ IIPC 2014

FRONT-END AND BACK-ENDDEVELOPMENT

Page 15: Hard Content, Fab Front-end @ IIPC 2014

FRONT-END AND BACK-ENDDEVELOPMENT

Page 16: Hard Content, Fab Front-end @ IIPC 2014

FRONT-END AND BACK-ENDDEVELOPMENT

Page 17: Hard Content, Fab Front-end @ IIPC 2014

FRONT-END AND BACK-ENDDEVELOPMENT

Page 18: Hard Content, Fab Front-end @ IIPC 2014

FRONT-END AND BACK-ENDDEVELOPMENT

Page 19: Hard Content, Fab Front-end @ IIPC 2014

USER REQUIREMENTS, PT. 2

Phase 2: Usability teststhink-aloud, 60-90 minutes

x 2:• 37, PostDoc web archive research project• 58, Multimedia editor at a Dutch public broadcaster

x 3:• 44, Crawl engineer• 50, Manager digital projects at a Dutch public broadcaster• 58, Freelance (archive) researcher & journalist

Page 20: Hard Content, Fab Front-end @ IIPC 2014

LESSONS-LEARNED

UI/UX+ Clean, visual look- More functionality explanations

COMMUNICATION+ FAQ contains good info about web archiving- Info about status + plans/ More info about scope and size of web archive

METADATA+ Overview of outgoing links- TMI/ Creation + last change of website

SEARCHING+ Fast!+ Thumbnail previews- Search by URL- More filtering options- Relevance ranking

VISUALISATION/ More stats, e.g., % text- Highlight differences crawls

USERS & USAGE+ Current groups representative- No av-streaming big loss for all/ Add more fine-grained subgroups

Page 21: Hard Content, Fab Front-end @ IIPC 2014

FUTURE WORK WEB ARCHIVES:CONTEXT COLLECTIONS

“Public broadcaster web archives will help you learn where you come from” -- Usability test participant

• We need to be more dynamic than the websites we archive• We can and must achieve public access• We are moving from pilot to standard practice• Connect crawls to catalogue• Increase public broadcaster cooperation

Page 22: Hard Content, Fab Front-end @ IIPC 2014

Thanks!@lottebelice | [email protected]

@benglabs