Upload
lotte-belice-baltussen
View
98
Download
1
Tags:
Embed Size (px)
DESCRIPTION
Presentation at the International Internet Preservation Consortium conference 2014 in Paris on the web archiving project the Netherlands Institute for Sound and Vision did together with Dutch public broadcaster NTR.
Citation preview
HARD CONTENT, FAB FRONT-ENDArchiving websites of the Dutch Public Broadcasters
10-04-2023
Lotte Belice Baltussen | R&D, Netherlands Institute for Sound and VisionIIPC | 21 May 2014 | BnF, Paris
10 april 2023
Nederlands Instituut voor Beeld en Geluid
3
Sound and Vision
• 70% of Dutch AV heritage• > 850,000 hours
• 2M photos• 20,000 objects
• Large paper archives
“The Archive as a Laboratory”
Web archiving since 2008 (LiWA, several pilots) with various objectives
NTR PILOT(2013-2014)
10-04-2023
WHY:• Saving websites selected to be taken offline• Getting insights in user requirements• Create great front and back-end• Provide public access• Shape future plans
WEBSITES
10-04-2023
CRAWLING ISSUES
ACCESS ISSUES
USER REQUIREMENTS, PT. 1
Phase 1: Focus group
USER REQUIREMENTS SUMMARY
• Communication and informatione.g. “As a user, I can suggest a website that should be archived”
• Metadatae.g. “As a user, I can see the crawl date for each archived URL”
• Searchinge.g. “As a user, I can search full-text through a single archived website”
• Visualisatione.g. “As a user, I can see side-by-side comparisons of the same URL that was archived at different moments in time”
FRONT-END AND BACK-ENDDEVELOPMENT
FRONT-END AND BACK-ENDDEVELOPMENT
FRONT-END AND BACK-ENDDEVELOPMENT
FRONT-END AND BACK-ENDDEVELOPMENT
FRONT-END AND BACK-ENDDEVELOPMENT
USER REQUIREMENTS, PT. 2
Phase 2: Usability teststhink-aloud, 60-90 minutes
x 2:• 37, PostDoc web archive research project• 58, Multimedia editor at a Dutch public broadcaster
x 3:• 44, Crawl engineer• 50, Manager digital projects at a Dutch public broadcaster• 58, Freelance (archive) researcher & journalist
LESSONS-LEARNED
UI/UX+ Clean, visual look- More functionality explanations
COMMUNICATION+ FAQ contains good info about web archiving- Info about status + plans/ More info about scope and size of web archive
METADATA+ Overview of outgoing links- TMI/ Creation + last change of website
SEARCHING+ Fast!+ Thumbnail previews- Search by URL- More filtering options- Relevance ranking
VISUALISATION/ More stats, e.g., % text- Highlight differences crawls
USERS & USAGE+ Current groups representative- No av-streaming big loss for all/ Add more fine-grained subgroups
FUTURE WORK WEB ARCHIVES:CONTEXT COLLECTIONS
“Public broadcaster web archives will help you learn where you come from” -- Usability test participant
• We need to be more dynamic than the websites we archive• We can and must achieve public access• We are moving from pilot to standard practice• Connect crawls to catalogue• Increase public broadcaster cooperation