Upload
others
View
0
Download
0
Embed Size (px)
Citation preview
Archiving the Rio 2016 Olympics: Scaling up IIPC collaborative collection
development
Alex Thurman & Helena Byrne
The Olympics and the Web
Copyright © 2017 Shape Collage Inc.
IIPC Access Working Group Olympics Collections
IIPC Collections
Collections publicly accessible at
http://netpreserve.org/projects/collaborative-collections
&
https://archive-it.org/home/iipc
•Olympics/Paralympics collections (2010-2016)
•European Refugee Crisis (2015-2016)
•World War One Commemoration (2015- )
•International Cooperation Organizations (2015- )
Anticipating Rio2016
Why The Olympics?
Participating IIPC Members
BIBLIOTHÈQUE ET ARCHIVES NATIONALES DU QUÉBEC (BANQ)
NASJONALBIBLIOTEKET (THE NATIONAL LIBRARY OF NORWAY)
BIBLIOTECA NACIONAL DE ESPAÑA (NATIONAL LIBRARY OF SPAIN)
NATIONAL DIET LIBRARY, JAPAN
BIBLIOTHÈQUE NATIONALE DE FRANCE (NATIONAL LIBRARY OF FRANCE)
NATIONAL LIBRARY OF AUSTRALIA
THE BRITISH LIBRARY NATIONAL LIBRARY OF NEW ZEALAND
COLUMBIA UNIVERSITY LIBRARIES NATIONAL LIBRARY OF SCOTLAND
DEUTSCHE NATIONALBIBLIOTHEK (GERMAN NATIONAL LIBRARY)
NATIONAL LIBRARY OF SERBIA
EESTI RAHVUSRAAMATUKOGU (NATIONAL LIBRARY OF ESTONIA)
NETARCHIVE.DK (ROYAL DANISH LIBRARY)
LIBRARY OF CONGRESS (including LC OVERSEAS OFFICES)
SCHWEIZERISCHE NATIONALBIBLIOTHEK (SWISS NATIONAL LIBRARY)
NÁRODNÍ KNIHOVNA CESKÉ REPUBLIKY (NATIONAL LIBRARY OF THE CZECH REPUBLIC) ….. and in 2018 YOU!
Project Plan
Collaborative Tools Used
How We Collected
• IIPC member institutions submitted
nominations through a shared Google Spreadsheet.
• Non-IIPC members and the public submitted
nominations through a Google Form.
IIPC Engagement Strategy
• Instructional videos with audio commentary. • Print Summary. • Emails. • Public blog posts. • Geo-referencing the nominations.
External Engagement Strategy
• Google Forms
• Public blog posts
• Twitter chat
• At least 3 non IIPC institutions
•Anonymous form
•176 Nominations
• 24 different countries covered
• 4 different languages
• Chinese, Korean and Georgian
• Multiple Sports, Rugby Sevens,
Boxing and Football most
popular nominations.
Total: 125 IIPC nominations: 101 Public nominations: 24
Countries Covered
blog posts direct contact social
media
https://netpreserveblog.wordpress.com/2016/11/08/rio-2016-round-up/
Promotion
Metadata fields
fields for public display nonpublic
free text fields
URL Title / Title (English) Country Language
Contributing institution
predefined dropdown options
Event Website type Subject Olympic / Paralympic sport
Crawl depth scope
Website type & Subject
Crawl depth scope options Archive-It seed type [crawl scope] Use cases # of Rio2016
seeds
Standard [full seed host/directory]
Full sites of athletes, teams, federations
1189
One page only Individual articles on Rio2016 topics from media, gov
1196
One page + [seed page plus one click of all links on page]
Media pages linking to multiple articles on Rio2016-related topic/tag Social media feeds posting external links
2363
total 4748
Overview of IIPC collecting (2016-2017 subscription)
Collection crawl overview
month # of crawls new data added
July 2016 3 665 GB
August 2016 (Olympics: Aug. 6-21) 20 1630 GB
September 2016 (Paralympics: Sept. 7-18) 7 565 GB
October 2016 9 266 GB
November 2016-June 2017 268 (QA patch crawls)
<1 GB
total 307 3.1 TB
File type distribution in sample crawls
crawl date
# of seeds preset data limit
file type distribution
July 7-10 1422 512 GB video 312 GB text 65 GB image 63 GB audio 46 GB pdf 17 GB
July 7 54 (YouTube, mostly team channels)
100 GB video 99+ GB
Aug. 16-18 786 (Twitter) 350 GB video 326 GB text 13 GB image 9 GB pdf <1 GB
Estimated proportion of total collection data (3.1 TB) that is video: 75%
What Next?
http://img.112.international/original/2016/06/02/235135.jpg / http://www.marketingdelosdeportes.com/wp-content/uploads/2016/08/TOKIO.jpg
https://stillmed.olympic.org/media/Images/OlympicOrg/Games/Winter/Beijing_2022/Beijing_2022_emblem.jpg?interpolation=lanczos-
none&resize=240:240
Contact Details
Alex Thurman
Columbia University Libraries
[email protected] @athurman
Helena Byrne
The British Library
[email protected] @HBee2015