35
Building Archivable Websites Nicholas Taylor Web Archiving Service Manager Digital Library Systems and Services Drupal Camp April 19, 2014

Building Archivable Websites

  • View
    1.342

  • Download
    0

Embed Size (px)

DESCRIPTION

Presentation for Stanford Drupal Camp on how and why to build archivable websites.

Citation preview

Page 1: Building Archivable Websites

Building Archivable Websites

Nicholas TaylorWeb Archiving Service ManagerDigital Library Systems and Services

Drupal CampApril 19, 2014

Page 2: Building Archivable Websites

ARCHIVABLE WEBSITES?

Why Build

“Frosted Spiders' Web” by Jess Wood under CC BY 2.0

Page 4: Building Archivable Websites

maintain web usability

“Broken Web Connections? Welcome to 2009...” by Paul:Ritchie under CC BY-NC-ND 2.0

Page 7: Building Archivable Websites

recover your lost website

“Warrick”

Page 8: Building Archivable Websites

refer to earlier website versions

“The Iraq War: Wikipedia Historiography” by STML under CC BY-SA 2.0

Page 9: Building Archivable Websites

institutional history

Internet Archive Wayback Machine: “Stanford University Homepage”

Page 10: Building Archivable Websites

websites are cultural artifacts

“The World Wide Web project”

Page 11: Building Archivable Websites

facilitate compliance

Page 12: Building Archivable Websites

optimize for other crawlers

“SEO on a railway platform” by superboreen under CC BY-NC-ND 2.0

Page 13: Building Archivable Websites

IMPROVE ARCHIVABILITY

How to

“metal web” by paul:74 under CC BY-NC-SA 2.0

Page 14: Building Archivable Websites

follow web standards and accessibility guidelines

“Web Standards Fortune Cookie” by Flickr user Matt Herzberger under CC BY-SA 2.0

Page 15: Building Archivable Websites

use a site map, transparent links, and contiguous

navigation

“Card sorting” by Flickr user Manchester Library under CC BY-SA 2.0

Page 16: Building Archivable Websites

maintain stable URLs andredirect when necessary

“San Francisco-Oakland Bay Bridge 1442a” by Flickr user Don Barrett under CC BY-NC-ND 2.0

Page 18: Building Archivable Websites

be careful w/ robot exclusion rules

“drupal/robots.txt at 7.x”

Page 19: Building Archivable Websites

minimize reliance on external assets necessary for

presentation

Internet Archive Wayback Machine: “Stanford Department of English”

Page 20: Building Archivable Websites

minimize reliance on external assets necessary for

presentation

“Stanford Department of English”

Page 22: Building Archivable Websites

specify HTTP response headers for caching and

content encoding

“time capsule on Alcatraz” by Flickr user inajeep under CC BY 2.0

Page 23: Building Archivable Websites

embed metadata, especially character encoding

“Keep the Packaging!” by Flickr user davidd under CC BY 2.0

Page 24: Building Archivable Websites

use durable data formats

“Lascaux cave painting” by Flickr user Christine McIntosh under CC BY-ND 2.0

Page 25: Building Archivable Websites

prefer responsive design over user-agent personalization

“«Responsive web design» - 217/366” by Flickr user Roger Ferrer Ibáñez under CC BY-NC-SA 2.0

Page 28: Building Archivable Websites

Heritrix

Wikimedia Commons: “File:Heritrix-screenshot.png”

Page 30: Building Archivable Websites

HTTrack

“HTTrack Website Copier”

Page 31: Building Archivable Websites

Wayback

“Internet Archive Wayback Machine”

Page 32: Building Archivable Websites

Web Archiving Integration Layer

“Web Archiving Integration Layer”

Page 33: Building Archivable Websites

Memento

“Memento”

Page 34: Building Archivable Websites

assess archivability w/ Archive Ready

“Archive Ready”