32
Code4Lib 2010 22 - 25 Feb Asheville, NC Patrick Hochstenbach

20100306 Datasalon 4 : code4lib

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: 20100306 Datasalon 4 : code4lib

Code4Lib 201022 - 25 Feb

Asheville, NCPatrick Hochstenbach

Page 2: 20100306 Datasalon 4 : code4lib

Code4lib

• 150 deelnemers US, Canada, Australië, Japan, Denemarken, België

• Stanford, Michigan, Cornell, Princeton,LOC,NYPL,Statsbiblioteket,...

• Programmeurs, Systems librarians, Digital Architects, Technical Managers

• Do-It-Yourself solutions

Page 3: 20100306 Datasalon 4 : code4lib

Topics

SOLR Cloud

Data Cleaning/Mining

Herding CatsDrupal

Ruby

Python

IRC

Virtual Book Shelfs

Levenshtein Distance

Mobile

Fedora

Blacklight Fedora

Page 4: 20100306 Datasalon 4 : code4lib

SOLR

• Zeer veel SOLR projecten

• Eigen ontwikkeling

• Open Source: VuFind, Blacklight

• Catalogi, Union Catalogs, Full-Text (Hathi), Multi-Media Archives

Page 5: 20100306 Datasalon 4 : code4lib

Project Blacklight

• University of Virginia + Stanford

• Ruby On Rails

• Niet: SOLR indexeren? == zelf uitzoeken

• http://code.google.com/p/solrmarc/

• Wel: Out-of-the-box discovery interface

• http://demo.blacklightopac.org/

• Facets, Exports, Google-like search

Page 6: 20100306 Datasalon 4 : code4lib

Project Blacklight

• http://nwda.projectblacklight.org/?f[format_facet][]=Postcards

• = Blacklight + ContentDM

• http://searchworks.stanford.edu/

• = Union Catalog California

• Jangle Feed voor integratie met bestaande ILS systemen

Page 7: 20100306 Datasalon 4 : code4lib

SOLR Sessions

• Eric Hatcher, Lucid Imagination

• Scalability/Performance

• Memory Issues

• Stemming/parsing

• Ranking

• Query Parsers

Page 8: 20100306 Datasalon 4 : code4lib

SOLR Sessions

• ClusteringComponent (a.k.a ‘on the fly machine intelligent “facets”’)

• ReversedWildcardFilterFactory (prefix queries “*thing”)

• Hunspell support (stemming, spellchecking, normalization support)

• http://www.slideshare.net/erikhatcher/solr-

Page 10: 20100306 Datasalon 4 : code4lib

Metadata Editing

• SOLR

• Apache Cocoon

• Fedora

• XML (METS+ TEI)

• Integration with Flickr, YouTube, iTunes

• Images, Video, Audio, Text

Page 11: 20100306 Datasalon 4 : code4lib

Metadata Editing

• 1.5 TB

• 100.000 items = 15MB/scan

• 35 collecties

• Doen batch scanning, maar willen nu on-demand scanning gaan doen en hebben een schaalbare oplossing nodig

• Trident: metadata tool that scales

Page 12: 20100306 Datasalon 4 : code4lib

Metadata Editing

Page 13: 20100306 Datasalon 4 : code4lib

Metadata Editing

Page 14: 20100306 Datasalon 4 : code4lib

Metadata Editing

Page 15: 20100306 Datasalon 4 : code4lib

Metadata Editing

Page 16: 20100306 Datasalon 4 : code4lib

Metadata Editing

Page 17: 20100306 Datasalon 4 : code4lib

Metadata Editing

Page 18: 20100306 Datasalon 4 : code4lib

Professionalisering IT

• Vampires vs. Werewolves

• Vampires = Developers

• Werewolves = Sysadmins

• Innovation is about risk

• You don’t take risks with people you don’t trust

Page 19: 20100306 Datasalon 4 : code4lib
Page 20: 20100306 Datasalon 4 : code4lib

Professionalisering IT

• Testing

• Nagios monitoring

• Hudson continuous code integration

• Documentation: Wiki’s

• Puppet configuration management

Page 21: 20100306 Datasalon 4 : code4lib
Page 22: 20100306 Datasalon 4 : code4lib
Page 23: 20100306 Datasalon 4 : code4lib
Page 24: 20100306 Datasalon 4 : code4lib

Mobile Web Apps

• Michael Doran, UTA

• 53 % Amerikaanse studenten heeft SmartPhone

• Mobiel Internet Explodeert

• Wat nu?

Page 25: 20100306 Datasalon 4 : code4lib

Mobile Web Apps

• Kopen?

• Boopsie

• Blackboard Mobile

• Design?

• Native Apps?

• Web Apps?

Page 26: 20100306 Datasalon 4 : code4lib

Mobile Web Apps

• iPhone, Android, Nokia, Blackberry, ...

• Toekomst is mobile web design

• Minimalist design: do one thing and do it well

Page 27: 20100306 Datasalon 4 : code4lib

Mobile Web Apps

• Toolkits:

• iUI: iPhone User Interface Framework

• iWebKit

• jQTouch

Page 28: 20100306 Datasalon 4 : code4lib

Mobile Web Apps

Page 29: 20100306 Datasalon 4 : code4lib

Mobile Web Apps

• Testen, testen, testen

• Emulators:

• webOs: http://developer.palm.com

• iPhone: http://developer.apple.com/iphone

• Android: http://developer.android.com/

• Web-based simulators (worthless..maybe except the Opera Mini

Page 30: 20100306 Datasalon 4 : code4lib

Mobile Web Apps

viewport

Page 31: 20100306 Datasalon 4 : code4lib

Mobile Web Apps

Larger buttons for finger tapping

Page 32: 20100306 Datasalon 4 : code4lib

Mobile Web AppsShelfLister version 2.0

6