Upload
roderic-page
View
485
Download
0
Embed Size (px)
Citation preview
PowerPoint Presentation
BHL, BioStor, and beyond
#BHLat10
@rdmpage
http://iphylo.blogspot.com
#iamataxonomist
3
Pinnotheres atrinicola Page, 1983
http://www.facebook.com/photo.php?pid=13530101&fbid=10150231079625521&op=1&o=global&view=global&subj=1112517192&id=6810205203
One species of peacrab had a parasitewhat is it?
Sur un type nouveau d'Epicarides Rhopalione uromyzon n. g. n. sp., parasite sous-abdominal d'un Pinnothere
Rhopalione in BHL
Why BHL is cool #1
Accessibility
First impressions, mehOMG its full of plants
Its all old stuff
Where the $#@! are the articles?
More hack, less yack
[to] be able to move some subset of the world from the leverage point of the command line.Steven E. Jones The Emergence of the Digital Humanities
Why BHL is cool #2
It is hackable
No articles? No problem!
Data is available for download
Also an API (and OAI-PMH, yuck!)
So, lets go find the articles
Find articles - simplesTitleVolumePageJournalVolumeStart page end page
Article
Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library doi:10.1186/1471-2105-12-187Mapping between BHL and articles
http://biostor.org/reference/102054
BioStor and Pintrest
BioStor and JournalMap
BioStor and BHL
articles
First impressions, mehOMG its full of plants
Its all old stuff
Where the $#@! are the articles?
Not so cool
Scanning currently dominated by USDA
BHL-Europe: unhackable zombie
Where next?
Findability: DOIs for articles
10.5962/bhl.part.14773
Mickey Mouse is evil
http://artlawjournal.com/mickey-mouse-keeps-changing-copyright-law/
100,000 articles from http://biostor.org (BHL)
1923today
http://biostor.org25
Synthetic documentsS. Michael Machines as readers: A solution to the copyright problemwe proposed to scan works digitally to extract their intellectual content, and then generate by machine synthetic works that capture this content and distribute them free of copyright
Cited, linkable specimens
NMNH Vertebrate Zoology Herpetology Collections11194
CAS Herpetology Collection CatalogMCZ Herpetology CollectionHerpetology Collection (University of Kansas Biodiversity Research Center)961967205818
http://iphylo.blogspot.co.uk/2012/02/gbif-specimens-in-biostor-who-are-top.html
The case for a PubMed Central for Biodiversity
Isnt that, um, PubMed Central?...
Europe PMC
PubMed Central for biodiversityTaxonomic names
Geographic localities
Specimen codes
Handle XML, PDF, OCR text
Store facts as well as documents
Google figured out how to manage abundance while every other media company in the world was trying to manufacture scarcity, and for that we should be grateful. Siva Vaidhyanathan The Googlization of everything (and why we should worry)