Upload
trish-rose-sandler
View
628
Download
4
Embed Size (px)
DESCRIPTION
The data model and user interface for the Biodiversity Heritage Library (BHL) portal at http://www.biodiversitylibrary.org/ was originally designed to accommodate books and journals found in botanical garden libraries and natural history museums. As the size and reputation of the BHL grew, there were many publishers and individuals who wanted to contribute to the BHL but their content consisted of publication types at more granular levels, such as articles, book chapters, and dissertations. In order to ingest and serve these materials, in early 2011, BHL launched a separate portal called Citebank hosted at citebank.org. Currently, Citebank contains over 180,000 citations linked to content files, either hosted at citebank.org or hosted externally. While feedback on Citebank has been positive, users indicated a desire to combine both the services of the BHL portal and the services of the Citebank portal into a single interface in order to enable a unified search for all biodiversity literature. To respond to these needs, the BHL has begun expansion of its data model in the BHL portal to accommodate articles, book chapters, treatments and other segment-like material so that they can be searched alongside its traditional book and journal content. Parallel to this activity the NSF-funded Global Names Architecture (GNA) Project has enlisted Citebank to fulfill the role of a global biodiversity repository for bibliographic citations. In support of this, Citebank will provide a key functional component to the GNA - that of reconciliation services for citations. Once reconciled, citations can be linked either to scanned page images in the BHL, or to PDFs uploaded by users. If neither exists, citations can point to other digital representations online. Experience with Citebank has resulted in many lessons learned about working with diverse publication types; data formats; and contributors with varying levels of technical competencies. Those lessons were incorporated into a functional requirements document that is being used to inform development of the BHL data model. This talk will outline the functional requirements needed for a global citation repository for biodiversity and how those requirements will better serve the needs of the biodiversity community.
Citation preview
The Biodiversity Heritage Library and bibliographic citations: towards new user services in support of a Global Names
Architecture
Trish Rose-Sandler, William UlateMissouri Botanical Garden
2012 Annual TDWG MeetingBeijing, ChinaOct.21,2012
A brief history…
The Biodiversity Heritage Library
www.biodiversitylibrary.org
The Biodiversity Heritage Library
Genesis: “BHL Article Repository”
• Idea first introduced at TDWG 2008, Fremantle (by BHL, many have discussed for years)
• YouTube for biodiversity articles• Needed (need) a way to access articles in BHL– “BHL has no articles.”– BHL has hundreds of thousands of articles but you
can’t search for them via author, article title search– Can find via “article coordinates” using BHL’s UI &
OpenURL resolver: Journal / Volume / Start Page / Year
CiteBank
• Objectives– Create a repository for community-vetted
taxonomic bibliographies.– Ability to ingest, display, download, and index
articles so that the BHL can operate as an article repository.
– Provide links to content published online through other repositories.
• Launched on December 6th 2010
Citations today: http://citebank.org
Citations Providers
SpecimenDatabases
CommercialAggregators
Software ToolsOpen Access
Digital Libraries
Indices
Nomenclators
SpecimenDatabases
CommercialAggregators
Software ToolsOpen Access
Digital Libraries
Indices
Nomenclators
Open AccessPublishers
International Collaborative Projects
Lessons Learned
• Biblio/Drupal data model insufficient for mass of data envisioned for all biodiversity, too flat and difficult to expand in collaboration with Biblio development community
• Data providers want their content findable and managed in the Biodiversity Heritage Library, not a system alongside BHL
• Maintaining two platforms for biodiversity literature threatens sustainability of the literature resources over the longer term
Citations, Articles and Chapters in BHL
Global Names Architecture
Support citation reconciliation
.
.
.
.
.
.
.L. Sp. Pl. 2: 971. 1753
Linneaus, C. Species Plantarum, vol. 2 p. 971. 1753
Linné, Carl von. Sp. Pl. Vol. 2 Page 971. 1753
Caroli Linnaei, Species Plantarum exhibentes plantas rite cognitas, ad genera relatas, cum Differentis Specificis, Nominibus Trivialibus, Synonymis Selectis, Locis Natalibus, secundum SYSTEMA SEXUALE digestas.. 2:971. 1753
Zea mays
Where are we?
• Articles– Extend BHL data model to store article metadata– Build process to harvest data from BioStor
• Create user interfaces for adding article metadata and associated files– Define functional requirements as improvements to Drupal-based
Citebank– Define process flow for adding article metadata and associated
files– Implement UI changes
• Change BHL UI to accommodate article search• Change BHL UI to accommodate article display (TOC)
What is needed for a citation repository?
Admin. Interface– IMPORT AND MAPPING TOOL• Preview/Accept/Reject/Undo/Report on Import• No standard schema, MODS or Bibtex• Drag & drop GUI or mapped source and target field config.
– USER MANAGEMENT• Self-Registration• Admin. Approval & Deletion• User Roles Assignment
– GLOBAL UPDATES
What is needed for a citation repository?
General User Interface– IMPORT• Upload/Preview/Accept/Reject/Undo/Report on Import
– CREATE CITATION• By filling a Form, via BibTex
– BROWSE• Faceted: title,author,subject, year, contributor, my citations
What is needed for a citation repository?
• CITATION TYPES– Journal Article, Book Chapter, Conference Proceedings,
Conference Paper, Thesis, Government Report, Note, etc.
• OAI HARVESTING– Harvest and serve data through OAI-PMH
• SPECIFICATIONS FOR DATA PROVIDERS PAGE
• CONTRIBUTORS PAGE– Recognize ALL contributions
• REPORTING– Statistics Page by Citation and Publication type– Recent/Latest Uploads
Where are we going?
• Integrate BHL’s Services with ZooBank
• Authoritative list of titles in common use for nomenclatural acts (“TL3”)
• Harvest relevant content from Mendeley
• Integrate services and interfaces with the GNUB data model
• Interoperate with citation parsing tools & services
Thank you
[email protected] Data AnalystArt of Life Project CoordinatorMissouri Botanical Garden
2012 Annual TDWG MeetingBeijing, ChinaOctober 21st, 2012
[email protected] BHL Project ManagerBHL Technical DirectorMissouri Botanical Garden