12
Lowering barriers to publishing biological data on the web Brad Chapman Department of Molecular Biology Massachusetts General Hospital Boston, MA USA [email protected] http://friendfeed.com/chapmanb 27 June 2009

Lowering barriers to publishing biological data on the web

Embed Size (px)

DESCRIPTION

Short 10 minute talk encouraging bioinformatics programmers to organize and reuse code targeted at making data easily available on the web. Current open source technologies are combined into a higher level framework. An example implementation using Google App Engine and existing bioinformatics libraries is presented.

Citation preview

Page 1: Lowering barriers to publishing biological data on the web

Lowering barriers topublishing biological data on

the web

Brad Chapman

Department of Molecular BiologyMassachusetts General Hospital

Boston, MA [email protected]

http://friendfeed.com/chapmanb

27 June 2009

Page 2: Lowering barriers to publishing biological data on the web

Motivation

Page 3: Lowering barriers to publishing biological data on the web

Motivation

I Web accessible

I Interoperable in standard formats

I Displays for browsing

I Analyses

I Scale

Page 4: Lowering barriers to publishing biological data on the web

Current state: Reusable libraries

I Parse file formats

I Run programs

I Build analysis pipelines

I Communities

Python examples

I Biopython

I bx-python

I pygr

I PyCogent

Page 5: Lowering barriers to publishing biological data on the web

Current state: Database schemas

I Represent biological data

I Expand analyses beyond flat files

I Interoperate with standards

BioSQL Chado

Page 6: Lowering barriers to publishing biological data on the web

Current state: Web applications

Page 7: Lowering barriers to publishing biological data on the web

Faster and Bigger

Page 8: Lowering barriers to publishing biological data on the web

Proposal

I ProvideI Reusable presentation componentsI Quickly deployable frameworks

I IntegrateI Bioinformatics librariesI Database schemasI Web development frameworks

Page 9: Lowering barriers to publishing biological data on the web

Proposal

Page 10: Lowering barriers to publishing biological data on the web

http://biosqlweb.appspot.com/

Page 11: Lowering barriers to publishing biological data on the web

Challenges: Design

I ReusableI Components: avoid large frameworkI Multi-language: javascript front end

I AccessibleI Automated data retrieval (REST)I Standard formats (GFF, RDF)

I AvailableI Creative Commons

http://creativecommons.org/about/licenses

I Open Data Commonshttp://www.opendatacommons.org/licenses/

Page 12: Lowering barriers to publishing biological data on the web

Challenges: Community questions

How do we. . .

I provide plug-in components?

I leverage existing code?

I make reuse easier?

I communicate about these issues?