Beyond the Tsunami: Dealing with Life Sciences Data

Beyond the Tsunami: Developing the Infrastructure to Deal with Life Sciences Data

Christopher Southan and Graham Cameron, EMBL-European Bioinformatics Institute (EBI), Cambridge, U.K.

EBI and Sanger at Hinxton: Engaging with the Data Challenges

• Technology for sequence data generation and reduction• Repositories, storage, archiving • Databases, entitity linking, infrasctruture and utility• Biocuration, annotation, standards, ontolgies• Experimental biological data from research groups• Data exploitation, mining and visualisation • Biological hypothesis iteration

EMBL-Bank

1.5E+11

2.5E+11

Release 101, Aug 2009, 163 million entries, 283 billion bases

10 years of Rapid Growth

GU057010; SV 1; linear; viral cRNA; STD; VRL; 1701 BP.08-OCT-2009 (Rel. 102, Created)08-OCT-2009 (Rel. 102, Last updated, Version 1)Influenza A virus (A/Chengdu/03/2009(H1N1)) segment 4 hemagglutinin (HA) Jiang T., Qin C., Li X., Zhao H., Yu M., Deng Y., Yu X., Han J., Qin E., RA Zhu Q.; "A community transmission of influenza A (H1N1) virus in a boarding school RT in China, 22-27 July 2009“

*******************************************************************************************AF177758; SV 1; linear; mRNA; STD; HUM; 1868 BP.10-SEP-1999 (Rel. 61, Created)07-OCT-2008 (Rel. 97, Last updated, Version 6)Homo sapiens ubiquitin specific protease 16 (USP16) mRNA, complete cds.PUBMED; 10786635. Smith T.S., Southan C.; "Sequencing, tissue distribution and chromosomal assignment of a novel ubiquitin-specific protease USP23"; Biochim. Biophys. Acta 1490(1-2):184-88(2000). Ensembl-Gn; ENSG00000143258; Homo_sapiens.

New Technology > New Data Archives

Volume (TB) 1.9

35Assembledsequence

Capilliary traces

Next. Gen. Reads

European Nucleotide Archive Snapshot March 2009

Accelerating Genome Coverage

Jan 2009, 4370 projects

from EBI/Sanger

The 1000 Genomes Project: Cataloging Human Genetic Variation

• Initial human genome -10 years and 40 gigabases • Over next two years the eqivalent of two human genomes

will be produced every 24 hours • Completed dataset will be 6 trillion DNA bases, 500 TB• 60-fold more than 28 years of EMBL-Bank • Expected to cover 1200 genomes

Data Exploitation: EBI Accesses

Last 4 years of hit-rates for web pages and web services

200,000

400,000

600,000

800,000

1,000,000

1,200,000

GenomesGenomes Nucleotide sequenceNucleotide sequence

ExpressionExpression ProteomesProteomes

Protein families, and domains

Protein structureProtein structure

Protein interactions

Chemical entitiesChemical entities

PathwaysPathways

SystemsSystems

Literature, ontologiesLiterature, ontologies

Towards a sustainable infrastructure for biological information in Europe, to support life science, translation to medicine, the environment, bio-industries and society.

Conclusions

• The International Nucleotide Sequence Database Collaboration will exeed 300 billion bases in 2009.

• Storage at the EBI has doubled annually and is now 5 Petabytes.• Next-Generation Sequencing is increasing data production ~ 10-fold.• By 2010 the full genomic variation in over 1000 people will be revealed

and genomes from over 1000 species completed.• An increase in data mining is needed to facilitate conversion into

knowledge.• The European ELIXIR project and other global initiatives to enhance

the sustainable infrastructure for biological databases are essential.• The impact of data-intensive computing on the Life Sciences will be

profound and transforming.• Exploitation will bring major benefits for biology, medicine, agriculture,

biofuels and environmental science.

Beyond the Tsunami: Dealing with Life Sciences Data

Technology

Challenges of Dealing with Uncertainty - PM World Library · Challenges of Dealing with Uncertainty Bob Prieto Let me begin by saying that the scope of this subject is well beyond

Dealing with the Tsunami of Demographic Change · 2017-11-03 · Whack a mole “Connected” The Connected Culture. How do you change the culture? Change organizational structure,

science Of Tsunami Hazards - Tsunami Society Internationaltsunamisociety.org/332ZamoraEtAl.pdf · SCIENCE OF TSUNAMI HAZARDS Journal of Tsunami Society International Volume 33 Number

DEALING WITH THE TSUNAMI OF UNMANAGED …...Enterprise Managed Unmanaged BYOD (PC & Mobile) Smartphones Switches Printers VOIP Point of Sale Medical Devices Manufacturing Web, PCs

Method of Splitting Tsunami (MOST) Software Manualredsismica.uprm.edu/Spanish/tsunami/media/MOST_manual.pdf · Method of Splitting Tsunami ... Tsunami Research Program Page vii Wave

Today General remarks The science and politics of global warming Dealing with global warming: the Kyoto Protocol & beyond

JMA tsunami warning improvement planares.tu.chiba-u.jp/peru/pdf/meeting/120314-15/05_Mr...BasedonMj=7.9 Tsunami Warning Tsunami Advisory Major Tsunami Tsunami 7 Sub.-1 Underestimation

SCIENCE OF TSUNAMI HAZARDS - Tsunami Societytsunamisociety.org/351LinEtAl.pdf · SCIENCE OF TSUNAMI HAZARDS Journal of Tsunami Society International Volume 35 Number 1 2016 DETECTION

International Tsunami Survey Team (ITST) Post-Tsunami Survey

“Exploring solutions beyond the obvious reflects our …€¦ · Dealing with the climate means dealing with uncertainty. It is a scientific fact that global warm-ing is taking

Moving Beyond the Tsunami-The WHO Story-Complete Book Beyond the Tsunami.pdf · of a tsunami is so r are in this par t of the world. O ne sign of an impeding tsunami is a receding

and Tsunami Disaster Mechanism of Tsunami and Tsunami ...quake.enveng.titech.ac.jp/lecture/Tsunami2013/TakahashiS_2.pdf · 2．Mechanism of Tsunami and Tsunami Disaster Mitigation

HAWAII TSUNAMI WARNING WARNING SYSTEM ......HAWAII HAWAII TSUNAMI TSUNAMI WARNING WARNING SYSTEM: SYSTEM: EMERGENCY RESPONSE and TSUNAMI PREPAREDNESS Brian Yanagi, IOC International

Tsunami Fragility – A New Measure to Identify Tsunami Damage · 2012-01-25 · Tsunami Fragility Paper: Tsunami Fragility – A New Measure to Identify Tsunami Damage – Shunichi

Real-time earthquake monitoring for tsunami warning in the Indian Ocean and beyond

India Tsunami Response Experience. Beyond Brick and Morta… · The Tsunami which was triggered by a massive earthquake off the Indonesian coast of Sumatra on 26 December 2004 affected

THE “BEYOND REASON” PREPARATION GUIDE · THE “BEYOND REASON” PREPARATION GUIDE Purpose of this guide Almost any negotiation involves dealing with people. This means that emotions

RECORDED - TSUNAMI HISTORY - Tsunami Laboratory

Tsunami Warning in NZ (Coetzee).ppt · 2014-07-24 · Pacific Tsunami Warning Centre (PTWC) Thresholds: Mw 6.5 - 7.5 Tsunami Information Bulletin: Tsunami not generated; local tsunami

Dealing with the 1970s, 1980s, 1990s, and beyond Jonah Hacinas Tyaquan Knightnor