20
Andy Jenkinson, EBI An Introduction to DAS

An Introduction to DAS

  • Upload
    gore

  • View
    40

  • Download
    0

Embed Size (px)

DESCRIPTION

An Introduction to DAS. Andy Jenkinson , EBI. Summary of Topics. What is Data Integration? Problems in Data Integration An architectural overview of DAS Brief History of DAS. What is Data Integration. All These are Data Integration. Reading some papers so you can write a report - PowerPoint PPT Presentation

Citation preview

Page 1: An Introduction to DAS

Andy Jenkinson, EBI

An Introduction to DAS

Page 2: An Introduction to DAS

Summary of Topics

• What is Data Integration?

• Problems in Data Integration

• An architectural overview of DAS

• Brief History of DAS

Page 3: An Introduction to DAS

What is Data Integration

Page 4: An Introduction to DAS

All These are Data Integration

• Reading some papers so you can write a report

• Exploring some database websites so you can learn about a topic

• Downloading some data from different databases so you can analyse it

• Downloading some data from different databases so you can combine it with your own

Page 5: An Introduction to DAS

All These are Data Integration

• Reading some papers so you can write a report

• Exploring some database websites so you can learn about a topic

• Downloading some data from different databases so you can analyse it

• Downloading some data from different databases so you can combine it with your own

Page 6: An Introduction to DAS

Data Integration

• “Automatic” data integration• pulling in data from different

locations• processing it• creating a resource derived from

the data• done via computers, not humans

• e.g. creating/updating a data warehouse

Warehouse

PDB

EnsemblUniProt

Page 7: An Introduction to DAS

Warehouse model

Page 8: An Introduction to DAS

Data Integration:like herding cats

Page 9: An Introduction to DAS

Databases are all different

Page 10: An Introduction to DAS

Databases evolve

Page 11: An Introduction to DAS

Data ages

Page 12: An Introduction to DAS

Databases are big

Page 13: An Introduction to DAS

Distributed Annotation System

• Distributed

• Client-Server architecture

• Federation

• RESTful web services

Page 14: An Introduction to DAS

Warehouse model

Page 15: An Introduction to DAS

DAS model

Page 16: An Introduction to DAS

Architectural Overview

Page 17: An Introduction to DAS

DAS

• Databases are all different• DAS is a uniform facet of a database – always the same

• Databases change their structure• when the database changes, DAS stays the same

• Databases are updated• DAS data comes directly from the provider so is always fresh

• Databases are big• DAS uses real-time targeted queries

Page 18: An Introduction to DAS

History

Developed circa 1999 for sharing genome annotations

Expanded 2004 onwards• more data types• better metadata• addition of Registry

DAS/2 project• split from DAS, not backwards compatible• inspired some DAS developments

Page 19: An Introduction to DAS

To Summarise…

The Distributed Annotation System is…• A network of biological data sources• An example of federation• A collection of REST web services

The DAS Protocol is…• An integration platform• A client-server protocol• An agreed standard

Page 20: An Introduction to DAS

Image Credits

• Flickr/muir.ceardach• Flickr/Horia Varlan• Flickr/Alessandro Pinna• Fotopedia/Jean-Marie Hullot• listicles.com/?p=3485• Google Earth/Cnes/Spot Image• Olivier H. Beauchesne