141
And now, for somethin completely different…

And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

  • View
    213

  • Download
    0

Embed Size (px)

Citation preview

Page 1: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

And now, for somethingcompletely different…

Page 2: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Web 2.0 + Web 3.0 = Web 5.0?

The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects

Mark Wilkinson & Bruce McManusHeart + Lung Institute

iCAPTURE Centre, St. Paul’s Hospital, UBC

Page 3: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Who am i?

Page 4: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Mark Wilkinson

Page 5: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 6: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Carole Goble

Page 7: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

“Shopping for …data should be as

easy as shopping for shoes!!”

Carole Goble

Page 8: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Carole’s Shoes

Page 9: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

My shoes

Page 10: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

It is hard to fill Carole’s shoes!

Page 11: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Who am i?

Page 12: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Geneticist &Molecular Biologist

Page 13: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Informatician

Page 14: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Software

Page 15: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Middleware

Page 16: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Robert Stevens

Page 17: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 18: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Shouldn’t be

seen!

Page 19: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Then why am I presenting to you?

Page 20: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

enablesuseful

and excitingbehaviours

Page 21: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Cool!

Page 22: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

For cardiovascularresearchers

Page 23: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Please indulge meas I expose

my underwear

Page 24: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

…BRIEFly ;-)

(sorry, I couldn’t resist)

Page 25: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Problem

Page 26: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Problem

Page 27: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Holy Grail:(circa 2002)

Align the promoters of all serine threonine kinases involved exclusively in the regulation of cell sorting during wound healing in blood vessels.

Retrieve and align 2000nt 5' from every serine/threonine kinase in Mus musculus expressed exclusively in the tunica [I | M |A] whose expression increases 5X or more within 5 hours of wounding but is not activated during the normal development of blood vessels, and is <40% homologous in the active site to kinases known to be involved in cell-cycle regulation in any other species.

Page 28: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Problem

Page 29: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Problem

Page 30: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Solution??

Page 31: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Not really…

Page 32: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Why not?

Heart

Heart

Page 33: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Occam’s Razor

“Pluralitas non est ponenda sine neccesitate.”

“Plurality should not be posited without necessity."

Page 34: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

“Biology is hard!”

Page 35: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

So… what can we do?

Page 36: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

What we need

Page 37: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Our “information systems” areDATA-centric

Page 38: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Our “information systems” are NOTKNOWLEDGE-centric

Page 39: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

(Source: Clarke and Rollo, Education and Training, 2001)

Page 40: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The REAL Solution

KnowledgeMachine-readableKnowledge

Page 41: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

How do we make knowledge machine-readable?

Page 42: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Ontology

Page 43: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Ontology (Gr: “things which exist” +-logy)An explicit formal specification of how to represent the objects, concepts and other entities that are assumed to exist in some area of interest and the relationships that

hold among them.

Page 44: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Problem…

Page 45: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Ontology Spectrum

Catalog/ID

SelectedLogical

Constraints(disjointness,

inverse, …)

Terms/glossary

Thesauri“narrowerterm”relation

Formalis-a

Frames(Properties)

Informalis-a

Formalinstance

Value Restrs.

GeneralLogical

constraints

Originally from AAAI 1999- Ontologies Panel by Gruninger, Lehmann, McGuinness, Uschold, Welty; – updated by McGuinness.Description in: www.ksl.stanford.edu/people/dlm/papers/ontologies-come-of-age-abstract.html

WHY?

Because I say so!

Because it fulfils XXX

Page 46: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

My Definition of Ontology (for this talk)

Ontologies explicitly define the things that exist in “the world”

based on what properties each kind of thing must have

Page 47: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Ontology Spectrum

Catalog/ID

SelectedLogical

Constraints(disjointness,

inverse, …)

Terms/glossary

Thesauri“narrowerterm”relation

Formalis-a

Frames(Properties)

Informalis-a

Formalinstance

Value Restrs.

GeneralLogical

constraints

Page 48: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

My goal with this talk

Page 49: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Clay Shirky

“Ontology is Over-rated”

http://www.shirky.com/writings/ontology_overrated.html

Page 50: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

COST

Catalog/ID

SelectedLogical

Constraints(disjointness,

inverse, …)

Terms/glossary

Thesauri“narrowerterm”relation

Formalis-a

Frames(Properties)

Informalis-a

Formalinstance

Value Restrs.

GeneralLogical

constraints

Page 51: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

COMPREHENSIBILITY

Catalog/ID

SelectedLogical

Constraints(disjointness,

inverse, …)

Terms/glossary

Thesauri“narrowerterm”relation

Formalis-a

Frames(Properties)

Informalis-a

Formalinstance

Value Restrs.

GeneralLogical

constraints

Page 52: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Likelihood of being “right”

Catalog/ID

SelectedLogical

Constraints(disjointness,

inverse, …)

Terms/glossary

Thesauri“narrowerterm”relation

Formalis-a

Frames(Properties)

Informalis-a

Formalinstance

Value Restrs.

GeneralLogical

constraints

Page 53: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Here’s my argument…

Page 54: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Semantic Web?

An information system where machines can receive information from one source, re-interpret it, and correctly use it for a purpose that the source had

not anticipated.

Page 55: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Semantic Web?

If we cannot achieve those two things, then IMO we don’t have a “semantic web”, we only have a distributed (??), linked database… and that isn’t

particularly exciting…

Page 56: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Where is the semantic web?

Catalog/ID

SelectedLogical

Constraints(disjointness,

inverse, …)

Terms/glossary

Thesauri“narrowerterm”relation

Formalis-a

Frames(Properties)

Informalis-a

Formalinstance

Value Restrs.

GeneralLogical

constraints

REASON: “Because I say so” is not open to re-interpretation

Page 57: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The games we have been playing for ~2 years

Page 58: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Find. Integrate. Analyse.

Founding partner

SADI

Page 59: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Data + Knowledge for Cardiologists.

Founding partner

CardioSHARE

Page 60: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Brief History of

CardioSHARE

Page 61: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Automating the interaction between biologists andtheir preferred data and analytical resources

(Running since 2002)

Page 62: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Moby defines an ontology of datatypes, and a WS registry that is

aware of that ontology

Page 63: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

I’m studying a mutation in my favorite organism

I want to know what mutations in the equivalent gene look like in

another organism

Page 64: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Oh no…

Page 65: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Moby Semantic Web Service Approach

Page 66: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 67: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 68: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 69: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 70: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 71: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 72: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 73: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 74: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 75: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 76: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 77: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 78: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 79: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 80: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 81: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

WORKFLOW

Page 82: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Analytical Workflow constructionUsing Moby

No explicit coordination between providers

Automated discovery of appropriate tools

Automated execution of selected tools

The machine “understands” the data-type you have in-hand, and assists you in choosing the next step

in your analysis.

Page 83: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

This got me thinking…

Page 84: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Datatypes…

Biology?!?!?

Page 85: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

CardioSHARE

The Cardiovascular Semantic Health And Research Environment

Page 86: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Theatre Analogy

Page 87: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Components of a Play

The Script

The Stage

The Casting Director & directors

The Actors

Page 88: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Components of CardioSHARE

Cardiovascular Ontologies (The Scripts)

SHARE (The Stage & Set Decorator)

SADI (The Casting Director)

Data (The Actors)

Page 89: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

DATA

the “actors” in CardioSHARE

Page 90: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

ACTORS:

No intelligence

No Use or Function(…without a script)

more successfulwhen clean!!

Page 91: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

No Intelligence?No Use?

Data exhibits “Late binding”

Page 92: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 93: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Late binding:

“purpose and meaning” of the data is

not determined untilthe moment it is required

Page 94: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

benefit/Consequenceof Late binding:

data is amenable toconstant re-interpretation

Page 95: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

How do we achieve this?

Because weDO NOT CLASSIFY

our data…we simply hang properties on it

Page 96: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Catalog/ID

SelectedLogical

Constraints(disjointness,

inverse, …)

Terms/glossary

Thesauri“narrowerterm”relation

Formalis-a

Frames(Properties)

Informalis-a

Formalinstance

Value Restrs.

GeneralLogical

constraints

These are classification systems

These are axioms that enable interpretation

Page 97: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Components of CardioSHARE

Cardiovascular Ontologies (The Scripts)

SHARE (The Stage & Set Decorator)

SADI (The Casting Director)

Data (The Actors)

Page 98: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

SADI

“Semantic AutomatedDiscovery and Integration”

Page 99: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

SADI =…but smarter…

Page 100: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

SADI

Instead of specifyingthe analysis or database

you specify theBiological/clinicalRELATIONSHIP

of interest

Page 101: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

“Take this gene sequence, run the BLAST algorithm to

search for genes with similar sequence, and give me the result”

Page 102: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

SADI“get me the homologues for this gene”

SADI knows that the BLAST algorithm is used to search for homologous

gene sequences

SADI finds a BLAST server on the Web and executes your search for you

Page 103: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

SADI

By analogy:

SADI, the casting director, knows which actors are best able to fit the part,

automatically finds them, and casts them in the play

Page 104: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

SADI Web ServicesWS Discovery is aided by reasoning

WS interfaces are defined in OWL as two classes: Input and Output

Input and Output classes include the property restriction axioms that define

the input to/output from the service

“mini ontology”

Page 105: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 106: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

SADI Web Services

The WS consumes input data (native RDF) and adds a new property (predicate+object) to it.

Page 107: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 108: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 109: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Components of CardioSHARE

Cardiovascular Ontologies (The Scripts)

SHARE (The Stage & Set Decorator)

SADI (The Casting Director)

Data (The Actors)

Page 110: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

SHARE

SHARE is a completely generic “stage”upon which a “play” is going to be

performed

It’s the place where the actors are assembled to do their thing for an

audience

Page 111: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

SHARE

Simply a place where you can askANY question (i.e. the Play)

and see the result

Page 112: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

SHARE

At the moment it isn’tparticularly impressive…

By analogy, we have a stage, but we haven’t hired any set decorators yet!

You’ll see what I mean in a minute…

Page 113: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

DEMO

Page 114: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Recapwhat we just saw

A SPARQL database query was entered into the SHARE environment

The query was passed to SADI and was interpreted based on the properties being asked-about

SADI searched-for, found, and accessed the databases and/or analytical tools required to generate those

properties

“The play was performed”

Page 115: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Recapwhat we just saw

We asked, and answered a complex “database query”

WITHOUT A DATABASE!!

Page 116: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Holy Grail:

Align the promoters of all serine threonine kinases involved exclusively in the regulation of cell sorting during wound healing in blood vessels.

Retrieve and align 2000nt 5' from every serine/threonine kinase in Mus musculus expressed exclusively in the tunica [I | M |A] whose expression increases 5X or more within 5 hours of wounding but is not activated during the normal development of blood vessels, and is <40% homologous in the active site to kinases known to be involved in cell-cycle regulation in any other species.

Page 117: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Components of CardioSHARE

Cardiovascular Ontologies (The Scripts)

SHARE (The Stage & Set Decorator)

SADI (The Casting Director)

Data (The Actors)

Page 118: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

CardiovascularOntologies:

Allow us to refer to complex clinical/molecular phenomenon inside of our query

(i.e. Biological Knowledge!)

Allows experts to define these concepts, and then have their expert-knowledge re-used by

non-experts in their own queries.

Page 119: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

HUH?

Page 120: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

WORKFLOW

QUERY:

Retrieve images of mutations from genes in organism XXX that share homology to this gene in

organism YYY

Concept:

“Homologous Mutant Image”

Page 121: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Phrased in terms of properties:

Find image P where {

Gene Q hasImage image P

Gene Q hasSequence Sequence Q

Gene R hasSequence Sequence R

Sequence Q similarTo Sequence R

Gene R = “my gene of interest” }

Page 122: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

…but these are simply axioms…

HomologousMutantImage is {

Gene Q hasImage image P

Gene Q hasSequence Sequence Q

Gene R hasSequence Sequence R

Sequence Q similarTo Sequence R

Gene R = “my gene of interest” }

Page 123: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

QUERY:

Retrieve homologous

mutant images for gene XXX

Page 124: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

CardioSHARE

We construct small, independent OWL classes representing these kinds of concepts

These classes simplify the construction of complex queries by “encapsulating” data discovery, retrieval,

and analysis pipelines into simple, easy-to-understand words and phrases.

Page 125: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

CardioSHARE

These Classes are shared on the Web such that third-parties, potentially with different

expertise, can utilize the expertise of the person who designed the Class.

Easily share your expertise with others!

Easily utilize the expertise of others!

Page 126: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

CardioSHARE

We are not building massive ontologies!

Publish small, independent Class definitions

Cheap

Scalable

Flexible

Don’t try to describe all of biology!

Page 127: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Importantly!

CardioSHARE/SADI releases Semantic Systemsfrom the constraints of pure logical reasoning

If the defining property restriction of your class maps to e.g. a statistical or other analytical

service, then classification into that class can be achieved through non-logical means

Page 128: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Biology is hard!(Too hard to define purely logically)

Page 129: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

DEMO #2

Page 130: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

The Holy Grail:

Align the promoters of all serine threonine kinases involved exclusively in the regulation of cell sorting during wound healing in blood vessels.

Retrieve and align 2000nt 5' from every serine/threonine kinase in Mus musculus expressed exclusively in the tunica [I | M |A] whose expression increases 5X or more within 5 hours of wounding but is not activated during the normal development of blood vessels, and is <40% homologous in the active site to kinases known to be involved in cell-cycle regulation in any other species.

Page 131: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson
Page 132: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

CardioSHARE architecture: Increasingly complex ontological layers organize data into richer concepts, even hypotheses

Blood Pressure

Hypertension

Ischemia

Hypothesis

Database 1 Database 2 AnalysisAlgorithmXX

SADIWeb

“agents”

Page 133: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Recap

SADI interprets queries (SPARQL + OWL Class Definitions)

Determine which properties are available, and which need to be discovered/generated

Discovery of services via on-the-fly “classification” of local data with small OWL

Classes representing service interfaces

Page 134: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Recap

CardioSHARE encapsulates individual data retrieval and analysis workflows into

OWL Classes

An ontology of 1 class

Low-cost, high accuracy

Page 135: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Semantic Web

An information system where machines can receive information from one source, re-interpret it, and correctly use it for a purpose that the source had

not anticipated.

Page 136: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

What we achieve

Re-interpretation :

The SADI data-store simply collects properties, and matches them up with OWL Classes in a SPARWL query and/or

from individual service provider’s WS interface

Page 137: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

What we achieve

Novel re-use:

Because we don’t pre-classify, there is no way for the provider to dictate how their

data should be used. They simply add their properties into the “cloud” and

those properties are used in whatever way is appropriate for me.

Page 138: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

What we achieve

Data remains distributed – no warehouse!

Data is not “exposed” as a SPARQL endpoint greater provider-control over

computational resources

Yet data appears to be a SPARQL endpoint… no modification of SPARQL or

reasoner required.

Page 139: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

RecapCardioSHARE allows researchers to:

- Ask questions in natural, intuitive ways

- Execute complex analyses without a bioinformatician

- Access output from databases and analytical tools in exactly the same way

- Share hypotheses and models in an EXPLICIT manner

- Evaluate other’s hypotheses over your own data

Page 140: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Join us!

SHARE and SADI are Open-Source projectsundertaken with the collaboration of,

and for the benefit of, the biological and bioinformatics research community.

YOUR participation is welcome as we move from prototype

to “the real world”!

Page 141: And now, for something completely different…. Web 2.0 + Web 3.0 = Web 5.0? The HSFBCY + CIHR + Microsoft Research SADI and CardioSHARE Projects Mark Wilkinson

Fin