27
On our way to to Information Overload ?

On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Embed Size (px)

Citation preview

Page 1: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

On our way toto

Information Overload ?

Page 2: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Or to prevent it by Or to prevent it by Appropriate use of Technology ?Appropriate use of Technology ?

Page 3: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

C19881 0.99C92992 0.67C02002 0.66C99229 0.44C00392 0.33C93939 0.21

consolidated knowledge

Collexis Fingerprints (CFP’s)

Page 4: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

English

French

Spanish

Peoplemedical researchersaround the world

Activitiesin elect. text like projects, publicationsMedline abstracts...

Disease: #12674

MultilingualThesaurus IndexerMatches keywords, translatesthem to identical numbers and ranks them by their relevance

Maladie: #12674

Enfermedad: #12674

Malaria: #24530

Hospital: #19994

Paludisme: #24530

Paludismo: #24530

Hôpital : #19994

Hospital: #19994

...

...

...

The CommonLanguageEach activity is representedas a set of keyword numbersranked by their relevance

#4256 : 1.0#3627 : 0.8#19994 : 0.5#28746 : 0.3#32874 : 0.1#32874 : 0.1#32874 : 0.1

#14325 : 1.0#3627 : 0.8#19994 : 0.5#28746 : 0.3#32874 : 0.1#32874 : 0.1#32874 : 0.1

#85643 : 1.0#3627 : 0.8#19994 : 0.5#28746 : 0.3#32874 : 0.1#32874 : 0.1#32874 : 0.1

#17345 : 1.0#3627 : 0.8#19994 : 0.5#28746 : 0.3#32874 : 0.1#1c8456 : 0.1#00356 : 0.1

„Collexion“ of activities

You:

#17345:1.0#3627 :0.8#19994:0.5#28746:0.3#32874:0.1

Your activity as text

Submit and indexed to keyword numbers

Find similaractivities andthe peoplebehind

Cross-language networking

Page 6: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

BIOSEMANTICS• “Cellese”: the language that cells use to communicate

internally and externally.

• The Molecular Language and its biological MEANING• The Group

– Jan Kors PhD.– Erik van Mulligen PhD– Bob Schijvenaars PhD– Marc Weeber PhD– Christiaan v.d. Eyck MsC– Rob Jelier PhD – Barend Mons PhD– Johan van der Lei PhD

Page 7: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

SERENDIP

Beyond PublicationBeyond PublicationSemantic metaSemantic meta--analysis of massive data and information sources for discoveryanalysis of massive data and information sources for discovery

Bsik 2003Bsik 2003

A consortium to combine State-of-the-art Information and Knowledge Mining Technologies

To support:

•Thesaurus and ontology enrichment

•Disambiguation of concepts

•Semantic meta-analysis of massive information

To enable:

•Information-based discovery

•Evidence based policy making

Page 8: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Thesaurus and Ontology Enrichment

• New concepts• Synonyms• Homonyms• Genes, Proteins • Pictures

Page 9: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Valida

tion 3

Freetext

UnexplainedText (XML)

Potential concepts

Thesauri:•Mesh•HUGO•SwissProt•SAGE•Others

FUA

4

1Fingerprints(known concepts)

partners

E-BioSci

EMBOElsevier

NLP

2

TNO

LUMC

HUGONC

Genebio

AMC

EUR

UVA

SERENDIP

Page 10: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?
Page 11: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?
Page 12: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Too much to read: major trends foreseen:

• From Reading to Consulting• From Reading to Meta-analysis• From Text to Knowledge

Representations

Page 13: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

C19881 0.99C92992 0.67C02002 0.66C99229 0.44C00392 0.33C93939 0.21

Semantic typesSemantic typesCo-occurrence dataCo-occurrence data

The first step: to the Conceptual Semantic Network

Page 14: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Calcium deposition Pleocytosis Basal Ganglia EncephalopathyCerebrospinal Fluid Tomography, X-Ray Computed Parents FamilyAicardi Goutieres syndrome Ferrocalcinotic deposition Spastic quadraplegia Fahr disease Microcephaly AGS1

xG-protein coupled receptors G-substrate Lipoid dermatoarthritis Receptors Complement Factor B RNA, Complementary Xenopus oocyte AGS1

SwissProt: Activator of G-protein signaling 1 (AGS1)

*225750

AICARDI-GOUTIERES SYNDROME 1; (AGS1) : OMIM

Aicardi Goutieres syndrome 1Heterogeneity Linkage (Genetics) Clinical diagnosis Family 2 AGS1 **Lod Score Genetic Heterogeneity analysis Toxoplasmosis Calcium deposition 3 Encephalopathy 4 Cadmium Genus: Human cytomegalovir... Cerebrospinal fluid abnorm. 5.. Interferon-alpha Chromosomes Viral Child Head Tricuspid Valve Stenosis

Page 15: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Fingerprinting

disambiguatio

n

ACS

META-ANALYSIS

Page 16: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Applications

• Cross-language, jargon and cross-system matching (implemented): www.sharingpoint.shared-global.org

• Information-based discovery (Research)

• Community building (Experts,Policy Making)

• Trendwatching and Indicators (Policy Making)

Page 17: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Seed-Term based Conceptual Semantic Networks

Page 18: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

??

Clustering of genes on-the-fly

Page 19: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Predicting new knowledge ?

Page 20: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

III= Distribution over distance categories of concept-pairs without co-occurrence in the learning set.

IV= Distance categories of concept pairs related to the probability that there is no explicit relationship or co-occurrence in Medline (zero ratio) . A ratio of 0 means that an automatic Query in Medline with the concept pair with “AND” in between does lead to 0 hits in Medline.

Page 21: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

New Drug discovery ?

Page 22: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Semantic Filtering

Page 23: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?
Page 24: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Knowledge Maps, Nature Biotechnology Map

Page 25: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Knowledge Maps: Medline Bioterrorism Map 1997

Page 26: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Knowledge Maps: Medline Bioterrorism Map 2001

Page 27: On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Private Research

DC

Public

E-BioSciPharma etc.

ORIELSERENDIPFP6 etc.

I-ResearchMinistiesWHO, FAOetc.

SHAREDBIREME/VHLEDCTPOxford intiative etc.