Upload
others
View
5
Download
0
Embed Size (px)
Citation preview
Xavier Aimé, Orphanet Inserm US14 Jean Charlet, APH-‐HP / Inserm UMR 872 éq20 Francky Trichet, Pascale Kuntz , LINA CNRS Frédéric Fürst , MIS Ferdinand Dhombres, Inserm UMR 872 éq20
Rare Diseases KM: the Contribu3on of Proximity Measurements in OntoOrpha and OMIM
Plan
• Proximity is not similarity • Proxima • OntoOrpha • OMIM corpus • Experiments • Conclusion
2 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Proximity is not similarity
WordSimilarity-‐353 Test Collec5on : Similarity ( cup , coffee ) ~ 7
• Lev Finkelstein, Evgeniy Gabrilovich, Yossi MaOas, Ehud Rivlin, Zach Solan, Gadi Wolfman, and Eytan Ruppin, "Placing Search in Context:
The Concept Revisited", ACM TransacOons on InformaOon Systems, 20(1):116-‐131, January 2002
3 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
similar close
Proxima • Structure:
• Terms:
• Instances:
4 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Gestalt Laws of Perceptual Organiza5on A B
-‐-‐-‐-‐-‐-‐-‐A-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐B-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐
-‐-‐-‐-‐-‐-‐-‐A-‐-‐-‐-‐-‐-‐-‐A-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐A-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐A-‐-‐-‐-‐-‐-‐
-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐B-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐B-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐B-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐
-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐
Proxima • Structure:
• Terms:
• Instances:
5 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
A B
-‐-‐-‐-‐-‐-‐-‐A-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐B-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐
-‐-‐-‐-‐-‐-‐-‐A-‐-‐-‐-‐-‐-‐-‐A-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐A-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐A-‐-‐-‐-‐-‐-‐
-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐B-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐B-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐B-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐
-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐
A B ?
Ontology OntoOrpha (1/2)
6 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
h\p://www.orpha.net -‐ 5,914 entries, >12,000 daily visitors, 36 partners countries
Ontology OntoOrpha (2/2)
7 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Corpus
8 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
hQp://omim.org/ -‐ 26,151 entry
Ques3on 1
• InformaOon retrieval process: – Seman3c network from OntoOrpha – Weigh3ng rela3ons between 2 concepts
27/08/2012 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 9
?
Results
10 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Ques3on 2
• 2 concepts: – similar / close? – rela3on between these concepts? merge concepts? à EvaluaOon of the conceptualizaOon quality
27/08/2012 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 11
A B ?
A B
Similarity / Proximity
12 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Similarity / Proximity
13 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Merge concepts? Rela3on of specializa3on?
Similarity / Proximity
14 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Rela3on?
SubclassOf, same class ?
Data and Method
• Sample: – 5,000 random pairs of concepts without direct rela3on – 5,000 random pairs of concepts with direct rela3on
• Proximity: – Proxima (expres) + OntoOrpha + OMIM
• Similarity: – Jaccard (intens) + Jaro-‐Winckler (expres) + OntoOrpha
27/08/2012 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 15
Results
16 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Results
17 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Pathology – Pathology : • Emery-‐Dreifuss muscular dystrophy • X-‐linked Emery-‐Dreifuss muscular dystrophy hasSubType OK
Results
18 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Concept – Subconcept : Cardiac septal defect – Atrial septal defect : SubclassOf OK
Results
19 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Rela3on(s) between concepts OK
Conclusion & Future work
ü -‐ a method to evaluate the quality of a conceptualiza3on -‐ a method to ponderate the rela3ons -‐ two use cases with a measure of proximity
û -‐ dependant of a corpus (and corpus quality) -‐ dependant of the similarity measure -‐ can not be used with all pairs of concepts ( n.(n-‐1)/2 possibili3es)
heurisOcs
20 X. Aimé et al. -‐ MIE 2012 -‐ Pisa 27/08/2012
Xavier Aimé, Orphanet Inserm US14 Jean Charlet, APH-‐HP / Inserm UMR 872 éq20 Francky Trichet, Pascale Kuntz , LINA CNRS Frédéric Fürst , MIS Ferdinand Dhombres, Inserm UMR 872 éq20
Rare Diseases KM: the Contribu3on of Proximity Measurements in OntoOrpha and OMIM