33
Applications Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart

Applications

  • Upload
    keena

  • View
    14

  • Download
    0

Embed Size (px)

DESCRIPTION

Applications. Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart. Typical Applications of Ontologies. Agent communication Data integration Description of service capabilities for matching and composition purposes Formal verification of process descriptions - PowerPoint PPT Presentation

Citation preview

Page 1: Applications

Applications

Chapter 9, Cimiano Ontology Learning Textbook

Presented by Aaron Stewart

Page 2: Applications

Typical Applications of Ontologies

• Agent communication• Data integration• Description of service capabilities for matching

and composition purposes• Formal verification of process descriptions• Unification of terminology across communities

Page 3: Applications

Text Applications of Ontologies

• Information Retrieval (IR)• Clustering and Classification of Documents• Semantic Annotation• Natural Language Processing

Page 4: Applications

Task-Based Evaluation(Porzel and Malaka 2005)

Page 5: Applications

Task-Based EvaluationRequirements

1. Algorithm output can be quantified2. Task can use background knowledge3. Ontology is an additional parameter4. Output can be traced to the ontology

Page 6: Applications

Contents

1. Text Clustering and Classification2. Information Highlighting for Supporting

Search3. Related Work

Page 7: Applications

Text Clustering and Classification

• What is the difference?

Page 8: Applications

Text Clustering

Page 9: Applications

Text Classification

Arrows Weather Flat shapes 3-D forms Smile!

Page 10: Applications

Dot Kom Project

• One of many competitions

Page 11: Applications

Approaches

• Bag of words• Manually engineered MeSH Tree Structures• Automatically constructed ontologies

Page 12: Applications

What is a “Bag of Words” anyway?

the

quickbrown

fox

Page 13: Applications

Bag of Words

the quick brown fox jumps over the lazy dog

(2)

Page 14: Applications

Building Hierarchies

Page 15: Applications

Note on Ontologies

• Our ontologies (“micro”)– Like a database record schema

• Their ontologies (“macro”)– Like WordNet

Page 16: Applications

Clustering

• Hierarchical Agglomerative Clustering• Bi-Section K-means• “A Comparison of Document Clustering

Techniques”– www.cs.sfu.ca/~wangk/894report/chen1.pdf

Page 17: Applications

Document Representations

• Bag of Words• Certain words + ontology -> extended features• Strategies: add, replace, only

Page 18: Applications

Vectors and Cosine Similarity

Page 19: Applications

Classification Results (Categories)

Page 20: Applications

Classification Results (Documents)

Page 21: Applications

Cluster Metrics

P : computer-generated clustersL : human-created clustersP, L : sets of clusters (partitioning)

Page 22: Applications

Clustering Results

Page 23: Applications

Clustering Results

Page 24: Applications

Information Highlighting for Supporting Search

• Challenge:– 10 minute limit– KMi Planet News web site– Compile a list of important• People• Technologies

Page 25: Applications

Information Highlighting for Supporting Search

• Tools:– Regular browser– Magpie– ESpotter– C-PANKOW

Page 26: Applications

Teams

• A : web browser only• B : web browser with AKT information• C : web browser with AKT++ information

Page 27: Applications

AKT++ Lexicon

Page 28: Applications

Scores

Page 29: Applications

Conclusions (for this section)

• Generated ontologies can be comparable to hand-crafted ontologies

• Humans can trust the computer too much! (Group C drop in score)

Page 30: Applications

Related Work

• Query Expansion• Information Retrieval• Text Clustering and Classification• Natural Language Processing

Page 31: Applications

Natural Language Processing

• Ambiguity resolution– Bank

• Compounds– Headache medicine

• Vague words– With, of, has– Selectional restrictions

• Anaphora

Page 32: Applications

More Applications

• Word sense disambiguation• Classification of unknown words• Named Entity Recognition (NER)• Anaphora Resolution• Question Answering– Who wrote the Hobbit?– Tolkien is the author of the Hobbit.

• Information Extraction– AUTOSLOG, ASIUM

Page 33: Applications

Analysis/Conclusion

• Pro/con: – Focused on two systems– Passing survey of others