12
Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Embed Size (px)

Citation preview

Page 1: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Controlled Vocabulary

Giri PalanisamyEda C. Melendez-Colom

Corinna GriesDuane CostaJohn Porter

Page 2: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Desired Result- Dream Systems

• Ecological Ontologies – as an endpoint – But better goal for now set up community site for

annotating data that could provide information for ontology construction

• Concept mapping

– Can pull out keywords and have users list synonym• Corrina has student working on text analysis, including

proximity between words• Developing “related words”

– Lets you make choices about how words should be used

– Synonyms don’t come together in a text

Page 3: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Desired Result/Dream System

• NBII Thesaurus web service– Already have a head start– May be more productive for LTER to help make them

have a better system – that LTER can use– LTER is already in NBII system and there are

capabilities to link there– EIONET also has thesaurus served through NBII– Will be adding another……

• SEEK has annotation language that they use inside KEPLER…..– Also may be working annotating attributes– Used to enforce consistency

Page 4: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Duane’s Dream

• Rich and complete browse hierarchy for use in Metacat interface– Not 10 levels! Maybe 4 or 5 levels

• Enhance metacat queries to extend keywords with potential related keywords

• Keyword enrichment tool that would enrich keyword section of EML document– Add keywords– Tool suggests additional keywords to add

Page 5: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Thesauri/Ontologies

Datasets

Page 6: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Issues

• Different standards for online ontologies (SKOS vs TAPR etc.)– Can you convert? NBII is looking at….

• Would like to have option of matching thesarus keywords in EML documents

• Thesaurus is not explicitly a hierarchy….

Page 7: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Discussion on Automation

• NBII has worked on some tools…

• Could make enrichment of EML documents by keywords a USER function– Learn from users

• Now have audited metacat searches so have a database with 3 months worth of queries

Page 8: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Demo of Systems

• CAP Semantic Research

• http://149.169.202.24:8080/ecologyes– Development server

• NBII Thesaurus Site

• http://nbii.ornl.gov/thesaurus

Page 9: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Ideas

• Publication on LTER vocabulary and relationship to NBII Thesaurus and other resources– Send list to NBII, they will return report on hits– How can LTER contribute?

• Corrina’s system could be used to help propose new information

• Can add information to NBII Thesaurus….

Page 10: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Challenges

• Evaluating lists/thesauri/ontologies that would benefit LTER

• Linking existing EML documents with context from a list/thesaurus/ontology

• Developing a dataset hierarchy from the interaction of LTER data catalog with list/thesaurus/ontology

Page 11: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Steps• Need training on how to use web services for

NBII access– Duane, Corinna, Inigo

• Send list of terms for checking in NBII– Need to finalize multi-word keyword list– Revise Token/Word list – to update– Human input?

• Further Discussion – Workshop at NCEAS?– Relationships between LTER, SEEK and NBII

• Editing, sharing, CAP work

– How to harvest user input to help “educate” system– CAP Student can participate

Page 12: Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter

Next Steps

• Corinna will check with SEEK on opportunities there

• Giri will check with Mike Frame on NBII buy-in

• VTC Last Week of August 2007??– Develop plans for future activities

• Workshops• Visits• Activities