Upload
vocong
View
215
Download
2
Embed Size (px)
Citation preview
1Dobler, Gogl, Scheiber and Tymaniuk
Ontology Engineering WS 10/11
Dobler, Gogl, Scheiber & Tymaniuk
Linked Open Innsbruck
Educational Institutions
2Dobler, Gogl, Scheiber and Tymaniuk
Outline
# Title
1 Introduction
2 Approach
3 The Ontology
4 Application & Usage Scenarios
3 Data Sources & Integration
4 Outlook & Conclusion
5 References
4Dobler, Gogl, Scheiber and Tymaniuk
Introduction
• Linked Open Innsbruck
– Make Innsbruck and relevant data a part of the Linked Open Data cloud
• Broad offer of educational institutions in Innsbruck
• Educational Institution := organization dedicated to education– ≡ School := An institution for instruction in a skill or business
– ⊃ Academic institution := An educational institutions dedicated to education and
research, which grants academic degrees
– Educational institutions can broken down by the types:
• Fund type : public, private, parochial, etc.
• Academic type: non-academical, primary-, secondary-, tertiary education, etc.
• Modus type: full-time school, night school, vacation school, etc.
• Chosen Ontology editor and format
– Protégé 4.1
– OWL
5Dobler, Gogl, Scheiber and Tymaniuk
The choice of the OE tool (Protégé 4.1)
• Open-source OWL ontology-engineering tool
• Built on top of OWL API
• User-friendly navigation and results visualization (multiple alternative
views of the same ontology, cloning of components)
• Large Protégé community
• However occasional crashes are possible
7Dobler, Gogl, Scheiber and Tymaniuk
Approach – Ontology Development 101 – Step 1
Step 1: Determine the domain and scope of the ontology
• Domain
– “Educational Institutions” with special emphasis on Innsbruck
• For what we are going to use the ontology?
– Wide spanning knowledge base with easy information access
• Who will use and maintain the ontology?
– Use: public
– Maintenance: Government, e.g. Austrian department of education, and institutions
themselves
• For what types of questions the information in the ontology should
provide answers?
– Competency questions!
8Dobler, Gogl, Scheiber and Tymaniuk
Approach – Ontology Development 101 – Step 1
Step 1: Determine the domain and scope of the ontology
• Possible competency questions:
– If I want to go to the University of Vienna, what pre-requirements do I have to
consider? Where do I get the prerequisites for it?
– Is the “Gymnasium der Franziskaner Hall” public funded or is it a parochial school?
– I want to become a lawyer – what do I have to do?
– I want to become a lawyer - where can I do so?
– I want to visit a college with lowest number of drop-outs in Austria / Innsbruck?
– I want to follow my idols’ footsteps, e.g. Heinz Fischer. Which educational
institutions did he visit and which education does he have?
– I want to sent my daughter to Gymansium Kettenbrück, which transportation
possibilities are there and how often has she to change bus?
– I like the book of Gustav Ganz “Art of Teaching”, where does the author work
and what lectures does he give?
– Where is the grammar school with the best online receptions and valuations?
– What is the shortest way of education if I want to become an artist?
9Dobler, Gogl, Scheiber and Tymaniuk
Approach – Ontology Development 101 – Step 2
Step 2: Consider reusing existing ontologies
• Browse around school & educational related ontologies on repositories
like Swoogle, Dr Watson, Protégé page, Unversity of Manchester,…
• Chosen one: Education Ontology by consulting firm Dan McCreary
– Ontology originally designed for the Minnesota Department of Education
• What: Extension of NIEM to include K-12 education metadata
• Why: Dramatically lower data integration costs and make longitudinal data analysis more cost-
effective
• Scope: K-12 education: sum of primary and secondary education
10Dobler, Gogl, Scheiber and Tymaniuk
Thing
ActivityStartDate
ActivityEndDate
PersonBirthDate
PropertyType
Activity Document PersonOrganization
ConceptType
StudentStateAssignedIDEnrollmentStateDate
Student Teacher
Education Extensions
Enrollment
Approach – Ontology Development 101 – Step 2 (2)
Education Ontology by consulting firm Dan McCreary
• Advantages
– Based on standards:
• NIEM: XML-based information exchange framework from the United States across all levels
of government and with private industry
• ISO/IEC 11179: An international standard for representing
metadata for an organization in a metadata registry
– Professional → quality
– Basic vocabulary
– Expandable regarding
• Institutions/schools (especially tertiary, alternative)
• Austrian school types
• Relationships & properties
• Disciplines
High Level Structure of the NIEM and the extension (source: Dan McCreary)
11Dobler, Gogl, Scheiber and Tymaniuk
Approach – Step 3 to 6
• Step 3: Enumerate
important terms in the
ontology
– Wikipedia as source, with its
portals for School, Education &
University
• Step 4: Define classes and
class hierarchy
– Top-down development
• Step 5: Define attributes and
relationships
– Austrian education system
• Step 6: Define the
restrictions of the properties
12Dobler, Gogl, Scheiber and Tymaniuk
Approach – Step 7 & Publishing
• Step 7: Create instances
– Medienkolleg Innsbruck
– Franziskaner Gymnasium Hall
– Gymnasium Kettenbrücke
– UNO Innsbruck Internation Summer School
– Teachers, books, academic degrees, valuations, …
• Publish ontology on the Web
– http://homepage.uibk.ac.at/~csag3861/ontologies/educational_institution.owl
– Submit URL at SWOOGLE
– Submit URL at schemaweb.info
15Dobler, Gogl, Scheiber and Tymaniuk
The Ontology - Cloud View
The cloud view shows the usage depending on classes
18Dobler, Gogl, Scheiber and Tymaniuk
Overview - Data Sources for Application
Educational
Institutions
DBpedia
Geo-
names
FOAF
profiles
The
Open
Library
Gutenberg
ProjectCiteSeer
bmukk.
gv.atschule.at
bsr.tsn.at
tibs.at…
Statistik
Austria
…stats.
oecd.org
spickmich
.atschule
.netschulradar
.at…
•School types
•Disciplines
•Persons
(graduated at, alma
mater, studied
education, has academic
degree,…)
•Persons
•Locations •Educational
Institutions
•Statistics(# graduates,
dropouts, budget,
…)
•School Evaluations
•Teacher Evaluations
•Published Books
(published at,
from teacher)
Transportation
Ontology
Innsbruck
•ways to school
19Dobler, Gogl, Scheiber and Tymaniuk
Application – www.EduWiki.at
Semantic MediaWiki: http://www.EduWiki.at
“The open platform for education in Innsbruck and beyond”
20Dobler, Gogl, Scheiber and Tymaniuk
Application – www.EduWiki.at (2)
• Features:
– Visual display of information
• E.g.: Semantic Maps, Semantic Result Formats, Course of Education Diagram
– Improved data structure
• E.g.: reducing the need for complex classification system by semantic queries
– Improved Search
• E.g.: own queries, visual query builder
– Inter-language consistency
• E.g.: data is independent of language
Semantic Map (source: http://mapping.referata.com/wiki/Semantic_Maps)
– External reuse
• E.g.: serve as data source for other apps
– Integrate and mash-up data
• E.g.: data is not isolated, serve as a central
information hub
– Automatically-generated content and lists
• E.g. stays up-to-date, less errors than human
input
21Dobler, Gogl, Scheiber and Tymaniuk
Usage Scenario – Case Example Lawyer
Requires Academic
Degree
Has Profession
• Competency Question: I want to become a lawyer – what do I have to do?
Can be Learned at
!
22Dobler, Gogl, Scheiber and Tymaniuk
Usage Scenario – Case Example Lawyer (2)
Requires Academic
Degree
Has Academic
Degree
Grants Academic
Degree
23Dobler, Gogl, Scheiber and Tymaniuk
Usage Scenario – Case example (2)
– Competency Question: I want to follow my idols’ footsteps, e.g. Heinz Fischer. Which
educational institutions did he visit and which education does he have?
taughtAt
24Dobler, Gogl, Scheiber and Tymaniuk
Usage scenario – Case example (3)
Competency Question: I like the book of Gustav Ganz “Art of Teaching”, where does the
author work and what lectures does he give?
26Dobler, Gogl, Scheiber and Tymaniuk
Data Sources for Usage Scenario
Educational
Institutions
DBpedia
Geo-
names
FOAF
profiles
The
Open
Library
Gutenberg
ProjectCiteSeer
bmukk.
gv.atschule.at
bsr.tsn.at
tibs.at…
Statistik
Austria
…stats.
oecd.org
spickmich
.atschule
.netschulradar
.at…
•School types
•Disciplines
•Persons
(graduated at, alma
mater, studied
education, has academic
degree,…)
•Persons
•Locations •Educational
Institutions
•Statistics(# graduates,
dropouts, budget,
…)
•School Evaluations
•Teacher Evaluations
•Published Books
(published at,
from teacher)
Transportation
Ontology
Innsbruck
•ways to school
27Dobler, Gogl, Scheiber and Tymaniuk
Data Sources on Educational Institutions in Ibk
• Bundesministerium für Unterricht, Kunst und Kultur – http://www.bmukk.gv.at/
• Tiroler Bildungswegweiser– http://bsr.tsn.at/tbw/
• Österreichisches Schulportal– http://schule.at/
• Tiroler Bildungsservice• www.tibs.at
• Portalseite der Berufsbildenden Schulen– http://www.berufsbildendeschulen.at
• Stadtbranchenbuch Innsbruck: – http://www.stadtbranchenbuch.at/innsbruck/S/472.html
• WIFI courses– http://www.tirol.wifi.at/eshop/kursbuch.aspx
• BFI courses– http://www.bfi-tirol.at/weiterbildung/kurssuche.html
• Österreichische Schulen im WWW– http://www.pinoe-hl.ac.at/schulen/
• Schulendatei– http://www.schulen-online.at
• …
28Dobler, Gogl, Scheiber and Tymaniuk
Data Sources on Educational Institutions in Ibk (2)
• Statistik Austria (Eduaction & Cultur)
– http://www.statistik.at/web_de/statistiken/bildung_und_kultur/
• OECD Stats (Education & Training)
– http://stats.oecd.org
• Spickmich: School- and teacher evaluations
– http://www.spickmich.at
• Schule.net: "Fair teacher evaluation in the Internet"
– http://www.schule.net
• Schulradar: "Find the best school for your kid!"
– http://www.schulradar.at
• …
30Dobler, Gogl, Scheiber and Tymaniuk
Data Sources for Usage Scenario
Educational
Institutions
DBpedia
Geo-
names
FOAF
profiles
The
Open
Library
Gutenberg
ProjectCiteSeer
bmukk.
gv.at
bsr.tsn.at
tibs.at…
Statistik
Austria
…stats.
oecd.org
spickmich
.atschule
.netschulradar
.at…
•School types
•Disciplines
•Persons
(graduated at, alma
mater, studied
education, has academic
degree,…)
•Persons
•Locations •Educational
Institutions
•Statistics(# graduates,
dropouts, budget,
…)
•School Evaluations
•Teacher Evaluations
•Published Books
(published at,
from teacher)
Transportation
Ontology
Innsbruck
•ways to school
schulen-
online.at
31Dobler, Gogl, Scheiber and Tymaniuk
– All schools in Austria (public, private), no tertiary institutions
– “Schulendatei Online”
– http://www.schulen-online.at/sol/pub_ssuche_init.do?method=schuleSuchenOeffentlich
Data Integration – Example - Schulendatei
32Dobler, Gogl, Scheiber and Tymaniuk
Manually:
• fill in the web form -> “copy and paste”
– Set “Schulkennzahl” from 1 – 9 for the county + Wildcards
Example: enter string: 7***** in field “Schulkennzahl” all schools in Tirol (public/private)
will be presented by the search engine.
Semi-automatic:
• Write a script that invokes the URLs and stores the pages (Gnu Wget)• http://www.schulen-
online.at/sol/pub_ssuche_exec.do?method=schuleSuchenOeffentlich&formItem=&sortierung=
KENNZAHL&treffer=50&schulkennzahl=&bezeichnung=&plz=&ort=&strasse=&schul
art=XX&bundesland=XX&bezirk=0&privat=XX -> first 50 hits
• http://www.schulen-
online.at/sol/pub_ssuche_list.do?step=next&method=schuleSuchenOeffentlich -> next
50 hits
How to extract data from Schuldatei
33Dobler, Gogl, Scheiber and Tymaniuk
• Extract information of stored Websites
– Copy and Paste….
or
– With a script
– Each record is stored in a table row block, for instance
<tr>…. VS<br>7122 Gols am See, Obere Hauptstraße 5</td><td
valign="top" style="width:290px"> Tel.: 02173/2275, Fax: 02173/2275<br> <a
href="mailto:[email protected]">[email protected]</a> <br> </td> </tr>
- Save in a file or database, import info into Ontology.
How to extract data from Schuldatei (2)
35Dobler, Gogl, Scheiber and Tymaniuk
• Ontology has at the moment the basic constructs
– But is still expandable
• E.g. import more ontologies
• OWL has to be interlinked for integrating in Linked Data Cloud
– E.g. http://dbpedia.org for schooltypes and discpilines
• Data has to be integrated
– i.e from Schuldatei, Statistik Austria, schulradar,…
Outlook
36Dobler, Gogl, Scheiber and Tymaniuk
• Protégé is a helpful and easy to use tool and well documented
• Scope of ontology tends to grow during its construction
• Decision problem: translate German terms or not?
– e.g. Gymnasium Grammar school
• A lot of different educational data sources → huge ontology potential
(e.g. when it comes to data integration)
• Data extraction method depends strongly on Website
• Most data sources do not have semantic tags implemented
Conclusion
38Dobler, Gogl, Scheiber and Tymaniuk
• T. Berners-Lee, J. Hendler, O. Lassila. The Semantic Web, Scientific
American, 2001.
• D. Fensel. Ontologies: A Silver Bullet for Knowledge Management
and Electronic Commerce, 2nd Edition, Springer 2003.
• T.R. Gruber, Toward principles for the design of ontologies used or
knowledge sharing? , Int. J. Hum.-Comput. Stud., vol. 43, no. 5-
6,1995
• Natalya F. Noy and Deborah L. McGuinness. ``Ontology
Development 101: A Guide to Creating Your First Ontology''.
Stanford Knowledge Systems Laboratory Technical Report KSL-01-
05 and Stanford Medical Informatics Technical Report SMI-2001-
0880, March 2001.
• Gomez-Perez et al., Ontological Engineering, Springer, 2004.
References
39Dobler, Gogl, Scheiber and Tymaniuk
• WWW links:– http://linkeddata.org
– http://protege.stanford.edu
– http://semantic-mediawiki.org/
– http://mapping.referata.com/wiki/Semantic_Maps
– http://www.danmccreary.com/presentations/semweb2006/
– http://protegewiki.stanford.edu/wiki/OntoGraf
– http://protegewiki.stanford.edu/wiki/Cloud_Views
– http://homepage.uibk.ac.at/~csag3861/ontologies/educational_institution.owl
– http://www.schulen-
online.at/sol/pub_ssuche_init.do?method=schuleSuchenOeffentlich
– http://www.gnu.org/software/wget/
References