View
218
Download
3
Tags:
Embed Size (px)
Citation preview
WNT TRAININGWang Notation Tool
Developed by
Piyushee Jha
Acknowledgments:National Science FoundationRensselaer Polytechnic InstituteBrigham Young University
2
What is WNT?
• Purpose: convert tables to layout independent form so several tables can be merged to create ontologies – Semantic Web
• Input: tables from the web (HTML)
• Output: Wang notation, XML schema
3
Wang Notation & XML
• Wang Category Notation(Year, {(1991,), (1992,)})
(Term, {(Winter,), (Spring,), (Fall,)})
(Mark, {(Assignments, {(Ass1,), (Ass2,), (Ass3,)}),
(Examinations, {(Midterm,), (Final,)}), (Grade,)})
• Wang Delta Notation (1st row)({Year.1991, Term.Winter, Mark.Assignments.Ass1}) = 85
({Year.1991, Term.Winter, Mark.Assignments.Ass2}) = 80
({Year.1991, Term.Winter, Mark.Assignments.Ass3}) = 75
({Year.1991, Term.Winter, Mark.Examinations.Midterm}) = 60
({Year.1991, Term.Winter, Mark.Examinations.Final}) = 75
({Year.1991, Term.Winter, Mark.Grade}) = 75
• XML Document (very small portion shown)<CategoryParentNodes> <CategoryParentNode CategoryParentNodeOID="C1"> <CategoryNodes> <CategoryNode CategoryNodeOID="C1.1"></CategoryNode> <CategoryNode CategoryNodeOID="C1.2"></CategoryNode> </CategoryNodes> </CategoryParentNode> <CategoryParentNode CategoryParentNodeOID="C2"> <CategoryNodes> <CategoryNode CategoryNodeOID="C2.1"></CategoryNode> <CategoryNode CategoryNodeOID="C2.2"></CategoryNode> <CategoryNode CategoryNodeOID="C2.3"></CategoryNode> </CategoryNodes> </CategoryParentNode>
…
4
Ontology
• Data model that contains a set of concepts and the relation between those concepts
5
Why WNT?
• Difficult to automatically determine Wang notation and XML schema for several tables
• Tables are different depending on who creates them so the interactive Wang Notation Tool (WNT) was developed
• WNT is semi-automatic, requires user input and an understanding of tables to make corrections
6
Table Concepts – Category Cells• Cells that show what the information in the
table represents
• Categories are Year, Term, and Mark.
• Subcategories are 1991 & 1992 for Year, Winter, Spring, Fall for Term and so on.
• Each category is contained within a RECTANGLE in the table
7
Categories as Trees
• Every category can be thought of as a tree
8
Table Concepts – Delta Cells• Cells that have information (all cells that
are not category cells)
• Marked in green below
• Users do not have to tell WNT which cells are delta cell; users are only responsible for category cells
9
Using WNT• WNT is programmed in Matlab. • First image to pop-up is shown below• User must enter their name in Matlab command window• Original table will be displayed while a table is being
processed
10
Using WNT (2)• Next the original clickable table appears
11
Choosing Categories• User will select categories by clicking on the top
left and bottom right corners of the rectangle containing a category (black cells)
• User can undo mistakes by clicking a cell again but ONLY in reverse order of cells clicked
12
Trees & Indented Notation• Each category is displayed for approval/correction
• Correct categories using Figure 3 to reflect what you think the correct “tree” for the category is
13
Virtual Headers• A category without a distinct header needs a virtual header
• It is a “rootless” tree
2000 2001 2002 2003 2004
Females (number)
Females (number)
Females (number)
Females (number)
Females (number)
Virtual Header
14
Virtual Headers (2)• Both categories below (yellow and green) require
virtual headers
15
Virtual Header (3)• Sometimes virtual headers are added automatically by WNT and
just need to be renamed.
Note that this figure requires further corrections
16
Virtual Headers (4)• Other times, they need to be added manually
using the Error Correction GUI (Figure 2)
17
Verification
• Before moving on to the next table, the user can verify that the table was interpreted correctly with the aid of a final GUI (shown on the next 2 slides)
• After user is satisfied, they can move onto the next table
• If they are not satisfied, they can start over
18
Verification (2)• Any delta cell that is clicked (blue) will “light” up
its corresponding category cells.
19
Verification (3)• Any category cell that is clicked (blue) will “light”
up its corresponding category and delta cells.
20
End of Interaction
No further interaction is required of the user.
Now, I will demo a few tables using WNT.