20
WNT TRAINING Wang Notation Tool Developed by Piyushee Jha nowledgments : ional Science Foundation sselaer Polytechnic Institute gham Young University

WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

  • View
    218

  • Download
    3

Embed Size (px)

Citation preview

Page 1: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

WNT TRAININGWang Notation Tool

Developed by

Piyushee Jha

Acknowledgments:National Science FoundationRensselaer Polytechnic InstituteBrigham Young University

Page 2: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

2

What is WNT?

• Purpose: convert tables to layout independent form so several tables can be merged to create ontologies – Semantic Web

• Input: tables from the web (HTML)

• Output: Wang notation, XML schema

Page 3: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

3

Wang Notation & XML

• Wang Category Notation(Year, {(1991,), (1992,)})

(Term, {(Winter,), (Spring,), (Fall,)})

(Mark, {(Assignments, {(Ass1,), (Ass2,), (Ass3,)}),

(Examinations, {(Midterm,), (Final,)}), (Grade,)})

• Wang Delta Notation (1st row)({Year.1991, Term.Winter, Mark.Assignments.Ass1}) = 85

({Year.1991, Term.Winter, Mark.Assignments.Ass2}) = 80

({Year.1991, Term.Winter, Mark.Assignments.Ass3}) = 75

({Year.1991, Term.Winter, Mark.Examinations.Midterm}) = 60

({Year.1991, Term.Winter, Mark.Examinations.Final}) = 75

({Year.1991, Term.Winter, Mark.Grade}) = 75

• XML Document (very small portion shown)<CategoryParentNodes> <CategoryParentNode CategoryParentNodeOID="C1"> <CategoryNodes> <CategoryNode CategoryNodeOID="C1.1"></CategoryNode> <CategoryNode CategoryNodeOID="C1.2"></CategoryNode> </CategoryNodes> </CategoryParentNode> <CategoryParentNode CategoryParentNodeOID="C2"> <CategoryNodes> <CategoryNode CategoryNodeOID="C2.1"></CategoryNode> <CategoryNode CategoryNodeOID="C2.2"></CategoryNode> <CategoryNode CategoryNodeOID="C2.3"></CategoryNode> </CategoryNodes> </CategoryParentNode>

Page 4: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

4

Ontology

• Data model that contains a set of concepts and the relation between those concepts

Page 5: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

5

Why WNT?

• Difficult to automatically determine Wang notation and XML schema for several tables

• Tables are different depending on who creates them so the interactive Wang Notation Tool (WNT) was developed

• WNT is semi-automatic, requires user input and an understanding of tables to make corrections

Page 6: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

6

Table Concepts – Category Cells• Cells that show what the information in the

table represents

• Categories are Year, Term, and Mark.

• Subcategories are 1991 & 1992 for Year, Winter, Spring, Fall for Term and so on.

• Each category is contained within a RECTANGLE in the table

Page 7: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

7

Categories as Trees

• Every category can be thought of as a tree

Page 8: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

8

Table Concepts – Delta Cells• Cells that have information (all cells that

are not category cells)

• Marked in green below

• Users do not have to tell WNT which cells are delta cell; users are only responsible for category cells

Page 9: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

9

Using WNT• WNT is programmed in Matlab. • First image to pop-up is shown below• User must enter their name in Matlab command window• Original table will be displayed while a table is being

processed

Page 10: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

10

Using WNT (2)• Next the original clickable table appears

Page 11: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

11

Choosing Categories• User will select categories by clicking on the top

left and bottom right corners of the rectangle containing a category (black cells)

• User can undo mistakes by clicking a cell again but ONLY in reverse order of cells clicked

Page 12: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

12

Trees & Indented Notation• Each category is displayed for approval/correction

• Correct categories using Figure 3 to reflect what you think the correct “tree” for the category is

Page 13: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

13

Virtual Headers• A category without a distinct header needs a virtual header

• It is a “rootless” tree

2000 2001 2002 2003 2004

Females (number)

Females (number)

Females (number)

Females (number)

Females (number)

Virtual Header

Page 14: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

14

Virtual Headers (2)• Both categories below (yellow and green) require

virtual headers

Page 15: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

15

Virtual Header (3)• Sometimes virtual headers are added automatically by WNT and

just need to be renamed.

Note that this figure requires further corrections

Page 16: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

16

Virtual Headers (4)• Other times, they need to be added manually

using the Error Correction GUI (Figure 2)

Page 17: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

17

Verification

• Before moving on to the next table, the user can verify that the table was interpreted correctly with the aid of a final GUI (shown on the next 2 slides)

• After user is satisfied, they can move onto the next table

• If they are not satisfied, they can start over

Page 18: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

18

Verification (2)• Any delta cell that is clicked (blue) will “light” up

its corresponding category cells.

Page 19: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

19

Verification (3)• Any category cell that is clicked (blue) will “light”

up its corresponding category and delta cells.

Page 20: WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University

20

End of Interaction

No further interaction is required of the user.

Now, I will demo a few tables using WNT.