30
The Semantic Web Professor James Hendler http://www.cs.umd.edu/~ hendler http://owl.mindswap.org

The Semantic Web Professor James Hendler hendler hendler

Embed Size (px)

Citation preview

Page 1: The Semantic Web Professor James Hendler hendler hendler

The Semantic Web

Professor James Hendlerhttp://www.cs.umd.edu/~hendler

http://owl.mindswap.org

Page 2: The Semantic Web Professor James Hendler hendler hendler

2

Jim Hendler, 5/13/03

MIND SWAP• Maryland Information and Network Dynamics Laboratory,

Semantic Web and Agents Project• J. Hendler• B. Parsia• Jennifer Golbeck• Aditya Kalyanpur• Grecia Lapizco-Encinas• Katy Newton• Evren Sirin

• Corporate Research Partners: • Fujitsu Laboratory of America, College Park• Lockheed Martin Advanced Technology Laboratories• NTT Corp• SAIC Corp.

• http://owl.mindswap.org: Owl-powered Semantic Web page (under development)

• http://www.mindswap.org: Traditional web page

Ronald Alford Ross Baker Amy Alford Matt Westhoff Michael Grove Ronald Reck

Page 3: The Semantic Web Professor James Hendler hendler hendler

3

Jim Hendler, 5/13/03

Building the Semantic Web

Page 4: The Semantic Web Professor James Hendler hendler hendler

4

Jim Hendler, 5/13/03

The Changing Web

1990

2000

2010

IMAGES AND

DOCUMENTS

DATAAND

PROGRAMS

Page 5: The Semantic Web Professor James Hendler hendler hendler

5

Jim Hendler, 5/13/03

The Semantic Wave

(Berners-Lee, 03)

UMCP has been a key contributor

Page 6: The Semantic Web Professor James Hendler hendler hendler

6

Jim Hendler, 5/13/03

XML is NOT semantics

Page 7: The Semantic Web Professor James Hendler hendler hendler

7

Jim Hendler, 5/13/03

<photo> <subject> http://www.w3.org/~timbl </subject> <name> Tim Berners-Lee</name> </name> …</photo>

XML is NOT semantics

Page 8: The Semantic Web Professor James Hendler hendler hendler

8

Jim Hendler, 5/13/03

<photo> <subject> http://www.w3.org/~timbl </subject> <name> Tim Berners-Lee</name> </name> …</photo>

XML is NOT semantics

Xml schema is DOCUMENT checking photo has multiple subject fields photo has one physical location etc.

Page 9: The Semantic Web Professor James Hendler hendler hendler

9

Jim Hendler, 5/13/03

<photo> <subject> http://www.w3.org/~timbl </subject> <name> Tim Berners-Lee</name> </name> …</photo>

XML is NOT semantics

Xml schema is DOCUMENT checking photo has multiple subject fields photo has one physical location etc.

WHICH SAYS NOTHING ABOUT TALKS, SUBJECTS, PEOPLE, EVENTS, etc.

Page 10: The Semantic Web Professor James Hendler hendler hendler

10

Jim Hendler, 5/13/03

Event:title

Event:WebPage

< > rdf:type photo:Photograph, Photo:File http://…/images#image1, Photo:topic :event1#event:speaker.

Event1 a Event:event; date “May 7-11”, speaker http://…#timbl.html Title “WWW 2002…”

TimBL rdf:type w3c-ont:person; name “Tim Berners-Lee” …

<daml:ObjectProperty rdf:ID="photograph"><rdfs:domain rdf:resource="#Picture"/><rdfs:range rdf:resource= …#person"/></daml:ObjectProperty>

<rdf:Description rdf:about="http://www.w3.org/2001/03/earl/0.95#Person"><rdf:type rdf:resource="http://www.w3.org/2000/01/rdf-schema#Class"/><rdfs:subClassOf rdf:resource="http://www.w3.org/2001/03/earl/0.95#Assertor"/></rdf:Description>

<s:Class rdf:about="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#Conference"><s:comment>describes a generic conceptabout events </s:comment><s:subClassOf rdf:resource="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#Event"/><a:disjointFrom rdf:resource="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#Workshop"/><a:restrictedBy rdf:resource="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#genid18"/>

The SEMANTICS is inthe links (e.g. to ontologies)!

Page 11: The Semantic Web Professor James Hendler hendler hendler

11

Jim Hendler, 5/13/03

From the small and structured<daml:Class rdf:about="#Book"> </daml:Class>

<daml:Class rdf:about="#Author"> <rdfs:subClassO rdf:resource="#Person"/> </daml:Class>

<daml:Property rdf:about="#age"> <rdfs:domain rdf:resource="#Book"/> <rdfs:range rdf:resource="#age_value_id"/> </daml:Property>

<rdfs:Property rdf:about="#award"> <rdfs:domain rdf:resource="Book"/> <rdfs:range rdf:resource="#award_value_id"/> </rdfs:Property>

• Database schema for International Digitial Children’s Library Metadata repository• ~40 classes

• ~25 properties (corresponding to database keys)

Page 12: The Semantic Web Professor James Hendler hendler hendler

12

Jim Hendler, 5/13/03

To the large and complexowl:Class rdf:ID="CD80_Antigen"> <rdfs:subClassOf rdf:resource="#Cell_Adhesion_Molecule</rdfs:label> <rdfs:label>CD80 Antigen</rdfs:label> <rdfs:subClassOf>

<owl:Restriction> <owl:onProperty rdf:resource="#Protein_Expressed_In_Tissue"/> <owl:someValuesFrom rdf:resource="#T_Cell"/>

</owl:Restriction></rdfs:subClassOf> <rdfs:subClassOf>

<owl:Restriction> <owl:onProperty rdf:resource="#Protein_Expressed_In_Tissue"/> <owl:someValuesFrom rdf:resource="#B_Cell"/>

</owl:Restriction></rdfs:subClassOf> <rdfs:subClassOf>

<owl:Restriction> <owl:onProperty rdf:resource="#Protein_Expressed_In_Tissue"/> <owl:someValuesFrom rdf:resource="#Macrophage"/>

</owl:Restriction></owl:Class>

<CD80_Antigen rdf:ID="CD80_AntigenData"> <Swiss_Prot>P33681</Swiss_Prot> <DEFINITION> <![CDATA[<def-source>NCI</def-source> <def-definition>A member of the immunoglobulin supergene family . Co-regulator of T cell activation with CD86. Plays a critical role in autoimmune, humoral, and transplant responses. Expressed on activated T and B cells and on macrophages. (from PROW)</def-definition>]]> </DEFINITION>

• US National Cancer Institute Ontology• ~17,000

classes

• >1 million synonyms and definitions

• RDF graph >500,000 triples

Page 13: The Semantic Web Professor James Hendler hendler hendler

13

Jim Hendler, 5/13/03

And many more• DAML ontologies for:

• Army Thesaurus: 18,000 concepts • CIA World Fact Book in DAML: 1400 classes, 2500

instances• UJTLs: 326 classes, all Joint and Naval Task lists as

instances• Military Geofile: ~50 installation types; 56,700 Geoloc

instances• NIMA Geonames: 36,750 Afghanistan targets • And Many more (220 in library)

• WordNet, NCI Cancer Thesaurus, Heads of State, Weather Conditions, CYC, UNSPC, Genealogy, baseball, Chemistry, …

Page 14: The Semantic Web Professor James Hendler hendler hendler

14

Jim Hendler, 5/13/03

The Growing Semantic Web• Jan 1, 03: Crawler finds 5.8M+ DAML statements on 20,000+ web pages

• Doesn’t include many instance KBs tied to ontologies• Doesn’t include many very large RDFS-based KBs that include some OWL

• Ontology library at http://www.daml.org has 209 ontologies (March 2003)• Open for anyone to create • Open for anyone to use

• OWL is being supported by large corporation labs• Web tool developers: IBM, HP, Sun, Intel, Fujitsu• Content providers: Daimler-Chrysler, Nokia, Motorola, EDS, Agfa

• OWL is starting to be used by thesaurus developers• C.f. National Cancer Institute metathesaurus released in OWL Lite• CALL Thesaurus in DAML • NASA thesaurus available in DAML

• Use of semantic markup for Web Services beginning to move beyond basic research• DAML-S cited as required reading for Web Services Choreography WG• DAML-S being demoed to WSDL WG tomorrow!!

Page 15: The Semantic Web Professor James Hendler hendler hendler

15

Jim Hendler, 5/13/03

BUNCH OF SLIDES DELETED

Page 16: The Semantic Web Professor James Hendler hendler hendler

16

Jim Hendler, 5/13/03

The Semantic Web “Revolution”Scientific American Article notes

[Joint starting place:]

I. Semantic Web Vision (TBL)

II. What are the enablers? (in sequence)Screen Scraping (Ora and TBL)

Data on Web (Ora and TBL)Zip code link between Data Bases (TBL)

Ontology Independence (JAH)

Effect of Scale (TBL)

“Then, a miracle occurs”

III. What can you do with it? (not necessaril y in sequence)Self -describing documents (JAH)

Logic to encode… (TBL)Services and Advertising (Ora)Devices (Ora)

Digital Signatures, Authentication, and Trust (TBL)

Page 17: The Semantic Web Professor James Hendler hendler hendler

17

Jim Hendler, 5/13/03

Tools are criticalTools for marking up images, figures and other non-textual data… must be built in a way that they tie into the "business processes" of the working scientist -- that is, rather than learning a whole new set of tools, the basic web tools of the scientist must include mechanisms that make it EASIER for the scientist to produce web content (appropriately markedup) while authoring papers, performing experiments, creating and logging data, and the other day to day activities of the working researcher.

Science and the Semantic Web, Hendler, 03

The same is true of the military user

Page 18: The Semantic Web Professor James Hendler hendler hendler

18

Jim Hendler, 5/13/03

Automatically-organized portals

Page 19: The Semantic Web Professor James Hendler hendler hendler

19

Jim Hendler, 5/13/03

New Info Management Paradigms

Page 20: The Semantic Web Professor James Hendler hendler hendler

20

Jim Hendler, 5/13/03

And other “cool stuff”

Page 21: The Semantic Web Professor James Hendler hendler hendler

21

Jim Hendler, 5/13/03

Tools for visualization and use

A number of tools for querying, browsing and visualizing SW data are being developed

Page 22: The Semantic Web Professor James Hendler hendler hendler

22

Jim Hendler, 5/13/03

Handle larger size/complexity

Tools are now scaling as well

Page 23: The Semantic Web Professor James Hendler hendler hendler

23

Jim Hendler, 5/13/03 PhotoSMORE: Marking up (sub)images

Page 24: The Semantic Web Professor James Hendler hendler hendler

24

Jim Hendler, 5/13/03

And the info flows…

Page 25: The Semantic Web Professor James Hendler hendler hendler

25

Jim Hendler, 5/13/03

And the info flows… correctly

Page 26: The Semantic Web Professor James Hendler hendler hendler

26

Jim Hendler, 5/13/03

Putting all this together…

Page 27: The Semantic Web Professor James Hendler hendler hendler

27

Jim Hendler, 5/13/03

…is starting to work

document

service

database

Are there any mountains in Europe bigger than Mt. Blanc?

Page 28: The Semantic Web Professor James Hendler hendler hendler

28

Jim Hendler, 5/13/03

Wrapping/Mapping toolDAML Ontology(DAML-S)

XML Schema (WSDL)

Mappings for attributes

Define transformationfunctions

Manualeditingif necess.

Page 29: The Semantic Web Professor James Hendler hendler hendler

29

Jim Hendler, 5/13/03

“Ad hoc Programming”

Advanced information management capabilities Discovery, Filtering, Composition

Page 30: The Semantic Web Professor James Hendler hendler hendler

30

Jim Hendler, 5/13/03

Ad hoc programming + logic