18
SAN DIEGO SUPERCOMPUTER CENTER HYDROLOGIC METADATA CATALOG AND SEMANTIC SEARCH SERVICES IN CUAHSI HIS http://his.cuahsi.or g/ CUAHSI HIS Sharing hydrologic data Thomas Whitenack David Valentine, Ilya Zaslavsky, Michael Piasecki, David G. Tarboton, Jeffery S. Horsburgh, Timothy Whiteaker, Daniel Ames, David R. Maidment

HYDROLOGIC METADATA CATALOG AND SEMANTIC SEARCH SERVICES IN CUAHSI HIS

  • Upload
    asha

  • View
    29

  • Download
    0

Embed Size (px)

DESCRIPTION

HYDROLOGIC METADATA CATALOG AND SEMANTIC SEARCH SERVICES IN CUAHSI HIS. Thomas Whitenack David Valentine, Ilya Zaslavsky, Michael Piasecki, David G. Tarboton, Jeffery S. Horsburgh, Timothy Whiteaker, Daniel Ames, David R. Maidment. http://his.cuahsi.org/. CUAHSI HIS - PowerPoint PPT Presentation

Citation preview

Page 1: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

HYDROLOGIC METADATA CATALOG AND

SEMANTIC SEARCH SERVICES IN CUAHSI HIS

http://his.cuahsi.org/ CUAHSIHISSharing hydrologic data

Thomas Whitenack

David Valentine, Ilya Zaslavsky, Michael Piasecki, David G. Tarboton, Jeffery S. Horsburgh, Timothy

Whiteaker, Daniel Ames, David R. Maidment

Page 2: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

CUAHSI HISThe CUAHSI Hydrologic Information System (HIS) is an internet based system to support the sharing of hydrologic data. It is comprised of hydrologic databases and servers connected through web services as well as software for data publication, discovery and access.

Data Discovery and Integration platform

Data Publication platform

Data Synthesis and Research platform

Data Services

Metadata Services Metadata Search

HIS Central

HydroDesktopHydroServer

Service registration

Catalog harvesting

Service and data theme metadata

Data carts

Water Data ServicesSpatial Data Services

Like search portals Google, Yahoo, Bing

Page 3: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

What is the Hydrologic Metadata Catalog?

• Database for the HIS Central registry and Search Services.

• Stores Site, Variable, and Series information, plus general metadata for each registered service. Data Values are not in the Catalog.

• Purpose is to provide ability to search across federated services to provide information which lead client applications to data values.

Page 4: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

HIS Central

HIS Central is a web application where you can register Water Data Services into the Hydrologic Metadata Catalog.

Page 5: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Registering a WaterML Service at HIS Central

Page 6: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Hydrologic Metadata CatalogHarvesting

Each registered water data service is harvested using the standard Water Data Service methods:◦ GetSites

Returns list of each site record for the available from the service◦ GetSiteInfo

For each site this request is made. Returns

All variables monitored at the site Period of record for each variable The Number of values available

Page 7: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Hydrologic Metadata CatalogCore Data Schema

Page 8: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Ontology• Keyword Hierarchy used to categorize and assist in the discovery of monitored variables.• Each Variable is “tagged” to a keyword concept.

Page 9: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Storing the Ontologyin the database

Concepts Hierarchy ConceptPaths

Page 10: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Ontology Service Methods• getSearchableTerms

• Simply returns a list of all searchable Keyword Concepts. Searchable concepts include “branch” concepts as well as “Leaf” concepts. Higher level branches are not included as they are too broad.

• getOntologyTree• By passing in a “Branch” concept, it returns the ontology terms below

it in a tree structure. (Passing “HydroSphere” returns then entire ontology).

• getWordList• Passing a substring, such as “temp” returns all keywords which

contain that sequence of characters. This is intended as an usibility feature for the client applications.

Page 11: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Search Service Methods (1/3)GetWaterOneFlowServiceInfo

◦ Returns a list of all the services with which are registered with HIS Central.

GetServicesInBox◦ Same as GetWaterOneFlowServiceInfo method, but

restricted by geographic envelope.

These methods both return the following information: WSDL endpoint for Water Data service, title, name, organization, contact info, estimated number of values, number of sites, number of variables, and geographic extent.

Page 12: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Search Service Methods (2/3)• GetSitesInBox

• Requires• Geographic extent (box)• Concept Keyword (can be empty)• NetworkIDs (used to restrict returned values, can be empty).

• Returns information necessary to display sites on a map and request more information about series.

Page 13: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Search Service Methods (3/3)• GetSeriesCatalogForBox

• The primary method for searching the catalog. Returns series record information. Client application uses this information to request the data values from the registered service.

• You provide: • Geographic extent (box)• Temporal extent (begin/end dates) • Concept Keyword (can be empty)• NetworkIDs (used to restrict returned values, can be empty).

Page 14: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

What info is in a Series Record you ask?Everything required to create a datacart.

SeriesRecord◦ ServCode - (string) services unique code – “nwis”◦ ServURL – (string) wsdl address of service◦ Location – (string) site code◦ VarCode- (string) variable code associated with the series◦ Varname –(string) variable name◦ beginDate – (string) start date of series◦ endDate – (string) end date of series (as of last harvest).◦ Authtoken – (string) unimplemented◦ ValueCount – (int) number of values in series◦ Sitename –(string) site name◦ Latitude –(double)◦ Longitude – (double)◦ datatype –(string)◦ valuetype –(string)◦ samplemedium –(string)◦ timeunits –(string)◦ conceptKeyword –(string) Ontology keyword to which this variable is tagged◦ genCategory –(string)◦ TimeSupport –(string)

Page 15: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Hydrologic Metadata Catalog Stats• Services• Variables• Sites• Series• Values referenced:

474,812

1,889,199 8,516,440

4,622,778,988

Page 16: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Future Development• Need to standardize the services to use WaterML

data exchange format. • Need to Harvest data directly from HydroServer

capabilities services. • Need to extend the search to allow for other

geometries to search by, besides envelope. (HUCs, counties, etc).

Page 17: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Conclusions

• Searching across multiple, federated services is made possible by harvesting and indexing metadata from registered services.

• Metadata is data. The catalog pushes the limits of what is metadata

Page 18: HYDROLOGIC  METADATA  CATALOG AND  SEMANTIC  SEARCH SERVICES IN CUAHSI  HIS

SAN DIEGO SUPERCOMPUTER CENTER

Questions?

[email protected]

• http://hiscentral.cuahsi.org

• http://hiscentral.cuahsi.org/webservices/hiscentral.asmx