1
Abstract Archiving and Near Real Time Visualization of USGS Instantaneous Data Ilya Zaslavsky, David Ryan, Thomas Whitenack, David Valentine, Matthew Rodriguez San Diego Supercomputer Center, UCSD, San Diego, CA The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities in the US and abroad The CUAHSI Hydrologic Information System (HIS) project has been developing databases, services and online and desktop software applications supporting standards- based publication and access to large volumes of hydrologic data from US federal agencies and academic partners. In particular, the CUAHSI WaterML 1.x schema specification for exchanging hydrologic time series, earlier published as an OGC Discussion Paper (2007), has been adopted by the United States Geological Survey to provide web service access to USGS daily values and instantaneous data. The latter service, making available raw measurements of discharge, gage height and several other parameters for over 10,000 USGS real time measurement points, was announced by USGS, as an experimental WaterML- compliant service, at the end of July 2009. We demonstrate an online application that leverages the new service for nearly continuous harvesting of USGS real time data, and simultaneous visualization and analysis of the data streams. To make this possible, we integrate service components of the CUAHSI software stack with Open Source Data Turbine (OSDT) system, an NSF- supported software environment for robust and scalable assimilation of multimedia data streams (e.g. from sensors), and interfacing with a variety of viewers, databases, archival systems and client applications. Our application continuously queries USGS Instantaneous water data service (which provides access to 15-min measurements updated at USGS every 4 hours), and maps the results for each station-variable combination to a separate "channel", which is used by OSDT to quickly access and manipulate the time series. About 15,000 channels are used, which makes it by far the largest deployment of OSDT. Using RealTime Data Viewer, users can now select one or more stations of interest (e.g. from upstream or downstream from each other), and observe and annotate simultaneous dynamics in the respective discharge and gage height values, using fast forward or backward modes, real-time mode, etc. Memory management, scheduling DataTurbine in Environmental Observing Projects CUAHSI HIS Service Oriented Architecture: General Outline Links CUAHSI HIS: http ://his.cuahsi.org/ HIS Wiki @ SDSC : http://river.sdsc.edu/wiki Open Source DataTurbine: http://www.dataturbine.org Real-time Data Viewer (RDV): http://code.google.com/p/rdv / The USGS realtime application: http://geo.sdsc.edu/jnlp/usgs.jnlp Integrating USGS Real Time data streams with OSDT Solution for accessing both streaming and static data, from different vendor systems, via a common interface Released under Apache 2.0 Open Source License Provides real high performance data streaming, 10+MB/sec, 1000 frames/sec Supported by NASA SBIR, 15 years in development It is one of just a handful comprehensive solutions for streaming data Supports multiple types of streams: real-time monitoring, video and multimedia, telemetry, instant messages, etc. etc. Can be accessed via URLs (e.g. can stream to browser) Scalable: DataTurbine servers can be interconnected to handle large streams OSDT-managed data displayed in CUAHSI DASH (Data Access system for Hydrology) application (the graph shows a simulated air temperature stream) Live data and video from Santa Margarita Ecological Reserve (NEON) Integration of Heterogeneous Devices Project NI cRIO Cam pbell CR510 Apprise Templine D avis weather station Vaisala W X T510 Vaisala PTB 210 Axis 241 (video) Greenspan Dissolved O xygen Sensor GLEON X X X X X CREON X X X NEON X X X N EES X X PRAGM A X X X X X NEON – Ecology: http://neoninc.org NEES – Earthquake Engineering: http://it.nees.org/ GLEON – Hydroecology: http://gleon.org/ CREON – Coral reefs: http://www.coralreefeon.org/ MoveBank – Animal tracking: Bridges and Civil Infrastructure – Engineering: http ://www.princeton.edu/~wikelski/research/index.htm http ://healthmonitoring.ucsd.edu/ PRAGMA – Pacific Rim Applications and Grid Middleware Assembly: http ://pragma-grid.net ID# H51H- 0861 Water Markup Language (WaterML) is a standard XML schema for hydrologic time series, designed to support exchange of hydrologic information between servers and clients within HIS. It contains general constructs describing sites, variables, time series, and data values. Designed to be the least barrier for adoption by hydrologists, WaterML relies on both the ODM relational schema adopted for managing data from academic projects, and the data and metadata available from government hydrologic repositories at the national and state levels. WaterML 1.0 specification is available as an OGC Discussion Paper at http://www.opengeospatial.org/standards/d p. A recently established OGC/WMO Hydrology Domain Working Group (HDWG) focuses on further development of international standards for hydrologic data exchange, WaterML 2.0:. http://external.opengeospatial.org/twiki_ public/bin/view/HydrologyDWG/WebHome Rainfa ll & Snow Water quantity and quality Remote sensing Meteorolo gy Soil water CUAHSI HIS develops service-oriented architecture for hydrologic research and education, to enable publication, discovery, retrieval, analysis and integration of hydrologic data. The project team has defined a common information model for organizing hydrologic observation data, designed a common exchange protocol (Water Markup Language), created a collection of SOAP web services (WaterOneFlow services) that provide uniform access to different federal, state and local hydrologic data repositories, and developed mechanisms for ontology-based data registration and discovery. Test bed HIS Servers Central HIS servers ArcGIS Matlab IDL, R MapWindow Excel Programmin g (Fortran, C, VB) Desktop clients Customizable web interface (DASH) HTML - XML W S D L - S O A P Modeling (OpenMI) Global search (Hydroseek) WaterOneFlow Web Services, WaterML Controll ed vocabula ries Metada ta catalo gs Ontolo gy ETL services HIS Lite Servers External data providers Other popular online clients ODM DataLoader Streaming Data Loading Ontology tagging (Hydrotagger ) WSDL and ODM registration Data publishing ODMTools Server config tools HIS Central Registry & Harvester End-to-end three-tier solution: from publication and registration to discovery and analysis of hydrologic observation data CUAHSI HIS Infrastructure for Hydrologic Data WaterOneFlow Web Services provide a uniform way to query and access observations data and metadata within HIS, regardless of intricacies of individual data sources. The following core methods are supported: GetSites GetSiteInfo GetVariableInfo GetValues WaterOneFlow method calls return WaterML- compliant documents to HIS clients. The list of data sources that are accessed via WaterOneFlow services include: USGS National Water Information system (Daily Values, Ground Water, Instantaneous Irregular Data, Unit Values), EPA STORET, USDA Snowpack Telemetry (SNOTEL), NCDC ASOS, MODIS, DAYMET, NAM12K, as well as over 40 observation networks contributed by academic research projects and made accessible via ODM Generic Web Services. The WaterOneFlow services and WaterML are being adopted by HIS partners, including government agencies. WaterOneFlow Services Water Markup Language Open Source DataTurbine in CUAHSI HIS Projects HydroDesktop How it works: USGS NWIS experimental instantaneous web services (GetValues REST service) are used to continuously harvest discharge and gage height information from 7330 USGS stations where both parameters are measured. A catalog of USGS stations and variables is created. • The custom OSDT source application, written using RBNB Simple API (SAPI) connects to the data store to retrieve and cache the 15-minute time series. • The time series are represented as OSDT channels. • The time series for each station, organized by states and counties, are available in RealTime Data Viewer, a common OSDT client. • The custom ODM Sync application loads the data into CUAHSI ODM (Observations Data Model) database, making it available via an additional WaterOneFlow web service, which can be shown in other CUAHSI HIS tools such as DASH (below). RDV displays discharge and gage height variables from several stations simultaneously, in one or more panels, and supports fast playback and rewind of time series, at specified speed and time window (above). In addition, RDV can display video/photo, X-Y, tabular, spectrum, text and web data panels, and annotate events. RDV supports deriving channels from primary channels, for example to convert discharge or gage height units the metric system (left)

Abstract Archiving and Near Real Time Visualization of USGS Instantaneous Data Ilya Zaslavsky, David Ryan, Thomas Whitenack, David Valentine, Matthew Rodriguez

Embed Size (px)

Citation preview

Page 1: Abstract Archiving and Near Real Time Visualization of USGS Instantaneous Data Ilya Zaslavsky, David Ryan, Thomas Whitenack, David Valentine, Matthew Rodriguez

Abstract

Archiving and Near Real Time Visualization of USGS Instantaneous Data

Ilya Zaslavsky, David Ryan, Thomas Whitenack, David Valentine, Matthew RodriguezSan Diego Supercomputer Center, UCSD, San Diego, CA

The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities in the US and abroad The CUAHSI Hydrologic Information System (HIS) project has been developing databases, services and online and desktop software applications supporting standards-based publication and access to large volumes of hydrologic data from US federal agencies and academic partners. In particular, the CUAHSI WaterML 1.x schema specification for exchanging hydrologic time series, earlier published as an OGC Discussion Paper (2007), has been adopted by the United States Geological Survey to provide web service access to USGS daily values and instantaneous data. The latter service, making available raw measurements of discharge, gage height and several other parameters for over 10,000 USGS real time measurement points, was announced by USGS, as an experimental WaterML-compliant service, at the end of July 2009. We demonstrate an online application that leverages the new service for nearly continuous harvesting of USGS real time data, and simultaneous visualization and analysis of the data streams.

To make this possible, we integrate service components of the CUAHSI software stack with Open Source Data Turbine (OSDT) system, an NSF-supported software environment for robust and scalable assimilation of multimedia data streams (e.g. from sensors), and interfacing with a variety of viewers, databases, archival systems and client applications. Our application continuously queries USGS Instantaneous water data service (which provides access to 15-min measurements updated at USGS every 4 hours), and maps the results for each station-variable combination to a separate "channel", which is used by OSDT to quickly access and manipulate the time series. About 15,000 channels are used, which makes it by far the largest deployment of OSDT. Using RealTime Data Viewer, users can now select one or more stations of interest (e.g. from upstream or downstream from each other), and observe and annotate simultaneous dynamics in the respective discharge and gage height values, using fast forward or backward modes, real-time mode, etc.

Memory management, scheduling service-based retrieval from USGS web services, and organizing access to 7,330 selected stations, turned out to be the major challenges in this project. To allow station navigation, they are grouped by state and county in the user interface. Memory footprint has been monitored under different Java VM settings, to find the correct regime. These and other solutions are discussed in the paper, and accompanied with a series of examples of simultaneous visualization of discharge from multiple stations as a component of hydrologic analysis.

DataTurbine in Environmental Observing Projects

CUAHSI HIS Service Oriented Architecture: General Outline

LinksCUAHSI HIS: http://his.cuahsi.org/

HIS Wiki @ SDSC : http://river.sdsc.edu/wiki

Open Source DataTurbine: http://www.dataturbine.org

Real-time Data Viewer (RDV): http://code.google.com/p/rdv/

The USGS realtime application: http://geo.sdsc.edu/jnlp/usgs.jnlp

Integrating USGS Real Time data streams with OSDT

• Solution for accessing both streaming and static data, from different vendor systems, via a common interface

• Released under Apache 2.0 Open Source License• Provides real high performance data streaming, 10+MB/sec, 1000 frames/sec • Supported by NASA SBIR, 15 years in development• It is one of just a handful comprehensive solutions for streaming data• Supports multiple types of streams: real-time monitoring, video and

multimedia, telemetry, instant messages, etc. etc.• Can be accessed via URLs (e.g. can stream to browser)• Scalable: DataTurbine servers can be interconnected to handle large streams• Can manipulate the streams: fast forward or slow motion playback

(TiVo-like)• Secure access to DataTurbine Server, based on user credentials

OSDT-managed data displayed in CUAHSI DASH (Data Access system for Hydrology) application (the graph shows a simulated air temperature stream)

Live data and video from Santa Margarita Ecological Reserve (NEON)

Integration of Heterogeneous DevicesProject NI

cRIO Campbell CR510

Apprise Templine

Davis weather station

Vaisala WXT510

Vaisala PTB210

Axis 241 (video)

Greenspan Dissolved Oxygen Sensor

GLEON X X X X X CREON X X X NEON X X X NEES X X PRAGMA X X X X X

NEON – Ecology: http://neoninc.org NEES – Earthquake Engineering: http://it.nees.org/ GLEON – Hydroecology: http://gleon.org/ CREON – Coral reefs: http://www.coralreefeon.org/ MoveBank – Animal tracking: Bridges and Civil Infrastructure – Engineering: http://www.princeton.edu/~wikelski/research/index.htm http://healthmonitoring.ucsd.edu/ PRAGMA – Pacific Rim Applications and Grid Middleware Assembly: http://pragma-grid.net

ID# H51H-0861

Water Markup Language (WaterML) is a standard XML schema for hydrologic time series, designed to support exchange of hydrologic information between servers and clients within HIS. It contains general constructs describing sites, variables, time series, and data values.

Designed to be the least barrier for adoption by hydrologists, WaterML relies on both the ODM relational schema adopted for managing data from academic projects, and the data and metadata available from government hydrologic repositories at the national and state levels. WaterML 1.0 specification is available as an OGC Discussion Paper at http://www.opengeospatial.org/standards/dp. A recently established OGC/WMO Hydrology Domain Working Group (HDWG) focuses on further development of international standards for hydrologic data exchange, WaterML 2.0:. http://external.opengeospatial.org/twiki_public/bin/view/HydrologyDWG/WebHome

Rainfall & Snow

Water quantity and quality

Remote sensing

Meteorology

Soil water

CUAHSI HIS develops service-oriented architecture for hydrologic research and education, to enable publication, discovery, retrieval, analysis and integration of hydrologic data. The project team has defined a common information model for organizing hydrologic observation data, designed a common exchange protocol (Water Markup Language), created a collection of SOAP web services (WaterOneFlow services) that provide uniform access to different federal, state and local hydrologic data repositories, and developed mechanisms for ontology-based data registration and discovery.

Test bed HISServers

Central HIS servers

ArcGIS

Matlab

IDL, R

MapWindow

Excel

Programming (Fortran, C, VB)

Desktop clients

Customizable web interface (DASH)

HTML - XML

WS

DL

- SO

AP

Modeling (OpenMI)

Global search (Hydroseek)

WaterOneFlow Web Services, WaterML

Con

trol

led

voca

bula

ries

Met

adat

aca

talo

gs

Ont

olog

y

ET

L s

ervi

ces

HIS LiteServers

External data providers

Other popular online clients

ODM DataLoader

Streaming Data Loading

Ontology tagging (Hydrotagger)

WSDL and ODM registration

Data publishing

ODMTools

Server config tools

HIS CentralRegistry & Harvester

End-to-end three-tier solution: frompublication and registration to discovery and analysis of hydrologic observation data

CUAHSI HIS Infrastructure for Hydrologic Data

WaterOneFlow Web Services provide a uniform way to query and access observations data and metadata within HIS, regardless of intricacies of individual data sources. The following core methods are supported: • GetSites • GetSiteInfo• GetVariableInfo• GetValues

WaterOneFlow method calls return WaterML-compliant documents to

HIS clients. The list of data sources that are accessed via WaterOneFlow services include: USGS National Water Information system (Daily Values, Ground Water, Instantaneous Irregular Data, Unit Values), EPA STORET, USDA Snowpack Telemetry (SNOTEL), NCDC ASOS, MODIS, DAYMET, NAM12K, as well as over 40 observation networks contributed by academic research projects and made accessible via ODM Generic Web Services. The WaterOneFlow services and WaterML are being adopted by HIS partners, including government agencies.

WaterOneFlow Services Water Markup Language

Open Source DataTurbine in CUAHSI HIS

Projects

Hyd

roD

eskt

op

How it works:• USGS NWIS experimental instantaneous web services (GetValues REST service) are used

to continuously harvest discharge and gage height information from 7330 USGS stations where both parameters are measured. A catalog of USGS stations and variables is created.

• The custom OSDT source application, written using RBNB Simple API (SAPI) connects to the data store to retrieve and cache the 15-minute time series.

• The time series are represented as OSDT channels.• The time series for each station, organized by states and counties, are available in

RealTime Data Viewer, a common OSDT client.• The custom ODM Sync application loads the data into CUAHSI ODM (Observations Data

Model) database, making it available via an additional WaterOneFlow web service, which can be shown in other CUAHSI HIS tools such as DASH (below).

RDV displays discharge and gage height variables from several stations simultaneously, in one or more panels, and supports fast playback and rewind of time series, at specified speed and time window (above). In addition, RDV can display video/photo, X-Y, tabular, spectrum, text and web data panels, and annotate events.

RDV supports deriving channels from primary channels, for example to convert discharge or gage height units the metric system (left)