22
Seeing the Big Picture Sensing, Linking, Analyzing and Visualizing Big Data Dr. Paul Janecek

Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

Seeing the Big Picture Sensing, Linking, Analyzing and Visualizing Big Data

Dr. Paul Janecek

Page 2: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 2

– Introduction

• Seeing Details and Context in Big Data

– Example: Real-Time Monitoring

• Sensing: In-Situ Data

• Linking: IOOS

• Analyzing: Pattern Recognition & Analysis

• Visualizing: Data Portal

– Challenges

Page 3: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 3

Page 4: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:
Page 5: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 5

• Detecting Hazardous Algal Blooms (HAB)

• In-Situ Sensors – Environmental Sample Processor (ESP)

• Networked Ocean Data – Integrated Ocean Observing Initiative (IOOS)

– Ocean Observatories Initiative (OOI)

• Data Visualization Portal – Spyglass Data Portal

• Developed by Think Blue Data, Thailand

• Impact: – Detection reduced from 3-5 days to 3.5 hours

Page 6: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 6

• Variety

– Structured and Unstructured data

• Time series, Log Files, Images

• Volume

– Massive historical archives of open data

• Velocity

– Real-time and near-real-time data streams

Page 7: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 7

• ESP: an Underwater Ecogenomic Robot

– Detects DNA and Toxins

– Sends result as image

– Near real-time data

Image Source: Center for Environmental Visualization, UW, in (Scholin, 2008)

Page 8: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

Image Source: WHOI

Page 9: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 9 Image Source: OOI, Center for Environmental Visualization, UW

Page 10: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

Image Source: IOOS

Page 11: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 11

• Data Sources

– Data Capture

– Data Storage Standards

– Metadata

– Quality

• Data Access

– Networking

– Security

Page 12: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

Image Source: IOOS

Page 13: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 13

Pacific Pacific Coast Central Atlantic

AOOS (Alaska) NANOOS (Northwest Pacific) GLOS (Great Lakes) NERACOOS (Northeast)

Integrate, Catalog,

Provide Public Access

CeNCOOS (Central California) GCOOS (Gulf of Mexico) MARACOOS (Mid-Atlantic)

PacIOOS (Pacific) Southern California CariCOOS (Caribbean) SECOORA (Southeast)

Image Source: Regional IOOS Portals

Page 14: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 14

IOOS® Data Integration Framework (DIF) Final Assessment Report

11/22/2010 p. 15

In Phase 1 of the project, the surface current observations for the project geographical area were provided by NDBC

and CO-OPS in IOOS DIF format for gridded data, i.e. netCDF/CF. In the Phase 2, the surface currents modeled by

CSDL were used instead of observations. The CSDL model output data was provided by CSDL in the same

netCDF/CF format, and was accessible via IOOS DIF standard access service, i.e. OPeNDAP/WCS. However, the

netCDF files to feed the particle tracker were retrieved via FTP because of limitations on the client side. The project

architecture and data flow diagram is presented in Figure 3-1.

Figure 3-1 HAB-FS Data Flow Diagram

3.2.3. Anticipated Benefits

A number of benefits were expected as a result of this project; categorized benefits are shown in Table 3-2.

# Anticipated Benefit Category1

1 A measurable increase in the spatial and temporal accuracy (skill) and precision (reproducibility)

of HAB forecasts A

2 Increased probability of providing an accurate nowcast for the time periods when satellite

imagery is not available due to clouds A

3 Extension of forecast spatial range to the areas where the forecast has been previously

unavailable, e.g., Tampa Bay A

4 Extension of forecast temporal range from 3 days to 5 days and more A

5 Increased HAB forecast objectivity C 1See Table 3-1

Table 3-2 HAB-FS Benefits

Example: Tracking Hazardous Algal Blooms

Image Source: IOOS Data Integration Framework, Final Assessment Report

Page 15: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 15

• Data Access

– Data Discovery

– Standards

• Data Integration

– Vocabularies

– Data Preparation

Page 16: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 16

• Actionable Data from In-Situ Sensors

– Parsing, Data Preparation

– Pattern Recognition, Signal Detection

– Update Monitoring Dashboard

Image Source (top right): Metfies et al.,Ecology of Harmful Algae, 2006

Page 17: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 17

Scatterplot Matrix

Time Series

Calendars Horizon Chart

Details

Time Series Data Histogram

Images

Organism Concentration

Spot Box Plot Image Spot Map

Organism Summary Data

Spot Detail Data

Sensor Components

Log File Details

Timeline of Events

Time Series Data

Image Data

Diagnostic Data

ESP Sensor Data

Page 18: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 18

• Portal Data

– ESP Data

• Organisms

• Images

• CTD, “Can”

• Diagnostics

– IOOS Data

• Context

Metadata Facets

Spatial Facet

Time Series Data Overview

Result Sets

Page 19: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:
Page 20: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 20

• Web-based visualization

– Massive distributed data sets

– Bandwidth constraints

– Highly Interactive Graphics

Page 21: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

l 21

Challenges Research & Technology Applications

Variety Structured: (Time Series) Unstructured: (Log Files)

Data and Metadata Standards Data Services & Discovery

Data Integration Data Mining

Process Mining Faceted Search

Distributed Databases Distributed Processing

Distributed Architecture Sensor Networks

Process Integration

Visualization Architecture

Data Analysis Process Analysis

Business Intelligence Decision Support

Monitoring

Volume Massive Data Sets

Velocity Real-Time Data

Page 22: Seeing the Big Picture - AIT Solutionssolutions.ait.ac.th/resources/pdf/11.Seeing the Big... · 2013-08-21 · l 2 –Introduction •Seeing Details and Context in Big Data –Example:

Thank You

Dr. Paul Janecek CEO, Think Blue Data Visiting Faculty, Computer Science and Information Management [email protected]