22
1 Spatial Data Management in the Cloud

1 Spatial Data Management in the Cloud. 2 GeoEye Analytics Mission: To provide superior advanced geospatial intelligence services and solutions to Defense,

  • View
    212

  • Download
    0

Embed Size (px)

Citation preview

1

Spatial Data Management inthe Cloud

2

GeoEye Analytics

Mission:

To provide superior advanced geospatial intelligence services and solutions to Defense, Intelligence, Homeland Security, and Civilian Agencies.

Solutions:

We provide a wide range of products & services including:– Geospatial Intelligence and All-Source Analysis

– Predictive Analytics

– Geospatial Content Management

– Advanced Research & Development

3

Geospatial Content Management

Context

• Geographic information is a key feature of 80-90% of all government data

• Federal Agencies are spending over $3.0 billion annually on collection and management of geospatial data

• Existing data management solutions are inadequate– The vast majority of analyst’s time is spent looking for data (80/20 rule)

– Up to 80% of the cost associated with a GIS is the development and maintenance of data

• Consolidation of GIS assets under IT is further exasperating the problem– IT personnel are generally unfamiliar with GIS data and processes

– Lack of knowledge leads to resistance

4

Data Management Solutions

• EarthWhere

• Web-based content management system used to catalog, provision, and disseminate geospatial data

• Addresses key data management challenges by:– Automatically cataloging geospatial assets

– Enable analyst to quickly find the data they need

– Reduce data acquisition and maintenance costs

• Flexible deployment options– On- Premise Solution

Catalog and provision existing data holdings Deploys on existing hardware No external network access required

– Amazon EC2 Cloud High performance, highly scalable Untethered from centralized IT Cost effective and flexible pricing

5

Key EarthWhere Features

• Automated cataloging of imagery, maps, terrain and vector data

• Fast, easy-to-use, search and retrieval

• Provision source data into derivative products on-the-fly

• Scalable and extensible Service Oriented Architecture (SOA)

• Distributed processing

• Role-based data level security

• OGC Compliant Web Services

• Proven and trusted solution throughout DOD and Civilian Federal agencies

6

Advanced Cataloging Capabilities

• Automated discovery of data– High performance directory crawlers

– User defined file filters

– Option to catalog relational databases (e.g. ArcSDE)

• Rapid extraction and normalization of metadata– Data remains in its native format

– Metadata is stored in a relation database

• Advanced Scheduling Capabilities– Run-once, at a specific time/interval, or continuously

• Option to Provision on Ingest– Format conversion and reprojection on Ingest

– Organize data based on metadata attributes

• Multi-Threaded and Scalable– Ingest tasks can be deployed across multiple servers

7

Rapid Data Discovery and Access

• Intuitive Workflow– Zoom to your area of interest

– Specify search criteria

– Download data it’s native format

– Option to create derivative products Specify output format, project, mosacking options, etc

• Simple Web Interface– Supports multiple browsers including Firefox and Internet Explorer

– No plugins required (Standard HTML and JavaScript)

• Sophisticated Search Capabilities Data Source

Data Type

Acquisition Date

Catalog Date

Location

Description

Provider

Sensor Name

Sensor Type

Sensor Azimuth/Elevation

Sun Azimuth/Elevation

Cloud Cover

Resolution and Quality

Target ID/BE Number

8

Amazon EC2 Cloud Deployment Option

• Scalable to meet expanding needs of organization– Can boost capacity by expanding the hardware set without

having to rebuild entire architecture and server environment

– Only need to upgrade AWS instance

• Increased Mobility– Customers can access data inside and outside organization

– Supports VPN IP security models to protect sensitive data

• Reduces organization’s IT requirements– Infrastructure shifted to the cloud

– Reduction in internal network bottlenecks

• Reduce Costs– Usage is paid on a subscription basis and reflects demand

– “Pay as you go” usage

– Reduced internal IT infrastructure costs

• Reliability– AWS offers 99.99% online reliability

9

How Does it Work?

•High availability, high performance•Secure & scalable•Cost effective•Your own volume•Transparent, predictable cost•Maintain control & ownership of your dataCustomer Data•Preconfigured•Customer managed, including automatic updates•Scalable

EarthWhere AMI•Customer purchases storage on Amazon Cloud and uploads data•Customer acquires EarthWhere AMI•

10

- EarthWhere gives users the ability to download source or raw data through the use of the web application that is shipped with the software.

- This gives them the ability to either download the file in a bundled .zip format or launch it directly in applications like ArcMap or ERDAS Imagine

- This can also be customized as needed through the available API

Exposing large raw data sets via a cloud infrastructure

11

• All data that is ingested into the EarthWhere system is automatically available through the available WMS. 

• Along with the Data WMS you have the ability to cache data which enables large data layers and services to be viewed rapidly.

Exposing data services from large data sets via a cloud infrastructure

12

• GeoEye has the expert data managers and resources to provide a data hosting and management service. 

• Part of the EarthWhere system gives you the ability to discover data automatically, just point the software to data directories and let it scan.  

• The data discovery can be setup to run on a scheduled basis, ensuring when users search the web application for data they find the latest and newest data. 

• When logging into the web application users have a variety of options presented to them for searching for data, the most widely used being AOI selected on a map and narrowed down by the use of the metadata attributes.

Hosting, search and discovery of large data sets via cloud storage

13

Exploitation of OpenSource Media and GeoINT

Agency consumption of publicly acquired crowd sourced data

A broader range of media sources from social media, file sharing, and web content can augment open media sources, all in an unclassified network

We bring open source exploitation to the analyst• Real time foreign media• Real time social media• Real time geotagged media, photos, and open

maps• Petabytes of data on a cloud system

14

Problem: Search vs. Discovery

• Current State: Painful Search

• Analysts searching for answers in millions of documents.

• Critical information takes too long to find. Making unexpected connections is nearly impossible.

Example: Human Terrain Analysis @ Skope

Use intelligence message traffic and previous research to synthesize evidence of the location, density, and disposition of tribes in a spatial area of interest. In its current state, this takes days to weeks.

• Desired State: Visual Discovery

• Find answers faster, and discover new relationships that were unexpected

FILTER

15

Canvas

What does it do?

Discovers knowledge in (large) collections of unstructured data.

How does it do that?

Analyzes and visualizes large amounts of text. It includes traditional search capabilities.

How is it built?

On a cluster of computers working together to solve large problems just like Google, Amazon, and Yahoo. Users interact with it through a web browser.

Why is it important?

If “the answer” is hidden among hundreds to millions of documents, Canvas can help you discover it.

Where can I use it?

Lots of Text; Message Traffic; Finished Intel Product Portals; Metadata

Transition opportunity to message traffic programs and analyst workgroups.

Applicable to NIPR, SIPR, JWICS

16

Canvas Key Differentiators

• Canvas does both!

• If you’re comfortable with more traditional workflows based on keyword searches (Google-style), Canvas does that faster than ever before.

• If you don’t know what you’re looking for, Canvas helps you get there.

• Find something valuable that you didn’t know to look for.

SearchersI know what I want, is it in here?

Explorers I have a collection, what’s in it?

17

Exploitation of OpenSource Media and GeoINT

Professional Media

Social Media

OpenStreetMap

Geotagged GroundPhotography

Entity Extraction | Geocoding | Phone and Coordinate Filtering

Mill

ions

of F

iles

18

• Legion is being developed for NGA as part of the effort to develop new tools to solve analytical challenges along with MrGeo.

• Goal is to accelerate time intensive operations to change expectations.

• Current applications– Viewshed – Radio Frequency Propagation– Landform Analysis– Helo LZ site selection

Exposing large raw data sets via a cloud infrastructure

19

• GeoEye has tools for determining changes over time and represented for imagery and presents these

• Changes with color coded graphics. There can also be set up a sliding scale to show gradual changes of imagery

• Output charts (pie and bar) to represent specific details of change and representative data

Analysis of changes between legacy and current imagery

20

• EarthWhere provides this functionality in a cloud environment

Normalization of multiple images using different formats of a common location

21

• EarthWhere provides this functionality in a cloud environment

Collaboration and common operating picture with large data sets using cloud infrastructure

22

Questions?

Please contact:

Jon PolayGeoEye Analytics

Director, Federal Government(703) 462-1872

[email protected]