9
Mansour Raad [email protected] Kyunam Kim [email protected] Geospatial Analytics and AI at Scale with Big Data Toolkit

Geospatial Analytics and AI at Scale with Big Data Toolkit...Geospatial Analytics and AI at Scale with Big Data Toolkit Author: Esri Subject: 2020 Esri Developer Summit -- Presentation

  • Upload
    others

  • View
    23

  • Download
    2

Embed Size (px)

Citation preview

Page 1: Geospatial Analytics and AI at Scale with Big Data Toolkit...Geospatial Analytics and AI at Scale with Big Data Toolkit Author: Esri Subject: 2020 Esri Developer Summit -- Presentation

Mansour Raad [email protected] Kim [email protected]

Geospatial Analytics and AI at Scalewith Big Data Toolkit

Page 2: Geospatial Analytics and AI at Scale with Big Data Toolkit...Geospatial Analytics and AI at Scale with Big Data Toolkit Author: Esri Subject: 2020 Esri Developer Summit -- Presentation

• Esri Professional Services solution that allows customers to analyze, aggregate, and enrich big data within their existing big data analytics platform

[email protected] for inquiries

What is Big Data Toolkit (BDT)?

Page 3: Geospatial Analytics and AI at Scale with Big Data Toolkit...Geospatial Analytics and AI at Scale with Big Data Toolkit Author: Esri Subject: 2020 Esri Developer Summit -- Presentation

Target Audience

Unifying Big Data Analytics Platform + ArcGIS Platform together

HDFSHiveSpark Massive Analytics

Dissemination

Advanced AnalyticsBig Data Toolkit

• No Spatial Indexing

ArcGIS Enterprise

Spatiotemporal

Enterprise Geodatabase

OMG!Really?

Automatic publishing of Map/Feature services

……

Open-sourcegeo tools

• No Input Data Prep / Movement

Page 4: Geospatial Analytics and AI at Scale with Big Data Toolkit...Geospatial Analytics and AI at Scale with Big Data Toolkit Author: Esri Subject: 2020 Esri Developer Summit -- Presentation

Capabilities

Nearest Coordinate

Point in Polygon

Distance Relation

GeneralizeAdd

Calculate Field

Web Mercator

Hex

Extent Filter

Point Exclude

Feature Filter

Point Include

Smart

Snapping(Map

Matching)

Time Filter

Project

Stats

Delete

Dissolve

Geocoding & Reverse

Geocoding

Routing(requires ArcGIS

Enterprise)

Service Area

Calculation(requires ArcGIS

Enterprise)

Smart Similarity Analysis

Standard Distance

Clustering

GWR

Moving Averages Clip Future

Capability

Geodetic Area

Simplify

Intersection

SQL

File Geodataba

se

Shapefiles

Enterprise

Geodatabase

csv tsv

parquet

Hive

Enterprise Geodatabase

Native Geometry

Format

ArcGIS

Spatiotemporal

Big Data Store

csv

tsvparquet

Sources Processors Sinks

Page 5: Geospatial Analytics and AI at Scale with Big Data Toolkit...Geospatial Analytics and AI at Scale with Big Data Toolkit Author: Esri Subject: 2020 Esri Developer Summit -- Presentation

Smart Snapping (Map Matching)

Page 6: Geospatial Analytics and AI at Scale with Big Data Toolkit...Geospatial Analytics and AI at Scale with Big Data Toolkit Author: Esri Subject: 2020 Esri Developer Summit -- Presentation

Big Data Toolkit Stack

Page 7: Geospatial Analytics and AI at Scale with Big Data Toolkit...Geospatial Analytics and AI at Scale with Big Data Toolkit Author: Esri Subject: 2020 Esri Developer Summit -- Presentation

- NYC Taxi 35M- ZIPCODE 360 Polygons with 2100+ Demographic Variables

- Add New Column- Point-in-Polygon- Summary Statistics based on 200m & 500m hexagons- Persist to Enterprise Geodatabase (SQL Server)- Visualize Hexagon Aggregations in ArcGIS

Demo 1

Page 8: Geospatial Analytics and AI at Scale with Big Data Toolkit...Geospatial Analytics and AI at Scale with Big Data Toolkit Author: Esri Subject: 2020 Esri Developer Summit -- Presentation

- Sneak Peek at New Big Data Toolkit 2.0- New Target Audience: Data Scientists- Use Big Data Toolkit in Notebook

Demo 2

Page 9: Geospatial Analytics and AI at Scale with Big Data Toolkit...Geospatial Analytics and AI at Scale with Big Data Toolkit Author: Esri Subject: 2020 Esri Developer Summit -- Presentation

• RDD -> DataFrame, SQL• Project Tungsten

- Memory Management and Binary Processing- Cache-aware computation- Code generation- https://databricks.com/blog/2015/04/28/project-tungsten-bringing-spark-closer-to-bare-

metal.html

New Big Data Tookit v2.0