31

"Interactive Deep Analytics" Dashboard

Embed Size (px)

Citation preview

Page 1: "Interactive Deep Analytics" Dashboard
Page 2: "Interactive Deep Analytics" Dashboard

Named leader in report.

Founded in 2009  Acquired by AOL in 2014  

Using Big Data stack since 2009

75 people - 30 R&D

1.5T of daily data >100 data sources

Page 3: "Interactive Deep Analytics" Dashboard

Helping marketers to optimize their spend

Across: Channels | Devices Online + Offline

Convertro

Page 4: "Interactive Deep Analytics" Dashboard

ü Clear

ü Actionable

ü Great UX/I

Successful Dashboard

Page 5: "Interactive Deep Analytics" Dashboard

Rendering time  

Storage  

Cost  

UX/I  

Considerations

Processing time  

Page 6: "Interactive Deep Analytics" Dashboard

Insights  

Comparison  

Over  2me  

Explore  

“S2cky”  Configura2on  

RT  metrics  

Integrate  

Page 7: "Interactive Deep Analytics" Dashboard

USE CASE #1

Page 8: "Interactive Deep Analytics" Dashboard

Speed   Batch  

Page 9: "Interactive Deep Analytics" Dashboard

Materialization to one table is too costly (belated massive updates)

Leverage Vertica’s sorted data structure

Join data in run time ( O(n) )

Query  

Spend    Batch  

Revenue  Speed  

Query* Merged  Structure  

Spend    Batch  

Revenue  Speed  

* λ architecture

Page 10: "Interactive Deep Analytics" Dashboard

USE CASE #2

Page 11: "Interactive Deep Analytics" Dashboard

Different    metrics  with    

1:N  rela2onships  

Page 12: "Interactive Deep Analytics" Dashboard

Avoid joins in query time ( if possible )

Pre joining and aggregate by dimensions

Pre joining does not necessarily explode

your data store

Visits,    Conversions,  Impressions  

Conversions   Impressions  ⨝   ⨝  

Σ  

Visits  

Page 13: "Interactive Deep Analytics" Dashboard

USE CASE #3

Page 14: "Interactive Deep Analytics" Dashboard

Many  Dimensions    

Page 15: "Interactive Deep Analytics" Dashboard

Limit number of returned records to screen – vizualize the most significant data  

 Allow to dump data with different QOS

Page 16: "Interactive Deep Analytics" Dashboard

Allow to choose up to X dimensions – not all  

For each page allow to choose different relevant dimensions  

Build different data structures for different pages

Page 17: "Interactive Deep Analytics" Dashboard

USE CASE #4

Page 18: "Interactive Deep Analytics" Dashboard

Same  data  different  rendering  

Page 19: "Interactive Deep Analytics" Dashboard

Same  data  different  rendering  

Page 20: "Interactive Deep Analytics" Dashboard

Query locality caching  

Backend does data rendering    

Shared configuration across widgets    

MPP has a limited query schedulers  

Table   Query   Cache  

Σ   Widget  1  

Widget  2  

Page 21: "Interactive Deep Analytics" Dashboard

USE CASE #5

Page 22: "Interactive Deep Analytics" Dashboard

Real  Time    data  points    2cker  

Page 23: "Interactive Deep Analytics" Dashboard

Sometimes you don’t have to be 100% accurate or consistent

try using:

Extrapolation

Sampling

Different data stores

Heuristics

logs   Speed  layer   Ticker  Every  X  minutes  

Real  2me    extrapola2on  

Page 24: "Interactive Deep Analytics" Dashboard

Hydro – Data Rendering Service

Hydro

EXTRACT  

TRNSFORM

RENDER  

ETL  Web/App  Server  

API  

DB1  

DB2  

Connect to any data source Multi level caching and invalidation Applying data transformation and rendering Logic sharing

Page 25: "Interactive Deep Analytics" Dashboard

Understand the requirements

One technology doesn't fit all

One data structure doesn't fit all

Good UX takes into account Data and Technology considerations

Page 26: "Interactive Deep Analytics" Dashboard
Page 28: "Interactive Deep Analytics" Dashboard

Data Processing and Mining

Page 29: "Interactive Deep Analytics" Dashboard

Analytics DB - Vertica

Built  for  analy>cs  Storage  /  Query  engine  /  Op2mizer    

Column  oriented  store  Sorted  

True  MPP  

 Deals  well  with  high  cardinality  and  sparse  data  

*not an open source

Page 30: "Interactive Deep Analytics" Dashboard

Real Time metrics

Page 31: "Interactive Deep Analytics" Dashboard

Web Stack

Server      Pandas  Hydro  

Client    Backbone  marioneVe  RequireJs  handlebars      highcharts  d3    underscore  TwiVer  Bootstrap  SlickGrid  ...  

Architecture  

Visualizaion