30
Data Warehousing on Hadoop The Future of Data Warehousing.

Future of data warehousing.pptx (1) uli

Embed Size (px)

Citation preview

Page 1: Future of data warehousing.pptx (1)  uli

Data Warehousing on HadoopThe Future of Data

Warehousing.

Page 2: Future of data warehousing.pptx (1)  uli

Big Data in the Olden Days

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

_________________________________________________________________________________________________

Page 3: Future of data warehousing.pptx (1)  uli

Big Data Today

Machine Translation

______________________________________________________________________________________________

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

Page 4: Future of data warehousing.pptx (1)  uli

Big Data Today

Voice of Patient

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 5: Future of data warehousing.pptx (1)  uli

The Rise of Big Data

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 6: Future of data warehousing.pptx (1)  uli

The Perfect Data Storm

With digitisation we now have an abundance of data (exponential growth).

Globalisation & Machine Data

Distributed Computing.

Moore’s Law

New breakthroughs in Artificial Intelligence (neural networks)

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 7: Future of data warehousing.pptx (1)  uli

Big Data vs EDW

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 8: Future of data warehousing.pptx (1)  uli

EDW Ralph

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 9: Future of data warehousing.pptx (1)  uli

EDW Bill

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 10: Future of data warehousing.pptx (1)  uli

ETL vs ELT

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 11: Future of data warehousing.pptx (1)  uli

SMP vs MPP______________________________________________________________________________________________

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

Page 12: Future of data warehousing.pptx (1)  uli

RDBMS - Swiss Data Knife

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 13: Future of data warehousing.pptx (1)  uli

Limitations – Persistent Storage______________________________________________________________________________________________

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

Page 14: Future of data warehousing.pptx (1)  uli

Limitations – Unstructured Data

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 15: Future of data warehousing.pptx (1)  uli

Limitations – Unstructured Data

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 16: Future of data warehousing.pptx (1)  uli

Limitations – ETL______________________________________________________________________________________________

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

Page 17: Future of data warehousing.pptx (1)  uli

Limitations – ETL

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 18: Future of data warehousing.pptx (1)  uli

Limitations – BI

$$$$ Cloud

Server-less

Open Source

______________________________________________________________________________________________

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

Page 19: Future of data warehousing.pptx (1)  uli

Limitations – Graph

- Verbose- Performance (self-joins)

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 20: Future of data warehousing.pptx (1)  uli

Limitations – Graph

MATCH (kenau:Person {name:"Keanu Reeves"})-[:ACTED_IN]->(movie)<-[:ACTED_IN]-(coStar)RETURN coStar.name;Cypher Query Language

______________________________________________________________________________________________

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

Page 21: Future of data warehousing.pptx (1)  uli

Limitations – Graph

Data Orchestration

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 22: Future of data warehousing.pptx (1)  uli

Limitations – Graph

Data Catalog

Data Lineage

Master Data Management

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 23: Future of data warehousing.pptx (1)  uli

Other Limitations______________________________________________________________________________________________

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

Page 24: Future of data warehousing.pptx (1)  uli

Limitations Agility

Business Requirements

Data Analysis

Source to target map

Data Model

Development (ETL, BI, Reports, Dashboards)

Testing

Deployment

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 25: Future of data warehousing.pptx (1)  uli

Self-Service Sandboxes

Use Cases

Data Profile/Analysis

Ad-hoc Analytics

Data Science

Data Exhaust for EDW

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 26: Future of data warehousing.pptx (1)  uli

Analytics Sandboxes

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

__________________________________________________________________________

EDW/BI Data Discovery/Science

Data Format/Quality

Cleansed, Processed, Integrated

Raw, Unknown

Data Types Structured Any

Method Known Unknowns Unknown Unknowns EDA

Data Scope EDW All, New data sources, EDW, long tail of data

Time to Insight Long – EDW Lifecycle Shorter

Data Transformations

Formal: ETL, Code Ad-hoc: Iterative, Self-Service, GUI

Tool ETL & BI Tool Data Discovery ,Science, Preparation Platform

Self-Service Ad-hoc queries All data

Testing Unit, Integration, UAT Less Formal

Audience Business User, Power User

Data Scientist, Data Analyst,Data Developer

______________________________________________________________________________________________

Page 27: Future of data warehousing.pptx (1)  uli

Is EDW Obsolete?______________________________________________________________________________________________

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

Page 28: Future of data warehousing.pptx (1)  uli

In Summary

Use the appropriate technology for the problem at hand… and yes, there is a fancy word for this

Polyglot Persistence

The Law of the Instrument

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Page 29: Future of data warehousing.pptx (1)  uli

Sonra Offerings

p: +353 1 254 2897t: @sonra_io

e: [email protected]: www.sonra.io

_________________________________________________________________________________________________

______________________________________________________________________________________________

Data Warehouse in the Cloud Quick Start Packages

Training: Big Data for Data Warehouse Professionals

Page 30: Future of data warehousing.pptx (1)  uli

WWW.SONRA.IOWWW.SONRA.IO