21

Technical Evangelist twitter: @tgrallemail: [email protected]@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Embed Size (px)

Citation preview

Page 1: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase
Page 2: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Technical Evangelist

twitter: @tgrallemail: [email protected]

Tugdual “Tug” Grall

BigData - NoSQLHadoop - Couchbase

Page 3: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

About me

• Tugdual “Tug” Grall

- Couchbase

- Technical Evangelist

- eXo

- CTO

- Oracle

- Developer/Product Manager

- Mainly Java/SOA

- Developer in consulting firms

• Web

- @tgrall

- http://blog.grallandco.com

- tgrall

• NantesJUG co-founder

• Pet Project :

• http://www.resultri.com

Page 4: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

<50%?

2027

95%Relational

Technology

$30B Database Market Being Disrupted

2012

All new database growth will be NoSQLAll new database growth will be NoSQLAll new database growth will be NoSQLAll new database growth will be NoSQL

Relational Technology

Relational Technology

Relational Technology

NoSQLTechnology

Other

Page 5: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

ClouderaHortonworks

Mapr

Operational vs. Analytic Databases

CouchbaseMongoDB

CassandraHbase

AnalyticAnalyticDatabasesDatabases

Get insights Get insights from datafrom data

Real-time, Real-time, Interactive Interactive DatabasesDatabases

Fast access Fast access to datato data

NoSQL

Page 6: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

What Is Biggest Data Management Problem Driving Use of NoSQL in Coming Year?

Lack of flexibility/rigid schemas

Inability to scale out data

Performance challenges

Cost All of these Other

49%

35%

29%

16%12% 11%

Source: Couchbase Survey, December 2011, n = 1351.

Page 7: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Hadoop & NoSQL

Page 8: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

What is Sqoop?

Sqoop is a tool designed to transfer data between Hadoop and relational databases.

You can use Sqoop to import data from a relational database management system (RDBMS) such as MySQL or Oracle into the Hadoop Distributed File System (HDFS), transform the data in Hadoop MapReduce, and then export the data back into an RDBMS.

sqoop.apache.org

Page 9: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

What is Sqoop?

Traditional ETL

Application DataData

T

Page 10: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

What is Sqoop?

A different paradigm

Data

Application

Data

Page 11: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

What is Sqoop?

A very scalable different paradigm

Data

Application

Data

Application

Data

Application

Data

Page 12: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

What is Sqoop?

Where did the Transform go?

Application

Data

TTT TTT TTT TTT

Page 13: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Sqoop Details

• Sqoop

• Default connection is via JDBCLots of custom connectorsCouchbase, VoltDB, VerticaTeradata, NetezzaOracle, MySQL, Postgres

Page 14: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Ad and offer targeting

events

profiles, campaigns

profiles, real time campaign statistics

40 milliseconds to respond with the decision.

2

3

1

Page 15: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Moving Parts

Page 16: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Content and Recommendation Targeting

Page 17: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Content Driven Site: Moving Parts

Page 18: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Couchbase

Page 19: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Couchbase Server Core Principles

Easy Scalability

Consistent High Performance

Always On

24x365

Grow cluster without application changes, without downtime with a single click

Consistent sub-millisecond read and write response times

with consistent high throughput

No downtime for software upgrades, hardware maintenance, etc.

Flexible Data Model

JSON document model with no fixed schema.

Page 20: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Couchbase Handles Real World Scale

Page 21: Technical Evangelist twitter: @tgrallemail: tug@couchbase.comtug@couchbase.com Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

Q&A