Why NoSQL and MongoDB for Big Data

Preview:

DESCRIPTION

This ingite length deck talks about why we have seen so much database innovation and the genesis of the NoSQL movement over the last 5 year. While there are many great NoSQL products it speaks to why MongoDB is dominating the space and is the heir apparent to the RDBMS for modern operational data.

Citation preview

Why NoSQL

2

Dawn of Databases to Present

Brewer’s CapbornWWW

born

10genfounded

1965 1970 1975 1980 1985 1990 1995 2000 2005 2010

SQLinvented

Oraclefounded

PC’s gaintraction

Client Server

DynamicWeb Content

3 tierarchitecture

Web applications

SOA

CloudComputing

released

NoSQLMovement

BigTable

IDS(network)

IMS(hierarchica

l)

MUMPS

Codd’s paper

IDMS(network)

3

Big Data

Sensor Data (volume, velocity)Situational Awareness (Variety, Volume)

SIGINT(V ) Asset Management (variety, velocity)

OSINT( 3V )

Social Media (

3V )3

Modern Data

4

Relational Database Challenges

Data Types

• Unstructured data

• Semi-structured data

• Polymorphic data

Volume of Data

• Petabytes of data

• Trillions of records

• Millions of queries per second

Agile Development

• Iterative

• Short development cycles

• Changing data model

New Architectures

• Horizontal scaling

• Commodity servers

• Cloud computing

5

The Evolution of Databases

2010

RDBMS

NoSQL

OLAP/BI

Hadoop

2000

RDBMS

OLAP/BI

1990

RDBMS

Operational Data

Datawarehouse

Online

Offline

6

Fully Featured NoSQL

Data Model{ first_name: ‘Paul’, surname: ‘Miller’, city: ‘London’, location: [45.123,47.232], cars: [ { model: ‘Bentley’, year: 1973, value: 100000, … }, { model: ‘Rolls Royce’, year: 1965, value: 330000, … } }}

Rich Queries• Find Paul’s cars• Find everybody in London with a car

built between 1970 and 1980

Geospatial• Find all of the car owners within 5km of

Trafalgar Sq.

Text Search• Find all the cars described as having

leather seats

Aggregation• Calculate the average value of Paul’s

car collection

Native Indexes• Secondary• Compound• Geospatial

• Full Text• Hash• Covering

Security• Kerberos• FIPS 140-2• Field Level Security

• LDAP• Auditing• RBAC

7

Indeed.com TrendsTop Job Trends

1. HTML 5

2. MongoDB

3. iOS

4. Android

5. Mobile Apps

6. Puppet

7. Hadoop

8. jQuery

9. PaaS

10. Social Media

NoSQL Space

LinkedIn Job Skills

MongoDB

Competitor 1

Competitor 2

Competitor 3

Competitor 4

Competitor 5

All Others

Google Search

MongoDB

Competitor 1

Competitor 2

Competitor 3

Competitor 4

Jaspersoft Big Data Index

Direct Real-Time Downloads

MongoDB

Competitor 1

Competitor 2

Competitor 3

8

Open Source Software

Technology must scale

Cost must scale!

Recommended