32
NoSQL, NewSQL, Big Data… Total Data The Future of Enterprise Data Management NoSQL Road Show MaLhew AsleL, The 451 Group [email protected] © 2011 by The 451 Group. All rights reserved

NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

  • Upload
    others

  • View
    14

  • Download
    0

Embed Size (px)

Citation preview

Page 1: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

NoSQL,  NewSQL,    Big  Data…  Total  Data  The  Future  of  Enterprise  Data  Management    

NoSQL  Road  Show  

MaLhew  AsleL,  The  451  Group  [email protected]  

©  2011  by  The  451  Group.  All  rights  reserved    

Page 2: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    ©  2011  by  The  451  Group.  All  rights  reserved    

Overview  

  NoSQL  and  NewSQL  databases   AdopPon  and  development  drivers  

  Big  Data  and  Total  Data   DefiniPon  and  implicaPons  

2  

Page 3: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

© 2011 by The 451 Group. All rights reserved ©  2011  by  The  451  Group.  All  rights  reserved    

451  Research  is  focused  on  the  business  of  enterprise  IT  innovaPon.  The  company’s  analysts  provide  criPcal  and  Pmely  insight  into  the  compePPve  dynamics  of  innovaPon  in  emerging  technology  segments.  

The  451  Group  

Tier1  Research  is  a  single-­‐source  research  and  advisory  firm  covering  the  mulP-­‐tenant  datacenter,  hosPng,  IT  and  cloud-­‐compuPng  sectors,  blending  the  best  of  industry  and  financial  research.    

The  UpPme  InsPtute  is  ‘The  Global  Data  Center  Authority’  and  a  pioneer  in  the  creaPon  and  facilitaPon  of  end-­‐user  knowledge  communiPes  to  improve  reliability  and  uninterrupPble  availability    in  datacenter  faciliPes.  

TheInfoPro  is  a  leading  IT  advisory  and  research  firm  that  provides  real-­‐world  perspecPves  on  the  customer  and  market  dynamics  of  the  enterprise  informaPon  technology  landscape,  harnessing  the  collecPve  knowledge  and  insight  of  leading  IT  organizaPons  worldwide.  

ChangeWave  Research  is  a  research  firm  that  idenPfies  and  quanPfies  ‘change’  in  consumer  spending  behavior,  corporate  purchasing,  and  industry,  company  and  technology  trends.    

Page 4: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Coverage  areas  

 MaLhew  AsleL   Senior  analyst,  enterprise  so]ware   With  The  451  Group  since  2007   www.about.me/maLasleL   www.twiLer.com/masleL  

  Commercial  AdopPon  of  Open  Source  (CAOS)   AdopPon  by  enterprises   AdopPon  by  vendors  

  InformaPon  Management   Database   Data  warehousing   Data  caching  

4  

Page 5: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Relevant  reports  

  NoSQL,  NewSQL  and  Beyond   Assessing  the  drivers  behind  the  development  and  adopPon  of  NoSQL  and  NewSQL  databases,  as  well  as  data  grid/caching  technologies  

 Released  April  2011   Role  of  open  source  in  driving  innovaPon   [email protected]  

Page 6: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

NoSQL,  NewSQL  and  Beyond  

NoSQL    New  breed  of  non-­‐relaPonal  database  products  

  RejecPon  of  fixed  table  schema  and  join  operaPons    

  Designed  to  meet  scalability  requirements  of  distributed  architectures  

  And/or  schema-­‐less  data  management  requirements    

NewSQL  

 New  breed  of  relaPonal  database  products  

 Retain  SQL  and  ACID   Designed  to  meet  scalability  requirements  of  distributed  architectures  

 Or  improve  performance  so  horizontal  scalability  is  no  longer  a  necessity    

…  and  Beyond  

 In-­‐memory  data  grid/cache  products   PotenPal  primary  pladorm  for  distributed  data  management      

Page 7: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

NoSQL,  NewSQL  and  Beyond  

NoSQL    Big  tables  –  data  mapped  by  row  key,  column  key  and  Pme  stamp    

  Key-­‐value  stores  -­‐  store  keys  and  associated  values    

  Document  store  -­‐  stores  all  data  as  a  single  document    

  Graph  databases  -­‐  use  nodes,  properPes  and  edges  to  store  data  and  the  relaPonships  between  enPPes  

NewSQL   MySQL  storage  engines  -­‐  scale-­‐up  and  scale-­‐out  

 Transparent  sharding  -­‐  reduce  to    manual  effort  required  to  scale  

 Appliances  -­‐  take  advantage  of  improved  hardware  performance,  solid  state  drives  

 New  databases  -­‐  designed  specifically  for  scale-­‐out  

Data  grid/cache    spectrum  of  data  management  capabiliPes,  from  non-­‐persistent  data  caching  to  persistent  caching,  replicaPon,  and  distributed  data  and  compute  grid  

Page 8: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Photo credit: Foxtongue on Flickr http://www.flickr.com/photos/foxtongue/4844016087/

Page 9: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Scalability  

9  

Hardware  economics  –  scale  out  

  Example  project/service/vendor:  •  BigTable,  HBase,  Riak,  MongoDB,  Couchbase,  Hadoop  

•  Amazon  RDS,  Xeround,  SQL  Azure,  NimbusDB  

•  Data  grid/cache  

  Associated  use  case:  •   Large-­‐scale  distributed  data  storage  •   Analysis  of  conPnuously  updated  data  •   MulP-­‐tenant  PaaS  data  layer  

Page 10: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Performance  

10  

MySQL  limitaPons,  performance  at  scale  

  Example  project/service/vendor:  •  Hypertable,  Couchbase,  Riak,  Membrain,  MongoDB,  Redis  

•  Data  grid/cache  •  VoltDB,  Clustrix  

  Associated  use  case:  •  Real  Pme  data  processing  of  mixed  read/write  workloads  

•  Data  caching  •  Large-­‐scale  data  ingesPon  

Page 11: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Relaxed  consistency  

11  

CAP  Theorem  –  relax  consistency  to    maintain  availability  

  Example  project/service/vendor:  •  Dynamo,  Voldemort,  Cassandra,  Riak  

•  Amazon  SimpleDB  

  Associated  use  case:  •  MulP-­‐data  center  replicaPon    

•  Service  availability  •  Non-­‐transacPonal  data  off-­‐load  

Page 12: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Agility  

12  

Polyglot  persistence,  schema-­‐less  

  Example  project/service/vendor:  •  MongoDB,  CouchDB,  Cassandra,  Riak  

•  Google  App  Engine,  SimpleDB,  SQL  Azure  

  Associated  use  case:  •  Mobile/remote  device  synchronizaPon  

•  Agile  development  

•  Data  caching  

Page 13: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Intricacy  

13  

Big  data,  total  data  

  Example  project/service/vendor:  •  Neo4j,  GraphDB,  InfiniteGraph  •  Apache  Cassandra,  Hadoop,  Riak  •  VoltDB,  Clustrix  

  Associated  use  case:  •  Social  networking  applicaPons  •  Geo-­‐locaPonal  applicaPons  •  ConfiguraPon  management  database  

Page 14: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Necessity  

14  

Open  source  

  The  failure  of  exisPng  suppliers  to  address  emerging  requirements  

  Example  projects:  •  BigTable:  Google  •  Dynamo:  Amazon  

•  Cassandra:  Facebook  •  HBase:  Powerset  •  Voldemort:  LinkedIn  

•  Hypertable:  Zvents  •  Neo4j:  Windh  Technologies  

Page 15: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Big  Data…  Total  Data  

Page 16: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Current  data  management  trends  

16  

The  volume,  variety  and  velocity  of  data  is  growing  rapidly  

Data  processing  capabiliPes  have  never  been  beLer  

 The  value  of  data  has  never  been  beLer  understood  

  The  data  deluge  problem  is  also  a  big  data  opportunity  

RISK  OPPORTUNITY  

Page 17: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

What  is  Big  Data?  

17  

 More  than  just  rising  data  volumes  

  Big  Data  ≠  Volume  

Page 18: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

What  is  Big  Data?  

18  

  Also  variety  of  data  types/sources  and  velocity  of  data  updates  

  Big  Data  =  Volume  ±  Variety  ±  Velocity  

Page 19: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Current  data  management  trends  

19  

The  volume,  variety  and  velocity  of  data  is  growing  rapidly  

 Data  processing  capabiliPes  have  never  been  beLer  

 The  value  of  data  has  never  been  beLer  understood  

 ‘Big  Data’  covers  a  diverse  set  of  products  that  can  be  applied  to  different  problems  

  ‘Big  Data’  highlights  the  problem  –  volume/variety/velocity,    and  promises  a  soluPon  –  value,    but  doesn’t  provide  a  path  in  between  

RISK  OPPORTUNITY  

Page 20: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

What  is  Total  Data?  

20  

  Not  just  another  name  for  Big  Data  

  It’s  about  how  you  deliver  value  from  that  data  

Page 21: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

What  is  Total  Data?  

21  

  Also  the  desire  of  the  user  to  store  and  process  all  their  data  

  Value  =  (Volume  ±  Variety  ±  Velocity)  x  Totality  

Page 22: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

What  is  Total  Data?  

22  

  And  the  desire  to  explore  their  data  for  new  value  

  Value  =  (Volume  ±  Variety  ±  Velocity)  x  (Totality  +  ExploraPon)  

Page 23: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

What  is  Total  Data?  

23  

 Within  tolerable  Pme  frames  

  Value  =  (Volume  ±  Variety  ±  Velocity)  x  (Totality  +  ExploraPon)                Time  

Page 24: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

What  is  Total  Data?  

24  

 Within  tolerable  Pme  frames  

  Value  =  (Volume  ±  Variety  ±  Velocity)  x  (Totality  +  ExploraPon)                Time  

  Total  Data  is  making  the  most  efficient  use  of  exisPng  and  new  data  management  resources  to  deliver  value  from  data  

  If  your  data  is  big,  the  way  you  manage  it  should  be  total  

  Inspired  by  ‘Total  Football’  

Page 25: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Source: Wikimedia. Attribution: Bundesarchiv, Bild 183-N0716-0314 / Mittelstädt, Rainer / CC-BY-SA http://commons.wikimedia.org/wiki/File:Bundesarchiv_Bild_183-N0716-0314,_Fu%C3%9Fball-WM,_BRD_-_Niederlande_2-1.jpg

Page 26: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Total  Football  meets  Total  Data  

  “You  make  space,  you  come  into  space.  And  if  the  ball    doesn’t  come,  you  leave  this  space  and  another  player    will  come  into  it.”  

 Bernadus  Hulshoff,  Ajax    1966-­‐77  

  Abandonment  of  restricPve  (self-­‐imposed)  rules  about  individual  roles  and  responsibility  

  PromoPon  of  individuality  within  the  overall  context    of  the  system  

  Enabled  and  relied  on  fluidity  and  flexibility  to  respond  to  changing  requirements  

  Reliant  on,  and  exploited,  improved  performance  levels    26  

Page 27: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

The  old  way  

27  

ReporPng/BI  App  

ETL/data  integraPon   EDW  

Data  archive  

ReporPng/BI  

ReporPng/BI  

ReporPng/BI  

RelaPonal  database  

App  

App  

App  

App  

App  

Data  mart  

Data  mart  

ReporPng/BI  

ReporPng/BI  

RelaPonal  database  

RelaPonal  database  

Page 28: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

The  old  way  

28  

Data archive

Operational database

Analytic database

Business intelligence

Page 29: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

The  new  way  

29  

ETL/data  integraPon   EDW  

Queryable  archive  

ReporPng/BI  

Hadoop  

Exploratory  analyPcs  pladorm  

App   Stream  processing  

Datastructure  

Datastructure  

Virtual  Data  Hub  

Data  mart  

Data  mart  

ReporPng/BI  

ReporPng/BI  

RelaPonal  database  

App  

App  

App  

App  

App  

NewSQL  database  

NoSQL  database  

Non-­‐relaPonal  database  

App  

RelaPonal  database  

Cache  ReporPng/BI  

App  

RelaPonal  database  

Page 30: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

The  new  way  

30  

Operational database

Analytic database

Hadoop

Event stream processing

Exploratory analytics

Data virtualization

Non-relational database

Data cache/grid

Data archive

Datastructure In-database analytics

Page 31: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

Relevant  reports  

  Total  Data   Explaining  the  the  total  data  management  approach  to  dealing  with  the  impact  of  big  data  on  the  data  management  landscape  

 Coming  late  2011   [email protected]  

  Free  copy  for  complePng  our  Total  Data  survey:  

  www.bit.ly/451data  

COMING LATE 2011

Page 32: NoSQL,NewSQL, BigData…TotalDatanosqlroadshow.com/dl/basho-roadshow-amsterdam-2011/slides/Mat… · ©"2011"by"The"451"Group."All"rights"reserved"" NoSQL,NewSQL," Big"Data…"Total"Data"

©  2011  by  The  451  Group.  All  rights  reserved    

FULL  TIME  

Thank  you   QuesPons?  

www.451research.com @maslett