The Traditional RDBMS Wisdom is All Wrong

Michael Stonebraker

Traditional RDBMS Wisdom

 Data is in disk block formatting (heavily encoded)  With a main memory buffer pool of blocks  Query plans

 Optimize CPU, I/O

 Fundamental operation is read a row

 Indexing via B-trees  Clustered or unclustered

 Dynamic row-level locking  Aries-style write-ahead log  Replication (asynchronous or synchronous)

 Update the primary first

 Then move the log to other sites

 And roll forward at the secondary (s)

 Describes MySQL, DB2, Postgres, SQLServer, Oracle, …

 Focus of most college-level DBMS courses  Including M.I.T.

 Focus of most DBMS textbooks

 Is obsolete  i.e. completely wrong

DBMS Market (about third-sies)

 Data Warehouses  Column stores will take over and don’t look like the traditional

wisdom

 Everything else  Hadoop, Graph-stores, No-SQL, array-stores,…

 OLTP

 Focus of this talk!

Reality Check on OLTP Data Bases

 TP data base size grows at the rate transactions increase

  1 Tbyte is a really big TP data base   1 Tbyte of main memory buyable for around

$30K (or less)   (say) 64 Gbytes per server in 16 servers

  If your data doesn’t fit in main memory now, then wait a couple of years and it will…..

 Facebook is an outlier

Reality Check – Main Memory Performance

 TPC-C CPU cycles  On the Shore DBMS prototype   “Elephants” should be similar

Motivated H-Store/VoltDB

 Main memory Linux SQL DBMS  multi-node and sharded  Stored procedure interface  Pure ACID  Fast

  ~100X the elephants on TPC-C   ~10X No-SQL without giving up ACID   Scales to 3M TPC-C’s per second

 Biggest use case is game state!

OLTP Data Bases -- 4 Big Decisions

 Main memory vs. disk orientation   Anti-caching is the answer

 Recovery strategy   Aries is dead; long live transaction logging

 Replication strategy   Active-active is the answer

 Concurrency control strategy   Determinism wins; nobody uses row level locking

To Go Fast

 Must focus on overhead   Better B-trees affects a small fraction of the path length

 Must get rid of all four pie slices   Anything less gives you a marginal win

 You cannot run a disk-based DBMS with a buffer pool!!!!

What if My Data Doesn’t Fit?

 Use a disk-based DBMS and go slow

 Use Anti-caching

Anti-Caching (VLDB ‘14)

 Main memory format for data  When memory fills, gather cold tuples and write to an archive (in main memory format)  When a transaction has a “miss”, abort it but continue with “fake processing” to find all the absent data  Get and “pin” the needed data  Reschedule transaction when all needed data in main memory  Numbers from H-Store implementation

 Is obsolete  i.e. completely wrong

Advantages

 Better main memory management  1 hot tuple won’t force 99 cold tuples to stay in main memory

with it

 No conversion of data back and forth between main memory and disk format

Disadvantage

 Largest query (and all indexes) must still fit in main memory at one time

 This is not a data warehouse!!  Easy to fix with time travel

Conclusion

 There may be corner cases where anti-caching loses to a disk architecture   But we can’t find one

 Main memory DBMSs are the answer!!!!   Hekaton, Hana, SQLFire, MemSQL, VoltDB, …

Some Data From Nirmesh Malvaiya

 Implemented Aries in VoltDB  Compared against the VoltDB scheme

 Asynchronous checkpoints

 Command logging

0 10 20 30 40 50 60

Client rate (thousands of tps)

Command-loggingPhysiological-logging

No-logging

0 10 20 30 40 50 60

Client rate during run before crash (thousands of tps)

Command-loggingPhysiological-logging

Some Data From Nirmesh Malvaiya

 1.5 X run-time performance gain  1.5 X penalty at recovery time

 Almost all OLTP applications demand HA  Only run recovery for cluster-wide failures

 E.g. power outage

 Bye-bye Mohan

How to Implement HA

 Active-Passive  As in the traditional wisdom

 Active-Active  Send update transactions to all copies

 Each executes transaction logic

How to Implement HA

 Active-Passive  Write Nirmesh’s data log over the network and roll forward at the

backup node

 Active-Active  Send only the transaction, not the effect of the transaction

 Allows read-queries to be sent to any replica

My Intuition – Active-Active will Cream Active-Passive

 Extend Nirmesh numbers to network traffic  1.5 becomes 2 or 3 at run time

 Roll forward stays at 1.5

 I.e. active-active will win  Would be nice to prove this!!!

Concurrency Control

 MVCC popular (NuoDB, Hekaton)  Time stamp order popular (H-Store/VoltDB)   Lightweight combinations of time stamp order

and dynamic locking (Calvin, Dora)   I don’t know anybody who is doing normal

dynamic locking   It’s too slow!!!!

The Nail in the Coffin

 Time stamp order compatible with active-active  As are any deterministic CC schemes

 Row-level locking and MVCC are not  Need a 2 phase commit between the replicas

 Slow, slow, slow

Net-Net on OLTP

 Main memory DBMS  With anti-caching

 And command logging

 Deterministic concurrency control  HA via active-active

 Has nothing to do with the traditional wisdom!!!

Summary

 What we teach out DBMS students is all wrong  Legacy implementations from the elephants are all wrong

The Traditional RDBMS Wisdom is All Wrong - HPTS · 2013-10-02 · The Traditional RDBMS Wisdom is...

Documents

knowledge and discovery Traditional practice future wisdom

LEARNING FROM LOCAL WISDOM : FRICTION DAMPER · PDF fileTitle LEARNING FROM LOCAL WISDOM : FRICTION DAMPER IN TRADITIONAL ... FRICTION DAMPER IN TRADITIONAL BUILDINGS ... Kontrol seismik

Traditional Wisdom Yoga-Tara vali of Acharya Shankara

The Traditional RDBMS Wisdom is All Wrong

A Milestone in Codifying the Wisdom of Traditional

RDBMS TOTAL

RDBMS Concepts

Rdbms labmanual

Traditional wisdom and permaculture knowledge towards …ali-sea.org/wp-content/uploads/11.SPERI_Agroecology.pdfTraditional wisdom and permaculture knowledge towards agro-ecology transition

Traditional Ecological Knowledge and Wisdom of … · Traditional Ecological Knowledge and Wisdom of Aboriginal Peoples in British Columbia ... ductivity of plant foods and ... October

TRADITIONAL RDBMS TO NOSQL DATABASE: … · TRADITIONAL RDBMS TO NOSQL DATABASE: NEW ERA OF DATABASES FOR BIG DATA Salah A. Jowan1*, Ramadan Faraj Swese1, Antisar Yousf Aldabrzi1,

Rdbms day3

TRADITIONAL WISDOM & MODERN KNOWLEDGE: REAFFIRMING THE WISDOM OF OUR PACIFIC ANCESTORS PACIFIC RISK MANAGEMENT OHANA MARCH 17, 2010 WAIKIKI BEACH MARRIOTT

Traditional Wisdom - Advaita Ashramaadvaitaashrama.org/Content/pb/2006/072006.pdf · Traditional Wisdom TÆRTHA:SACRED LOCALE gÒttu rn ˜buà={k fUrˆa=]àÆt¤swnwhtKrˆaàblmt

SYBSc (RDBMS)

The Green Wisdom of Japanese Traditional Architecture

27 - GitHub Pages · Relational Model RDBMS – Relational Database Management System. ... (vs. the Row Orientation of traditional RDBMS) Document-Oriented Stores Key-Value/Tuple

Traditional vs. Roth IRA: Some Unconventional Wisdom for ... vs Roth IRA.pdf · The problem with the conventional wisdom in the Traditional vs. Roth IRA debate is that it's mathematically

The Hero With an African Face Mythic Wisdom of Traditional Africa

C Traditional Wisdom - Advaita AshramaC Traditional Wisdom c APRAM`DA: VIGILANCE v{KJtu ˘tlw& Nhtu ˚tÀbt c{Ñ ;ÖjÌgbwåg;u > yv{bœtul Ju°Ôgk NhJœtàbgtu CJu;T >> Om is the