OLTP on Hardware Islands

Danica Porobic, Ippokratis Pandis*, Miguel Branco, Pınar Tözün, Anastasia Ailamaki

Data-Intensive Application and Systems Lab, EPFL*IBM Research - Almaden

Hardware topologies have changed

Topology Core to core latency

Core Core

Island

SMP of CMP

variable

SMPhigh

(~10ns)

(10-100ns)

(~100ns)

2Variable latencies affect performance & predictability

Should we favor shared-nothing software configurations?

The Dreaded Distributed Transactions

% Multisite TransactionsTh

Shared-Nothing

Shared-Everything

• Shared-nothing OLTP configurations need to execute distributed transactions

• Extra lock holding time• Extra sys calls for comm.• Extra logging• Extra 2PC logic/instructions• ++• At least SE provide robust perf.• Some opportunities for constructive

interference (e.g. Aether logging)Shared-nothing configurations not the panacea!

Deploying OLTP on Hardware Islands

4HW + Workload -> Optimal OLTP configuration

Shared-everything Single address space Easy to programх Random read-write accesses х Many synchronization pointsх Sensitive to latency

Shared-nothing Thread-local accessesх Distributed transactionsх Sensitive to skew

Best Worst

Thread to core assignment

Outline• Introduction• Hardware Islands• OLTP on Hardware Islands• Conclusions and future work

Multisocket multicores

Communication latency varies significantly

Core Core Core

Memory controller

Core Core Core

Inter-socket links

Memory controller

Inter-socket linksInter-socket links Inter-socket links

50 cycles500 cycles

<10 cycles

Placement of application threadsCounter microbenchmark TPC-C Payment

50000000

100000000

150000000

200000000

250000000

300000000

350000000

400000000

8socket x 10cores 4socket x 6cores

Unpredictable

? ?? ?? ?? ?

? ?? ?

Spread IslandSpread Island

Islands-aware placement matters

Counter per core

Counter per

socket

Single counter

1000000

10000000

100000000

1000000000

10000000000

Impact of sharing data among threadsCounter microbenchmark TPC-C Payment – local-only

Shared not..

Shared eve

020000400006000080000

100000120000140000160000

516.8x

Fewer sharers lead to higher performance

8socket x 10cores 4socket x 6cores

(vs 8x)

(vs 80x)

Outline• Introduction• Hardware Islands• OLTP on Hardware Islands

– Experimental setup– Read-only workloads– Update workloads– Impact of skew

• Conclusions and future work

Experimental setup• Shore-MT

– Top-of-the-line open source storage manager– Enabled shared-nothing capability

• Multisocket servers– 4-socket, 6-core Intel Xeon E7530, 64GB RAM– 8-socket, 10-core Intel Xeon E7-L8867, 192GB RAM

• Disabled hyper-threading• OS: Red Hat Enterprise Linux 6.2, kernel 2.6.32• Profiler: Intel VTune Amplifier XE 2011• Workloads: TPC-C, microbenchmarks

Microbenchmark workload• Single-site version

– Probe/update N rows from the local partition

• Multi-site version– Probe/update 1 row from the local partition– Probe/update N-1 rows uniformly from any partition– Partitions may reside on the same instance

• Input size: 10 000 rows/core

Software System Configurations

1 IslandShared-everything

24 IslandsShared-nothing

4 Islands

Increasing % of multisite xcts: reads

0 20 40 60 80 1000

20000400006000080000

100000120000140000160000180000200000 1 Island

24 Islands4 Islands

% Multisite transactions

Contention for shared data

No locks or latches

Messaging overhead

Fewer msgs per xct

Finer grained configurations are more sensitive to distributed transactions

0% 50% 100%0

LoggingLockingCommunicationXct managementXct execution

Multisite transactions

Where are the bottlenecks? Read case

4 Islands10 rows

Communication overhead dominates

Increasing size of multisite xct: read case

0 10 20 30 40 50 60 70 80 90 1000

140 24 Islands12 Islands4 Islands1 Island

Number of rows retrieved

(μs) Physical

contention

More instances per transaction

All instances involved in a transaction

Communication costs rise until all instances are involved in every transaction

Increasing % of multisite xcts: updates

0 20 40 60 80 1000

100000

120000 1 Island24 Islands4 Islands

% Multisite transactions

No latches

2 round of messages+Extra logging

+Lock held longer

Distributed update transactions are more expensive

0% 50% 100%0

LoggingLockingCommunicationXct managementXct execution

Multisite transactions

Where are the bottlenecks? Update case

4 Islands10 rows

Communication overhead dominates

Increasing size of multisite xct: update case

0 10 20 30 40 50 60 70 80 90 1000

25024 Islands12 Islands4 Islands1 Island

Number of rows updated

Efficient logging with

Aether*

More instances per transaction

Increased contention

*R. Johnson, et al: Aether: a scalable approach to logging, VLDB 2010Shared everything exposes constructive interference

0 0.25 0.5 0.75 10

100000200000300000400000500000600000700000800000

50% multisite

Skew factor

0 0.25 0.5 0.75 10

100000200000300000400000500000600000700000800000

Local only

24 Islands4 Islands1 Island

Skew factor

Effects of skewed input

4 Islands effectively balance skew and contention

Few instances are highly loaded

Still few hot instances

Contention for hot data

Larger instances can balance load

OLTP systems on Hardware Islands• Shared-everything: stable, but non-optimal• Shared-nothing: fast, but sensitive to workload• OLTP Islands: a robust, middle-ground

– Runs on close cores– Small instances limits contention between threads– Few instances simplify partitioning

• Future work:– Automatically choose and setup optimal configuration– Dynamically adjust to workload changes

Thank you!20

OLTP on Hardware Islands

Documents

From OLTP to OLAP - bicortex.combicortex.com/.../post_content//2013/09/From-OLTP-to-OLAP.pdf · From OLTP to OLAP - Derivation of Sale Fee Star Schema Type of models 48! Where hierarchical

THE OLAP/OLTP CULTURAL CONFLICT

OLTP Compared With OLAP - hjemmesider.diku.dkhjemmesider.diku.dk/~bonnet/coursematerial/AdvancedDB06/olap-mining.pdf · 1 OLTP Compared With OLAP • On Line Transaction Processing

OLAP v/s OLTP

Oltp vs olap

Informationslogistik Unit 8: OLTP, OLAP, SAP, and Data ...institute.unileoben.ac.at/infotech/lehre/infolog/SS2012/unterlagen/... · Reminder: Normalization Transactions, OLTP, OLAP,

DB2BP Physical Design OLTP 0412

OLTP on Hardware Islands Danica Porobic, Ippokratis Pandis*, Miguel Branco, Pınar Tözün, Anastasia Ailamaki Data-Intensive Application and Systems Lab,

Chap3 Oltp Olap Olam

OLAP and Cubes · 2010. 7. 9. · oltp vs. olap oltp olap ผู ทํางานกับข อมูลมักจะเปนระด ับ พนักงานฏ ิบัติการ

Oracle OLTP (Transactional) Load Testing

OLTP to OLAP COnversion

+ NoSQL W2013 CSCI 2141. + OLTP vs. OLAP We can divide IT systems into transactional (OLTP) and analytical (OLAP). In general we can assume that OLTP

OLTP Performance Benchmark Review

OLTP on Hardware Islands - Advanced Database Systems ......distributed consensus protocols, such as two-phase commit, which many argue are inherently non-scalable [16, 9]. Simi-larly,

OLTP Universe Design

OLAP vs OLTP

Designing Highly Scalable OLTP Systems

ATTACHMENT 13 (105% OLTP) OLTP) I 467-S methodologiesATTACHMENT 13 (105% OLTP) Browns Ferry Nuclear Plant (BFN) Unit I Technical Specifications (TS) Change 467-S Revision of Technical

KITCHEN ISLANDS - Hardware Resources...Kitchen Islands by Hardware Resources This furniture style island is manufactured KITCHEN ISLANDSusing the highest quality furniture grade wood