41
1 RAC on Extended Distance Clusters Erik Peterson RAC Development Oracle Corporation

RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Embed Size (px)

Citation preview

Page 1: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

1

RAC on

Extended Distance Clusters

Erik PetersonRAC DevelopmentOracle Corporation

Page 2: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

2

Agenda

• Benefits of RAC on extended clusters• Design considerations• Empirical performance data• Live customer examples• Positioning w.r.t. DataGuard• Summary

Page 3: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

3

Benefits of RAC on Extended Clusters

• Full utilization of resources no matter where they are located

Site A

All Work gets Distributed to All Nodes

One PhysicalDatabase

Site B

Page 4: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

4

Benefits of RAC on Extended Clusters

• Faster recovery from site failure than any other technology in the market

Site A

Work Continues on Remaining Site

One PhysicalDatabase

Site B

Page 5: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

5

Design Considerations

Page 6: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

6

Design Considerations

• Connectivity• Disk Mirroring• Quorum

Page 7: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

7

Connectivity• Redundant connections for public traffic,

interconnect and I/O

Site A Site BDual SAN Connections

Dual Private Interconnects

Dual Public Connections

Page 8: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

8

Connectivity• Distances > 10km require Dark Fiber (DWDM or CWM). • Extra benefit of separate dedicated channels on 1 fibre• Essential to setup buffer credits for large distances

Site A Site B

DWDM

DWDM

Page 9: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Connectivity Caveats• Distance

– Single fiber limit (100km?)

• Performance– Need to Minimize Latency.

• Direct effect on synchronous disk mirroring and Cache Fusion operation

• Direct point to point connection => Additional routers, hubs, or extra switches add latency

• Cost– High cost of DWDM if not already present in the

infrastructure

Page 10: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Disk Mirroring

• Need copy of data at each location• 2 options exist:

– Host Based Mirroring (CLVM)– Remote Array Based Mirroring

Page 11: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Host Based Mirroring

• Standard cluster aware host based LVM solutions (requires a CLVM)

• Disks appear as one set• All writes get sent to both sets of disks

Page 12: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Array Based Mirroring

• All I/Os get sent to one site, mirrored to other• Examples: EMC SRDF• Longer outage in case of failure of primary site

Primary Secondary

Page 13: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Mirroring Example:Large UK Bank

• 2 nodes AIX• Tested both• 9 km – Host Based Mirroring – Shark Storage

(<1 minute down)• 20 km – Array Based Mirroring (PPRC) w/

ERCMF (extended remote copy facility) that avoids doing a manual restart by suspending I/Os until PPRC has done the switch. (1-5 minutes down)

Page 14: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Cluster Quorum: Recommendations

• What happens if all communications between sites is lost?

Page 15: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Cluster Quorum: Recommendations

• Use a third site for quorum device for maximum availability

ThirdSite

Page 16: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Empirical Performance Data

• Unit Tests (Oracle/HP Test results)– Cache Fusion– I/O

• Overall Application Tests (from 4 different sets of tests)

Page 17: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

17

Empirical Performance DataCache Fusion Unit Test

0

1

2

3

4

5

6

Local 25km 50km 100kmDistance

MS

Low Load - 1 ICLow Load - 2 ICHigh Load - 1 ICHigh Load - 2 IC

~1ms increased memory-to-memory block transfer latency over 100km for all casesResults from joint Oracle/HP testing

Page 18: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

18

Empirical Performance DataI/O Unit Test

02468

101214

Local 50km 100kmDistance

I/O L

aten

cy (m

s)

I/O latency increased by 43% over 100km. Note: Without buffer credits this tested at120-270% I/O latency degradation Results from joint Oracle/HP testing

Page 19: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

19

Empirical Performance DataOverall Results: Joint Oracle/HP Testing

For 100km …• Memory-to-memory messaging latency increased

~ 1ms• I/O latency increased in the ballpark of 43% .

This is ~ 4-5 ms

Page 20: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

20

Empirical Performance DataOverall Application Effect

0%10%20%30%40%50%60%70%80%90%

100%

Local 25km 50km 100kmDistance

% o

f Loc

al P

erfo

rman

ce

HP/Oracle RAC Test

Untuned examplew/o buffer credits

Page 21: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

21

Empirical Performance DataOverall Application Effect

0%10%20%30%40%50%60%70%80%90%

100%

Local 20/25km 40km 80kmDistance

% o

f Loc

al P

erfo

rman

ce

Veritas RAC Test

IBM/Oracle RAC Test

Note: differences in results are due to differences in test cases, not in clusterware used

Tuned examplewith buffer credits

Page 22: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Comic Relief

• 3 nodes Sun Solaris • 8km DWDM link• brownouts of around 11 seconds• 10% introduced performance hit • Active/Active: host based mirroring using

Veritas Volume Manager

Page 23: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

23

Comic Relief (UK) – Sun 8km

Fibre Channel Switch for SAN disk access FC-SW over DWDM

Sun-13 Sun-14

DWDM

Dedicated Gigabit Ethernet Switch for memory interconnect access

Sun-02

Single database mirrored physically in two locations, 8km apart

DWDM

Page 24: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Latency Tests

sun-13 sun-14 sun-02COMPROD

1COMPROD

2COMPROD

3395 / s 1.2 ms 356 / s 1.4 ms 320 / s 1.5 ms 310 / s 1.6 ms 376 / s 1.6 ms

Measurements from COMPROD1 running on sun-13

· Oracle instance running with application activity · Oracle instance running but with no application activity

Oracle instance shutdown

Page 25: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

25

Live Customer Examples

Page 26: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

First Known Client

• The Rover Group did the first known implementation with a similar architecture in the mid 1990’s using Oracle7 Parallel Server.

Page 27: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Austrian Railways

• 6 nodes Tru64 • OPS => RAC migration• 1.6 km 24 mono-mode fiber optic cable running

Memory Channel , 3 nodes on each side• 2 SAN fabrics• Host based mirroring• 13 DB, one RAC, one OPS

Page 28: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

ESPN

• American sports broadcasting network• 9iRAC provides the sports ticker (that shows

current scores) that is always on the ESPN channel.

• 2 Node IBM AIX, dual gigabit interconnect• Distance: Across the Street• Host Based Mirroring

Page 29: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Strathclyde University

• Running Oracle RAC on 2 node Sun Solaris nodes approximately 1km apart. Previously ran OPS in this environment. Sun Cluster 3, Veritas Volume manager to perform mirroring,

Page 30: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Extended RAC - SAP customers

• BASF (Germany, 2 x 2 nodes IBM AIX (8 way)) - 2 TB. Both production and test clusters have nodes 2km apart.

Page 31: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Other Examples

• Vodaphone Italy – 2 node Sun Solaris, Sun Cluster, 2.2km, Host Based Mirroring (Veritas)

• Nordac - Germany, 4 node HP Tru64, 300m • University of Melbourne - Oracle E*Business

Suite 11i on 3 nodes Tru64, 0.8 km• China Mobile (Shanghai) - 3 node IBM AIX

using HA GEO for mirroring. 2 corners of Shanghai (15-20km apart) - Host Based Mirroring

Page 32: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Other Examples

• Western Canada Lotterie Corporation - 4 node OpenVMS on 10 KM apart

• Deutsche Bank (Germany) - 2 node Sun Solaris cluster at 12Km apart.

Page 33: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

33

RAC on Extended Clusters Positioning

W.R.T. Data Guard

Page 34: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Additional Benefits Data Guard Provides

• Greater Disaster Protection– Greater distance– Additional protection against corruptions

• Better for Planned Maintenance– Full Rolling Upgrades

• More performance neutral at large distances– Option to do asynchronous

• If you cannot handle the costs of a DWDM network, Data Guard still works over cheap standard networks.

Page 35: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Hybrid: Extended RAC + DataGuard:• One cluster, one RAC, one primary database• All nodes are used for the primary RAC• Separate Data Guard Database, connected to all

nodes

Primary DB Primary DB DG Copy

Page 36: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Switch to DataGuard

• If need comes to switch to Data Guard copy• All available nodes can host Data Guard RAC

cluster.

Primary DB Primary DB DG Copy

Page 37: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

Hybrid Advantages

• Protection Against Corruptions• Better Ability to Support Planned Maintenance

(Rolling Upgrades)

• Distance is still limited

Page 38: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

When does it not work well?

• Distance is too great– No fixed cutoff, but as distance increases you are

slowing down both cache fusion & I/O activity. The impact of this will vary by application. Prototype first if doing this over ~50km.

• Public Networks– To much latency added between the nodes.

Page 39: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

39

Summary

RAC on Extended Cluster

• It works! – proven at customer sites & partner labs.

• Good design is key! Bad design can lead to a badly performing system.

• Data Guard offers additional benefits

Page 40: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

40

References1. Joseph Algieri & Xavier Dahan, Extended MC/ServiceGuard cluster configurations

(Metro clusters), Version 1.4, January 2002 <InternalPaper>2. Sun Microsystems, Metro clusters Based on Sun Cluster 3.0 Software, 20023. Michael Hallas and Robert Smyth, Comic Relief Red Nose Day 2003 (RND03),

Installing a Three-Node RAC Cluster in a Dual-Site Configuration using an 8 Km DWDM Link, Issue 1, April 2003

4. Paul Bramy (Oracle), Christine O’Sullivan (IBM), Thierry Plumeau (IBM) at the EMEA Joint Solutions Center Oracle/IBM, Oracle9i RAC Metropolitan Area Network implementation in an IBM pSeries environment, July 2003

5. Veritas, VERITAS Volume Manager for Solaris: Performance Brief – Remote Mirroring Using VxVM, December 2003

6. CTC TechRep:How to design a disaster tolerant solution with Oracle9i RAC and HP ContinentalClusters

7. Mai Cutler (HP), Sandy Gruver (HP), Stefan Pommerenk (Oracle) “Extended Distance RAC” Eliminating the current physical restriction of Oracle Real Application Cluster

8. Oracle Maximum Availability Architecture (OTN)

Page 41: RAC on Extended Distance Clusters - Antapexantapex.org/Extended_RAC_1.pdf · • 13 DB, one RAC, ... Extended RAC + DataGuard: ... Stefan Pommerenk (Oracle) “Extended Distance RAC”

41

QuestionsAnswers

Discussion