26
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Joe Spiezio, Solutions Architect - AWS [email protected] Haider Witwit, Solutions Architect - AWS [email protected] June 20, 2016 Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS

Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Embed Size (px)

Citation preview

Page 1: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Joe Spiezio, Solutions Architect - [email protected]

Haider Witwit, Solutions Architect - [email protected]

June 20, 2016

Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS

Page 2: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Session agendaContext: on-premises Disaster Recovery (DR) using AWS

Why AWS for recovery of on-premises IT infrastructure

The ascending levels of DR

DR/Continuity scenarios

Demo

Q&A

Page 3: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

TerminologyBusiness Continuity

Business Continuity ensures that an organization's critical business functions continue to operate or recover quickly despite serious incidents.

Disaster RecoveryDisaster Recovery (DR) enables the recovery or continuation of vital technology infrastructure and systems following a natural or human-induced disaster.

Recovery Point Objective Recovery Time ObjectiveRTO is a targeted duration in which a business process must be restored after a disaster or disruption.

RPO is the maximum targeted period in which data might be lost from an IT service due to a major incident.

Page 4: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Understanding RTO and RPO

Disaster

Down time

Transactions lost

RPOa

RTO

Page 5: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Plan for various types of disasters

Page 6: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

History of DR

There have been many challenges for traditional DR for enterprisesBuilding and maintaining regional data centersFailed DR testsNot meeting RPO & RTOHigh technical debt

Page 7: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

AWS compared to traditional disaster recovery

Conventional

High cost to build disaster recovery sites or data centers (CAPEX)High cost of storage, backup, archival and retrieval tools, and processes (OPEX)Difficult planning, procurement and deploymentChallenging to verify DR plansSingle level of DR across the organization

AWS

Low cost upfront investment (CAPEX)On-demand costs (OPEX)Consistent experience across AWS environmentsRecovery automationSeparate levels of DR per application or business unit

Page 8: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

DR topology map

ELB/Appliance

EC2/Auto Scaling

Route 53

Load Balancers

Web/App Servers

Your Data Centers

DNS

DB failover nodes

AD failover nodes

Availability Zones

Multi-regionDisaster Recovery

Data Centers

AD/Authentication

Database Servers

Page 9: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Ascending levels of DR options

Backup & Restore

Pilot Light

Warm Standby

Multi-Site

Backup of on-premises data to AWS to use in a DR event

Replicate data and minimal running services into AWS, ready to take over and flare up

Replicate data and services into AWS ready to take over

Replicated and load balanced environments that are both actively taking production traffic

RPOa

RTO

$COST

24 hours 24 hours

$

RPOa

RTO

$COST

12 hours 4 hours

$$

RPOa

RTO

$COST

1-4 hours 15 min

$$$

RPOa

RTO

$COST

<15 min 0-5 min

$$$$

Business continuity

begins

Un-interrupted Business

continuity

Page 10: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Backup & Restore Pilot Light Warm Standby Multi-Site

S3StorageGateway

Glacier EBS Volumes

Route 53 Direct Connect

VPN

Net

wor

king

Sto

rage

Multiple Direct

Connect locations

Com

pute

Auto Scaling

ELBEC2

Dep

loym

ent /

M

anag

eme

nt

CloudFormation IAM

Added through the levels of DR

VPC

Page 11: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Backup and restore architecture

~$200 / MonthIn US-EAST

+VPN

On-premises Active

Production

www.example.com

Corporate data center AWS region

AWS DR failover

AppServers

DBServer

VPN Connecti

on

Storage GatewayiSCSI

BackupSystem

S3 / Bucket

Glacier / Archive

WebServers Internet traffic

S3 (1TB)$31/Month

Glacier (2TB)$22/Month

Storage Gateway$125/Month

S3 / Bucket

S3 (1TB)$31/Month

1TB Data

Volume

Page 12: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Backup and restore detailsSuitable for:

• Solutions that can sustain higher technical debt• Lower business critical nature• Low cost DR option

Leverage existing investments in• De-duplication• Compression• WAN Acceleration

Page 13: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Pilot light architecture

Data Replication

On-premises Active

Production Route 53

www.example.com

Corporate data center

1 TB DataVolume

AWS region

WebServers

AWSActive

Production

Direct Connect

AppServers

DBServer

1TB Data

Volume

DBServer

Page 14: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Pilot light architecture

$309 / MonthIn US-EAST

+DirectConnect

Data Replication

ELB

On-premises Active

Production Route 53

www.example.com

Corporate data center

1 TB DataVolume

WebServers

AWS region

WebServers

AWSActive

Production

Direct Connect

AppServers

DBServer

AppServers

1TB Data

Volume

DBServer EBS (GP2)

$100/Month

EC2 (m4.xlarge)$205/Month

EC2 (t2.medium)$0/Month

ELB (100GB Data)$0/Month

EC2 (t2.small)$0/Month

ELB (100GB Data)$0/Month

R53 (1M Query)$4/Month

CloudFormation

Page 15: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Pilot light details

ConsiderationsSuitable for:Solutions that need lower RTO & RPOhigher business critical natureMid-range cost DR option

3rd Party & MarketplaceCloudEndureRacemiZertoOthers

Page 16: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Warm standby architecture

$410 / MonthIn US-EAST

+DirectConnect

ELB

On-premises Active

Production Route 53

www.example.com

Corporate data center

1 TB DataVolume

WebServers

AWS region

WebServers

AWSActive

Production

AppServers

DBServer

AppServers

1TB Data

Volume

DBServer EBS (GP2)

$100/Month

EC2 (m3.xlarge)$205/Month

EC2 (t2.medium)$41/Month

ELB (100GB Data)$19/Month

EC2 (t2.small)$22/Month

ELB (100GB Data)$19/Month

R53 (1M Query)$4/Month

CloudFormation

Data Replication

Direct Connect

Page 17: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Multi-site architecture

$473 / MonthIn US-EAST

+DirectConnect

Data Replication

ELB

On-premises Active

Production Route 53

www.example.com

Corporate data center

1 TB DataVolume

WebServers

AWS region

WebServers

AWSActive

Production

Direct Connect

AppServers

DBServer

AppServers

1TB Data

Volume

DBServer EBS (GP2)

$100/Month

EC2 (m3.xlarge)$205/Month

EC2 (t2.medium)$82/Month

ELB (100GB Data)$19/Month

EC2 (t2.small)$44/Month

ELB (100GB Data)$19/Month

R53 (1M Query)$4/Month

CloudFormation

Page 18: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Warm standby and multi-site details

ConsiderationsSuitable for:Solutions that require RTO & RPO in minutesCore business critical functionsHigher cost DR option

PartnersPartner ecosystem

Page 19: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Lessons Learned

3rd Party solutionsPartner engagementOpportunity to automate technical debtCustomer experiences

Page 20: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

AWS Partner Ecosystem

Page 21: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Demonstration

Page 22: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

corporate data center AWS cloud

virtual private cloud

VPC subnet

VPC subnet

VPC subnet10.219.10.x

VPC subnet10.219.11.x

AD1

DB110.119.11.123

APP110.119.11.121

Load Balancer

APP210.119.11.122

AD2

DB210.219.9.12

3

AmazonRoute 53

AWS Direct Connect

ELB

DR.demo.awscloudlab.com

Auto Scaling group

SQL AlwaysON ListenerAuto-failover

10%90%

Page 23: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

corporate data center AWS cloud

virtual private cloud

VPC subnet

VPC subnet

VPC subnet10.219.10.x

VPC subnet10.219.11.x

AD1

DB110.119.11.123

APP110.119.11.121

Load Balancer

APP210.119.11.122

AD2

DB210.219.9.12

3

AmazonRoute 53

AWS Direct Connect

ELB

DR.demo.awscloudlab.com

Auto Scaling group

SQL AlwaysON ListenerAuto-failover

10%90%

X0% 100%

X

Page 24: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

corporate data center AWS cloud

virtual private cloud

VPC subnet

VPC subnet

VPC subnet10.219.10.x

VPC subnet10.219.11.x

AD1

DB110.119.11.123

APP110.119.11.121

Load Balancer

APP210.119.11.122

AD2

DB210.219.9.12

3

AmazonRoute 53

AWS Direct Connect

ELB

DR.demo.awscloudlab.com

Auto Scaling group

SQL AlwaysON ListenerAuto-failover

10%90%

X0% 100%

X

Page 25: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Q&A

Page 26: Disaster Recovery, Continuity of Operations, Backup, and Archive on AWS | AWS Public Sector Summit 2016

Thank you!