23
© Copyright 2009 EMC Corporation. All rights reserved. Efficient Backup with Data Deduplication Which Strategy is Right for You?

Efficient Backup with Data Deduplication Which Strategy is ... · Data Domain –Covers entire environment, works with existing backup software and infrastructure ... VMware and TSM

  • Upload
    buidiep

  • View
    215

  • Download
    0

Embed Size (px)

Citation preview

© Copyright 2009 EMC Corporation. All rights reserved.

Efficient Backup with Data

Deduplication

Which Strategy is Right for You?

© Copyright 2009 EMC Corporation. All rights reserved.2

Data Growth– Typically represents a factor of 4 to 30 times

production capacity

– Daily, weekly, and monthly full backups kept for months or years

New requirements to keep more data for longer periods

Server Virtualization– Dynamic environment causes increased complexity

– VM sprawl creates protection challenges

– Server consolidation creates high server utilization leaving little bandwidth for backup

Digital Information Created and Replicated Worldwide

2,500

2,000

1,500

1,000

500

0

Exabyte

s

20122011201020092008

Source: IDC Digital Universe white paper, sponsored by EMC, May 2009

5-FOLD Growth

in 4 YEARS

Virtual

Server

A

Virtual

Server

B

Virtual

Server

C

Old ParadigmPhysical Environment: Low overall

server utilization and plenty of

bandwidth for backup

New ParadigmVirtual Environment: High overall

server utilization and little bandwidth for

backup

20 percent resource utilization 80 percent resource utilization

100%

80%

40%

0%

60%

20%

CP

U U

tilizati

on

100%

80%

40%

0%

60%

20%

CP

U U

tilizati

on

Server

A

Server

B

Server

C

ESXServer

Hardware

Shared Physical Resources

Major trends influencing the transformation of backup environments

Why So Much Interest in Data Deduplication?

© Copyright 2009 EMC Corporation. All rights reserved.3

Where is Deduplication Applied Today?

Backup

To address major inefficiencies and costs due to redundant data

Offered at both source/target depending upon use case

Delivered as integrated backup solution or as hw target for incumbent backup software

Archive

To provide single instancing for long-term retention

Reduces long-term storage costs

Ability to guarantee single instance of data for compliance

Primary storage

To provide increased primary storage efficiency; store more data, retain data longer

Non-disruptive; maintain performance with significantly reduced capacity requirements

Reduced storage acquisition cost; longer intervals between storage capacity upgrades

Deduplication

Across Use Cases

Provides Additive

Savings and

Readily Co-exist

Session Focus

© Copyright 2009 EMC Corporation. All rights reserved.4

EMC‟s Definition of Deduplication

“The process of detecting and

identifying the unique data segments

within a given set of information,

enabling the elimination of redundancy

when stored or moved.”

Before:total segments = 39

After:Unique segments = 6

Data Set 3

Data Set 2

Data Set 1

Deduplication

© Copyright 2009 EMC Corporation. All rights reserved.5

Only unique data segments are backed up

Data already backed up, so only a unique ID pointer is stored (20 bytes)

First instance Duplicate instance Modified instance

March 2009 March 2009 April 2009

Unique data stored on disk, available for immediate recovery

A B C D

AB

CD

A B

C D

E

A B

C D

E B

C D

E

New data segment identified and backed up

How Data Deduplication Works

© Copyright 2009 EMC Corporation. All rights reserved.6

Factors Impacting Dedupe Ratios for Backup

Type of data

Small variations can have big impact

Data change rate

More user created content* = higher

deduplication ratio

Less change = higher deduplication ratio

Retention policy

Longer retention

policy = higher

deduplication ratio

More full backups = higher deduplication

ratio

Full to incremental backup Ratio

*Encrypted and

compressed data not ideal

dedupe candidates

© Copyright 2009 EMC Corporation. All rights reserved.7

Retain more.

Replicate smarter.

Recover reliably.

Efficiently move data offsite – faster

Only move deduplicated data for 99%

bandwidth efficiency and cost-effective DR

Leverage end-to-end data verification

Continuous fault-detection and healing ensure

data recoverability to meet SLAs

Retain backups longer, using less disk

10-30x data reduction eliminates the use of tape

for operational recovery

EMC Data Domain and Avamar Deduplication

© Copyright 2009 EMC Corporation. All rights reserved.8

Retain. Replicate. Recover.

Dedupe everything without

changing anything

Never backup the

same data twice

Simplify backup, archiving and DR with

easy integration across workloads,

infrastructures, and backup software

Revolutionize your backup by moving less

data to solve your toughest VMware, NAS,

and remote office backup challenges

Data Domain

Deduplication Storage Systems

Avamar

Deduplication Backup Software

EMC Data Domain and Avamar

© Copyright 2009 EMC Corporation. All rights reserved.9

EMC Data Domain

Supports backup and archive software– Backup Software: NetWorker, Symantec, CommVault, IBM TSM, …

– Application utilities: Oracle RMAN, SQL Server, …

– F5 ARX file virtualization

– Archive: SourceOne, Symantec Enterprise Vault, Mimosa, …

– Data Domain Retention Lock software option

Supports any protocol– SAN: VTL software option

– NAS: NFS, CIFS

– Custom: NetBackup OpenStorage software option

Scaleable for Local and Distributed Recovery– Up to 5.4 TB/hour

– Up to 71 TB addressable capacity per system

– Data Domain Replicator software option

Advanced dedupe architecture for high speed & resilience– Stream Informed Segment Layout (SISL) scaling architecture

– Data Invulnerability Architecture

Inline deduplication storage systems

Data Domain

Deduplication Storage Systems

© Copyright 2009 EMC Corporation. All rights reserved.10

Integrated software & hardware solution with global source-based deduplication

– Deduplicates across sites and servers globally

– Effective full backup every time

– Single step recovery

– Backup process reduces data sent over the network and stored

– Variable-length subfile segments for optimal deduplication

• Integrated high availability and reliability – RAIN for high availability and fault tolerance

– Avamar server and data recoverability verified daily

– Replication between servers

Flexible deployment options– Avamar software

– Avamar Data Store

– Avamar Virtual Edition for VMware environments

EMC Avamar

Deduplication backup software

Avamar

Deduplication Backup Software

© Copyright 2009 EMC Corporation. All rights reserved.11

Integrated deduplication

Optimize dedupe for the greatest benefit

Single pane of glass

EMC NetWorker

Manage from a single pane of glass

File Systems and Applications

Avamar

NetWorker

Data Domain

© Copyright 2009 EMC Corporation. All rights reserved.12

EMC Data Protection Advisor

Complete view into the total backup environment –EMC solutions and beyond

Centralizes monitoring, reporting and analysis

Manage SLAs, Capacity Planning, Optimization

Unified Data Protection Management

Lower effort and cost

Reduce Risk

Manage Complexity

Single view across EMC backup to disk with deduplication solutions

Avamar Data Domain Disk Library NetWorker

© Copyright 2009 EMC Corporation. All rights reserved.13

LAN or SAN

Existing

Backup

Software

LAN

Use Case:

Dedupe with Existing Backup Software

Challenges

Backup storage growth and costs

Pressures on datacenter space and energy

Committed to current backup software

Why Dedupe Storage Systems

Dasy integration across workloads, infrastructures, and backup software

Address storage inefficiencies due to redundant data

Efficient replication – reduces or eliminates associated tape costs

Apps, File Systems,

Virtual Servers

DR

WAN

© Copyright 2009 EMC Corporation. All rights reserved.14

Fast, scalable, efficient large database backup

Use Case:

Large, high-change rate database backup

Challenges

Backup speed critical to stay within tight backup windows

High change rate database generally have less redundant data

Why Dedupe Storage Systems

Meet backup windows with high-throughput while eliminating redundant data

Physical and Virtual Servers

Efficiently scale for enterprise environments

SAN VTL or LAN options

“Plug and Play” – leverage existing backup software

IBM DB2

Informix

DR

WAN

© Copyright 2009 EMC Corporation. All rights reserved.15

Use Case:

Limited Bandwidth LAN and NDMP Backup

Challenges

Full NDMP backups are slow - consume significant network, filer and storage resources

Limited Bandwidth LAN backups face backup window challenges

Why Dedupe Backup Software

NDMP Backup: Eliminates time-consuming, full backups by only moving new data

Limited Bandwidth LAN: Speeds backup by deduplicating data at source before moving across LAN

Single-step restore from full backup image

Reduces/eliminates associated tape costs

Dedupe at source for faster

NDMP and LAN backups with

less network resources used

NAS Device

LAN

© Copyright 2009 EMC Corporation. All rights reserved.16

© Copyright 2009 EMC Corporation. All rights reserved.17

Use Case:

Remote Offices

Challenge

Remote office backups cost and risk

Cost of people and equipment at each location

Bandwidth too costly for centralized backup

Risk that backups not happening at all

Why Backup to Data Center with Dedupe

Reduces data moved by up to 500x for efficient back up over existing bandwidth

Eliminate local tape backup, equipment and shipment

Remove risk that data is unprotected

REMOTE OFFICES

DATA CENTER

Cost & Risk

Dedupe

WAN

© Copyright 2009 EMC Corporation. All rights reserved.18

Use Case:

VMware environments

Challenges Higher server utilization leaves resource

contention issues during backup

Not meeting SLAs

Limits server consolidation goals, increases costs

Virtual

Server

A

Virtual

Server

B

Virtual

Server

C

Old Paradigm New Paradigm

20 percent resource utilization 80 percent resource utilization

100%

80%

40%

0%

60%

20%

CP

U U

tilizati

on

100%

80%

40%

0%

60%

20%

CP

U U

tilizati

on

Server

A

Server

B

Server

C

ESXServer

Hardware

Shared Physical Resources

Dedupe within the

VM eliminates bottleneck

Why Dedupe Backup Software Dedupe within the VM to never backup the

same data twice Also supports console, VCB and snap

backups

Up to 90% faster backups

Up to 95% less data moved

Removes a barrier to greater server consolidation

© Copyright 2009 EMC Corporation. All rights reserved.19

EMC Data Domain and Avamar

After

Data Domain

– Covers entire environment, works with existing backup software and infrastructure

– Replicates offsite for fast and safe DR

– 7 PB protected

Avamar

– Solved VMware backup challenge in data centers

– Backs up 200+ remote sites daily

– Avamar $3.5M TCO savings over 3 years (VMware use Case)

USE CASE

Worldwide Financial Institution

Before

Main data center, regional data centers and over 200 remote offices

VMware and TSM backup and recovery bottlenecks limiting consolidation

Tape used in all locations and shipped offsite

Tapes containing data on 12 million customers lost in transport to offsite storage

More

Secure

Less

Risk

3.5MLess

$$$

More

Performance

Less

Network

Tape

Minimized

Less

Management

© Copyright 2009 EMC Corporation. All rights reserved.20

Depends on:

– Current backup challenges and environment

– Application and data type

– Service level requirements

Backup, e-mail, and file system assessments, TCO tools

EMC Assessment for Deduplication Service

EMC Operational Assurance for Avamar

Let Us Help You Determine the Right Solution

Which deduplication solution is right for you?

Deduplication

Dataset 1

Dataset 2

Dataset 3

© Copyright 2009 EMC Corporation. All rights reserved.21

EMC Education Services

Develop and validate your Deduplication and Backup Recovery expertise

1. Information Storage Technology „Open‟ Curriculum

2. EMC Technology-Specific Learning Paths

3. EMC Proven Professional Certification Program

Enhance your Deduplication and Backup Recovery capabilities

Featured Learning Paths

– Avamar (deduplication) Administration

– Backup and Recovery – NetWorker

– EMC Disk Library

– EMC RecoverPoint

– Replication Manager

Take the next step

Visit http://education.EMC.com/BackupRecovery

© Copyright 2009 EMC Corporation. All rights reserved.22

Why EMC for Backup-to-Disk with Deduplication

Market Leadership – Avamar: #1 in source-based deduplication

– Data Domain: #1 in target-based deduplication

– Disk Library: #1 in VTL

EMC offers the broadest set of integrated backup-to-disk solutions with deduplication

– Avamar, Data Domain

– Disk Library, NetWorker

EMC is the only vendor providing solutions to ALL customer needs

– From “refresh to re-design”

– Tailored to the size, need, and budget of the customer

Inline deduplication is a competitive differentiator

– A fundamental technology appearing across the portfolio in both hardware and software