Upload
phamtuong
View
216
Download
1
Embed Size (px)
Citation preview
1© Copyright 2011 EMC Corporation. All rights reserved.
EMC DATA DOMAIN OVERVIEW
2© Copyright 2011 EMC Corporation. All rights reserved.
EMC Data Domain: Leadership and InnovationA history of industry firsts
First deduplication NAS
First deduplication volume replication
Largest deduplication
arrayFirst deduplication
directory replication
First deduplication virtual tape library
First deduplication nearline storage
Fastest backupcontroller
Cascaded replication
2003 2004 2005 2006 2007 2008 2009 2010 2011
First long-term
retention system for backup and
archiveFirst
distributed processing
3© Copyright 2011 EMC Corporation. All rights reserved.
Deduplication Dramatically Reduces Storage Capacity Requirements
Deduplication10–30 times less data stored versus fulls + incrementals with typical retention policies
0
10
20
30
1 5 10 15 20Weeks in Use
Data
Sto
red
Deduplication storageTraditional storage
4© Copyright 2011 EMC Corporation. All rights reserved.
Backup Data Reduction/DeduplicationTime Series of Large Enterprise Implementation
1H '11
1H '10
2H '09
1H '09
2H '08
2H '07
48%
46%
40%
27%
24%
15%
7%
6%
4%
8%
12%
15%
16%
14%
14%
15%
16%
14%
7%
17%
22%
25%
28%
31%
10% 13%
18%
20%
26%
21%
25%
In Use Now In Pilot/Evaluation In Near-term Plan In Long-term Plan Past Long-term PlanNot in Plan
In last three years, in-use rates for backup with deduplication have risen from 15% to 48%
Source: Wave 15 Storage Study – Q2 2011, published 5/16/11, large-enterprise sample; H ‘07, n=151; 2H ‘08, n=127; 1H ‘09, n=147; 2H ‘09, n=182; 1H ‘10, n=146; 1H ‘11, n=31;TheInfoPro (www.theinfopro.com)
5© Copyright 2011 EMC Corporation. All rights reserved.
Backup Data Reduction/DeduplicationLarge Enterprise
The “in-use” rating for EMC is now over 3x that of its nearest competitor
Source: Wave 15 Storage Study – Q2 2011, published 5/16/11, large-enterprise sample, n=31,TheInfoPro (www.theinfopro.com)
Competitor 7
Competitor 6
Competitor 5
Competitor 4
Competitor 3
Competitor 2
Competitor 1
EMC
0% 10% 20% 30% 40% 50% 60% 70%
In Use Now (Not Including Pilots) In Pilot/Evaluation (Budget Has Already Been Allocated) Near-term Plan Long-term Plan Past Long-term Plan (> 18 Months Out)
6© Copyright 2011 EMC Corporation. All rights reserved.
Purpose-Built Backup Appliances Open Systems + Mainframe
EMCIBMHPOracleQuantumSepatonFalconStorDellOthers2010 Total
Market
$1.69B
EMC: 64.2%
Source: Worldwide Purpose-Built Backup Appliance 2011–2015 Forecast and 2010 Vendor Shares, May 2011, IDC. Chart: Worldwide Supplier Revenue, Total PBBA Market
7© Copyright 2011 EMC Corporation. All rights reserved.
With Data Domain Deduplication Storage Systems, You Can…
Retain longerKeep backups onsite longer with less disk for fast, reliable restores, and eliminate the use of tape for operational recovery
Replicate smarter Move only deduplicated data over existing networks with up to 99% bandwidth efficiency for cost-effective disaster recovery
Recover reliablyContinuous fault detection and self-healing ensure data recoverability to meet service level agreements
WAN
8© Copyright 2011 EMC Corporation. All rights reserved.
Deduplication Fundamentals
9© Copyright 2011 EMC Corporation. All rights reserved.
Data Domain BasicsEasy integration with existing environment
Replication
CIFS, NFS, NDMP, DD
Boost
Ethernet
Virtual Tape Library (VTL)
over Fibre Channel
DD890 appliance
Control Tier Target Tier Disaster Recovery Tier
2U 2 to 14 ports 10 and 1 Gigabit Ethernet; 8 Gb/s Fibre Channel RAID 6 Up to 285 TB usable capacity with shelves 2 TB or 1 TB 7.2K rpm SATA HDD in shelf File system NVRAM N+1 fans and redundant, hot-plug power supplies
DD890 appliance
Backup and Archive
ApplicationsEMC
SymantecCommVault
IBMHP
VeeamQuest
10© Copyright 2011 EMC Corporation. All rights reserved.
Second Friday Full BackupB C D E F L G H
Data Deduplication: Technology OverviewStore more backups in a smaller footprint
A B C D E F G H I J
Friday Full BackupA B C D A E F G
Mon Incremental A B H
Tues Incremental C B I
Thurs Incremental A C K
Weds Incremental E G J
Backup Estimated Data Logical Reduction Physical
Monday Incremental 100 GB 7–10x 10 GB
Tuesday Incremental 100 GB 7–10x 10 GB
K L
Wednesday Incremental 100 GB 7–10x 10 GBThursday Incremental 100 GB 7–10x 10 GB
Second FRIDAY FULL 1 TB 50–60x 18 GB
TOTAL 2.4 TB 7.8x 308 GB
FRIDAY FULL 1 TB 2–4x 250 GB
11© Copyright 2011 EMC Corporation. All rights reserved.
Retain: Store More for Longer with LessOver one year of retention in 3U of Data Domain deduplication storage
Week 1
Backup Cumulative Estimated PhysicalData Logical Reduction
April 14 3.8 TB 10x 366 GB
April 21 5.2 TB 12x 424 GB
April 28 6.6 TB 14x 482 GB
May 31 12.2 TB 17x 714 GB
June 30 17.8 TB 19x 946 GB
TOTAL 23.4 TB 20x 1,178 GB
April 7 2.4 TB 8x 308 GB
Week 2
Week 3
Month 1
Month 2
Month 3
Month 4 July 31 23.4 TB 20x 1,178 GB
First Full 1 TB 4x 250 GB
12© Copyright 2011 EMC Corporation. All rights reserved.
Data Integrity: Data Invulnerability Architecture
OtherRAID 6NVRAMSnapshots
End-to-end data verificationChecksumDeduplication, write to diskVerify
Self-healing file systemCleaningExpired dataDefragVerify
Deduplication
Local Compression
RAID
File System
GenerateChecksum
VerifyData Verify the file
system metadata integrityVerify user data integrity
Verify stripe integrity
End-to-end data verification
13© Copyright 2011 EMC Corporation. All rights reserved.
Network-Efficient Replication for True Disaster RecoveryLowers WAN costs; improves service level agreements
95–99% cross-site bandwidth reduction
Source:Remote sites
Destination:Data Center Hub Supports hundreds
of remote sites
1–5%
1–5%
1–5%
Archive data
Backup data
Data Domain Global Deduplication Array
Data Domain system
Flexible replication
One-to-many Many-to-one Bi-directional System-to-
system Cascaded
Home
DB
WAN
Home
Data Domain system
Data Domain system
14© Copyright 2011 EMC Corporation. All rights reserved.
DD Boost Software• Distributes parts of deduplication process to
backup server or application clients– Licensable software works across Data Domain
portfolio
• Supports majority of backup software market– EMC Avamar and NetWorker– Symantec NetBackup and Backup Exec
• Speeds backups by up to 50 percent• Process more backups with existing
resources– 20–40% less overall impact to backup server– 80–99% less LAN bandwidth
• Enables Data Domain replication management from the backup application
DDBoost
15© Copyright 2011 EMC Corporation. All rights reserved.
Data Domain Replicator• Network-efficient and
encrypted• Transfers only compressed,
deduplicated data over the WAN• Consolidate up to 270 remote
sites into a single system
Additional Data Domain Software Options
Data Domain Virtual Tape Library• Easily integrates with Fibre
Channel• Emulates multiple tape
libraries• Supports open systems and
IBM i operating environments
Data Domain Encryption• Inline encryption of data at
rest• Satisfies internal governance
rules and compliance regulations • Protects against theft or loss
of a physical system
Data Domain Retention Lock• File locking to satisfy IT
governance and compliance policies• Electronic data shredding
16© Copyright 2011 EMC Corporation. All rights reserved.
DD Archiver OverviewCost-optimized, long-term retention
• Data Domain system for backup and archive– Active tier: short-term data protection; less than 90
days– Archive tier: scalable long-term retention; multiple
years
• High-throughput deduplication storage– Up to 9.8 TB/hr
• Cost optimized for long-term retention– Up to 570 TB usable, 28.5 PB logical capacity– Low cost per gigabyte while maintaining high
throughput– Fault isolation of archive units for long-term
recoverability
• Leverage existing Data Domain system advantages
– Supports DD Replicator and DD Retention Lock software options
– Data Domain Data Invulnerability Architecture to ensure data integrity
17© Copyright 2011 EMC Corporation. All rights reserved.
Industry’s Most Scalable Inline Deduplication Systems
DD160 DD620 DD640 DD670 DD860 DD890Global Deduplication Array
DD Archiver
Speed (DD Boost) 1.1 TB/hr 2.4 TB/hr 3.4 TB/hr 5.4 TB/hr 9.8 TB/hr 14.7 TB/hr 26.3 TB/hr 9.8 TB/hr
Speed (other) 667 GB/hr 1.1 TB/hr 2.3 TB/hr 3.6 TB/hr 5.1 TB/hr 8.1 TB/hr 10.7 TB/hr 4.3 TB/hrLogical capacity 40–195 TB 83–415 TB 0.32–1.6 PB 0.6–2.7 PB 1.4–7.1 PB 2.9–14.2
PB 5.7–28.5 PB 5.7–28.5 PB
Usable capacity
Up to 3.98 TB
Up to 8.3 TB
Up to 32.2 TB
Up to 55.9 TB
Up to 142 TB
Up to 285 TB Up to 570 TB Up to 570
TB
Software options:DD Boost, DD Virtual Tape Library, DD Replicator, DD Retention Lock, and DD Encryption
DD160Appliance
DD600 Appliance Series
DD ArchiverGlobal Deduplication Array
DD800Appliance Series
18© Copyright 2011 EMC Corporation. All rights reserved.
Deduplication Storage Evaluation Criteria
19© Copyright 2011 EMC Corporation. All rights reserved.
Methodology: Inline vs. Post-Process Deduplication
POST-PROCESSDeduplication After Storing
The more processes, the more resource contention
− Copy to tape: Too slow to stream tape− Recovery: Service level agreement
predictability− Replication: Poor time-to-disaster-recovery− Deduplication: If interleaved with backup or
restore More administration to fight these
issues
DeduplicationStore
3x disk accesses to shared store
Other activities unimpeded
− Predictable− Simpler
INLINEDeduplication Before Storing
Deduplication
20© Copyright 2011 EMC Corporation. All rights reserved.
Performance: CPU-Centric vs. Spindle-Bound
Thro
ughp
ut M
B/s
50
6,000
Number of Disk Spindles50 100 150 200
Data Domain
Fibre Channel SATA
Mostdeduplication
vendors
21© Copyright 2011 EMC Corporation. All rights reserved.
Data Domain Systems TrajectoryData Domain SISL Scaling Architecture: CPU-centric
Thro
ughp
ut G
B/s
1.5
0.04
5
3DD Boost
2004 Future2010
2014 (est.)
DD200 (2004)
Improvement since 2004:Throughput: ~175xCapacity: ~450x
Single-controller, standard
protocols
2011
Dual-controller
Global Deduplication
Array
22© Copyright 2011 EMC Corporation. All rights reserved.
Why Data Domain?Less disk to resource, less to manage• CPU-centric deduplication• Inline deduplicationSimple, mature, and flexible• Simple, mature appliance• Any fabric, any software, backup
or archive applicationsResilience and disaster recovery• Storage of last resort• Fast time-to-disaster recovery
(DR) readiness• Cross-site global compression
– Data center or remote office
24© Copyright 2011 EMC Corporation. All rights reserved.
Data Domain Infrastructure and EcosystemSupports a variety of workloads and data typesVMwareMicrosoftMicrosoft SharePointOracleSAP
Backup Midrange andMainframe
EMC Bus-Tech
Archive
NAS, SAN, DAS
EMCSymantecCommVault
CAHPVizioncore
Backup ApplicationsEMCF5 NetworksSymantecCommVault
Archive ApplicationsIBMAtempoBakBone
Primarystorage
NetworkDisaster Recovery
Replicationover WAN
IBM i
25© Copyright 2011 EMC Corporation. All rights reserved.
Enterprise Recoverability Readiness at Disaster Recovery SiteData Domain inline deduplicated replication
DR-readyReplicate during backup
“Adaptive”post-process deduplicated replication
Backup to Cache Backup time 1.7-times longer than Data Domain
DR-readyDeduplicate and replicate less than 50% ingest speed—two times longer if uncompressed at fixed bandwidth
“Scheduled”post-process deduplicated replication
Backup to Cache Backup time 1.1-times longer than Data Domain
DR-readyDeduplicate and replicate less than 50% ingest speed—two times longer if uncompressed at fixed bandwidth
VTL/tape/truck
Backup to VTL
?Copy to tapeTruck to storage Truck from storage
Recall tapes
26© Copyright 2011 EMC Corporation. All rights reserved.
EMC Global Services Strategize Design Implement Manage
CONSULTING TECHNOLOGY DEPLOYMENT
MANAGED SERVICES
MAINTENANCE AND SUPPORT EDUCATION
• Strategic Observation service establishes a roadmap/vision to meet your recovery objectives
• Operational Readiness service recommends a Reference Architecture that leverages EMC deduplication technologies and optimizes your implementation
• Best practice methodologies from architecture through integration
• Assessment, Design/Implementation, Operational Assurance, Health Check, Data Migration
• Residency Services provide onsite or remote skilled service professionals with proven best practices and technology expertise
• Remote Managed Services provide cost-effective, ITIL-based, 24x7 intelligent remote monitoring and operational infrastructure management
• 360° global, proactive, and preemptive procedures and solution support
• Open Storage Technology education, EMC technology-specific learning paths, EMC Proven Professional Certification
27© Copyright 2011 EMC Corporation. All rights reserved.
Why EMC Global Services ?Save money • Significantly lower implementation and operating expenditures• Fill internal resource gaps for less • Protect investments in EMC solutions
Accelerate time to value• Reduce deployment time• Accelerate return on investment for new projects• Ease the burden of compliance while protecting critical business
informationMitigate risk and get better results• Configure the solution to meet your requirements• Improve service levels; reduce management costs• EMC best practices and unmatched product expertise = superior
customer experience• Reduce disruption while taking advantage of the features and
benefits of the latest EMC products and solutions