22
1 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved. Click to edit the outline text format Second Outline Level Third Outline Level Fourth Outline Level Fifth Outline Level Sixth Outline Level Seventh Outline LevelClick icon to add picture EMC Data Domain Click icon to add picture Overview

EMC Data Domain Tech

Embed Size (px)

DESCRIPTION

pk

Citation preview

PowerPoint Presentation© Copyright 2014 EMC Corporation. All rights reserved.
© Copyright 2014 EMC Corporation. All rights reserved.
Click to edit the outline text format Second Outline Level Third Outline Level Fourth Outline Level Fifth Outline Level Sixth Outline Level Seventh Outline LevelClick icon to add picture
EMC Data Domain
Overview
*
© Copyright 2014 EMC Corporation. All rights reserved.
© Copyright 2014 EMC Corporation. All rights reserved.
Click to edit the outline text format Second Outline Level Third Outline Level Fourth Outline Level Fifth Outline Level Sixth Outline Level Seventh Outline LevelClick to edit Master text styles
EMC Data Domain
Scale and performance
Reduce storage required by 10–30x
Protect up to 100 PB of logical capacity in a single system
Complete backups faster—up to 31 TB per hour
Seamless integration
Reliable access and recovery
Efficient resource utilization
*
Data Domain systems are a protection storage platform for backup and archive data that reduce the amount of disk storage needed to retain and protect data by ratios of 10-30x and greater, making disk a cost-effective alternative to tape. These systems can protect up to 100 PB of logical capacity in a single system enabling customers to retain data online and onsite for longer retention periods, as well as providing faster and more reliable restores.
With throughput up to 31 TB/hour, Data Domain systems allow more backups to complete sooner while putting less pressure on limited backup windows.
EMC Data Domain Replicator software transfers only the deduplicated and compressed unique changes across any IP network, requiring a fraction of the bandwidth, time, and cost, compared to traditional replication methods. “Time-to-DR readiness” is greatly reduced when compared to other replication methods.
Data Domain’s Data Invulnerability Architecture – built into every Data Domain system – provides the industry’s best defense against data integrity issues ensuring you can access and recover your data when you need it.
*
EMC Data Domain:
Leadership and Innovation
First deduplication
Fastest backup
2013
2014
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
*
*
Why Protection Storage?
Inline deduplication minimizes storage footprint by 10–30x
Data Domain Data Invulnerability Architecture ensures data is recoverable and accessible
Consolidate backup, archive, and disaster recovery on a single system
Protect a wide variety of data sources
*
When we talk about ‘protection storage’, it’s all about three key pillars: enabling scale without cost and complexity, ensuring your data is recoverable AND accessible, and consolidating a broad range of data protection use case.
Data Domain systems fulfills these pillars in the following ways:
Inline deduplication minimizes your storage footprint by 10 – 30x enabling scale without cost and complexity.
The Data Domain Data Invulnerability architecture built into every Data Domain system provides the industry’s best defense against data integrity issues ensuring your data is both recoverable and accessible when you need it.
*
Data Domain Basics
Replication
Fibre Channel
Control Tier
Target Tier
*
Now let’s take a closer look at the Data Domain storage system itself moving from the outside in.
This is a picture of what you would see in a Data Domain deployment. A Data Domain appliance is a storage system with shelves of disks and a controller. It’s optimized, first to back up and second to archive applications, and supports the industry-leading backup, archiving and enterprise applications.
The list on the left is composed primarily of leading backup, archive, and enterprise applications—not only EMC’s offerings with EMC NetWorker & SourceOne, but also Symantec, CommVault, Oracle and so on…
On the way into the storage system, data can pass through either Ethernet or Fibre Channel. With Ethernet, it can use mass protocols and NFS or CIFS; it can also use optimized protocols or products, such as Data Domain Boost, a custom integration with leading backup applications. Fibre Channel connectivity enables a Data Domain system to act as a virtual tape library or you can use DD Boost over Fibre Channel to eliminate virtual tape management.
*
Data Deduplication: Technology Overview
H
I
J
K
L
Second FRIDAY FULL 1 TB 50–60x 18 GB
TOTAL 2.2 TB 7.6x 288 GB
FRIDAY FULL 1 TB 2–4x 250 GB
Second Friday Full Backup
*
A technology overview of data deduplication will help illustrate how you can store more backups in a smaller footprint with Data Domain.
Note to Presenter: Click now in Slide Show mode for animation.
On Friday, the backup application initiates the first full backup of 1 TB, but only 250 GB is stored on Data Domain. This occurs because as the data stream is coming into Data Domain, the system is deduplicating before storing data to disk. On average this results in a two- to four-times reduction in data on a first full backup.
Note to Presenter: Click now in Slide Show mode for animation.
Over the course of the week, 50 GB daily incremental backups result in a seven- to 10-times reduction and only require 5 GB to be stored. As the graphic on the left shows, during the week incremental backups contain data that was already protected from the first full backup.
Note to Presenter: Click now in Slide Show mode for animation.
Finally, on the second Friday, the second full backup contains almost all redundant data. Therefore of the 1 TB backup dataset, only 18 GB needed to be stored.
*
Data Domain Data Invulnerability Architecture
Industry’s Best Defense Against Data Integrity Issues
Stored Correctly
Stays Correct
Recovers Correctly
Recovery/Access
Verification



*
*
Inline vs. Post-Process Deduplication
Copy to tape: Too slow to stream tape
Recovery: Service level agreement predictability
Replication: Poor time-to-disaster-recovery
More administration to fight these issues
Deduplication
Store
Deduplication
*
One of the most conventional alternatives to the Data Domain inline deduplication storage system approach (shown on the left) is by using a methodology known as post-process (shown on the right). In the post-process architecture, data is stored to a disk before deduplication. Then after it is stored, it is read back internally, deduplicated, and written again to a different area.
Although this approach may sound appealing because it seems as if it would allow for faster backups and the use of less resources, it actually creates two problems:
First, a lot more disk is needed to store the multiple pools of data, and for speed, because most of the other vendor’s deduplication approaches are spindle-bound. Because of this, there’s typically a factor of three or four more disks in a post-process configuration than you’ll see in a Data Domain deployment.
*
CPU-Centric v Spindle-Bound Performance
*
This slide shows another way to look at the virtues of being CPU-centric.
As mentioned before, most of the deduplication competitors for backup targets are spindle-bound or disk-bound. It takes so many disk seeks to look up the information to tell whether data has been stored before or not, and to sort out and then minimize the data, that it takes a lot of disk drives or faster disk drives to get the job done.
This slide shows what’s happened in our competitive environment as a result. If they’re using SATA disk drives, most deduplication storage vendors tend to need three or four times as many drives as a Data Domain system to store the same amount of deduplicated data.
In some cases, for example, IBM’s ProtecTIER, storage systems use Fibre Channel drives instead of SATA. This can decrease the seek time, but it comes at a significantly higher cost. Data Domain systems, by being CPU-centric and minimizing disk usage to only what is required to store the actual data, end up having a smaller footprint. This can look like a weakness, but it’s actually a strength.
*
© Copyright 2014 EMC Corporation. All rights reserved.
© Copyright 2014 EMC Corporation. All rights reserved.
Click to edit the outline text format Second Outline Level Third Outline Level Fourth Outline Level Fifth Outline Level Sixth Outline Level Seventh Outline LevelClick to edit Master text styles Second level Third level Fourth level Fifth level
Delivering Data Protection as a Service
Secure Multi-Tenancy
Enables enterprises to deliver Data Domain in a private cloud
Enables service providers to deliver Data Domain in a private/public cloud
Features:
Tenant management and reporting
*
Secure multi-tenancy enables large enterprises and services providers to offer data protection as a service with Data Domain systems in a private or public cloud. With secure multi-tenancy, a Data Domain system can logically isolate data and restrict each tenant’s visibility and read/write access to only their data. In addition, secure multi-tenancy provides management and monitoring by tenant to enable chargeback, trending and other reporting.
*
Data Domain Software Options
Data Domain Encryption
Protects against theft or loss of a physical system
Data Domain Extended Retention
Long-term retention of backup
Data Domain Replicator
Network-efficient and encrypted
Consolidate up to 270 remote sites into a single system
Data Domain Retention Lock
Satisfies governance and compliance
Supports open systems and
IBM i operating environments
*
EMC offers six different Data Domain software options that can enhance the value of a Data Domain system in your environment.
The first is Data Domain Boost, which provides advanced integration with backup and enterprise applications for increased performance and ease of use.
DD Replicator software provides network-efficient, encrypted replication for backup, archive and disaster recovery. In addition, you can replicate up to 270 remote sites into a single Data Domain system for consolidated protection of your distributed enterprise.
DD Encryption software enhances the security of backup and archive data that resides on EMC Data Domain deduplication storage systems with encryption that is performed inline—before the data is written to disk. Encrypting data at rest satisfies internal governance rules and compliance regulations and protects against theft or loss of a physical system.
DD Extended Retention software, which enables long-term retention of backup data on the higher end Data Domain systems with up to 100 PB of logical capacity.
DD Retention Lock software provides governance and compliance retention to satisfy IT governance and compliance standards including SEC 17a-4(f) for archive data.
*
Data Domain Boost
Advanced integration with leading backup and enterprise applications
Speeds backups by up to 50%
Enables more efficient resource utilization
Provides application control of Data Domain replication process
DD Boost
*
Data Domain Boost is a software option supported across the entire Data Domain family, that distributes parts of the deduplication process out of the Data Domain system and onto the backup or application server enabling client-side deduplication. The speeds backups by up to 50% and enables more efficient resource utilization – including reducing the impact on the server by 20 to 40% and reducing the impact on the network by 80 to 99%.
In addition, DD Boost for backup applications enables the application to control the Data Domain replication process with full catalog awareness of both the local and remote copies of the backup.
*
© Copyright 2014 EMC Corporation. All rights reserved.
© Copyright 2014 EMC Corporation. All rights reserved.
Click to edit the outline text format Second Outline Level Third Outline Level Fourth Outline Level Fifth Outline Level Sixth Outline Level Seventh Outline LevelClick to edit Master text styles Second level Third level Fourth level Fifth level
Data Domain Boost Ecosystem
Data Domain Replicator
Reduces bandwidth requirements up to 99%
Protects sensitive data when replicating over untrusted networks
Accelerates time-to-disaster recovery (DR) readiness
Consolidates backup and archive data from hundreds of remote sites
Leverages multiple replication topologies
EMC Data Domain Replicator provides fast, network-efficient and encrypted replication for disaster recovery (DR), remote office backup and recovery, multisite tape consolidation, and long-term offsite retention. Data Domain Replicator asynchronously transfers only compressed, deduplicated data over a wide area network (WAN), which eliminates up to 99 percent of the bandwidth required compared to standard replication methods.
When replicating over untrusted networks, Data Domain Replicator can encrypt sensitive data. This encryption can be enabled on all or only a selected portion of the replicated data set.
For fast time-to-DR readiness, Data Domain Replicator provides logical throughput performance up to 52 TB per hour over a 10 Gb network in replication deployments where one Data Domain system is mirroring its data to another.
You can also consolidate data from up to 270 remote sites by simultaneously replicating data to a single, large Data Domain system at a central hub.
*
Data Domain Encryption
Encrypts all data stored on a Data Domain system
Encrypts data inline before it’s written to disk
Leverage the internally generated static default key or rotate keys for compliance
Backup
Archive
*
With EMC Data Domain Encryption, encryption of data-at-rest safeguards user data in the event of theft or loss of physical storage media. Additional privileged commands can lock and unlock the file system to further secure and protect user data during system transport.
Data Domain Encryption seamlessly integrates with the high-speed, inline deduplication process and encrypts data before it’s written to disk. Inline encryption provides a fast and secure solution that ensures that user data never resides in a vulnerable, unencrypted state on the disk subsystem.
By default, Data Domain Encryption software option encrypts all data on the system using an internally-generated encryption key. This encryption key is static and cannot be changed by the user.
*
Data Domain Extended Retention
Data Domain Controller
Active Tier
Retention Tier
Separate tiers of storage for long-term retention of data to eliminate reliance on tape
Cost-effective scalability
Granular replication for simplified disaster recovery
*
EMC Data Domain Extended Retention provides long-term retention of backup data and eliminate tape infrastructure for backup retention. This software is supported on the DD4200, DD4500, DD7200 and DD990 systems.
EMC Data Domain Extended Retention provides an internal tiering approach that enables cost-effective, long-term retention of backup data on an EMC Data Domain system. With it, customers can leverage Data Domain systems for long-term backup retention and minimize reliance on tape. Data Domain Extended Retention transparently incorporates two tiers of storage on a Data Domain system to achieve cost-effective scalability while delivering the throughput required to ingest hundreds of terabytes of backup data. This combination makes Data Domain systems the ideal tape elimination solution for long-term backup retention.
Data Domain Extended Retention provides transparent separation of short-term and long-term backup data by storing it in different tiers on Data Domain systems. Data is initially stored in the active tier for backup and operational recovery, then moved to an extremely scalable retention tier that’s optimized for long-term data retention—usually measured in years.
It ensures long-term data access and recoverability with fault isolation so that in the event of a failure or catastrophe the system continues to operate with all unaffected components.
*
© Copyright 2014 EMC Corporation. All rights reserved.
© Copyright 2014 EMC Corporation. All rights reserved.
Click to edit the outline text format Second Outline Level Third Outline Level Fourth Outline Level Fifth Outline Level Sixth Outline Level Seventh Outline LevelClick to edit Master text styles Second level Third level Fourth level Fifth level
Governance and Compliance for Archive Data
Data Domain Retention Lock
Efficiently store and manage governance and compliance archive data on a single Data Domain system
Meets the strictest regulatory requirements such as SEC 17a-4(f)
Litigation hold protects archive data during legal actions
Secure file locking of archive data at an individual file level
Integrates seamlessly with industry-leading archiving applications
Archive
Software
*
EMC Data Domain Retention Lock enables IT organizations to efficiently store and manage different types of governance and compliance archive data on a single EMC Data Domain system.
Data Domain Retention Lock Compliance edition meets the strictest retention requirements of regulatory standards such as SEC 17a-4(f) for electronic records including file, email, and content. Data Domain Retention Lock Compliance edition ensures that files on the Data Domain system that are locked by an archive application for a specified retention period cannot be deleted or overwritten under any circumstances until the retention period expires.
In addition, with Data Domain Retention Lock Compliance edition, litigation hold enables customers extend retention policies to protect compliant archive data during legal discovery.
Data Domain Retention Lock also enables secure file locking of archive data at an individual file level; enabling these files to be intermixed with unlocked files on the same Data Domain system.
*
Data Domain Virtual Tape Library
High-Speed, Inline Deduplication for SAN Environments
Eliminates physical tape challenges
Integrates seamlessly into existing Fibre Channel SAN environments
*
Data Domain Virtual Tape Library software eliminates the challenges of physical tape and can emulate up to 64 virtual tape libraries with up to 540 virtual tape drives, and unlimited tape cartridges.
EMC has qualified Data Domain Virtual Tape Library with leading open systems and IBM i enterprise backup applications, It integrates nondisruptively into existing Fibre Channel storage area network (SAN) backup environments.
*
© Copyright 2014 EMC Corporation. All rights reserved.
© Copyright 2014 EMC Corporation. All rights reserved.
Click to edit the outline text format Second Outline Level Third Outline Level Fourth Outline Level Fifth Outline Level Sixth Outline Level Seventh Outline LevelClick icon to add picture
Data Domain Management Center
Virtual Appliance for Aggregate Multi-system Management
Dashboards show the aggregate status of all Data Domain systems
Manages and monitors up to 75 Data Domain systems through a single interface
Role-based access control restricts access to authorized users
*
*
Data Domain Management Center is a virtual appliance that streamlines management and monitoring for environments with multiple Data Domain systems.
Data Domain Management Center provides three key benefits focused on enable IT to provide data management services:
Simplicity is created through the use of dashboards that show the aggregate status of all Data Domain systems in the environment as well as the ability to quickly drill-down into system specific details.
Scalability, which enables you to manage and monitor up to 75 Data Domain systems through a single interface.
*
Data Domain Systems
Backup
Archive
Database
Mainframe
*
Note to Presenter: There are six clicks in this animation; view in Slide Show mode for animation.
Let’s examine how Data Domain systems are the ideal protection storage platform for backup and archive data. On the top here, you can see a variety of data sources including databases, e-mail servers, virtual machines and more. On of the main strengths of Data Domain systems is that all of these data sources and a broad range of backup and archive use cases with leading backup and archive applications can be protected on a single system.
For both backup and archive use cases, one of the key differentiators Data Domain systems offer is the ability to encrypt data inline, meaning data is deduplicated then encrypted in real-time as it is written to disk. Furthermore, on the archive side, Data Domain systems can meet a variety of US and international compliance regulations for archive data – including SEC 17a-4(f).
*
Data Domain Systems
Data Domain Software
Usable capacity
*
*
© Copyright 2014 EMC Corporation. All rights reserved.
© Copyright 2014 EMC Corporation. All rights reserved.
Click to edit the outline text format Second Outline Level Third Outline Level Fourth Outline Level Fifth Outline Level Sixth Outline Level Seventh Outline LevelClick icon to add picture
Why Data Domain?
Industry-leading speed and scale
*
To summarize, Data Domain systems are high-speed, scalable systems that offer you the piece of mind that your data is easily accessible and recoverable. While network-efficient replication leads to time and costs savings. Data Domain also easily integrates with leading applications making them ideal for almost any environment. Finally, your archive data will be able to meet the strictest compliance and governance regulations.