Scality presentation cloud Computing Expo NY 2012 v1.0

Preview:

DESCRIPTION

This is the presentation i've made about Object Storage and why it's the next generation of storage for large-scale storage or unstructured data.

Citation preview

Slide 1

Cloud Storage Made Seamless

Marc VillemadeTechnology EvangelistScality

Ranajit NevatiaVP, MarketingPanzura

There are two types of data(roughly)

StructuredWe (sort of) know how to manage this

UnstructuredThis is the new beast we have issues with

Slide 2

How to define Structured Data?Structured data is a set of organized pieces of data

Relational databases are a perfect exampleAtomic pieces are, on their own, meaningless

Slide 3

What about Unstructured Data?Unstructured data is self-contained pieces of

data Self-descriptiveMeaningful in and of itselfTypically has metadata attached to it

Email, Videos, Presentations, Spreadsheets, satellite images…

An easy way to think about it is anything that can be stored in one file is unstructured data

Slide 4

Some numbers…In 2012, Humanity will generate 2.7 ZB of data 1

It is estimated that we permanently store ~ 1 ZB of it 2 (~40%)

80% of it is unstructured 1

500 Quadrillion files (500,000 million million files)

Next year and so on, it will grow by 50% y-o-y 1

It will double every 2 years in the next 10 years

Kind of unfathomable, ain’t it?

Slide 5

(1) IDC numbers – (2) University of Southern California (2007)

Humans like organized thingsWell, some of them at least…

Structured storage systems have been used for Unstructured Data Organized in file systems, hierarchies, directories Easier for us

And then new data creation patterns emerged early 2000s The model doesn’t fit anymore And here’s why

Slide 6

Typical SAN / NAS issues at Scale

Technology refresh and migration necessary to benefit from larger disks

Scheduled maintenance window nuisance

Limitations on # of files

Volume management is complex

Serial architecture compromises performance

RAID is less efficient for large drives

FC networks are expensive & point-to-point

Cost is prohibitive for large capacity

Slide 7

Humans like organized thingsWell, some of them at least…

Structured data storage systems are used for Unstructured. Organized in file systems, hierarchies, directories Easier for us

And then new data creation patterns emerged early 2000s The model doesn’t fit anymore SANs and NASes were not made to handle this

Slide 8

So what’s the solution?

We believe it’s Object StorageYahoo!, Amazon, Google.. were the pioneers

Main CharacteristicsFlat NamespaceInfinite ScalabilityElasticityCost-efficiencyData availability and durability

Slide 9

Scality’s Storage Vision

Slide 10

Their DCTheir App.YOUR Data

Their DCYOUR App.YOUR Data

YOUR DCYOUR App.YOUR Data

Scality has developed a distributed (scale–out) object-based storage software to turn x86 servers into Petabyte scale storage for unstructured data (files).(Scality is NOT designed for VM, VDI, Relational Database)

Slide 11

What is the Secret Sauce?

• Distributed System• Distributed metadata• No Single point of failure• Self healing• Organic upgrades

Slide 12

What’s unique about Scality RING

• Performance• ESG Lab report: we’re 10x faster than any other object store

• Hardware-agnostic• Software Vendor• Mixed hardware (disks, nodes)

• Erasure-Coding with No penalty on read• With only 60% overhead

• Tiering• Policy driven• Automatic, Transparent

Recommended