GlusterFS – a Scale-Out Data Platform

John Mark WalkerGluster Community Guy

10/25/12

Topics

● What is GlusterFS● Humble beginnings

● Evolution● Community Process● GlusterFS 3.3● The Future is Gluster

10/25/12

Simple Economics● Simplicity, scalability, less cost

Multi-TenantVirtualized Automated Commoditized

Scale on Demand In the Cloud Scale Out Open Source

10/25/12

Simplicity Bias

● FC, FCoE, iSCSI → HTTP, Sockets ● Modified BSD OS → Linux / User Space /

C, Python & Java● Appliance based → Application based

10/25/12

Scale-out Open Source is the winner

10/25/12

Bengaluru Office

Conference Room US Head Office

Bengaluru Office

Community Deployments

10/25/12

Not a Storage Company

● At first a cluster-building company● Engineering team excelled at

building open source HPC systems

10/25/12

Necessity:The Mother of

Invention

10/25/12

The big idea:Storage should be

simple

10/25/12

What is Simple Storage?

● Low-risk, easy to deploy and administer, data consistency, open source, software-only, user space

10/25/12

What is GlusterFS, Really?

Gluster is a unified, distributed storage system

● User space, global namespace, stackable, POSIX-y, scale-out NAS platform, inspired by GNU Hurd

10/25/12

Some Features

● No single point of failure● DHT

● Synchronous and asynchronous replication

● Proactive self-healing

10/25/12

What Can You Do With It? ● Media – Docs, Photos, Video● Shared storage – multi-tenant

environments● Big Data – Log Files, RFID Data● Objects – Long Tail Data

10/25/12

Standard Deployment● Distributed

over multiple servers

● Replicate volumes

● On top of disk FS (XFS, Ext4, ie. Xattrs)

● Multi-protocol access

Red Hat Proprietary17

Storage for Any EnvironmentScale-out NAS for On-premises and Public Clouds

●Standardized NAS infrastructure●On-premise and public cloud●POSIX-ish ●Apps move easily between environments●Replicate between both

Public CloudOn-premises

10/25/12

First Versions ● Toolkit for building storage systems● Very hacker-friendly● Community integral part of development

– Drove feature development– Repeatable use cases

10/25/12

Mid-2011 Snapshot ● Scale-out NAS● Distributed and replicated● NFS, CIFS and native GlusterFS ● User-space, stackable architecture● Lots of users, not many devs

→ A good platform to build on

10/25/12

GlusterFS 3.3: Building on the Foundation

● Granular locking● Proactive self-healing● Improved rebalancing● More access methods

10/25/12

Granular Locking– Server fails, comes back– Files evaluated– Block-by-block until healed

Server 1 Server 2

GlusterFS GlusterFS

Virtual Disk 1-1

Virtual Disk 1-2

Virtual Disk 2-1

Virtual Disk 2-2

Blocks compared

10/25/12

Proactive Self-healing– Performed server-to-server– Recovered node queries peers

/ Symlink 1

Hidden | Symlink 2 \ Symlink 3

File 1File 2File 3

Server 1 - good

Server 2 - recovered

Replicated

Server 3 - good

File 1File 2File 3

Server 4 - good

Self-healing

10/25/12

Easier Rebalancing

– Now faster● Previously, created entire new hash set, moving data unnecessarily

● Now recreates hash map and compares to old

– Easier to decommission server nodes– Proof point for synchronous translator

10/25/12

Unified File and Object (UFO)

– S3, Swift-style object storage– Access via UFO or Gluster mount

Client Proxy Account

Container

Object

HTTP Request

ID=/dir/sub/sub2/file

GlusterFS – a Scale-Out Data Platform - Linux Foundation · PDF file10/25/12 Enabling...

Documents

Porinting QEMU to Plan 9: QEMU Internals and Port Strategy

Project Aplo: GlusterFS Container Converged with OpenShift · Solution Provide a solution that will run GlusterFS as containers in OpenShift pods. Integrate GlusterFS deployment and

Glusterfs and Hadoop

QEMU-GlusterFS integration › en › latest › ...Enabling GlusterFS for Virtualization use QEMU-GlusterFS integration – Native integration, no FUSE mount – Gluster as QEMU block

SDK Application debug on Linux/QEMU TrainingQEMU to start You will see the QEMU start on the SDK console. To kill this session, select Stop QEMU in the same QEMU. The QEMU session

Glusterfs for sysadmins-justin_clift

GlusterFS Challenges and Futures · GlusterFS Challenges and Futures Jeff Darcy Storage Developer Conference September 17, 2012

GlusterFS – Architecture & Roadmap

QEMU Backend Block Device Driver In Practicebos.itdks.com/13e2bb7282b94eac802ee38afaff83c2.pdf · QEMU Utils qemu-img qemu-img create -f qcow2 -o size=100G vol.qcow2 qemu-img info

QEMU-GlusterFS integration

GlusterFS at OSUOSL

Implement GlusterFS on Azure · PDF fileImplement GlusterFS on Azure . 4 . Introduction . Recently, AzureCAT worked with a customer to set up GlusterFS distributed file system, a free,

Petascale Cloud Storage with GlusterFS

Marian Marinov Clusters With Glusterfs

QEMU QEMU and SystemCand SystemC

GlusterFS Update and OpenStack Integration

GlusterFS and RHS for SysAdmins - Red Hatpeople.redhat.com/ndevos/talks/fisl15/gluster_for_sysadmins... · GlusterFS: ... Gluster Red Hat Storage ... GlusterFS and RHS for SysAdmins

P-QEMU: A Parallel Multi-core System Emulator Based On QEMU

GlusterFS and Openstack Storage

Qemu structure