Big Data Processing in the Cloud: A Hydra/Sufia Experience

BIG DATA PROCESSING IN THE CLOUD:A HYDRA/SUFIA EXPERIENCE

HelsinkiJune 2014

Collin BrittleZhiwu Xie

SENSORS

SMART INFRASTRUCTURE

DATA SHARING

• Encourage exploratory and multidisciplinary research

• Foster open and inclusive communities around

• modeling of dynamic systems• structural health monitoring and damage detection• occupancy studies• sensor evaluation• data fusion• energy reduction• evacuation management • …

CHARACTERIZATION

• Compute intensive

• Storage intensive

• Communication intensive

• On-demand

• Scalability challenge

COMPUTE INTENSIVE

• About 6GB raw data per hour

• Must be continuously processed,

ingested, and further processed

• User-generated computations

• Must not interfere with data retrieval

STORAGE INTENSIVE

• SEB will accumulate about 60TB of raw data per year

• To facilitate researchers, we must keep raw data for an extended period of time, e.g., >= 5 years

• VT currently does not have an affordable storage facility to hold this much data

• Within XSEDE, only TACC’s Ranch can allocate this much storage

COMMUNICATION INTENSIVE

• What if hundreds of researchers around the world each tried to download hundreds of TB of our data?

ON DEMAND

• Explorative and multidisciplinary research cannot predict the data usage beforehand

SCALABILITY

• How to deal with these challenges in a scalable manner?

BIG DATA + CLOUD

• Affordable

• Elastic

• Scalable

FRAMEWORK REQUIREMENTS

• Mix local and remote content

• Support background processing

• Be distributable

OBJECTS AND DATASTREAMS

Local Object

Meta Meta File

OBJECTS AND DATASTREAMS

Local Object

Meta Meta File

REMOTESTORAGE

LocalRepository

EC2 GlacierS3

Amazon

Worker

Database

Public Server

Clients

BACKGROUNDPROCESSING

01000010

FROM QUEUES TO THE CLOUD

10100101

01010101

11000011

10100101

11000011

10100101

11000011

10100101

11000011

10100101

11000011

10100101

11000011

00111100

10100101

11110000

10100101

QUEUEING

01010101

00100100

10100101

11000011

10100101

11000011

00100100

00000010

Database

Public Server

Clients

RedisMaster

RedisSlave

Private Server

DISTRIBUTED PROCESSING

SCALE UP

WE CHOSE SUFIA

WHAT IS SUFIA?

• Ruby on Rails framework…

• Based on Hydra…

• Using Fedora Commons…

• And Resque

QUESTIONS?rotated8 (who works at) vt.edu

Big Data Processing in the Cloud: A Hydra/Sufia Experience

Data & Analytics

HYDRA - SPETSES

Industrial Vacuum Cleaners · Industrial vacuum cleaner Wet & Dry dust collection Hydra Standard (Hydra STD) Technical Specification of Hydra STD, Hydra ML & Hydra PJ Hydra Series

Dr. Sufia husain

Chainsaw hydra

Hydra Shock

Hydra-Stopper - Sandiman S.A. hydrastopper.pdf · Hydra-Stopper Features Hydra Stop’s innovative Hydra-Stopper provides new solutions to old distribution system ... , PVC open bracket

Making the SHiFt: Using Sufia with Hydra/Fedora for collection management and access James Halliday Programmer/Analyst, Library Technologies Juliet L

Hydra: future development A Hydra roadmap… Hydra Europe Symposium – Dublin – 7/8 April 2014 Richard Green

HYDRA / HYDRA EVO - MSW · 2020. 3. 18. · • CAN-Open interface available as option on Wittur door drives • Energy certification according to VDI 4707-2. Hydra PLUS Hydra PLUS

Sufia newscast presentation

Hydra Vision

HYDRA Ammonium & HYDRA Nitrate Analyzers · HYDRA Ammonium & HYDRA Nitrate Analyzers. HYDRA Overview The HYDRA Analyzers are a Family of Nitrogen Analyzers ... Rugged PVC Housing

Available Hydra-Handler Options Hydra-Handler Options ... tasks for the Hydra Handler and a checklist for daily safety ... After doing so the Hydra Handler positions itself so the

Hydra I data display systemThe Hydra I data display system was one of four display processing projects originally considered by the Data Systems Development Branch of the Information

Hydra Europe poster for Hydra Connect 2

Hydra Vent - yorkmfg.com › wp-content › uploads › 2020 › 12 › Hydra-V… · Hydra-Vent. TM around windows, doors, vents and penetrations. Co. rners • Start . Hydra-Vent

Hydra EVO Hydra EVO Plus

Hydra EVO Hydra EVO Plus - gimaitaly.com · hydra evo hydra evo plus dhet012 rev_4 . nel corso del manuale identificheremo come: in the manual we are identifying as follows: dans

Hydra Info2

Advertising management ppt. by sufia and kannan