11
Bioconductor on the RosettaHUB community platform www.rosettahub.com Karim Chine RosettaHUB, Inc [email protected] BioC 2016 24 th June 2016

Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc [email protected] BioC 2016 24th June 2016

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

Bioconductor on the RosettaHUBcommunity platform

www.rosettahub.com

Karim ChineRosettaHUB, Inc

[email protected]

BioC 2016

24th June 2016

Page 2: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

A universal open platformfor scientific computing and collaboration

Computational ComponentsR packages, Bioconductor, Wrapped C,C++,Fortran code, Python modules, Matlab

Toolkits…

Open source or commercial

Computational ResourcesClusters, grids, private or public clouds

Free or pay-per-use

Computational GUIsHTML5 and Desktop Workbench

Built-in views /Plugins /Collaborative views

Shiny apps

Open source or commercial

Computational ScriptsR / Python / Matlab / Groovy

Computational APIsJava / SOAP / REST, Stateless and stateful

Computational StorageLocal, NFS, FTP, Amazon S3, EBS, HDFS

Generated Computational Web ServicesStateful or stateless, mapping of R objects/functions

RosettaHUB

Page 3: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

Infrastructures federation: RosettaHUB cloud

Public Clouds

Private Cloud

rosettahub.com

Page 4: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

AWS: programmable infrastructure

Command Line

Web Console

SDK

API

Page 5: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

RosettaHUB Command Line

RosettaHUB Web Console

RosettaHUB SDKs

RosettaHUB API

RosettaHUB: programming with data andinfrastructure

Page 6: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

Google Docs-like real time collaboration

rosettahub.com

Page 7: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

Traceable and reproducible data science

RosettaHUB

Machine Image A

R 3.1

Bioconductor 3.0

RosettaHUB Amazon Machine Images

RosettaHUB

Machine Instance 1

Bsaed on Image A

Amazon Elastic Block Stores

rosettahub.com

RosettaHUB

EBS 2

Data Set D2

RosettaHUB

Machine Image B

R 3.2

Bioconductor 3.1 RosettaHUB

Machine Image C

R 3.2

Bioconductor 3.2

RosettaHUB

EBS 2

Data Set D2

RosettaHUB

Machine Instance 2

Bsaed on Image A

Researcher

Reviewer

RosettaHUB

EBS 2

Data Set D2

RosettaHUB

EBS 1

Data Set D1

RosettaHUB

EBS 4

Data Set D4

RosettaHUB

EBS 3

Data Set D3

Page 8: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

A multi-language framework

Page 9: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

A universal data science engine

• Reactive data science micro services platform

• Based on Java/R/Python processes

• Events-driven remote objects/engines

• Fully Dockerized

• Collaborative spreadsheets

• Collaborative scientific graphics canvas

• Collaborative dashboards

• Collaborative widgets

Page 10: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

The platform architecture

Docker Swarm

Linux cluster

RosettaHUB Cloud Broker

Data Science Portal

Platform Message Broker

Liferay MySQL

Database

RosettaHUB MySQL Database

Ro

settaHU

B P

latform

System Administrator

Clouds Management

Console

Azure API

Job Sched

uler A

PI

OpenN

ebula API

Do

cker/Swarm

AP

I

Data Science Workbench

Views

Science Gateways

Factory

eLearning Apps

Social Apps MarketplaceReal-time

Collaboration Apps

LiferayAPI

RosettaHUB Public API

VM

VM

VM

GCE API OpenStack APIAWS API

Researcher Teacher Student

Page 11: Bioconductor on the RosettaHUB community platform...Bioconductor on the RosettaHUB community platform Karim Chine RosettaHUB, Inc karim.chine@rosettahub.com BioC 2016 24th June 2016

Inside the containers and engines

Rosetta Engine

PythonWolframLanguage

Julia SQLEmbeddedderbyDB

GraphDB

EmbeddedorientDB

JDBC SQLMySql

PostgreSQLRedshift

...

Scala/

Spark

Java Platform

LanguagesGroovyJython

...JNI JNIJNI

Jupyter Server

R Studio

shiny

R

JNI

VNC Server

JSON / NoSQL cloud databaseEquiv. Firebase

Server

ParaviewWeb ServerSsh

Unified Data Bus

Java Virtual Machine

SpreadsheetEngine

Cross-language Interactive/Collaborative

Widgets

Cross-language MacrosHTTPFile

Server

Rosetta Engine SOAP API

Rosetta Engine JSON HTTP API

Rosetta Engine Real-time Events Bus

Rosetta Gateway

FTP File ServerSecurity Policy

Manager

Docker Container

Virtual Machine