20
C.W. Smith, S. Tran, O. Sahni, and M.S. Shephard, Rensselaer Polytechnic Institute Raminder Singh Indiana University [email protected] Enabling HPC Simulation Workflows for Complex Industrial Flow

XSEDE15_PhastaGateway

Embed Size (px)

Citation preview

Page 1: XSEDE15_PhastaGateway

C.W. Smith, S. Tran, O. Sahni, and M.S. Shephard, Rensselaer Polytechnic Institute

Raminder SinghIndiana [email protected]

Enabling HPC Simulation Workflows

for Complex Industrial Flow

Page 2: XSEDE15_PhastaGateway

Parallel Data & Services

Domain Topology

Mesh Topology/Shape

Dynamic Load Balancing

Simulation Fields

Physics and Model Parameters Input Domain Definition with Attributes

PHASTA

Parasolidor

GeomSim

MeshSim and MeshSim Adapt

Paraview

SolutionTransfer

Hessian-based error indicator

NS, FELevel set

Solution transfer constraints

mesh with fields

mesh with fields

calculated fields

mesh size field

meshes and fields

meshing

operation geometricinterrogation

Attributed topology

non-manifoldmodel construction

geometry updates

mesh size field

mesh

Partition Control

Complex Flow Simulations

Page 3: XSEDE15_PhastaGateway

Project challenges

High barrier to run HPC workflows

– Requires knowledge of file system

– scheduler

– scripting

– runtime environment

– compilers … - for each HPC system

Other Challenges

– Must have very high degree of automation –human in the loop kills scalability and performance

– Need easy access to parallel computers

Page 4: XSEDE15_PhastaGateway

User specifies

• problem definition

• simulation parameters

• required compute resources

through experiment creation web page

• Workflow steps are executed on

HPC system

• user is emailed

• output is prepared for download

option to delete or archive

• Scales to multiple users and systems

Science gateway for PHASTA lowers the barrier

Page 5: XSEDE15_PhastaGateway

• Used PHP Gateway framework with Airavata to develop gateway and enable PHASTA application

• Setup a community account to support the community

• Defining resources to run the application

– TACC Stampede

– CCI IBM Blue Gene.

• Define the PHASTA application.

PHASTA Solution

Page 6: XSEDE15_PhastaGateway

What is PGA?

• PGA is the sample gateway implemented to demonstrate Airavata middleware features.

• You can download and use it as it is or modify it according to your requirements.

• There is an Ansible script available and dockerimage worked on by a GSOC Student.

• PGA is developed using PHP.

• Visit PGA at;

– https://testdrive.airavata.org/

2

Page 7: XSEDE15_PhastaGateway

Landing page

Page 8: XSEDE15_PhastaGateway

User login and creation

Page 9: XSEDE15_PhastaGateway

Resource definition

Page 10: XSEDE15_PhastaGateway

Application definition

Page 11: XSEDE15_PhastaGateway

Create Experiment

Page 12: XSEDE15_PhastaGateway

Monitor Experiment

Page 13: XSEDE15_PhastaGateway

Experiment Statistics

Page 14: XSEDE15_PhastaGateway

Other Examples using PGA

Page 15: XSEDE15_PhastaGateway

Gateway Features for Default User

• In the gateway default user can;

– Create and Launch Experiments.

– Monitor Experiments.

– Create Projects (Experiment grouping).

– Clone, Cancel and Edit Experiment.

– Report Issues & Provide Feedback.

6

Page 16: XSEDE15_PhastaGateway

Apache Airavata

Page 17: XSEDE15_PhastaGateway

• Address user requests

• Allow staging data from user desktop to resource and vice-versa

• Tail on remote application logs

• User key generation and CCI user accounts

Future work

Page 18: XSEDE15_PhastaGateway

Industry Challenge Talk Wednesday @ 4

Page 19: XSEDE15_PhastaGateway

Workflow Diagram for SEQC Transcriptome Assembly and Evaluation

Yes

Pre-processing,Input:SequencingReadsFASTQFiles• AdapterTrimming(cutadaptso ware)• PolyA/TTrimming,andRemovingmtRNA,rRNA(customscript)• ErrorCorrec onforRAN-Seqreads(SEECER)

Sta s calcomparisonofallthe~60assemblies(Sta s calTes ngforpopula onofAssemblies)• NovelScore:EfficientlyCoveredBasesforAllGenes(EC-BAG)Score(CustomScript)• Sta s calTes ng,e.g.ANOVA

PassedQC?(customscriptneededtochecktheaboveQCcriteria,e.g.:

If(CEGMA_CEGs>235)thenCEGMA_flag=Passed)

TranscriptomeAssemblies,Input:TrimmedSequencingReadsFASTQFiles• AssemblingSamplesAandBforsixcenters,usingdifferentreplicate-combina ons(Trinityso ware)• ~60TranscriptomeAssemblies

GenomeCoverage–SNPDetec onforFASTQTrimmedInputReads• MappingInputReadstotheReferenceGenome(TopHatso ware)• SNPdetec on(GATKso ware):OutputCalledSNP_Reads• GenomeCoverage,usingMappedReads(featureCounts–RBioconductorPackage)

QualityControl(QC),Input:AssembledCon gsFiles(FASTAFormat)• DETONATE(DETONATEso ware,usinghumanreferencegenome)• CEGMA(CEGMAso ware)• Assembliessta s caloutputs(providedbyTrinityforeachassembly)• Mappingreadsbacktothecon gs(TopHatso ware)

Discardthe

Assembly

No

GenomeCoverage–SNPDetec onforFASTAAssembledCon gs• Mappingassembledcon gstotheReferenceGenome(GMAPso ware)• SNPdetec on(GATKso ware):OutputCalledSNP_Con gs• GenomeCoverage,usingMappedCon gs(featureCounts–RBioconductorPackage)

SNPCompariosn• ComparingDetectedSNP_Con gswithdbSNP(CustomScriptandSnpSi )• ComparingDetectedSNP_ReadswithdbSNP(CustomScriptandSnpSi )

Page 20: XSEDE15_PhastaGateway

Thanks!!!Questions?

[email protected]

[email protected]

https://iu.box.com/xsede15