Francesca Cuteri Christopher Czaban ˆ ......Lattice Seminar DESY ALESSANDRO SCIARRA Motivation and...

CL2QCD

Develope

♂ Alessa

ndro Sci

♂ Christ

Czaban

♀ France

scaCut

Past Developers♂ Matthias Bach♂ Christopher Pinke

♂ Frederik Depta♂ Christian Schäfer

♂ Lars Zeidlewicz

Open CL LQCD

https://github.com/CL2QCD/cl2qcd

ZEUTHEN

FRANKFURT

Alessandro SciarraLattice Seminar20.06.2016

Lattice group of Owe Philipsen

Lattice SeminarDESY

ALESSANDRO SCIARRA

Motivation and Philosophy

Our physics motivation (I)

QCD THERMODYNAMICS

N3s ˆ Nt lattice Ñ T “

Ñ 1 ! Nt ! Ns

Second order scenario First order scenario

Alessandro Sciarra (Goethe Universität) CL2 QCD HIC for FAIR – 20.06.2016 2 / 21

Lattice SeminarDESY

ALESSANDRO SCIARRA

Our physics motivation (II)

mu,d ms

1st tr. 1st tr. ∞

[M. D’Elia, O. Philipsen et al., Phys.Rev. D90 (2014)]

In the volume below theplane µ “ 0 simulationsare safe

The A point in theRoberge–Weiss plane isat a value of the mass thatcan be simulated

The position of the pointB determines the type oftransition at the Nf “ 2chiral point at µ “ 0

The idea is to study the line connecting A and B in the ms “ 8 plane

Lattice SeminarDESY

ALESSANDRO SCIARRA

Why do we need a fast code?

Just to give an idea of how costly it is... We keep fixed: Nt µ

for Nt in ...; do # ~3 values

for mu in ...; do # ~6 values

for mass in ...; do # ~6 values

for Ns in ...; do # ~3 values >= 3*Nt

for T in ...; do # ~5 valuesecho "Run the (R)HMC for >50k trajectories"# ...

done # Consider that the typical time of adone # simulation varies from weeks to months

Lattice SeminarDESY

ALESSANDRO SCIARRA

Just to give an idea of how costly it is... We keep fixed: Nt

donedone

Lattice SeminarDESY

ALESSANDRO SCIARRA

Just to give an idea of how costly it is...

We keep fixed: Nt

Lattice SeminarDESY

ALESSANDRO SCIARRA

Just to give an idea of how costly it is...

We keep fixed: Nt

Considering 3 months as average time of a single simulation. . .

For loop in order as above Ñ 405 years

Inner three for loop parallel Ñ 4.5 yearsˆ

(roughly 1.5)

STRETCH FACTOR(feedback, technical)

Lattice SeminarDESY

ALESSANDRO SCIARRA

Should a code just work?

Any code in principle should be:

ReadableMaintainableEasy to extendEasy to useHard to breakTestable

CL2QCD

Lattice QCD with Open CL

Clean Code 2 Limit Questions and Doubts

Lattice SeminarDESY

ALESSANDRO SCIARRA

Should a code just work? NO

CL2QCD

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2QCD

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2QCD

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2QCD

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2QCD

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Structure of the code

Physics

Hardware

Programming on GPU

Unit tests, maintainabilityand portability

Performances of the code

Multiple GPUs

Ongoing and future work

Outline of the talk

1 Motivation and our philosophy

2 CL2QCD features

3 Structure of the codePhysicsHardware

4 Programming on GPU

5 Unit tests, maintainability and portability

6 Performances of the codeMultiple GPUs

7 Ongoing and future work

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Outline of the talk

2 CL2QCD features

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The code from the user point of view (I)

Features implemented in CL2QCD

Different fermion formulationsX Wilson fermions in standard formulationX Staggered fermions in rooted standard formulationX Twisted Mass Wilson fermions

Improved gauge actions (for Monte Carlo simulations)X Tree-level SymanzikX IwasakiX Doubly Blocked Wilson

Pure gauge simulations (Heatbath and Monte Carlo)Standard inversion and integration algorithmsILDG-compatible input/output (using LIME)RANLUX pseudo-random number generatorEven-odd preconditioningHasenbuch’s trick for Wilson fermions

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The code from the user point of view (II)

# Clone the git repositorycd <CL2QCD_INSTALL_DIR>git clone https://github.com/CL2QCD/cl2qcd.git# Make a build directorycd <CL2QCD_INSTALL_DIR>/cl2qcdmkdir buildcd build/## Run cmake; if not found automatically,# cmake variables can be set via command line as# "-D <CMAKE_VARIABLE>=<VALUE>"#cmake ..# Build all executablesmake -j

The compiler must be capable of basic C++11 features

es OpenCL Ñ http://www.khronos.org/opencl/

LIME Ñ http://usqcd.jlab.org/usqcd-docs/c-lime/

libxml2 Ñ http://xmlsoft.org/

Boost Ñ http://www.boost.org/

GMP Ñ http://gmplib.org/

MPFR Ñ http://www.mpfr.org/

Nettle Ñ http://www.lysator.liu.se/~nisse/nettle/

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The code from the user point of view (III)

EXECUTABLES HMC

Heatbath

Inverter

Gaugeobservables

Only with (Twisted Mass) Wilson fermion.

Only with standard staggered fermion.

Possibility of doing overrelaxation.

Measurement of fermionic observables.

Measurement of gauge observables.

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Executable Options

Each executable has its helper Ñ ./<executable> -h

Each option has a default value, used if not specifiedIn the helpers there are options not relevant for that algorithmSome options are used for profiling/benchmarks only

...HMC options:

--tau arg (=0.5)--reversibility_check arg (=0)--integrationsteps0 arg (=10)--integrationsteps1 arg (=10)--integrationsteps2 arg (=10)--hmcsteps arg (=10)--num_timescales arg (=1)--integrator0 arg (=leapfrog)--integrator1 arg (=leapfrog)--integrator2 arg (=leapfrog)--lambda0 arg (=0.19318332750378359)--lambda1 arg (=0.19318332750378359)--lambda2 arg (=0.19318332750378359)--use_gauge_only arg (=0)--use_mp arg (=0)...

Possible input file:

...fermact=rooted_staggnum_tastes=2solver=cgcgmax=8000measure_correlators=0cg_iteration_block_size=50num_timescales=2tau=1integrator0=twomnintegrator1=twomnintegrationsteps0=3integrationsteps1=6nspace=24ntime=6mass=0.6000rhmcsteps=100000savefrequency=500savepointfrequency=200startcondition=hot...

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The output levels of the code

[01:05:26] FATAL:[22:13:59] ERROR:[15:42:13] WARNING:[10:00:12] INFO:[03:35:17] DEBUG:[08:52:48] TRACE:

The undesired output is removedby pre-processor operations

NO OVERHEAD

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

NO OVERHEAD

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

NO OVERHEAD

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Outline of the talk

2 CL2QCD features

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The code structure

M. Bach, O. Philipsen, C. Pinke, and A. Sciarra «CL2QCD– Lattice QCD based on OpenCL»http://arxiv.org/pdf/1411.5219.pdf

M. Bach, «Energy- and Cost-Efficient Lattice-QCD Computations using Graphics Processing Units»http://publikationen.ub.uni-frankfurt.de/frontdoor/index/index/docId/37074

Parameters

Utilities

ILDG IO

PHYSICS

AlgorithmsImplementation of algorithms using

Lattices and Fermionmatrix objects

• HMC

• RHMC

• HEATBATH

• SOLVER

• INTEGRATOR

• METROPOLIS

LatticesRepresentations of lattice fields

Operations on fields

• GAUGEFIELD

• FERMIONFIELD

• GAUGEMOMENTA

FermionmatrixMatrix operations on fermion fields

• WILSON

• TWISTED MASS

• STAGGERED

Observables• GAUGEOBSERVABLES

• 〈ψ̄ψ〉• CORRELATORS

PRNG• RANLUX

Noise Sources• POINT • Z2

• GAUSSIAN • Z4

HARDWARE

BuffersRepresentation of OpenCL buffer

Device-dependent: AOS or SOA

• SU3

• SU3VEC

• SPINOR

• GAUGEMOMENTA• · · ·

LatticesRepresentation of fields buffer

Host-devices communications

• GAUGEFIELD

• FERMIONFIELD

• GAUGEMOMENTA

CodeExecution of specific OpenCL kernels

Meta information about kernels

• SPINOR ALGEBRA

• FERMIONS

• MOLECULAR DYNAMICS

• GAUGEFIELD• · · ·

OpenCL CompilerCompilation of OpenCL kernels

Reread functionality

DeviceRepresentation of specific device

Provides Code modules

SystemRepr. of current architecture

Provides available OpenCL devices

OpenCL KernelsCode for execution on device

Compiled at runtime

• PLAQUETTE

• SAXPY

• DSLASH

• GAMMA5• · · ·

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The code structure

CL2QCD

HARDWARE

System

Device

Open CLcompiler

Open CLBuffers

LatticeBuffers

OPEN CLKERNELS

Parameters

ILDG I/O

Utilities

PHYSICS

Latticefields

Fermionmatrices

Algorithms

NoiseSources

Observable

Open CL

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The code structure

CL2QCD

HARDWARE

System

Device

Open CLcompiler

Open CLBuffers

LatticeBuffers

OPEN CLKERNELS

Parameters

ILDG I/O

Utilities

PHYSICS

Latticefields

Fermionmatrices

Algorithms

NoiseSources

Observable

Open CL

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The code structure

CL2QCD

HARDWARE

System

Device

Open CLcompiler

Open CLBuffers

LatticeBuffers

OPEN CLKERNELS

Parameters

ILDG I/O

Utilities

PHYSICS

Latticefields

Fermionmatrices

Algorithms

NoiseSources

Observable

Open CL

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Outline of the talk

2 CL2QCD features

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The Physics package

PHYSICS LATTICEFIELDS

ψ-fields

Uµ-fields

Hµ-fields

FERMIONMATRICES

Wilson

TwistedMass

Staggered

GAUGEOBS.

PlaquettePolyakovLoop

ChiralCondensate

Integrator

Leapfrog

Fermionforce

SolverCG

BiCGstabPRNG NOISE

SOURCES

SlicesVolumeTESTS

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The Physics package

ψ-fields

Uµ-fields

Hµ-fields

FERMIONMATRICES

Wilson

TwistedMass

Staggered

GAUGEOBS.

ChiralCondensate

Integrator

Leapfrog

Fermionforce

SolverCG

BiCGstabPRNG NOISE

SOURCES

SlicesVolumeTESTS

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The Physics package

ψ-fields

Uµ-fields

Hµ-fields

FERMIONMATRICES

Wilson

TwistedMass

Staggered

GAUGEOBS.

ChiralCondensate

Integrator

Leapfrog

Fermionforce

SolverCG

BiCGstabPRNG NOISE

SOURCES

SlicesVolumeTESTS

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The Physics package

ψ-fields

Uµ-fields

Hµ-fields

FERMIONMATRICES

Wilson

TwistedMass

Staggered

GAUGEOBS.

ChiralCondensate

Integrator

Leapfrog

Fermionforce

SolverCG

BiCGstabPRNG NOISE

SOURCES

SlicesVolumeTESTS

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Outline of the talk

2 CL2QCD features

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The Hardware package

HARDWARE

LATTICEBUFFERS

ψ-fields

Uµ-fields

Hµ-fields

OPEN CLBUFFERS

3-vector

12-vector

8-vector3 ˆ 3matrix

CODEPRNG

Uµ-fieldsHµ-fields

Heatbath MolecularDynamics

Correlator

ψ-fields

Complex

Buffers

SYSTEM

DEVICE

OPEN CLCOMPILERTESTS

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

HARDWARE

LATTICEBUFFERS

ψ-fields

Uµ-fields

Hµ-fields

OPEN CLBUFFERS

3-vector

12-vector

CODEPRNG

Correlator

ψ-fields

Complex

Buffers

SYSTEM

DEVICE

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

HARDWARE

LATTICEBUFFERS

ψ-fields

Uµ-fields

Hµ-fields

OPEN CLBUFFERS

3-vector

12-vector

CODEPRNG

Correlator

ψ-fields

Complex

Buffers

SYSTEM

DEVICE

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

HARDWARE

LATTICEBUFFERS

ψ-fields

Uµ-fields

Hµ-fields

OPEN CLBUFFERS

3-vector

12-vector

CODEPRNG

Correlator

ψ-fields

Complex

Buffers

SYSTEM

DEVICE

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Outline of the talk

2 CL2QCD features

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

General Purpose Graphics Processing Unit

CARD CHIPMEMORY PEAK DP PEAK BW CLOCK

YEAR{GB} {GFLOPS} {GB/s} {MHz}

AMD Radeon HD 5870 Cypress 1 544 154 850 2009

AMD Radeon HD 7970 Tahiti 3 947 264 925 2012

AMD FirePro S10000 Tahiti Pro GL 2ˆ3 - 6 2ˆ1480 2ˆ240 825 2012

AMD FirePro S9050 Thaiti 12 806 264 900 2014

AMD FirePro S9150 Hawaii 16 2530 320 900 2014

AMD FirePro S9170 Grenada 32 2620 320 930 2015

AMD FirePro S9300 Capsaicin 2ˆ4 (HBM) 868 2ˆ512 850 2016

NVIDIA GeForce GTX 680 Kepler 2 - 4 129 192 1006 2012

NVIDIA Tesla K40 Kepler 12 1680 288 745 2013

NVIDIA Tesla K80 Kepler 2ˆ12 2912 2ˆ240 560 2014

ρ “Number of FLOPs

Number of Bytes to read and writeWilson fermions ρp {Dq „ 0.57Staggered fermions ρpDKSq „ 0.35

«FLOPS do not count.» – Clark

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

CPU vs GPU code

saxpyψ1

α φ “ αψ1 ` ψ2

On CPU On GPU

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

CPU vs GPU code

saxpyψ1

α φ “ αψ1 ` ψ2

On CPU On GPU

void saxpy( const spinor *x,const spinor *y,const hmc_complex *alpha,spinor *out)

for(unsigned int index = 0; index < VTOT ; index += 1 ){

const spinor tmp = spinor_times_complex(x[index], alpha);out[index] = spinor_acc(y[index], tmp);

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

CPU vs GPU code

saxpyψ1

α φ “ αψ1 ` ψ2

On CPU On GPU

__kernel void saxpy(__global const spinor *x,__global const spinor *y,__global const hmc_complex *alpha,__global spinor *out)

{int id = get_global_id(0);int gs = get_global_size(0);

for(unsigned int idMem = id; idMem < SPINORFIELDSIZE_MEM; idMem += gs){

const spinor tmp = spinor_times_complex(x[idMem], alpha);out[idMem] = spinor_acc(y[idMem], tmp);

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Outline of the talk

2 CL2QCD features

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Crucial concepts

Robert C. Martin (2009), «Clean Code»

Kent Beck (2002), «Test Driven Development: By Example»

Test each single part of code on its own

Unit tests implemented using BOOST and CMake unit test frameworks

Regression tests for the Open CL parts are absolutely mandatory

LQCD functions are local Ñ analytic results to test against can be calculated

Avoid dependence of the tests on specific environments

Testing33%Developing

Refactoring25%

Optimisation8%

PRESENT

tMore Development More Refactoring

2011 2012 2013 2014 2015 2017 2018

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Crucial concepts

Refactoring25%

Optimisation8%

PRESENT

2011 2012 2013 2014 2015 2017 2018

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Crucial concepts

Refactoring25%

Optimisation8%

PRESENT

2011 2012 2013 2014 2015 2017 2018

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Outline of the talk

2 CL2QCD features

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

The Dirac operator kernel

//Lattice Size

Performance of Wilson /D

AMD Radeon HD 7970 AMD Radeon HD 5870NVIDIA Tesla K40 AMD FirePro S10000AMD FirePro S9150

Lattice Size

Performance of Staggered DKS

AMD Radeon HD 7970 AMD Radeon HD 5870NVIDIA Tesla K40 AMD FirePro S10000AMD FirePro S9150

<BW>/BWmax

320 GB/s264 GB/s240 GB/s288 GB/s154 GB/s

Max Bandwith

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

CL2QCD VS tmLQCD (in 2013)

0 1000 2000 3000 4000 5000 6000 7000

Execution Time / s

AMD FirePro S10000 (2012)AMD Radeon HD 5870 (2009)

tmlqcd on 2 AMD Opteron 6172 (2010)

SETUP mπtMeVu

A 260B 310C 520

β “ 3.9

κc “ 0.160856

Nτ “ 8

Nσ “ 24

AT MAXIMAL TWIST

Runs done on LOEWE-CSC

tmLQCD has been run on a whole node (24 cores)

Price per flop for the GPUs much lower than for the CPUs

M. Bach et al. «Twisted-Mass Lattice QCD using OpenCL»http://pos.sissa.it/archive/conferences/187/032/LATTICE%202013_032.pdf

1 GPU « 4 CPU

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Outline of the talk

2 CL2QCD features

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Multi-GPU scaling

CL2QCD can use multiple GPUs within the same node

At the moment, the lattice can be divided only in the temporal direction

Tested on SANAM (AMD FirePro S10000, in 2013), 4 GPUs per node

1 2 3 40

LOPs 323 × 12

483 × 16323 × 64243 × 128

Hard scaling:

The total lattice size is kept constant

1 2 3 40

GPUsSo

323 × 16243 × 32

Weak scaling:

The local lattice size is kept constant

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Outline of the talk

2 CL2QCD features

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Plausible future work

‚ Arbitrary parallelization direction

Pure Wilson RHMC

Staggered pion mass

Clover term

Multiple pseudofermions

Physics analytic tests

Smearing?

Disentangle all packages + work on CMake

Parallelization in >1 directions

BaHaMA

Cleaner and easier to develop code

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Pure Wilson RHMC

Staggered pion mass

Clover term

Smearing?

BaHaMA

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Pure Wilson RHMC

Staggered pion mass

Clover term

Smearing?

BaHaMA

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Pure Wilson RHMC

Staggered pion mass

Clover term

Smearing?

BaHaMA

Lattice SeminarDESY

ALESSANDRO SCIARRA

CL2 QCD features

Physics

Hardware

Programming on GPU

Multiple GPUs

Pure Wilson RHMC

Staggered pion mass

Clover term

Smearing?

BaHaMA

Francesca Cuteri Christopher Czaban ˆ ......Lattice Seminar DESY ALESSANDRO SCIARRA Motivation and...

Documents

Introduction to the lattice Boltzmann method · Introduction Molecular dynamics Lattice Boltzmann Finite volumes. 9 Introduction Lattice Boltzmann. 10 Introduction Lattice Boltzmann

Message-passing for Lattice Boltzmann1.1 Theory of Lattice Boltzmann Method 1.1.1 Lattice Gas Cellular Automata Methods The lattice Boltzmann algorithm is based on the lattice gas

Lattice Network Coding via Signal Codes - People › cfeng01 › talks › isit2011.pdf · Lattice Network Coding via Signal Codes ... Introduction Lattice network coding Lattice-based

Anne Schilling Department of Mathematics University of ...fishel/ams_tuscon/schilling.pdf · Motivation Configurations Rigged Solvable Lattice Models Highest Weight Crystals Ansatz

Correlations and ﬂuctuations from lattice QCD · Quark-number susceptibilities Motivation The deconﬁned phase of QCD can be reached in the laboratory Need for unambiguous observables

From Lattice Boltzmann Method to Lattice Boltzmann Flux … · From Lattice Boltzmann Method to Lattice Boltzmann Flux Solver Yan Wang 1, ... flows [8,13–15], compressible flows

Status of Linac Lattice Studybeamdocs.fnal.gov/AD/DocDB/0040/004042/003/PIP-linac-KIM.pdfStatus of Linac Lattice Study Overview / Motivation / On-line application / Measurement H

Solving Hard Lattice Problems and the Security of Lattice-Based

Towards Quantum Simulation of Non-Abelian Lattice Gauge ...Motivation for Quantum Computing/Simulation Quantum computation is expected to efﬁciently handle the exponential growth

The Lattice Boltzmann Method - Computational …crowl/pres.pdf6 8 10 12 14 16 18 20 The Lattice Boltzmann Method Lindsay Crowl Introduction Motivation NS Equations Blood Flow Model

Lattice path asymptotics via Analytic Combinatorics in Several … · 2016-12-07 · Lattice path asymptotics Introduction and motivation Overview | walks I Consider nearest-neighbour

Lattice Boltzmann for CFD and beyond The lattice Boltzmann method

Ball-spring model of a bond Lattice Cubic lattice

Reciprocal Lattice - · PDF fileReciprocal Lattice • For every real lattice there is an equivalent reciprocal lattice. A two dimension (2‐D) real lattice is defined by

Lattice Coding I: From Theory To Applicationusers.monash.edu/~gfarr/research/slides/Sakzad... · Motivation Preliminaries Problems Relation Lattice Coding I: From Theory To Application

A Bravais lattice is a lattice in which every lattice points has

Lattice Drawing (Survey of Approaches, Geometric Method and …phoenix.inf.upol.cz/~outrata/download/texts/LatDrawing... · 2009. 3. 23. · Outline 1 introduction and motivation

Dibaryons from Lattice QCDDibaryons from Lattice QCD

Pick’s Theorem and Lattice Point Geometry 1 Lattice ... · PDF file11.05.2011 · Marin Math Circle paquin@math.stanford.edu Pick’s Theorem and Lattice Point Geometry 1 Lattice

Lattice modulation experiments with fermions in optical lattice