31
Applications of Deep Learning to Nuclear Fusion Research Diogo R. Ferreira IST, University of Lisbon [email protected]

Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Applications of Deep Learningto Nuclear Fusion Research

Diogo R. Ferreira

IST, University of Lisbon

[email protected]

Page 2: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

The Sun

© NASA/SDO

Page 3: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Nuclear Fusion Reaction

© Universe Today

Page 4: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

The Tokamak

© EUROfusion

Page 5: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

JET(JointEuropeanTorus)nearOxford,UK

Page 6: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

JET(JointEuropeanTorus)nearOxford,UK

© EUROfusion

Page 7: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

JET Diagnostics

© EUROfusion

Page 8: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Tomography at JET

© EUROfusion

Page 9: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Tomography at JET

• Camera signals• sampling rate: 5 kHz

• window average of 5 ms (25 samples)

• 5 kHz / 25 = 200 Hz

• Tomographic reconstructions• pulse duration: ~30 sec

• 30 sec × 200 Hz = 6000 reconstructions/pulse

• in practice, only a few reconstructions per pulse

• Time per reconstruction• ~1h on average

• 6000 × 1h = 250 days

Page 10: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Deep Learning

LeCun et al, Proc. of the IEEE 86, 2278 (1998)

Krizhevsky et al, Adv. Neural Inf. Proc. Sys. 25, 1097 (2012)

• Convolutional Neural Networks (CNNs)

Page 11: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Deep Learning

56

7500

Transposedconvolution

Dense

7500

Dense Reshape

5

20

25

15

30

Strideof 25

60

100

50

20

20

5

Strideof 2

5

5

Strideof 2

5

120

200

20Transposedconvolution

Transposedconvolution

Strideof 1

5

5

120

200

Transposedconvolution

• Inverse of a CNN ~ "deconvolutional" neural network

Page 12: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Training

• Dataset• pulses 80128 to 92504 (2011-2016)

• 25584 sample reconstructions

• 90% for training (23025), 10% for validation (2559)

• Training• accelerated gradient descent (Adam)

• learning rate: 10-4

• batch size: 307 (307 * 75 = 23025)

• 75 batches = 75 updates/epoch

Page 13: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Training

NVIDIA Tesla P100

total:3530 epochs20 hours

best:epoch 1765val_loss 0.008556 MW m-3

Page 14: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Video Demo

Page 15: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Plasma Disruptions

• Disruptions are a major problem in tokamaks

Page 16: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Raw Camera Signals

Page 17: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Raw Camera Signals

Page 18: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Deep Learning

• Recurrent Neural Networks (RNNs)

© Chris Olah

Simple RNN

LSTM(Long

short-termmemory)

Page 19: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Disruption Prediction

56

200

32

98

64

47

128

21

LSTM

128

time(1s)

channels

Conv1D +MaxPooling

Conv1D +MaxPooling

Conv1D +MaxPooling

features

steps

1

1

Page 20: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Disruption Prediction

• Time-to-disruption (regression)• last layer is Dense(1) with no activation

• loss function is mean absolute error (mae)

• use only disruptive pulses

• Probability of disruption (classification)• last layer is Dense(1) with sigmoid activation

• loss function is binary cross-entropy

• use both disruptive and non-disruptive pulses

56

200

32

98

64

47

128

21

LSTM

128

time(1s)

channels

Conv1D +MaxPooling

Conv1D +MaxPooling

Conv1D +MaxPooling

features

steps

1

1

Page 21: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Training

• Time-to-disruption (ttd)• 1683 disruptive pulses

• 90% for training, 10% for validation

• X: draw random samples from each pulse

• y: (disruption time) – (latest sample time)

• Probability of disruption (prd)• 9798 pulses (17% disruptive)

• 90% for training, 10% for validation

• X: draw random samples from each pulse

• y: 1 if pulse contains disruption, 0 otherwise

sampletime to

disruption

Page 22: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Training

ttd (mean absolute error)NVIDIA Tesla P100total: best:1026 epochs epoch 51313 hours val_loss 1.986 s

prd (binary cross-entropy)NVIDIA Tesla P100total: best:1118 epochs epoch 55914 hours val_loss 0.202

Page 23: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Results

low ttdhigh prd

disruption

Page 24: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Results

low ttdlow prd

no disruption

Page 25: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Results

high ttdhigh prd

orhigh ttdlow prd

no disruption

Page 26: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Results

early signs

Page 27: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Results

early signs

Page 28: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Results

false alarms(false positives)

Page 29: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Results

missed alarms(false negatives)

very rare

Page 30: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

56

200

32

98

64

47

128

21

LSTM

128

time(1s)

channels

Conv1D +MaxPooling

Conv1D +MaxPooling

Conv1D +MaxPooling

features

steps

1

1

Conclusion

• Deep learning and the analysis of fusion data• replacing compute-intensive tasks (e.g. tomography)

• support for tokamak operation (e.g. disruption prediction)

• both post-processing and real-time processing

56

7500

Transposedconvolution

Dense

7500

Dense Reshape

5

20

25

15

30

Strideof 25

60

100

50

20

20

5

Strideof 2

5

5

Strideof 2

5

120

200

20Transposedconvolution

Transposedconvolution

Strideof 1

5

5

120

200

Transposedconvolution

CNN for plasma tomography RNN for disruption prediction

Page 31: Applications of Deep Learning to Nuclear Fusion …web.ist.utl.pt/diogo.ferreira/talks/ferreira18...•Deep learning and the analysis of fusion data •replacing compute-intensive

Pointers

• Source code• Plasma tomography

• https://github.com/diogoff/plasma-tomography

• Disruption prediction• https://github.com/diogoff/plasma-disruptions

• More info• Full-pulse Tomographic Reconstruction with Deep Neural Networks

• https://arxiv.org/abs/1802.02242

• Artificial intelligence helps accelerate progress toward efficient fusion reactions• https://www.pppl.gov/news/2017/12/artificial-intelligence-helps-accelerate-progress-toward-

efficient-fusion-reactions

• Princeton Team Using AI for Fusion Up for Global Impact Award• https://blogs.nvidia.com/blog/2018/03/05/ai-deep-learning-global-impact-awards-princeton/