21
Duc Hoang – Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST – Final Presentation 1 August 2019 Inferring Convolutional Neural Networks’ accuracy from its architectural characterizations

Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

Duc Hoang – Rhodes CollegeSupervisor: Dr. Gabriel N. PerdueSIST – Final Presentation1 August 2019

Inferring Convolutional Neural Networks’ accuracy from its architectural characterizations

Page 2: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

I. MINERvA experimenta. Neutrinos and MINERvA detectorb. Vertex finding and hadron multiplicity problem

Outline

8/01/2019 Duc Hoang | Inferring Convolutional Neural Network’s accuracy2

IV. Summary & Outlook

III. Inferring CNNs’ accuracy before training timea. Architectural characterizationsb. Predicting CNNs’ accuracy based on characterizations – why

is it useful?

II. Deep Learninga. Deep Neural Networksb. Convolutional Neural Networks (CNNs)c. CNN’s design difficulties

Page 3: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

Neutrinos

8/5/19 Duc Hoang | Inferring Convolutional Neural Network’s accuracy3

Neutrons

Protons

Electrons

U UD

QuarksProton

• Neutrinos are fundamental.• They are electrically neutral "partners" of the familiar charged

leptons (e.g., electrons).• They are very light,• They very rarely interact with other particles

Page 4: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

MINERvA

8/5/194

Beam Direction (+Z)

Target 1Target 2

Water TargetTarget 3

Target 4Target 5

He Target

• Nuclear effects with a variety of target materials ranging from Helium to Lead.

• Fine-grained resolution for excellent kinematic measurements.

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 5: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

Vertex Finding and Hadron Multiplicity problem

8/5/195

True vertex

plane number →

strip

num

ber

5 meters

Hadron Showers

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 6: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

• Fully-connected architecture• Each input multiplied by a

weight.• Weighted values are

summed, Bias is added.• Non-linear activation

function is applied • Trained by varying the

parameters to minimize a loss function (quantifies how many mistakes the network makes)

Deep Neural Networks

8/5/196 Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 7: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

• Similar concept to Deep Neural Networks, but highly effective for imageinputs, and modern neutrino detectors are imaging detectors.

Convolutional Neural Networks (CNNs)

8/5/197 Duc Hoang | Inferring Convolutional Neural Network’s accuracy

X

U

V

Page 8: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

• There is no universal CNN design for every tasks.• And designing an appropriate structure/architecture of CNN

takes a lot of time and effort even for the experts.• There is no systematic way to design CNNs: mainly rely on

human intuition and random/grid search.• Computationally expensive to train a CNN model.

Difficulty

8/5/19 Presenter | Presentation Title or Meeting Title8

Page 9: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

I. Propose a systematic language to characterize CNN’s architecture, and demonstrate that they can be predictive of a CNN’s accuracy.

II. Suggest architectural changes to CNNs for different physics tasks (vertex finding and hadron multiplicity)

Objectives

8/5/199

Examples of architectural attributes we extracted (32 in total): Ø Number of convolutional

layers.Ø Number of rectified linear

unit (ReLU) activated convolutional layers.

Ø Average depth

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 10: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

Method

8/5/19 Presenter | Presentation Title or Meeting Title10

Architectural characterizations CNN performance (accuracy)

Machine Learning models(classification and regression)

Important architecture of CNN for physics task

Interpret the models

Page 11: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

Ø Divide data set into broken and healthy networks.

Ø Use Random Forest and Extremely Randomized Tree to predict each CNN’s class.

Classification

8/5/1911 Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Broken Healthy

Page 12: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

• Machine Learning models perform significantly better than random guessing (50% when there is no class overflow):

Classification results

8/5/1912

Model Average 5-fold cross-validation scores Accuracy on test set

Random Forest 70.3 ± 0.006 % 70.6 %

Extremely Randomized Tree 70.2 ± 0.003 % 70.5 %

• Were also able to extract important features to suggest architectural changes to CNNs. More details in paper.

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 13: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

• Fitted an non-linear Ordinary Least Square model on just the healthy networks.

Regression on just healthy networks

8/5/1913 Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 14: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

• Limitations: Still not enough parameters to characterize the detailed relationship between CNN’s architecture and its performance. ▶ Planning to extend attribute set in the future.

Regression results

8/5/1914

• Models fitted on two populations of vertex finding CNNs:

Population 𝑹𝟐 of OLS Number of Healthy CNNs

First 0.426 49276

Second 0.295 21415

Combined 0.961 70691

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 15: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

Ø Proposed a systematic language to characterize convolutional neural networks architecture.

Ø Successfully demonstrated that we can use those parameters to predict whether a network is “good”.

Ø There are limitations to predict the exact accuracy, but initial results are promising. Extension of the attributes set might help in the future.

Ø One of the early studies about relationship between CNN’s architecture and its performance.

Ø More details in our up-coming paper.

Outlook & Summary

8/5/1915 Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 16: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

§ Special thanks to my supervisor, Dr. Gabriel N. Perdue.

§ To members of MINERvA’s machine learning group for their mentorship and useful conversations:

§ Anushree§ Steven (Oak Ridge)§ Nafis§ Luis

§ To members of SIST committee and my mentor group and other awesome interns:

§ Laura Fields § Sandra Charles§ Judy Nunez

Acknowledgements

8/5/1916

§ Alexander Martinez§ Raul Campos § Matthew Alvarez

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 17: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

References

8/5/1917

• Neural networks and Convolutional Networks visualization: Source

• Random Forest: Source

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 18: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

Back-ups

8/5/1918 Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 19: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

K-folds cross validation

8/5/1919 Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 20: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

• Different deep learning problems require different network architecture.

• However, selecting an appropriate architecture for CNNs is usually done by human intuition or random search

• If we have a way to uniformly characterize a network architecture, then it would be particularly useful.

Architectural characterizations of CNNs

8/5/1920

Good for the task?

?

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

Page 21: Inferring Convolutional Neural Networks’ accuracy from its ...Duc Hoang –Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST –Final Presentation 1 August 2019 Inferring Convolutional

Random Forest and Extremely Randomized Tree

8/5/19 Presenter | Presentation Title or Meeting Title21