New Adapting DL to New Data: An Evolutionary Algorithm for … · 2017. 5. 5. · 9 Adapting DL to...

ORNL is managed by UT-Battelle

for the US Department of Energy

Adapting DL to New Data: An Evolutionary Algorithm for Optimizing Deep Networks

Steven R. Young

Research Scientist

Oak Ridge National Laboratory

2 Adapting DL to New Data

Overview

• Deep Learning in Science

• Challenges

• Tools

• Next Steps

Deep Learning for Science Applications

CommercialApplications

ScienceApplications

Object Recognition

Face Recognition

State of the Art Results

Characteristics

• Data is easy to collect• Inexpensive labels

Challenging New Domains

Material Science

High Energy Physics

Characteristics

• Data is difficult to collect• Few labels available

Problem: Adaptability Challenge

• Premise: For every data set, there exists a corresponding neural network that performs ideally with that data

• What’s the ideal neural network architecture (i.e., hyper-parameters) for a particular data set ?

• Widely-used approach: intuition

1. Pick some deep learning software (Caffe, Torch, Theano, etc)

2. Design a set of parameters that defines your deep learning network

3. Try it on your data

4. If it doesn’t work as well as you want, go back to step 2 and try again.

The Challenge

Pooling

Convolutional

Output

Fully Connected

Pooling

Convolutional

Output

Fully Connected

Pooling

ConvolutionalLearning Rate Batch Size

Momentum Weight Decay

Deep Learning Toolbox

The Challenge

Convolutional

Output

Learning Rate Batch Size

Momentum Weight Decay

Hyper-parameter Selection

• Manual search, guess and check

– Requires domain knowledge

• Grid search

– Exponential growth with high-dimensional hyper-parameter space

– Doesn’t exploit low effective dimension for discovery

• Random search

– By itself, not adaptive (no use of information from previous experiments)

What can we do with Titan?

18,688 GPUs

MENNDL: Multi-node Evolutionary Neural Networks for Deep Learning• Evolutionary algorithm as a solution for searching hyper-

parameter space for deep learning

– Focus on Convolutional Neural Networks

– Evolve only the topology with EA; typical SGD training process

– Generally: Provide scalability and adaptability for many data sets and compute platforms

• Leverage more GPUs; ORNL’s Titan has 18k GPUs

– Next generation, Summit, will have increased GPU capability

• Provide the ability to apply DL to new datasets quickly

– Climate science, material science, physics, etc.

Designing the Genetic Code

• Goal: facilitate complete network definition exploration

• Each population member is a network which has a genome with sets of genes

– Fixed width set of genes corresponds to a layer

• Layers contain multiple distinct parameters

– Restrict layer types based on section

• Feature extraction and classification

• Minor guided design in network, otherwise we attempt to fully encompass all layer types

Population – Group of Networks

Individual - Network

Feature Layers

Parameters

… …

Classification Layers

… …

MENNDL: Communication

Genetic Algorithm

Master

Population

Network

Parameters

Network 1

Parameters,

Predictions

Performance

Metrics

Network 2

Parameters,

Predictions

Performance

Metrics

Network N

Parameters,

Predictions

Performance

Metrics

Fitness

Metrics:

Accuracy

Worker

(one per node)

Hyper-parameter Values vs Performance

• Currently T&E of latest code that changes all possible parameters (e.g., # of layers, layer types, etc)

• Using just 4 nodes

• From 27% to 65% Accuracy

Evolved

MINERvA

MINERvA Vertex Segment Classification

Goal: Classify which segment the vertex is

located in.

Challenge: Events can have

very different characteristics.

Benefit of Parallelization

MINERvA dataset

12 hours

6 hours

2 hours

Unusual layers

• Second convolution layer has a kernel size of 29• Followed by MAX Pooling layer with kernel size of 19

Unusual Layers (limited training examples)

Current Status

• Scaled to 15,000 nodes of Titan

• 460,000 Networks evaluated in 24 hours

• Expanding to more complex topologies

• Evaluating on a wide range of science datasets

Acknowledgements

• Gabriel Perdue (FNAL) and Sohini Upadhyay (University of Chicago)

• Adam Terwilliger (Grand Valley State University) and David Isele (University of Pennsylvania)

• Robert Patton, Seung-Hwan Lim, Thomas Karnowski, and Derek Rose (ORNL)

Questions

New Adapting DL to New Data: An Evolutionary Algorithm for … · 2017. 5. 5. · 9 Adapting DL to...

Documents

Smart Mobility Policies with Evolutionary Algorithms: The Adapting Info Panel Case

Steven R. Young Travis Johnston Oak Ridge National Laboratory · 2017. 11. 7. · 22 Adapting DL to New Data Acknowledgements •Gabriel Perdue (FNAL) and Sohini Upadhyay (University

82E...DL 7S DL 1982S DL 2971,0 KRUGER 0MC DL 2968, STRATHCONA MC DL 633S DL 558S, LEAMINGTON MC DL 3064S, BRITISH MC DL 1928, HORN SILVER MC DL 2973, IOWA MC DL 2393S, SILVER BELL

Understanding millennials requires adapting the … · Understanding millennials requires adapting the research ... –Daniel Kahneman. ... Understanding millennials requires adapting

Adapting Designs

Adapting Deming's philosophy: An evaluative model.mgt_ves/DemingsPhilosophy.pdf · ADAPTING DEMING'S PHILOSOPHY: AN EVALUATIVE MODEL ... ADAPTING DEMING'S PHILOSOPHY: AN EVALUATIVE

An Evolutionary Approach to Designing Autonomous Planetary ...TAROS)Evolving… · An imperative problem currently faced in the ... adapting to the physical properties of their environ-ment

Deleterious Passengers in Adapting Populations · Deleterious Passengers in Adapting Populations Benjamin H. Good and Michael M. Desai1 Department of Organismic and Evolutionary Biology,

AutomationDirect ECOM Driver Help - Kepware · AutomationDirectECOMDriverHelp DeviceSetup SupportedDevices* DL-05 DL-06 DL-240 DL-250(-1) DL-260 DL-430 DL-440 DL-450 *AllPLCsviaanHx-ECOMmodule

92B€¦ · ir 2 deadman's island l 26 d 160 dl 171 dl 218 dl 225 dl 172 dl 286 dl 186 dle 168 dl 189 dl 211 dl 220 dl 163 dl 269 dl 287 dl 291 dl 20 dl 270 l 167 dl 177c dl 173 dl

G Adapting

Evolutionary system for automatically constructing and adapting radial … · 2007-10-09 · Evolutionary system for automatically constructing and adapting radial basis function

Adapting parameterization

Adapting Books

By Stephen M. Downes the - synergy | WordPress services · LIFE AFTER EVOLUTIONARY PSYCHOLOGY David Buller, Adapting Minds: Evolutionary Psychology and the Persistent Quest for Human

Evolutionary programming made faster - Evolutionary

DL-160S SOUND PRESSURE DATA LOGGER - … · DL-160S Nº 10 00 32 SOUND PRESSURE DATA LOGGER “For more than 25 years, our product range has been dynamically adapting to the constant

Adapting software for the public cloud - GOTO Conferencegotocon.com/dl/jaoo-aarhus-2010/slides/DougWilson... · adapted from existing on premise application systems. This paper examines

646 IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, … · 2014-10-19 · 646 IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, VOL. 10, NO. 6, DECEMBER 2006 Self-Adapting Control Parameters

UTM ZONE 9 UTM ZONE 10 - British Columbia › assets › gov › farming-natural...dl 305 dl 1022 dl 1029 dl 1604 dl 1677 dl 1780 dl 1774 dl 1117 dl 1099 sec 31, tp 3 dl 1491 ir 4