7
Apache SystemML Class ”I predict what you will do next summer.” Summer 2016 1

Apache SystemML 2016 Summer class primer by Berthold Reinwald

Embed Size (px)

Citation preview

ApacheSystemML Class

”Ipredictwhatyouwilldonextsummer.”

Summer2016

1

ClassDescription

• Goal• TeachscalablemachinelearningwithApacheSystemML• Attractpotentialcontributors

• Audience• Initiallysummerinterns,butgoalofdeveloping/foldingintoUniversityclass

• Duration~16hours• Content

• Developmentofscalablemachinelearningalgorithms• SystemML usageandhands-onexercises• AdvancedSystemML internals

• Officehours• AtAdlab:Thursday,4-5pm(maybeexpandedasdemanded)

2

Outline

1. SystemML Primer2. MachineLearningAlgorithms3. AdvancedSystemML Internals

3

SystemMLPrimer

• Goal• TeachenoughDML,SystemMLusage,andSparkforpeopletobeabletowriteandrunSystemMLalgorithmsonSparkandunderstanditsexecution.

• Content• DMLsyntax• SystemMLusage• SomeSpark

4

MachineLearningAlgorithms

• DescriptiveStatistics,DataPreparation,andTrain/Test/Cross-Validation• Regression• Classification• Clustering&MatrixFactorization

5

Foreachsession/chosenalgorithmhaveasimilarstructure:

• PossibleApplications

• Math/Alternatives/Discussion

• DMLformulation

• Datageneration

• Hands-onexercises• Performance• Accuracy

AdvancedSystemMLInternals

• Architecture• Compiler• Rewrites• Optimizer

• Runtime• Bufferpool• Storage• AdvancedOperators

• SparkBackend• Performancedebugging

6

7

S#/Date

Category Title Content Instructor S#/Date

Category Title Content Instructor

S1

6/21:9-12amR:G1-404

SystemMLPrimer

ScalableMachineLearningwithApacheSystemML

• IntroML• DML• SystemML usage• Architecture

BertholdReinwald,NakulJindal

S5

7/184-6R:

MLAlgsClustering&MatrixFactorization

• kMeans,mf,ALS,PCA,…)

• DML• Datagen• Hands-on• Perf &

Accuracy

AlexandreEvfimievski,Prithvi Sen

S2

6/27:4-6pmR:

MLAlgs

DataPrep,DescriptiveStatistics,andTrain/Test/Cross-validation

• Math• DML• Data-gen• Hands-on• Perf &Accuracy

FarazMakariManshadi

S6

7/254-6pmR:

SystemMLInternals

ApacheSystemMLArchitect.

• Architecture• Hops/Lops• CP/Cluster

BertholdReinwald,NiketanPansare

S3

7/5:4-6R:

MLAlgs Regression

• Linear, log.,GLM,Cox,Timeseries;CGmethod

• DML• Data-gen• Hands-on• Perf &Accuracy

AlexandreEvfimievski

S7

8/14-6pmR:

SystemMLInternals

ApacheSystemMLOptimizer

• Rewrites• Optimizer• Costmodel

MatthiasBoehm,ArvindSurve

S4

7/11:4-6R:

ML Algs Classificat.

• NaïveBayes,SVM,decTree,RF

• DML• Data-gen• Hands-on• Perf &Accuracy

Prithvi Sen

S8

8/84-6pmR:

SystemMLInternals

ApacheSystemMLRuntime

• Bufferpool• Storage• Spark

backend• Matrixblock

lib• Performance

debugging

MatthiasBoehm,ArvindSurve