Upload
arvind-surve
View
20
Download
0
Embed Size (px)
Citation preview
ClassDescription
• Goal• TeachscalablemachinelearningwithApacheSystemML• Attractpotentialcontributors
• Audience• Initiallysummerinterns,butgoalofdeveloping/foldingintoUniversityclass
• Duration~16hours• Content
• Developmentofscalablemachinelearningalgorithms• SystemML usageandhands-onexercises• AdvancedSystemML internals
• Officehours• AtAdlab:Thursday,4-5pm(maybeexpandedasdemanded)
2
SystemMLPrimer
• Goal• TeachenoughDML,SystemMLusage,andSparkforpeopletobeabletowriteandrunSystemMLalgorithmsonSparkandunderstanditsexecution.
• Content• DMLsyntax• SystemMLusage• SomeSpark
4
MachineLearningAlgorithms
• DescriptiveStatistics,DataPreparation,andTrain/Test/Cross-Validation• Regression• Classification• Clustering&MatrixFactorization
5
Foreachsession/chosenalgorithmhaveasimilarstructure:
• PossibleApplications
• Math/Alternatives/Discussion
• DMLformulation
• Datageneration
• Hands-onexercises• Performance• Accuracy
AdvancedSystemMLInternals
• Architecture• Compiler• Rewrites• Optimizer
• Runtime• Bufferpool• Storage• AdvancedOperators
• SparkBackend• Performancedebugging
6
7
S#/Date
Category Title Content Instructor S#/Date
Category Title Content Instructor
S1
6/21:9-12amR:G1-404
SystemMLPrimer
ScalableMachineLearningwithApacheSystemML
• IntroML• DML• SystemML usage• Architecture
BertholdReinwald,NakulJindal
S5
7/184-6R:
MLAlgsClustering&MatrixFactorization
• kMeans,mf,ALS,PCA,…)
• DML• Datagen• Hands-on• Perf &
Accuracy
AlexandreEvfimievski,Prithvi Sen
S2
6/27:4-6pmR:
MLAlgs
DataPrep,DescriptiveStatistics,andTrain/Test/Cross-validation
• Math• DML• Data-gen• Hands-on• Perf &Accuracy
FarazMakariManshadi
S6
7/254-6pmR:
SystemMLInternals
ApacheSystemMLArchitect.
• Architecture• Hops/Lops• CP/Cluster
BertholdReinwald,NiketanPansare
S3
7/5:4-6R:
MLAlgs Regression
• Linear, log.,GLM,Cox,Timeseries;CGmethod
• DML• Data-gen• Hands-on• Perf &Accuracy
AlexandreEvfimievski
S7
8/14-6pmR:
SystemMLInternals
ApacheSystemMLOptimizer
• Rewrites• Optimizer• Costmodel
MatthiasBoehm,ArvindSurve
S4
7/11:4-6R:
ML Algs Classificat.
• NaïveBayes,SVM,decTree,RF
• DML• Data-gen• Hands-on• Perf &Accuracy
Prithvi Sen
S8
8/84-6pmR:
SystemMLInternals
ApacheSystemMLRuntime
• Bufferpool• Storage• Spark
backend• Matrixblock
lib• Performance
debugging
MatthiasBoehm,ArvindSurve