17
Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential

Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential

Embed Size (px)

Citation preview

Anton Slutsky, Lead Data Scientist, EPAM Systems

Hadoop + Mahout

Confidential

Confidential 2

Agenda

Confidential 3

Machine Learning vs. Statistics

Confidential 4

Types of Machine Learning

Confidential 5

Machine Learning Applications

Confidential 6

Machine Learning and Data

Confidential 7

Obligatory Big Data Slide

Confidential 8

Hadoop

Confidential 9

Apache Mahout

Confidential 10

Why Hadoop + Mahout?

Confidential 11

Machine Learning Applications

Confidential 12

Machine Learning Applications

Confidential 13

Hadoop + Mahout Algorithm

Confidential 14

Get data into Hadoop

Confidential 15

Convert data into Mahout format

Confidential 16

Mahout format – Sequence File

Confidential 17

Learn model from Data