5

Click here to load reader

10 Pitfalls in Data Science - Data Science Meetup Kick-Off - Feb 2014

Embed Size (px)

Citation preview

Page 1: 10 Pitfalls in Data Science - Data Science Meetup Kick-Off - Feb 2014

10 Pitfalls in Data Science

Szilárd Pafka, PhDChief Scientist, Epoch

LA Machine Learning MeetupData Science TrackFeb 2014

Page 2: 10 Pitfalls in Data Science - Data Science Meetup Kick-Off - Feb 2014

About me

Page 3: 10 Pitfalls in Data Science - Data Science Meetup Kick-Off - Feb 2014

Data Science

Page 4: 10 Pitfalls in Data Science - Data Science Meetup Kick-Off - Feb 2014

(Some) Pitfalls

● DS = IT project

● DS isolated from business

● Restricted access to data

● Not enough EDA/cleaning

● Data leakage

● Overfitting

● Optimizing wrong metric

● Skip model validation

● Too complex to deploy

● Poor communication

Page 5: 10 Pitfalls in Data Science - Data Science Meetup Kick-Off - Feb 2014

Contact

[email removed from slideshare]

www.linkedin.com/in/szilard