7
MovieMetR Predicting the prospects of movies

About_Moviemetr

Embed Size (px)

Citation preview

MovieMetR

Predicting the prospects of movies

What  do  I  watch?  

A  Documentary?  

A  Drama?  

Which  one??  

Comedy?  

Which  is  good????  Which  is  a  waste  of  money??????  

•  Data  manipula;on:  Pandas  

•  Machine  Learning:  scikit-­‐learn  

•  Storing  Data:  SQLite  

•  Data  scraping  from  raw  HTML  

•  Beau;fulSoup  and  Regular  expressions  

•  Using  RoJen  Tomato  API  

•  CSS/HTML:  Bootstrap  

•  Visualiza;on:  D3.js  •  Querying  Data:  

SQLite  •  Python  to  HTML:  

Flask  

Procedure  •  Training  Set:  Movies  from  2013  •  Features  – Actors,  Director  – Genre,  Distributor,  Release  Dates  (Major  Holidays?)  

•  Issues:    – Curse  of  Dimensionality!  – Scaling  

•  Linear  Regression  with  L2  Regulariza;on  

How  does  the  site  look  like?  

Comparing  our  ra;ngs  with  RoJentomato  User  Ra;ngs  

See  the  app  at:  hJp://moviemetr.herokuapp.com    

Source  code  at:  hJps://github.com/souravc83/

MovieMetR    

What  do  I  watch?  

A  Documentary?  

A  Drama?  

Which  one??  

Comedy?  

I  can  chew  on  those  numbers  and  spit  out  a  decision.  J  

1.4  

30.2   24.2  

25.3  

And  Finally:  ProTip