18
Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD [email protected]

Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD [email protected]

  • Upload
    others

  • View
    31

  • Download
    2

Embed Size (px)

Citation preview

Page 1: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

Exploring Data Science

Trends in Data Science November 2016

Randall Shane, PhD [email protected]

Page 2: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

PLEASE NOTE: (1) Light on code this month!

(2) Images and some text has been borrowed from the inter webs. Apologies if I did not credit. Thanks for the info, no $$ were made but please be comforted in the fact that you’re making the world a smarter place!!

…DISCLAIMERS…

Page 3: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

Machine LearningMachine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without being explicitly programmed.

Machine learning focuses on the development of computer programs that can teach themselves to grow and change when exposed to new data. source: whatis.techtarget.com/definition/machine-learning

Page 4: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

Machine Learning Trends

Categories:

Algorithms

Data Storage

Languages

Page 5: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

According to KD Nuggets:

Algorithms

Page 6: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

According to Quora via Data Science Central:

Those top-10 algorithms for reference:1 C4.5 (Decision tree)2 k-Means3 Support Vector Machines (SVM)4 Apriori (Association rule learning)5 Expectation Maximization (EM)6 PageRank7 AdaBoost8 k-Nearest Neighbors (kNN)9 Naive Bayes10 Classification and Regression Tree

(CART/MART)

Algorithms

Page 7: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

According to Kaggle:

• Decision Trees • Naive Bayes • Least Squares

Regression • Logistic Regression • Ensemble Methods • Neural Networks

Algorithms

Page 8: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

Why do I keep hearing about deep learning and neural networks?

New buzzwords maybe?

Page 9: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

The cool kids are all doing it…Google TensorFlow

IBM Watson

Apple Accelerate

Page 10: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

Popularity by AWS:

1. S3 (Simple Storage Service)2. Glacier (Archival Storage)3. EBS (Elastic Block Store - persistent)4. EC2 (Instance Storage - temporary)5. Storage Gateway6. RDS (Relational)7. DynamoDB (NoSQL)8. SQS (Simple Queue Service)9. Elasti-Cache (Caching Service)10. Redshift (BI)

Data Storage

Page 11: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

Popularity by discussion on Stack Overflow:

Data Storage

Page 12: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

db-engines.com:

Data Storage

NoSQL Only

Page 13: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

Stack Overflow Developer Survey for Data & Math

Languages

Page 14: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

According to GitHub

Languages

Page 15: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

KD Nuggets - Data Science

Languages

Page 16: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

…So, what’ does it mean?

…IMHO…

1. Trends on storage and applications to cloud providers

2. Python & R for Data Scientists, Javascript remains strong

3. Open source products used for innovation & startups

4. Established products for established

5. Specific use technology popular in niches

Page 17: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

Sources• whatis.techtarget.com/definition/machine-

learning• http://www.kdnuggets.com/images/top-10-

algorithms-data-scientists-used.jpg• https://www.quora.com/What-are-the-top-10-

data-mining-or-machine-learning-algorithms• http://www.datasciencecentral.com/profiles/

blogs/top-10-machine-learning-algorithms• http://www.kdnuggets.com/2016/08/10-

algorithms-machine-learning-engineers.html• http://www.kdnuggets.com/2015/12/harasymiv-

lessons-kaggle-machine-learning.html• http://playground.tensorflow.org• http://www.thegeekstuff.com/2016/02/aws-

storage-and-db/• http://db-engines.com/en/ranking• http://stackoverflow.com/questions/1270321/a-

full-list-of-all-the-new-popular-databases-and-their-uses

• http://stackoverflow.com/research/developer-survey-2016

• http://www.techworm.net/2016/09/top-10-popular-programming-languages-github.html

• https://github.com/blog/2047-language-trends-on-github

• http://www.kdnuggets.com/2016/06/big-data-science-deep-learning-software-associations.html

Page 18: Exploring Data Science - Meetupfiles.meetup.com/19227507/Trends in Data Science.pdf · Exploring Data Science Trends in Data Science November 2016 Randall Shane, PhD randall.shane@icloud.com

Thank you for coming!

if you have additional questions, please feel free to reach out:

[email protected] @RandallShanePhD