14
CSCI 568A Discussion 01: Data Mining

CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

CSCI 568ADiscussion 01: Data Mining

Page 3: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

DATAWHAT?

Page 4: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large
Page 5: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large
Page 6: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large
Page 7: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

What Data Mining Isn’t

• Crawling / harvesting / screen scraping

• Querying (fishing)

• Collecting

• Drinking

Page 8: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

Statistics

AI / ML

(big) Databases

data mining sandwich

Page 9: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

“The process of discovering [useful] patterns in large amounts of data.”

Fry, B. Visualizing Data. 2008.

Page 10: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large
Page 11: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large
Page 12: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

milk cereal diapers beer

1 1 0 0

1 1 0 0

1 1 1 1

0 0 1 1

What patterns do we see?

Page 13: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

6 Core Topics

• Data & Big Data

• Classification

• Association Analysis

• Clustering

• Anomaly Detection

• Data Visualization & Interaction

Page 14: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

Homework

• Project 1

• Reading 1