45
Data Mining in Education Social Media + Text Qiang Hao [email protected] http://tobeneo.com

Data Mining and Text Mining in Educational Research

Embed Size (px)

Citation preview

Page 1: Data Mining and Text Mining in Educational Research

Data Mining in EducationSocial Media + Text

Qiang Hao

[email protected]

http://tobeneo.com

Page 2: Data Mining and Text Mining in Educational Research

Goals

• What is Data Mining?

• What tools / knowledge do you need to do Data Mining?

• What is the basic process of Data Mining?

Page 3: Data Mining and Text Mining in Educational Research

Questions Answered by Data Mining

• Can we predict whether the coming email is a spam?

Page 4: Data Mining and Text Mining in Educational Research

Questions Answered by Data Mining

• Can we predict whether the coming email is a spam?

Page 5: Data Mining and Text Mining in Educational Research

Questions Answered by Data Mining

• Can we predict whether the coming email is a spam?

money

you

he

……

Page 6: Data Mining and Text Mining in Educational Research

Questions Answered by Data Mining

• What is the attitude of people on Twitter towards the presidential candidate Donald Trump?

Page 7: Data Mining and Text Mining in Educational Research

Questions Answered by Data Mining

• What is the attitude of people on Twitter towards the presidential candidate Donald Trump?

#Trump

#DonaldTrump

#GOPTrump

Page 8: Data Mining and Text Mining in Educational Research

Questions Answered by Data Mining

• What is the attitude of people on Twitter towards the presidential candidate Donald Trump?

Page 9: Data Mining and Text Mining in Educational Research

Questions Answered by Data Mining

• What is the attitude of people on Twitter towards the presidential candidate Donald Trump?

a, an, the, is, are, was, were, if …

Page 10: Data Mining and Text Mining in Educational Research

Questions Answered by Data Mining

• What is the attitude of people on Twitter towards the presidential candidate Donald Trump?

Page 11: Data Mining and Text Mining in Educational Research

Questions Answered by Data Mining

• What is the attitude of people on Twitter towards the presidential candidate Donald Trump?

Negative

Neutral

Positive

Page 12: Data Mining and Text Mining in Educational Research

Educational Questions to Answer by Data Mining

Page 13: Data Mining and Text Mining in Educational Research

Educational Questions to Answer by Data Mining

• What algorithm can score essays as teachers do?

Page 14: Data Mining and Text Mining in Educational Research

Educational Questions to Answer by Data Mining

• What courses should we recommend to students based on their online activities?

Page 15: Data Mining and Text Mining in Educational Research

Educational Questions to Answer by Data Mining

• Does the intervention improve students’ lexical variety in their writing?

Page 16: Data Mining and Text Mining in Educational Research

Educational Questions to Answer by Data Mining

• Are there different patterns in students’ questions; if so, are the patterns related to their academic performance?

Page 17: Data Mining and Text Mining in Educational Research

Educational Questions to Answer by Data Mining

• What sub-topics do students tend to cover when discussing this topic?

Page 18: Data Mining and Text Mining in Educational Research

Educational Questions to Answer by Data Mining

• What predictor is the most important one for whether college students seek help online in their learning?

Page 19: Data Mining and Text Mining in Educational Research

Goals

• What is Data Mining?

Replicable

Reproducible

Automatic

Page 20: Data Mining and Text Mining in Educational Research

Goals

• What is Data Mining?

• What tools / knowledge do you need to do Data Mining?

Page 21: Data Mining and Text Mining in Educational Research

Tools / Knowledge

Page 22: Data Mining and Text Mining in Educational Research

Tools / Knowledge

Carmen Reinhart Kenneth Rogoff

Thomas Herndon

Page 23: Data Mining and Text Mining in Educational Research

Goals

• What tools / knowledge do you need to do Data Mining?

Expert level of knowledge in statistics

Intermediate level of knowledge in programming

Familiarity with R/Python

Page 24: Data Mining and Text Mining in Educational Research

R for SAS and SPSS Users

Robert A. Muenchen

Goals

Page 25: Data Mining and Text Mining in Educational Research

Hands-On Programming with R

Garrett Grolemund

Goals

Page 26: Data Mining and Text Mining in Educational Research

Goals

• What is Data Mining?

• What tools / knowledge do you need to do Data Mining?

• What is the basic process of Data Mining?

Page 27: Data Mining and Text Mining in Educational Research

Data Collection

Data Cleaning

Data Processing

Data Analysis

Sharing Data and Results

Research Pipeline

Page 28: Data Mining and Text Mining in Educational Research

Data Collection

Page 29: Data Mining and Text Mining in Educational Research

• XML

Data Collection

Page 30: Data Mining and Text Mining in Educational Research

Data Collection

• JSON

Page 31: Data Mining and Text Mining in Educational Research

Mining the Social Web 2nd

Edition

Matthew A. Russell

Python

Data Collection

Page 32: Data Mining and Text Mining in Educational Research

Data Cleaning

Page 33: Data Mining and Text Mining in Educational Research

Data Processing

Page 34: Data Mining and Text Mining in Educational Research

Data Processing

Page 35: Data Mining and Text Mining in Educational Research

Data Processing

Page 36: Data Mining and Text Mining in Educational Research

Data Processing

Text Analysis with R for Students of Literature

Matthew L. Jockers

Page 37: Data Mining and Text Mining in Educational Research

Data Analysis

• Lexical Variety

• Classification• Clustering Analysis• Latent Semantic Analysis• Support Vector Machine• Sentimental Analysis

• Topic Modeling

Page 38: Data Mining and Text Mining in Educational Research

Data Analysis

Renkl, A. (1997). Learning from worked‐out examples: A study on individual differences. Cognitive science, 21(1), 1-29.

Page 39: Data Mining and Text Mining in Educational Research

Data Analysis

An Introduction to Statistical Learning

Gareth JamesDaniela WittenTrevor HastieRobert Tibshirani

Page 40: Data Mining and Text Mining in Educational Research

Sharing Data and Results

• R + KnitR + RPub

• GitHub

Page 41: Data Mining and Text Mining in Educational Research

Sharing Data and Results

• R + KnitR + RPub: http://rpubs.com/neohao/online-help-seeking

Page 42: Data Mining and Text Mining in Educational Research

Sharing Data and Results

• GitHub: https://github.com/Neo-Hao/TwitterHashtagR

Page 43: Data Mining and Text Mining in Educational Research

Sharing Data and Results

Version control with Git

Jon Loeliger

Page 44: Data Mining and Text Mining in Educational Research
Page 45: Data Mining and Text Mining in Educational Research

Thanks!