22
Data Analytics for Scanning by Anne Boysen UH Annual gathering 2018

Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Data Analytics for Scanning

by

Anne Boysen

UH Annual gathering 2018

Page 2: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

90% of all data was created in the past 2 years

Data in Zettabytes

Page 3: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Social Science

Strategic Foresight

Data Analytics

Social Statistics

Data Mining

TrendScouting

GenerationalForesight

Page 4: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

1 2

Page 5: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

1 2 ErrorType Error Type

Page 6: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Let’s play Schrödinger's cat!

Page 7: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

?

Probability the last cat is

alive?

Page 8: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Probability 50 out of 100 will

be alive?

Page 9: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Disease or no disease?

Page 10: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Does failure lead to success?

Page 11: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Descriptive Predictive Prescriptive

Surveys

Text Analytics

NLP CNN

Clusters

Association rules

Decision Trees

Neural Networks

Logistic Regression

Linear Programming

ForecastingStructured Data

Unstructured Data

Identifies the likelihood of future outcomes based on data mining, algorithms and machine learning techniques finds the best course of action for a given situation creates a summary of historical data.

Simulation

Stemming

Sentiment Analysis

Page 12: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Judging a surveyrelevance – validity - reliability

•What is being asked?

•Who are asked?

•How are they being asked?

•How is it interpreted?

Page 13: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Relevance

Does it answer what you really need to know?

Page 14: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Do pay for a professional census-representative sample

Page 15: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Text mining marries qualitative and quantitative methods

Page 16: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Decision trees

Page 17: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

1. Determine outliers Novelty and detection algorithms, discriminant analysis, ethnographic approaches, desk research or brainstorm

2. Identify relationships to other variables

Use association rules to see how outliers are tied to more frequently occurring data.

3. Identify emerging clusters and segments

Use co- occurrences to identify clusters or segments. Cluster Analysis (K-means, Hierarchical clusters )

4. Explore target segmentsCapture behavioral and attitudinal data via surveys, web-scraping etc. to examine direction of outlier trend

Predictive Weak Signals Scanning

Page 18: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Multilayer Perceptrons / Neural Networks

Page 19: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

X3 Output

X4

X5

X2

X1

Input Layer Hidden layers Output LayerInput nodes

Page 20: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

Y = ∑ (input* weight) +

biasX3 Output

X4

X5

X2

X1

Input Layer

Activation function

To get non-linear outcomes:

- Logistic/ sigmoid function- Hyperbolic Tangent (tanh)- ReLU

Output LayerInput nodes

Page 21: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

X3 Output

X4

X5

X2

X1

Input Layer Hidden layers Output LayerInput nodes

Page 22: Data Analytics for Scanning - Houston Foresight€¦ · Text Analytics NLP CNN Clusters Association rules Decision Trees Neural Networks Logistic Regression Linear Programming Forecasting

In a time of drastic change it is the (machine) learners who inherit the future. The learned

usually find themselves equipped to live in a world that no longer exists

~ Eric Hoffer