Paranormal stats

Preview:

Citation preview

Para-normal Statistics:Analyzing what doesn't add up.

Steven LembarkWorkhorse Computinglembark@wrkhors.com

Normality

We expect data is normal.

It's what we are trained for.

Chi-Squared, F depend on it.

It's the guts of ANOVA.

Theory guarantees it, sort of.

What is "normal"?

Normal data is:

Parametric

Real

Symmetric

Unimodal

Ab-normal data

Not all data is parametric.

Ab-normal data

Not all data is parametric:

"Bold" + "Tide" / 2 ==

Ab-normal data

Not all data is parametric: Nominal Data

"Bold" + "Tide" / 2 == ??

"Bald" - "Harry" >= 0 ??

Ab-normal data

Not all data is parametric: Ordinal Data

"On a scale of 1 to 5 how would you rate..."

Is the average really 3?

Are differences between ranks unform?

Ab-normal data

Not all data is parametric: Ordinal Data

"On a scale of 1 to 5 how would you rate..."

Is the average really 3?

For different people?

Ab-normal data

Not all data is unimodal, symmetric.

Bi-modal data has higher sample variance.

Positive data is skewed.

Ab-normal data

Counts usually Binomial or Poission.

Binomial: Coin flips.

Poisson: Sample success/failure.

Power of Positive Thinking

Binomial: Count of success from IID experiments.

Mean = np

Variance = npq

Power of Positive Thinking

Poisson: Count of occurrances in sample size n.

Mean = np

Variance = np

Power of Positive Thinking

Curves all positive.

Right tailed.

Binomial has highest power if sample data is binomial.

Result: Smaller n for given Beta.

Kinda normal

Approximations work...

Kinda normal

Approximations work some of the time.

Rule: npq > 5 for binomial approximation.

Goal: Keep mean > 3σ so normal is all positive.

Q: How good an approximation?

A: It depends...

The middle way

Binomial:

n=20, p=0.5

Normal:

µ 10, σ = 2.23

Decent approximation.

Off to one side

Binomial:

n=20, p=0.3

Normal:

µ = 6, σ = 2.0

Drifting negative.

Life on the edge

Binomial:

n=20, p=0.1

Normal:

µ = 2, σ = 1.3

Significant negative.

Neverneverland

Binomial:

n=20, p=0.0013

Normal:

µ = 0.26

σ = 0.16

Heavily negative.

General rule: npq > 5

Small or large p is skewed.

Six-sigma range should be positive.

At that point n > 5 / pq.

For p = 0.0013, n = 3582.

Sample size around 4000?

When we assume we make...

Assuming normal data leaves a less robust conclusion.

Stronger, less robust:

Sensitive to individual datasets.

Not reproducable.

Non-parametric Statistics

Origins in Psychology, Biology, Marketing.

Analyze counts, ranks.

Tests based on discrete distributions.

Common in Quality

Frequency of failures.

QC with No-Go guages.

Variations between batch runs.

Customer feedback.

Example: Safety study

Q: Are departments equally "safe"?

Q: Is a new configuration any "safer"?

Compare sample populations.

What is "safe"?

Fewer reported injurys?

What is P( injury ) per operation?

What is "safe"?

Fewer reported injurys?

What is P( injury ) per operation?

0.5?

0.3?

What is "safe"?

Fewer reported injurys?

What is P( injury ) per operation?

0.5?

0.1?

A whole lot less?

What is "safe"?

Fewer reported injurys?

What is P( injury ) per operation?

0.5?

0.1?

A whole lot less?

N(0.01, 0.01) is heavily negative.

Severe?

Parametric measure of injurys?

Severe?

Parametric ranking of injurys?

( Finger + Thumb ) / 2 == ?

Severe?

Parametric ranking of injurys?

( Finger + Thumb ) / 2 == ?

( Hand + Eye ) == Arm ?

( Hand + Hand ) == 2 * Hand ?

Ordinal Data

Ranked data, not scaled.

Ordinal Data

Ranked data, not scaled.

Hangnail < Finger Tip < Finger < Hand < Arm

"Fuzzy Buckets"

Have p( accident ) from history.

Kolomogrov-Smirnov

Got tonic?

Kolomogrov-Smirnov

Nope, not Vodka.

Like F or ANOVA: Populations are "different".

K-S Test

Compare cumulative data (blue) vs. Expeted (red).

Measure is largest difference (arrow).

K-S for safety

Rank the injurys on relative scale.

Compare counts by bucket.

Cumulative distribution:

accomodates empty cells.

minor mis-catagorization.

A good datum is hard to find,You always get the other kind.

Apologies to Bessie Smith

Sliding-scale questions:

"How would you rate..."

"How well did..."

"How likely are you to..."

A good datum is hard to find,You always get the other kind.

Apologies to Bessie Smith

Reproducability:

Variable skill.

Variable methods.

Variable data handling.

A good datum is hard to find,You always get the other kind.

Apologies to Bessie Smith

Big Data:

Multiple sources.

Multiple populations.

Multiple data standards.

Repeatable Analysis

Variety of NP tests for "messy" data.

Handle protocol, sampling variations.

Robust conclusions with real data.

Summary

Non-parametric data: counts, nominal, ordinal data.

Non-parametric analysis avoids NID assumptions.

Robust analysis of real data.

Even the para-normal.

Questions?

References: N-P

http://www.uta.edu/faculty/sawasthi/Statistics/stnonpar.html

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC153434/

Nice writeups.

References: K-S

http://itl.nist.gov/div898/handbook/eda/section3/eda35g.htm

Exploratory data analysis is worth exploring.

https://en.wikipedia.org/wiki/Kolmogorov%E2%80%93Smirnov_test

As always... really good writeup of the test definition, math.

References: Robust Analysis

https://en.wikipedia.org/wiki/Robust_statistics

https://en.wikipedia.org/wiki/Robust_regression

Decent introductions.Also look up "robust statistics" at nist.gov or "robust statistical analysis" at duckduckgo.

References: This talk

http://slideshare.net/lembark

Along with everything else I've done...