33
Headquarters 300 W. Main Street, Suite 301 Charlottesville, VA 22903 434.973.7673 | fax 434.973.7875 www.elderresearch.com Copyright © 2017 Elder Research, Inc. Office Locations Arlington, VA Linthicum, MD Raleigh, NC Beyond Reliability: Advanced Analytics for Predicting Quality William J. Goodrum, Jr., PhD Elder Research, Inc. [email protected]

Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Headquarters300 W. Main Street, Suite 301Charlottesville, VA 22903434.973.7673 | fax 434.973.7875

www.elderresearch.comCopyright © 2017 Elder Research, Inc.

Office LocationsArlington, VA

Linthicum, MDRaleigh, NC

Beyond Reliability: Advanced Analytics for Predicting QualityWilliam J. Goodrum, Jr., PhD

Elder Research, [email protected]

Page 2: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Who Should Attend?

2

Page 3: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

When Individual Entities Matter

• Context– Natural Gas Wells in Wyoming

• Problem: Well “Freezing”– Costly to send maintenance

into the field to prevent– Lost production during

downtimeWhen should they go?Where should they go?

3

Page 4: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

What do maintenance crews need to know?

• Which well has stopped producing?

• Why has this well stopped producing?

• What is the nature of the failure?

• What service has already been performed?

4

Page 5: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Reliability: A Question of Questions

5

Page 6: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Reliability: A Question of Questions

6

Which question is more important?

What proportion of wells will havean equipment failure in the next180 days?

What is the likelihood thatthis particular well will requiremaintenance in the next 180 days?

Predictive Analytics“Traditional”

2/9

Page 7: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Traditional Paradigms

Kaplan-Meier/Cox Regression Design of Experiments

7

Page 8: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Predictive Analytics Methods

8

Decision Trees

NeuralNetworks

RegressionAnalysis

Random Forests/

Ensembles

Page 9: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

The Strengths of Traditional Methods

• We want to make a decision based on the reliability (or quality) of a population– How many or how much…?

• Possible applications:– Estimating fleet overall performance– Budgeting for field maintenance– Population health management– Life/Long-Term Care portfolio analysis

9

Page 10: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

What do the traditional methods tell us about the wells?Kaplan-Meier/Cox RegressionWe might know what proportion of wells make it to a given life, but we may not know what contributes to the likelihood of failure for a particular well

We might know something about the wells in the center of our DoE, but we will know very little about the performance in extreme cases.

10

Design of Experiments

Page 11: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Predictive Analytics: A Complementary Paradigm

11

Page 12: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Case Study: Connected “Toasters”• Client: Connected “Toaster”

Manufacturer• Goal: use data as an asset

for competitive advantage• Identified Opportunity:

preventative maintenance of “toasters”

• Our Engagement: Third-Party Validation

12

Page 13: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Unpacking the Opportunity

13

RawDataStore

Analysis:Requires

MaintenanceY/N?

TransformData

Page 14: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Elements of the “Toaster” Solution• Need: Select/create

variables related to maintenance and failures

• Analysis Method: Kaplan-Meier– Stratified by:

• Date of Manufacture• Design• …

• Goal: Optimize for Consumer’s Risk of devices

14

Page 15: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Burnt “Toast”: Limitations with Traditional Methods

• Average survival was correct, but particular survival probabilities did not match observations

• Some strata had no failures at all (e.g., newer “toasters”)

• Excessive Stratification àSmall samples!

• Assumptions for missing data grossly overestimated failures

15

Page 16: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Two Questions Again!

Our client’s questionHow many “toasters” can we afford to bring back for maintenance? How many failures in the field can we afford to have?

Their customer’s questionDoes my toaster need maintenance?

16

or

Page 17: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Applying Predictive Analytics to “Toaster” Data

• Problem: Classification– Does this “toaster” require

maintenance: Y/N?• Method: LASSO Regression

– Combined variable selection and prediction

– Mitigate overfit through regularization

• Benefit: also can estimate aggregate Consumer’s Risk

How did we compare?

17

Page 18: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Combining Traditional Methods with Predictive Analytics• Complementary Validation

– Identified similar feature space– Consumer’s Risk still matters!

• Entity Failure Probabilities– Likelihood of failure for individual

“toasters”– Closer to user needs

• Integration of Historical Performance– Historical data for each “toaster”

used to assess model performance

18

Page 19: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Back to Wyoming

19

Page 20: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

The Old Challenges of Found Data

• >1 TB of ugly dataSome Challenges Included:• Difficult integration• Missing/sparse data• Information unavailable until

end of project (e.g., “freeze”)• No information on well

treatment (i.e., methanol pour-down)

20

Page 21: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

More on “Freezing”

• Initially: well is “frozen”• Evolution 1:

– Subsurface Freezing (more costly)

– Above-Ground Freezing• Evolution 2: any downtime

(including scheduled maintenance!)

21

Page 22: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Starting Traditionally

• Initial Analysis: Kaplan-Meier

• Key Insights:– Field-level statistics still matter

for resourcing/budgeting decisions

– Fast and efficient statistics on aggregate well behavior

– Repeat “freeze” 10x more likely after first freeze

22

Page 23: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Finishing with Predictive Analytics• Problem: Classification

– How likely is this well to freeze in the next 180 days?

• Method: Logistic Regression

• Key Insights:– Aggregated entity probabilities

were more accurate than K-M– Significant additional effort

3x improvement over random baseline (at 20% workload)

23

Page 24: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Success?

24

Page 25: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Summary

25

Page 26: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

If the problem looks like this. . .

• Who. . .?

• Which. . .?

• Where. . .?

• When. . .?

26

Page 27: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

. . . then predictive analytics may help like this

• Who. . .?– Prioritization of people for

expert review• Which. . .?

– Highlight products of interest• Where. . .?

– Narrow geographical focus• When. . .?

– Likelihood of event in a given window of time

27

Page 28: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Reliability/Quality: Complementary Decisions

Traditional Methods• Decisions/Actions that Affect

Groups– High-level planning and cost

analysis– Resource forecasting– “Portfolio” analysis

Predictive Analytics• Decisions/Actions that Affect

Entities– Maintenance Recommendations– Prioritization of Investigations/Audits– Resource scheduling

28

Page 29: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

About Our Company

29

Page 30: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

About Elder Research

• Founded in 1995 by Dr. John Elder

• Offices:– Charlottesville (HQ)– Arlington– Baltimore– Raleigh

• Areas of Expertise:– Data Science– Text Mining– Data Infrastructure– Data Visualization

30

Page 31: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

Appendix

31

Page 32: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

What is LASSO?

• Least Absolute Shrinkage and Selection Operator

• Generalized Linear Model (Logistic Regression is related)

• Key Features: – Budget on the sum of coefficients– “Regularization” term

• Result: prevents overfit, and helps select inputs!

32

Page 33: Beyond Reliability: Advanced Analytics for Predicting Quality€¦ · 5/10/2017  · including analytics strategy development, predictive model validation, and predictive model building

About Me

Dr. William Goodrum is a Data Scientist with Elder Research; one of the oldest predictive analytics consultancies in North America. At Elder Research, Dr. Goodrum has led teams of Data Scientists and Software Engineers on a variety of different projects including analytics strategy development, predictive model validation, and predictive model building. These projects have been in industries as diverse as philanthropic development, maritime risk assessment, and connected device maintenance. He is also a frequent contributor to the company blog on analytics and analytics strategy.

Dr. Goodrum holds a B.S. in Mechanical Engineering from the University of Virginia, and a PhD in Engineering from Cambridge University.

33