29
#datapopupseattle How Data Science Builds Better Products Sean McClure, Ph.D Data Scientist, Senior Consultant, ThoughtWorks WorldOfDataSci Thoughtworks

How Data Science Builds Better Products - Data Science Pop-up Seattle

Embed Size (px)

Citation preview

Page 1: How Data Science Builds Better Products - Data Science Pop-up Seattle

#datapopupseattle

How Data Science Builds Better Products

Sean McClure, Ph.DData Scientist, Senior Consultant, ThoughtWorks

WorldOfDataSci Thoughtworks

Page 2: How Data Science Builds Better Products - Data Science Pop-up Seattle

#datapopupseattle

UNSTRUCTUREDData Science POP-UP in Seattle

www.dominodatalab.com

D

Produced by Domino Data Lab

Domino’s enterprise data science platform is used by leading analytical organizations to increase productivity, enable collaboration, and publish

models into production faster.

Page 3: How Data Science Builds Better Products - Data Science Pop-up Seattle

How Data ScienceBuilds Better Products

Sean McClure Data Scientist, ThoughtWorks

Page 4: How Data Science Builds Better Products - Data Science Pop-up Seattle

Products are Built to Quickly Test Ideas

lightweight end-to-end imperfect

Why Do We Build Products?

data value

product

early stages

Page 5: How Data Science Builds Better Products - Data Science Pop-up Seattle

learning is the key to building better products

world -> data -> discovery -> pivot

productproduct

productdiscovery

Page 6: How Data Science Builds Better Products - Data Science Pop-up Seattle

TRADITIONAL “DISCOVERY”

the right decisions

Page 7: How Data Science Builds Better Products - Data Science Pop-up Seattle

the right decisions

the right decisionsTRADITIONAL “DISCOVERY”

Page 8: How Data Science Builds Better Products - Data Science Pop-up Seattle

the right decisionsTRADITIONAL “DISCOVERY”

Page 9: How Data Science Builds Better Products - Data Science Pop-up Seattle

the right decisionsTRADITIONAL “DISCOVERY”

Page 10: How Data Science Builds Better Products - Data Science Pop-up Seattle

the right decisionsTRADITIONAL “DISCOVERY”

Page 11: How Data Science Builds Better Products - Data Science Pop-up Seattle

the right decisions

?unrealistic

TRADITIONAL “DISCOVERY”

Page 12: How Data Science Builds Better Products - Data Science Pop-up Seattle

data science

the right decisions

+understands strategyunderstands data

BETTER DISCOVERY

Page 13: How Data Science Builds Better Products - Data Science Pop-up Seattle

Count-controlled loops Condition-controlled loops Collection-controlled loops Infinite loops Restart loop Generators

Early exit from loops Loop variants and invariants Loop system cross-references Structured non-local control flow Conditions Exceptions

Loops Flow Control structures If-then-(else) Case and switch Coroutines Continuations

STANDARD SOFTWAREWhat’s Wrong With the Usual Approach?

All functionality is locked in place

Page 14: How Data Science Builds Better Products - Data Science Pop-up Seattle

STANDARD SOFTWARE

software environment

Page 15: How Data Science Builds Better Products - Data Science Pop-up Seattle

STANDARD SOFTWARE

software environment

Page 16: How Data Science Builds Better Products - Data Science Pop-up Seattle

STANDARD SOFTWARE

software

environment

Page 17: How Data Science Builds Better Products - Data Science Pop-up Seattle

Learning algorithms

Model Validation

Model Performance

Data visualizationOperationalizing Models

Scientific computing libraries

Data cleansing

Data preparation

Probability and statistics

Loops Flow Control structures If-then-(else) Case and switch Coroutines Continuations

Count-controlled loops Condition-controlled loops Collection-controlled loops Infinite loops Restart loop Generators

Early exit from loops Loop variants and invariants Loop system cross-references Structured non-local control flow Conditions Exceptions

ADAPTIVE SOFTWAREWhat is the New Approach?

unlocked

Page 18: How Data Science Builds Better Products - Data Science Pop-up Seattle

ADAPTIVE SOFTWARE

software environment

Page 19: How Data Science Builds Better Products - Data Science Pop-up Seattle

ADAPTIVE SOFTWARE

software environment

Page 20: How Data Science Builds Better Products - Data Science Pop-up Seattle

ADAPTIVE SOFTWARE

software environment

Page 21: How Data Science Builds Better Products - Data Science Pop-up Seattle

“rapid and flexible response to change”“continuous improvement”

post-developmentdevelopment

ability to pivot

ability to pivot

Page 22: How Data Science Builds Better Products - Data Science Pop-up Seattle

How Do You Put the Brain in the Box?

Page 23: How Data Science Builds Better Products - Data Science Pop-up Seattle

Successful Data Products

• establish early benchmarks • understand true validation • build sophistication via iteration • provide APIs to model results • get continuous exposure to domain experience • design product experiments

Need to utilize technology choices that allow for building data products successfully

Page 24: How Data Science Builds Better Products - Data Science Pop-up Seattle

Search Engine Marketing - Recommendation• Increasing CTR? • Decreasing CPC? • Call volume trends • Percentage of Good Call trends. • Page Position • Visits vs Cost Per Visit • Impressions vs CTR graph. • Breakdown of CVT types • Click-to-call • Daily Budget Spend • Top 5 KWs vs Previous Good Cycle • Budget distribution • Impressions per publisher • Revenue per publisher • Page position per publisher • Review for Negative KWs • Review for Partner site issues • Review for OAT • Check Category page • Impression Share • Are the ads approved and running? • Below 1st Page Bid KWs • Quality Score • Is it loading? • Are all numbers replacing correctly? • Out of Area Traffic • High Spend – Low Revenue. • Super Low CTRs

making decisions

Page 25: How Data Science Builds Better Products - Data Science Pop-up Seattle

Data Product

Hadoop Cluster

Databases

DB DataProducer

Queue

Reporting Data

Operational Data

rl_op

rl_keyword

rl_report

HDFS

Flume

Data Core CPI Data Mart

Campaign

Creative

Publishers

Proxy Logs Call Logs

CPI

Admin Console

Others

Others

SqoopCPI

SpaceRaw

Normalized

Core Jobs

CPI Jobs

Search Engine Marketing - Recommendation

Page 26: How Data Science Builds Better Products - Data Science Pop-up Seattle

Designing APIs around

Model Results

HealthcarePrediction Engine

Page 27: How Data Science Builds Better Products - Data Science Pop-up Seattle

#datapopupseattle

@datapopup #datapopupseattle

Page 28: How Data Science Builds Better Products - Data Science Pop-up Seattle

#datapopupseattle

Thank You To Our Sponsors

Page 29: How Data Science Builds Better Products - Data Science Pop-up Seattle

THANK YOU