The Tester’s Dashboard: Release Decision Support

Robert V. Binder System Verification Associates, LLC

rbinder@ieee.org

Peter B. Lakey Cognitive Concepts, Inc.

peterlakey@sbcglobal.net

Overview

• Complementary metrics for release decision-support – Model-based testing

• Operational profile

• Model coverage metrics

– Reliability Demonstration Chart

– Relative Proximity

• Case Study

• Observations

Release Decision Support

Model-based Reliability Estimation

• Test suites must be

– Proportional to operational profile

– Sequentially feasible

– Input feasible

• Approach

– Markov model

– Monte Carlo simulation

– Post run analytics

Model Coverage Metrics

Software System

Process SpaceProcess Space

Data SpaceData Space

activated

Usage Profile

Trigger

Latent

Defect

Observe

System

Failure

Observe

System

Failure

• % States Reached

• % State-Transitions Reached

Reliability Demonstration Chart

• Sequential Sampling

• Risk-Adjusted

• Musa equations

http://sourceforge.net/projects/rdc/

Relative Proximity

• Kullback-Lieber Distance – Information theoretic – Characterizes difference in variation of message population E

(expected) and sample A (actual) as “relative entropy”

• Relative Proximity – KLD math doesn’t work unless failures modeled (sum of the

actuals must be 1.0) – Assume the target failure rate is aggregate – Allocate failure rate in proportion to each operation

KLD = ∑ 𝐴𝑖 (𝑙𝑜𝑔2 (𝐴𝑖 / E𝑖 ))

Profile Explicit Failure Modes

• Assume maximum acceptable failure rate intensity of 1 in 10,000

Operation Mode Standard Profile

Explicit Failure Profile

Expected Number, 10000 Tests

A Pass 0.7 0.6993 6993

B Pass 0.2 0.1998 1998

C Pass 0.1 0.0999 999

Profile Explicit Failure Modes

• Relative Proximity indicates the difference between actual and observed failure rates

• Many possible operation failure rates with better or worse fidelity

• RDC based on aggregate FIO, not sensitive to operation variance

Mode Expected Actual KL Distance Actual KL Distance

A Pass 6993 7000 10.104 6990 -4.327

Fail 7 0 0.000 10 5.146

B Pass 1998 1990 -11.518 2000 2.887

Fail 2 10 23.219 0 0.000

C Pass 999 980 -27.149 994 -7.195

Fail 1 20 86.439 6 15.510

10000 10000 81.094 10000 12.020

Case Studies

• Stochastic Models

• Assumed Failure Rates

• Word Processing Application

• Ground-Based Midcourse Missile Defense

GBMD Test Run, 0-100

GBMD Test Run, 1K, 5K

GMBD Test Run, 10K

GBMD Relative Proximity Trend

1484.00

418.20

6.3012.0067.90

200.00

400.00

600.00

800.00

1000.00

1200.00

1400.00

1600.00

10 100 1000 10000

Observations

• Model coverage indicates minimal sufficiency

– Wouldn’t release without all state-xtn pairs covered

– Stochastic can take a long time to do this

– Cover with N+ first

• RDC assumes “flat” profile

– With sequential constraints, may be optimistic

– Strength is explicit risk-adjustment

• Relative Proximity will indicate when operation-specific Failure Intensity is as expected (or not)

The Tester’s Dashboard: Release Decision Support

Technology

Transform data into insight and action · Dashboard Marketing KPI Dashboard Web Analytics Report Sales Performance Dashboard Sales Conversion Dashboard Sales Team KPI Dashboard Cross

Linux Kernel Tester’s Guide

What Factors Must Be Considered During Appraising of Tester’s Work?

The Tester’s Role: Balancing Technical Acumen and User Advocacy

1 Using the Decision Dashboard (DD) to model the Rincon case CEE100

THE INFORMATION DASHBOARD FRAMEWORK · THE INFORMATION DASHBOARD FRAMEWORK ... A Department of Homeland Security Science and ... Ensure that this information is provided to the decision-

A Tester’s Guide to Collaborating with Product Owners

Tester’s Improving Veterans Access to Care in the Community Act

TESTER’S LITTLE HELPERS

An Exploratory Tester’s Notebook - QAI QUEST · An Exploratory Tester’s Notebook Michael Bolton ... An Exploratory Testing Session. ... • General charters may be necessary at

Agile Testing - Not Just Tester’s Story _ Dang Thanh Long

Microservices: A Performance Tester’s Dream or Nightmare? › slides › Eismann_Microservices.pdfMicroservices: A Performance Tester’s Dream or Nightmare? @simon_eismann Research

Design of a clinician dashboard to facilitate co-decision ... · which facilitates co-decision making. Co-decision making, or shared decision making, has been defined as “an approach

Tester’s 21st Century Veterans Benefit Delivery Act.pdf

Innovation: From the Tester’s Viewpoint

Building Quality In: Adopting the Tester’s Mindset

Teacher’s Dashboard Administrator’s Dashboard

Implementing Dashboard Samples With SAS · A dashboard is a data visualization application that consolidates important performance information and delivers it to decision-makers as

With a Penetration Tester’s Toolkit. Background What to Expect Topics Demonstrations

An Exploratory Tester’s Notebook. · An Exploratory Tester’s Notebook Michael Bolton, DevelopSense ... Michael has over 17 years of experience in the computer industry testing,