Cristian Cadar, Peter Boonstoppel, Dawson Engler RWset: Attacking Path Explosion in Constraint-Based...

Cristian Cadar, Peter Boonstoppel, Dawson Engler

RWset: Attacking Path Explosion in Constraint-Based Test Generation

TACAS 2008, Budapest, Hungary ETAPS 2008

Goal: generate inputs that explore (ideally) all paths of a program

Run program on symbolic input, whose initial value is anything

At conditionals that use symbolic inputs, fork execution and follow both paths:

On true branch, add constraint that condition is true On false, that it is not

When a path terminates, generate a test case by solving the constraints on that path

Constraint-Based Test Generation

• EGT / EXE / KLEE• DART [Godefroid/Klarlund/Sen]• CUTE [Sen et al.]• SAGE, Pex [Godefroid et al.]• Vigilante [Castro et al ]• BitScope [Song et al.]

• RWset applicable to any of these

Constraint-Based Test Generation

Effective bug-finding tool File system code

ext2, ext3, JFS

Networking applications bpf, udhcpd

Library code PCRE, Pintos

Device drivers Minix

EXE Results

Scalability challenge

• Exponential space!– Relatively small number of interesting paths: e.g., those that achieve

maximum branch coverage

• Mixed symbolic/concrete execution (EXE/DART)• Search heuristics

– Best First Search (EXE)– Generational Search (SAGE)

• Symbolic execution + random testing (Hybrid CUTE)• Caching function summaries (SMART)• Demand-Driven Compositional Symbolic Execution (Pex)

RWset (read-write set) analysis

• Determine whether continuing to execute the current program path will explore new states

• Only a value observed by the program can determine the execution of new program states

{data, arg1, arg2} = unconstrained

flag = 0;

if (arg1 > 100) flag = 1;

if (arg2 > 100) flag = 1;

process(data, flag);

flag = 0

flag = 0;

if (arg1 > 100) flag = 1;

if (arg2 > 100) flag = 1;

arg1 > 100

flag = 0

flag = 0;

if (arg1 > 100) flag = 1;

if (arg2 > 100) flag = 1;

true: arg1 > 100 false: arg1 100

. . . . . .

arg1 > 100

arg2 > 100

flag = 1

flag = 0

flag = 0;

if (arg1 > 100) flag = 1;

if (arg2 > 100) flag = 1;

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg2 100

process(data, 1)

flag = 0;

if (arg1 > 100) flag = 1;

if (arg2 > 100) flag = 1;

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg2 100

process(data, 1) process(data, 1)

flag = 0;

if (arg1 > 100) flag = 1;

if (arg2 > 100) flag = 1;

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg2 100

process(data, 1) process(data, 1)

If arg1, arg2 not read

Write-set analysis• Look at values written in the past• State abstraction in model checking• Memory state = write set up to current progr. pt.

– Concrete writes: concr loc = concr val– Symbolic writes: constraint(sym loc)

• Program point P, two paths w/ same write-set – prune the second one

Write-sets

• Precision: reason at the byte-level• Minimize write-set size

1. Overwrites

2. Dead locations

3. Alpha renaming

• Complicated in the symbolic domain

a[i] = 17; a[j] = 20;

• Cannot discard all constraints

on dead locations

Read-set analysis

• Key idea: can ignore from write-set writes to locations that are never read again– Definition of read driven by goal to achieve

high branch coverage– Location read if can hit a new branch by

changing its value

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

arg1 =

True-True path

arg2 =

data =

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

arg1 =

True-True path

arg2 = flag = 0

data =

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

arg1 > 100

True-True path

arg2 = flag = 0

data =

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

arg1 > 100

True-True path

arg2 = flag = 1

data =

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

arg1 > 100

True-True path

arg2 > 100

flag = 1

data =

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

arg1 > 100

True-True path

arg2 > 100

flag = 1

data =

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

arg1 > 100

True-True path

arg2 > 100

flag = 1

data =

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

True-True path

flag = 1

data =

arg1, arg2 not read

arg1 > 100

arg2 > 100

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

True-False path

arg1 > 100

True-True path

arg2 > 100

flag = 1

data = arg1 > 100

arg2 100

flag = 1

data =

arg1, arg2 not read

RWset RWset

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

True-False path

arg1 > 100

True-True path

arg2 > 100

flag = 1

data = arg1 > 100

arg2 100

flag = 1

data =

arg1, arg2 not read

RWset RWset

arg1 > 100

arg2 > 100

flag = 1

flag = 0

true: arg2 > 100

flag = 1

false: arg1 100

process(data, 1)

True-False path

arg1 > 100

True-True path

arg2 > 100

flag = 1

data = arg1 > 100

arg2 100

flag = 1

data =

arg1, arg2 not read

RWset RWset

Implementation

Execution path Global cache

(W1, R1) (W2, R2)

…(Wn, Rn)

RW-Set(P)

Implementation

(W1, R1) (W2, R2)

…(Wn, Rn)Write-set Hit!

Emit test case i.(WP = Wi)

RW-Set(P)

Implementation

Read-set Hit!i.(WP Ri = Wi Ri)

Add (WP,Ri) to RW-Set(P)

(W1, R1) (W2, R2)

…(Wn, Rn)(WP, Ri)Emit test case

RW-Set(P)

Implementation

(W1, R1) (W2, R2)

…(Wn, Rn)(WP, _)

Add (WP,_) to RW-Set(P)

RW-Set(P)

Evaluation

• Medium-sized open source benchmarks– bpf, udhcpd, expat, tcpdump, pcre

• Minix 3 device drivers– lance, pci, sb16

Medium-sized apps

• bpf: Berkeley Packet Filter• expat: XML parsing library• pcre: Perl compatible reg exp library• tcpdump: tool for printing packet headers• udhcpd: a DHCPD server

Medium-sized apps

• Ran base EXE on each app for 30 minutes– Except 30,000 test cases for PCRE

• Recorded number of branches hit• Reran in RWset mode until we reached

the same branch coverage

Medium-sized apps

Percentage tests needed by RWset EXE

100 100 100 100 100

bpf expat pcre tcpdump udhcpd

Base EXE RW-set EXE

Non-redundant states

tcpdump

udhcpd

Test cases

Test cases Test cases

Base EXE RWset EXE

Performance

• Dry run for the medium-sized apps: compute write-sets and read-sets but do no pruning

Device drivers

• Hard to check w/o the physical device or outside of the kernel

• Focused on three MINIX 3 drivers:– lance: AMD lance ethernet card driver– pci: PCI bus driver– sb16: Sound Blaster 16 driver

Minix 3 device drivers

• Structured around a main dispatch loop

while (1) { /* receive message from other processes, the kernel, or the hardware */ message = read_message(); process(message); }

while (1) { /* receive message from other processes, the kernel, or the hardware */ message = read_symbolic_data(); process(message); }

for (k=0; k < n; k++) { /* receive message from other processes, the kernel, or the hardware */ message = read_symbolic_data(); process(message); }

Minix drivers - evaluation

• Three versions of EXE:– Base – Write-set– RWset

• Fixed the # of iterations in each version– mainly to have the base version terminate

• Ran each version for an hour– recorded branch coverage + non-redundant states

Branch coverage (pci)

RWsetWrite-set

Branch coverage (lance)

RWsetWrite-set

Branch coverage (sb16)

Non-redundant states

RWset DFS

RWsetBase

Base EXE RWset EXE

Bugs in device drivers

Summary

• RWset analysis – efficient way to prune large numbers of

redundant paths– only values observed by the program trigger the

execution of new paths

• Evaluated on Minix drivers and medium size applications– big reduction in the number of paths explored

Questions?

Cristian Cadar, Peter Boonstoppel, Dawson Engler RWset: Attacking Path Explosion in Constraint-Based...

Documents

RWset: Attacking Path Explosion in Constraint-Based Test Generation

Proc. TACAS 2015, (c) Springer - SoSy-Lab · Proc. TACAS 2015, (c) Springer SoftwareVeriﬁcationandVeriﬁableWitnesses (ReportonSV-COMP2015) DirkBeyer UniversityofPassau,Germany

Annual Report 2014/15 Season - mermaidslsc.org.au · 2016. 11. 21. · Summer SurfGirl . Johanna Boonstoppel. Chaplain . Dale Penman. Surf Swim Coordinator . Graham Reid. Administrator

Multi-core symbolic bisimulation minimisation · Int J Softw Tools Technol Transfer (2018) 20:157–177 TACAS 2016

Combined Satisﬁability Modulo Parametric Theorieshomepage.cs.uiowa.edu/~tinelli/talks/Intel-07.pdfCombined Satisﬁability Modulo Parametric Theories . TACAS’07, 2007. S. Krstic

LNCS 4963 - RWset: Attacking Path Explosion in Constraint ... · RWset: Attacking Path Explosion in Constraint-Based Test Generation Peter Boonstoppel, Cristian Cadar, and Dawson

avantssar.eu The AVANTSSAR › events › fosad13 › slides › AVANTSSAR-fosad13.pdfthe Construction and Analysis of Systems (TACAS) 2012, LNCS 7214, p. 267-282. • David von Oheimb

SATELLITE EVENTS HAV Heap Analysis and Veriﬁcationjj/Etaps/flyer-ETAPS.pdftics, algorithms, data structures and methodologies can be discussed and explored. In doing so, TACAS aims

Welcome [ ] · PDF fileWelcome To the 32nd Annual International Festival ... 7:20 PM Turkish 12:45 PM Filipino 7:40 PM Lebanese 12:50 PM Chinese -TACAS 8:00 PM Chinese - CAFA 1:00

ML for ML: Learning Cost Semantics by Experimentankushd/docs/TACAS17_Pres.pdf · Jan Hoffmann (CMU) TACAS 2017 1 Machine Learning ML for ML: Learning Cost Semantics by Experiment

Online and Compositional Learning of Controllers with ...people.cs.aau.dk/~muniz/LMMST-TACAS-16.pdf · Online and Compositional Learning of Controllers with Application to Floor Heating

System Specification, Verification, and Synthesis CS 4830 ... › ~stavros › 01-intro.pdf• TACAS: Tools and Algorithms for the Construction and Analysis of Systems ... Temporal

Learning One-Clock Timed Automata · 2020-07-08 · TACAS Evaluation Artifact 2020 Accepted Learning One-Clock Timed Automata Jie An1(), Mingshuai Chen2,3 4, Bohua Zhan3,4, Naijun

Symbolic Execution and Model Checking for Testing · Symbolic Execution • JPF– SE [TACAS’03,’07] – Extension to JPF that enables automated test case generation – Symbolic

RWset: Attacking Path Explosion in Constraint-Based …cristic/talks/rwset-tacas-2008.pdf · Cristian Cadar, Peter Boonstoppel, Dawson Engler RWset: Attacking Path Explosion in Constraint-Based

Many-core on-the-fly model checking of safety properties ... · Int J Softw Tools Technol Transfer (2016) 18:169–185 DOI 10.1007/s10009-015-0379-9 TACAS 2014 Many-core on-the-ﬂy

Property-Guided Shape Analysis S.Itzhaky, T.Reps, M.Sagiv, A.Thakur and T.Weiss Slides by Tomer Weiss Submitted to TACAS 2014

Software Verification with Validation of Results€¦ · Proc. TACAS 2017, c Springer SoftwareVeriﬁcationwithValidationofResults (ReportonSV-COMP2017) DirkBeyer LMUMunich,Germany

RWset: Attacking Path Explosion in Constraint-Based Test ...€¦ · maximum branch coverage • Mixed symbolic/concrete execution (EXE/DART) • Search heuristics – Best First

OUR COMPANY · Client: Inmobiliaria Las Tacas S.A. Puntilla de Villarica, Los Lagos Region Client: Inmobiliaria Mogulemo Ltda. ADVICE AND EXPERTISE We develop specialized advice and