Delta Debugging

Once we have reproduced a program failure, we must find out what is relevant:

What does the failure actually depend on. We should simplify test cases to make debugging easier so we can identify what faults depend on, and make test cases easier to communicate.

To simplify…

Use binary search to cut a test case in half and iterate
We can automate this
- Using a binary search, throw away half the input and see if the output is wrong, if not go back to the previous state and discard the other half of the input. Repeat until we have simplified our complex input to our minimally simplified input
- If both halves pass, instead of a binary, we can divide our input into many subsets (our deltas), each subset having a small change. This increases the change of finding the failing input subset but is slower
- In general, start with few and large changes and increment to more and smaller changes
Let $R$ be the set of possible inputs
$r_{P} \in R$ corresponds to an input that passes
$r_{F} \in R$ corresponds to an input that fails
We let R denote the set of all possible inputs
We can go from one input r1 to another input r2 by a series of changes
A change $δ$ is mapping $R \to R$ which takes one input and changes it to another input
A change $δ$ can be decomposed to a number of elementary changes $δ_{1}, δ_{2}, \dots, δ_{n}$ where $δ$ is the composition of each sub-delta function in a left to right order (we apply $δ_{i}$ to $r$ then apply $δ_{j}$ to that output)

In summary

We have an input without failure $r_{P}$
We have an input with failure $r_{F}$
We have a set of changes $C_{F} = {δ_{1}, δ_{2}, \dots, δ_{n}}$ such that $r_{f} = (δ_{1} \circ δ_{2} \circ \dots \circ δ_{n}) (r_{P})$
Each subset c of $C_{F}$ is a test case

Given a test case c we should like to know if the input generated by applying changes in c to $r_{P}$ causes the same failure as $r_{F}$ .

We define the function $P o w erse t (c_{F}) = {P, F, ?}$ such that, given $c = {δ_{1}, δ_{2}, \dots, δ_{n}} \subseteq c_{F}$ , $t es t (c) = F$ iff $(δ_{1} \circ δ_{2} \circ \dots \circ δ_{n}) (r_{P})$ is a failing input

Hint

Once again, the whole point of this is to minimize our test cases so they are not so big

We want to find the smallest test case c such that $t es t (c) = F$ , a failing test case is called the global minimum of $c_{F}$ if for all $c^{'} \subseteq c_{F}$ , $∣ c^{'} ∣ < ∣ c ∣ ⟹ t es t (c^{'}) \neq = F$ .

The global minimum is the smallest set of changes which will make the program fail.

Finding the global minimum may require performing an exponential number of tests. Instead of looking for this global minimum, we can search for a 1-minimal input which is a set of changes that cause the failure but removing any change causes the failure to go away.

A failing test case $c \subseteq c_{F}$ is called a local minimum of $c_{F}$ if: for all $c^{'} \subset c, t es t (c^{'}) \neq = F$ . A failing test case $c \subseteq c_{F}$ is n-minimal if for all $c^{'} \subset c, ∣ c ∣ - ∣ c^{'} ∣ \leq n ⟹ t es t (c^{'}) \neq = F$ and it is 1-minimal if for all $δ_{i} \in c, t es t (c - δ_{i}) \neq = F$ .

The main trade off here is minimality guarantees vs computational costs. 1 minimality is the sweet spot.

To find a 1-minimal subset of c:

If for all changes in the test case, if you remove a a change (try this on each change) and the failure disappears, then c is 1-minimal
Otherwise, if removing the element still causes a failure, we found a smaller subset, recurse

Runtime

In the worst case, we remove one element from the set per iteration, after trying every other element:

Work is potentially $N + (N - 1) + (N - 2) + \dots$ This is $O (N^{2})$

It is silly to remove one element at a time, we can try to Divide and Conquer by dividing the change set in 2 initially and increase the number of subsets if we can’t make progress.

Minimization Algorithm

This delta debugging algorithm finds a 1-minimal test case: The idea is:

Partition the set $c_{F}$ to $Δ_{1}, Δ_{2}, \dots, Δ_{n}$ roughly equal chunks
$Δ_{1}, Δ_{2}, \dots, Δ_{n}$ are pairwise disjoint and $c_{F} = Δ_{1} \cup Δ_{2} \cup \dots \cup Δ_{n}$
The complement of $Δ_{i}$ is defined as $\nabla_{i} = c_{F} - Δ_{i}$
Start with n = 2
Test each test case defined by each partition and its complement
Reduces the test case if a smaller failure inducing set is found, otherwise it refines the partition Steps

Start with n=2 and $Δ$ test set
Test each $Δ_{1}, Δ_{2}, \dots, Δ_{n}$ and each $\nabla 1, \nabla_{2}, \dots, \nabla_{n}$
There are three possible outcomes:
1. Some $Δ_{i}$ causes failures: go to step 1 with $Δ = Δ_{i}$ and n = 2
2. Some $\nabla_{i}$ causes failures: Go to step 1 with $Δ = \nabla_{i}$ and n = n-1
3. No test causes failure: If granularity can be refined, go to step 1 with n = n*2, otherwise you found the 1 minimal set The worst case here is still quadratic but single failures converge in log N time.

🤖 Dan Huynh

Recent Notes

Dan Huynh

Linearity

CAP Theorem

Causality

Quorum Reads and Writes

Explorer

Delta Debugging

Minimization Algorithm

Graph View

Recent Notes

Dan Huynh

Linearity

CAP Theorem

Backlinks