Foundations for Verification of Continuous Programs with Caesar

April 11, 2025 · 4 min read

Caesar Developer

The paper "Foundations for Deductive Verification of Continuous Probabilistic Programs: From Lebesgue to Riemann and Back" by Kevin Batz, Joost-Pieter Katoen, Francesca Randone, and Tobias Winkler was recently published at OOPSLA 2025.

Before this work, Caesar was able to only verify simple discrete probabilistic programs, i.e. programs that only sample from discrete distributions. This paper lays out the formal foundations for us to verify probabilistic programs that sample from continuous distributions, with support for loops, and conditioning. One challenge is to integrate the integrals for the expected values that arise from continuous distributions into the deductive verification framework of Caesar. The key idea is to soundly under- or over-approximate these integrals via Riemann sums. In addition to theoretical results such as convergence and completeness of the approach, the paper also provides case studies of continuous probabilistic programs that are encoded in HeyVL and verified with Caesar.

In this post:

Encoding Riemann Sums in HeyVL

The HeyVL encoding of the Riemann sum approximation is simple. Consider sampling the variable x from the continuous uniform distribution over the $[0,1]$ .

The original sampling interval $[0,1]$ is divided into $N+1$ subintervals, where $N$ has to be a chosen integer literal. Then, the encoding looks as follows:

var x: UReal
var j: UInt = unif(0, N)
cohavoc x
coassume ?!(j / N <= x && x <= (j+1) / N)

We sample a random integer $j$ from the discrete uniform distribution over $[0,N)$ using the built-in unif distribution. The sampled integer $j$ is then used to select the subinterval $I = [j/N, (j+1)/N)$ . To overapproximate the expected value on the subinterval $I$ , we nondeterministically select a value $x \in I$ such that the expected value is maximized. This is done using the cohavoc and coassume statements.

Increasing the parameter $N$ leads to a more precise approximation of the expected value, but incurs a slowdown in the verification time.

The above encodes the Riemann sum over-approximation of the expected value of a function $f$ when sampling uniformly from the continuous interval $[0,1]$ . Formally:

\int_0^1 f(x) \, dx \quad\leq\quad \sum_{j=0}^N \frac{1}{N+1} \cdot \sup_{ j \in [j/N, (j+1)/N) } f\left(\frac{j}{N+1}\right)

A dual version of the encoding can be used to obtain an under-approximation of the expected value. Simply use havoc and assume statements instead of cohavoc and coassume, and use ?(...) instead of !?(...).

A more detailed explanation can be found in Section 9.1 of the paper.

Tortoise-Hare Race Example

Example 9.2.3 from the paper models a race between a tortoise and a hare. As long as the hare did not overtake the tortoise, the hare flips a fair coin to decide whether to move or not. If the hare decides to move, it samples a distance uniformly at random from $[0, 10]$ . The tortoise always moves exactly one step. The following HeyVL code encodes a proof that $\mathrm{wp}\llbracket\texttt{tortoise\_hare}\rrbracket(\texttt{count}) \leq 3.38 \cdot (t - h + 2)~,$ where $h$ and $t$ are the initial positions of the tortoise and hare, respectively. Here, we chose the parameter $N = 8$ for the Riemann sum approximation.

coproc tortoise_hare(init_h: UReal, init_t: UReal) -> (count: UReal)
    pre 3.38*((init_t - init_h) + 2)
    post count
{
    var h: UReal = init_h
    var t: UReal = init_t

    count = 0

    @invariant(ite(h <= t, count + 3.38*((t-h) + 2), count))
    while h <= t {
        var choice: Bool = flip(0.5)
        if choice {

            // -------------------------------------------
            // Over-approximating the continuous sampling:
            //      inc = unif[0,1]
            //
            var inc: UReal
            var N: UInt = 8
            var j: UInt = unif(0, 7) // discrete sampling
            cohavoc inc
            coassume ?!(j / N <= inc && inc <= (j+1) / N)
            // -------------------------------------------

            inc = 10 * inc
            h = h + inc
        } else {}

        t = t + 1
        count = count + 1
    }
}

Section 9.3 of the paper contains a detailed experimental evaluation with more examples. An associated artifact is available on Zenodo.

Beyond The Continuous Uniform Distribution

In the paper, only a statement to sample from the continuous uniform distribution is provided. However, as their Remark 1 states, this does not limit expressiveness. By applying the inverse transform sampling method, any continuous distribution can be sampled. This paper by Marcin Szymczak and Joost-Pieter Katoen contains some examples of how to sample from e.g. the normal distribution.

Encoding Riemann Sums in HeyVL​

Tortoise-Hare Race Example​

Beyond The Continuous Uniform Distribution​

Encoding Riemann Sums in HeyVL

Tortoise-Hare Race Example

Beyond The Continuous Uniform Distribution