Polar is a static analyzer for probabilistic while-loops. Classical loops without probabilities are special cases of probabilistic loops. Therefore, Polar can analyze classical loops as well. Its central functionality is the computation of closed-form formulas for variables within a loop. Consider this simple example:
a,b = 0, 1
while true:
a, b = b, a + b
end
This non-probabilistic loop calculates all Fibonacci numbers indefinitely. Although it never terminates,
Polar can still analyze it. For variable a
, Polar computes the following formula:
a = 0, 1, -sqrt(5)*(1/2 - sqrt(5)/2)**n/5 + sqrt(5)*(1/2 + sqrt(5)/2)**n/5
This means a starts at 0
, then 1
, with all subsequent values succinctly represented in the formula.
This formula provides the n
th Fibonacci number for any number of loop iterations n
.
Beyond computing formulas for loop variables, Polar offers additional functionalities. For example, it can compute invariants for loops, which are relationships between loop variables that remain true before and after every iteration. For the Fibonacci example, Polar provides the following invariant:
a**4 + 2*a**3*b - a**2*b**2 - 2*a*b**3 + b**4 - 1 = 0
Polar also handles probabilistic loops. Here's another example:
x,y = 0, 0
while true:
x = x + 1 {1/2} x - 1
y = y + 1 {1/2} y - 1
end
This loop represents two symmetric random walks. Both x
and y
start at 0
and are independently incremented or
decremented by 1
with a probability of 1/2
. Polar provides formulas for moments such as the expected values
of these variables:
E(x) = 0
E(y) = 0
Both expected values remain 0
.
However, Polar can also calculate higher moments, like variances, which vary in this example:
c2(x) = n
c2(y) = n
The variances (second central moments) of both x
and y
follow the formula n
.
Therefore, the number of loop iterations n
affects the variances of these variables.
Polar computes invariants for this probabilistic loop as well,
focusing on moments of program variables rather than their values.
For example, it computes the following invariants for the expected values and variances of x
and y
:
E(y) = 0
c2(x) - c2(y) = 0
E(x) = 0
Based on the functionality to compute closed-form formulas for moments of loop variables, Polar provides many more functionalities to statically analyze such loops. You can use Polar through its CLI interface or integrate it into your Python files (see usage).
To install Polar locally, you can perform the following steps.
-
Ensure you have python (version ≥ 3.9) and pip installed. If not, install them as preferred.
-
Clone the repository:
git clone [email protected]:probing-lab/polar.git
cd polar
- Create a virtual environment in the
.venv
directory:
pip install --user virtualenv
python -m venv .venv
- Activate the virtual environment:
source .venv/bin/activate
- Install the required dependencies:
pip install -r requirements.txt
The problems Polar can solve are uncomputable for arbitrary probabilistic loops. This means that an algorithm and a tool capable of handling any loop do not exist. Consequently, Polar imposes certain restrictions on the loops it can analyze. However, if an input loop meets these restrictions, Polar is guaranteed to be able to analyze it, although the process can be time-consuming. In many cases, though, Polar is quite efficient. The loops for Polar are written in a custom language. Here's an example:
x,y = 1,0
while true:
c1 = Bernoulli(1/2)
c2 = Bernoulli(1/2)
if c1 + c2 < 2:
y = y + 1 {1/2} y - 2 {1/3} y
g = Normal(y,1)
x = x + g**2
end
end
Generally, the input loops for Polar consist of a section for initial variable assignments followed by a while loop.
Standard arithmetic is allowed in variable assignments, using Python syntax.
Additionally, you can draw from common probability distributions such as Bernoulli
and Normal
.
Polar also supports expressions for probabilistic choice. For instance, y
is assigned y+1
with probability 1/2
,
y-2
with probability 1/3
, and to y
otherwise. The use of if
statements, if-elif-else
statements,
and loop guards are also possible.
For Polar to analyze input loops, they must adhere to the following restrictions:
- All variables in if-conditions and the loop guard must only assume finitely many values.
- All probabilities and distribution parameters must be constant
- Non-linear variable dependencies must be acylcic.
Ad restriction 1:
In the example, both c1
and c2
are drawn from Bernoulli distributions and hence are only 0
or 1
.
It is acceptable to use them in conditions and guards. However, y
cannot be used in conditions and guards, as
it could potentially assume any natural number.
Ad restriction 2:
For some distributions, such as Normal
, it is permissible to use non-constant distribution parameters.
Ad restriction 3:
The restriction forbids variables v1
, ... vk
, where v1
depends on v2
, v2
on v3
, ...,
vk
depends on v1
, with at least one of these dependencies being non-linear.
In the example x
depends non-linearly on g
(through g**2
in the assignment of x
).
However, since g
does not depend on x
this non-linear dependency is acceptable.
There is technically one more restriction: programs can only draw from distributions for which all moments exist. However, the syntax for the input loops only allows such distributions.
If your input program satisfies these restrictions, Polar theoretically guarantees its analyzability. Dropping any these restrictions leads to uncomputability or serious hardness barriers. For more details on the hardness barriers see your publication Strong Invariants Are Hard.
For more details on the syntax of input programs and Polar itself see your publication This Is the Moment for Probabilistic Loops.
Polar's core functionality lies in computing closed-form formulas for variables within a (probabilistic) loop.
These formulas, parameterized by the number of loop iterations n
, provide the value of a given program variable after n
iterations.
In probabilistic loops, instead of direct values, the formulas describe the moments of program variables.
For example, to compute the expected value of the variable x
in a probabilistic loop located in documentation/loops/loop.prob
,
execute the following command:
python polar.py documentation/loops/loop.prob --goals "E(x)"
In this case, Polar provides the following closed-form formula for the expected value of x
:
E(x) = 1; n**3/256 + 133*n**2/256 + 205*n/128 + 1
This means that the expected value of x
is initially 1
and subsequent values are succinctly represented
by the formula parameterized by the number of loop iterations n
.
The parameter goals
accepts expected value of monomials of program variables, for instance "E(x**2)"
, "E(t*x**2)"
, and so on.
To compute the variance use "c2(x)"
.
For the kth central moment (where k is a natural number), use "ck(x)"
.
To compute polynomial invariants among the expected values and variances of x
and y
, include all these moments
in the goal
parameter and additionally pass the invariant
parameter:
python polar.py documentation/loops/loop.prob --goals "E(x)" "E(y)" "c2(x)" "c2(y)" --invariants
Polar will then output formulas for all the moments specified in the goals
parameter, along with the following three invariants:
E(x) - 1024*c2(y)**3/658503 - 2128*c2(y)**2/7569 - 205*c2(y)/174 - 1 = 0
E(y) + 8*c2(y)/87 = 0
c2(x) - 6864896*c2(y)**5/4984209207 - 22626304*c2(y)**4/171869283 - 1252660*c2(y)**3/1975509 - 291097*c2(y)**2/181656 - 131467*c2(y)/33408 = 0
These are not just any invariants; they form a basis for all polynomial invariants among the moments specified in goals
.
For more details on why these constitute a basis, refer to moment invariant ideals in the paper Strong Invariants Are Hard.
To compute invariants for non-probabilistic loops, such as the loop that calculates Fibonacci numbers, omit the goals
parameter:
python polar.py documentation/loops/fibonacci.prob --invariants
In this case, Polar provides the following invariant:
a**4 + 2*a**3*b - a**2*b**2 - 2*a*b**3 + b**4 - 1 = 0
This invariant is a basis for all polynomial invariants among the program variables a
and b
. As a
and b
represent two consecutive Fibonacci numbers in this example, the invariant encapsulates all polynomial identities
that hold between two consecutive Fibonacci numbers.
For a list of all parameters supported by Polar run:
python polar.py --help
- Computing closed-form formulas
- Computing invariant ideals
- Parameters & Sensitivity Analysis
- Loop Guards & Moments after termination
- Unsolvable Loop Analysis
You can run the automatic test suite with:
python -m unittest
When contributing to Polar, please run the following command after cloning the repository:
pre-commit install
The command installs a few hooks that execute before every commit and enforces some code style guides.
If you are writing a scientific paper and want to cite Polar, please cite the following publication:
@article{DBLP:journals/pacmpl/MoosbruggerSBK22,
author = {Marcel Moosbrugger and
Miroslav Stankovic and
Ezio Bartocci and
Laura Kov{\'{a}}cs},
title = {This is the moment for probabilistic loops},
journal = {Proc. {ACM} Program. Lang.},
volume = {6},
number = {{OOPSLA2}},
pages = {1497--1525},
year = {2022},
url = {https://doi.org/10.1145/3563341},
doi = {10.1145/3563341},
timestamp = {Mon, 05 Dec 2022 13:35:13 +0100},
biburl = {https://dblp.org/rec/journals/pacmpl/MoosbruggerSBK22.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}