PSI is a tool for performing exact inference on probabilistic programs. Given a probabilistic program, the tool produces an expression for the joint posterior distribution of the expressions computed by the program.
The PSI solver is described in the following research papers (see also http://psisolver.org):
- CAV 2016: "PSI: Exact Symbolic Inference for Probabilistic Programs" (original publication)
- PLDI 2020: "λPSI: Exact Inference for Higher-order Probabilistic Programs" (extension to higher-order inference)
PSI does not currently try to make any guarantees on backwards compatibility.
If your project depends on PSI, write down the exact commit hash to ensure others can reproduce your results.
The commit of HEAD at build time is displayed when running ./psi --help
.
-
Run
dependencies-release.sh
to download the LDC D compiler and extract the archive. -
Run
build-release.sh
to build PSI.
$ ./dependencies-release.sh && ./build-release.sh
PSI is written in the D programming language. D compilers are available at http://dlang.org/download.html.
./build.sh
will use DMD inside the PSI directory if it exists, otherwise it will attempt to use DMD from your path.
./build.sh
creates a debug build.
./build-release
creates a release build.
Build with docker build . -t psi
, then launch the container with docker run -it psi
.
You can share folders between your file system and the container using the -v
option: docker run -it psi -v host/path:container/path
.
You can also install psi using homebrew. See the below link for more information: https://github.com/bvoq/homebrew-bvoq/
The build instructions given here are for GNU/Linux. PSI also works on other platforms. Feel free to write a pull request with working build scripts for your favourite platform.
cd test
./runtests
Run ./psi example.psi
, where example.psi
is a file containing the source code for the model.
The next section ("Quick language guide") briefly introduces the most important language features.
Run ./psi --help
to display information about supported command-line options.
$ cat example.psi
def main(){
a := gauss(0,1);
b := gauss(0,1);
return if a > b { a } else { b };
}
$ psi example.psi
p(r) = (d/dx)⁻¹[e^(-x²)](r·⅟√̅2)·⅟e^(1/2·r²)·⅟π·√̅2
$ psi example.psi --mathematica --cdf
p[r1_] := 1/Pi*Sqrt[Pi]*(Erf[1/2^(1/2)*r1]+1)/2^2
$ cat coin_bias_small.psi
def main(){
p := uniform(0,1);
x1 := flip(p);
observe(x1==1);
x2 := flip(p);
observe(x2==0);
x3 := flip(p);
observe(x3==1);
return p;
}
$ psi coin_bias_small.psi --plot
p(p) = (-12·p³+12·p²)·[-1+p≤0]·[-p≤0]
plotting... (PDF)
command:
set pointsize 20
set xlabel "p"
set ylabel "density"
set samples 500, 500
set isosample 200
unset key
plot [-10:10] (-1+x<=0)*(-30*x+30)*(-x<=0)*x**4
PSI operates on files with the *.psi extension.
Such a file contains one or multiple procedure definitions:
def procedureName(param1,param2,..., paramn){
statements
}
There should be a procedure of name main
, denoting the program entry and exit point.
Function bodies consist of a sequence of statements, possibly referencing the variables param1,param2,...,paramn.
-
variable := expression;
introduces a new variable of namevariable
-
variable = expression;
assigns to an existing variable.+=
,-=
,*=
,/=
,div=
,%=
,^=
,&&=
,||=
, ... are also supported. -
if expression { statements } else { statements }
is a conditional.else { statements }
is optional -
repeat expression { statements }
repeats the given statements a number of times given by the expression. The expression should be an integer constant. -
for variable in [start..end){ statement }
is a for-loop over a range. Start and end should be integer constants. If the exact number of iterations is not known ahead of time, an if statement can be nested within the for loop in order to cut off some of the iterations. -
while expression { statement }
is a while-loop. Currently, while loops are only supported with the --dp switch and they must have bounded running time. -
observe(expression);
conditions the joint probability distribution of the program variables on the event that the expression is not equal to zero Note: The probability that the expression is not equal to zero should be positive (enforced by PSI during normalization). -
cobserve(expression,value);
conditions the joint probability distribution of the program variables on the event that the given expression equals the given value. The probability that the variable is equal to the expression must be zero, but the probability density at that point must be positive. Note: This is not currently enforced automatically by PSI, therefore some care is required when using this primitive. -
assert(expression);
puts the program in the error state in case the expression is equal to zero (PSI will then report the probability of error). -
return expression;
terminates the procedure with the given result. If no return statement is provided at the end of a procedure, return (); is automatically inserted.
a := readCSV(filename);
where filename is a string literal (such as "data.csv"), reads the given file containing a comma-separated list of values into a read-only arraya
. Indexing into such arrays requires deterministic integer indices.
-
The following operators are supported and denote the corresponding operations on real numbers with standard C operator precedence (where applicable):
expression + expression
,expression - expression
,expression * expression
,expression / expression
,expression div expression
,expression ^ expression
,expression | expression
,expression ⊕ expression
,expression & expression
-expression
expression < expression
,expression > expression
,expression <= expression
,expression >= expression
,expression == expression
,expression != expression
(div
is integer division. a div b
is the same as floor(a/b)
. ⊕
is bitwise xor. The non-unicode syntax is a xorb b
.)
-
There are logical operators
!
,&&
and||
, also with standard C operator precedence. (Warning: with the--dp
switch,&&
and||
do not currently use short-circuit evaluation.) -
Conditional expressions are supported:
if expression { expression } else { expression }
. Only one branch is evaluated. -
Expressions can be parenthesized in order to change order of precedence, e.g.
a*(b+c)
.
Primitive distributions can be sampled arbitrarily within expressions. They are exposed as procedures with special names. If arguments do not conform to the constraints of the distribution, the program enters the error state.
-
gauss(mean,variance)
samples from the Gaussian distribution with the given mean and non-negative variance. -
uniform(a,b)
samples an uniform real number between a and b, with a<=b. -
bernoulli(p)
is 1 with probability p and 0 with probability 1-p, where 0<=p<=1. -
uniformInt(a,b)
samples an uniform integer between a and b. There should be at least one integer n with a<=n<=b. -
categorical(p)
is i with probability p[i] for i in [0..p.length). p should be an array of real numbers of positive length. -
exponential(rate)
samples from the Exponential distribution with the given positive rate (inverse scale). -
beta(alpha,beta)
samples from the Beta distribution with the given positive shape parameters. -
gamma(shape,rate)
samples from the Gamma distribution with the given shape and rate. -
laplace(location,scale)
samples from the Laplace distribution with the given location and positive scale. -
cauchy(location,scale)
samples from the Cauchy distribution with the given location and positive scale. -
pareto(shape,scale)
samples from the Pareto distribution with the given shape and scale which should be positive real numbers. -
studentT(degrees_of_freedom)
samples from the StudentT distribution with the given positive degrees of freedom parameter. -
weibull(scale,shape)
samples from the Weibull distribution with the given positive scale and shape parameters. -
rayleigh(sigma_squared)
samples from the Rayleigh distribution with the given non-negative squared scale parameter. -
binomial(n,p)
samples from the Binomial distribution with the given number of trials and success probability. -
negBinomial(r,p)
samples from the Negative Binomial distribution with the given number of failures and success probability. -
geometric(p)
samples from the Geometric distribution with the given success probability. Returns the number of failures before the first success. -
poisson(mean)
samples from the Poisson distribution with the given positive mean. (At this time, expect less-than optimal simplification from PSI for expressions generated by programs sampling from this distribution.)
floor
,ceil
,exp
,log
,abs
.
-
sampleFrom("(variable1,...,variablen;parameter1,...,parameterm) => pdf",argument1,...,argumentm): type
This primitive samples an n-tuple of values from the given probability density function. The density function can depend both on the values and a set of parameters. To specify the pdf, the mathematical operators+-*/^
, the constante
, the dirac deltadelta(expr)
, absolute values|expression|
and Iverson brackets of the shape[expression<=expression]
,[expression<expression]
,[expression=expression]
and[expression != expression]
can be used, among others.Example:
x := sampleFrom("(x;y) => [-y<=x]*[x<=y]*x^2",y); (x,y,z) := sampleFrom("(x,y,z;w) => [-1<=x]*[x<=1]*[-1<=y]*[y<=1]*|x|*|y|*delta(z-w)",w);
Note: PSI does not currently verify that the given density expression is in fact a generalized probability density, so some care is required when using this primitive. Furthermore, PSI will trust any type annotation on sampleFrom. The default type is ℝ.
-
Marginal(expression): Distribution[type]
Returns the Marginal distribution of the given expressions, conditioned on the execution path in the current function. sample(Marginal(expression)) creates an independent value with the same marginal distribution as expression. -
Expectation(expression)
Shortcut forexpectation(Marginal(expression))
Note: PSI does not currently verify that the expectation of the given expression in fact exists, so some care is required when using this primitive. (e.g.Expectation(gauss(0,1)/gauss(0,1))
does not converge.)
def swapped(a,b){
return (b,a);
}
def main(){
(z,w) = swapped(1,2);
return ((z,w),swapped(z,w)); // p(r₁,r₂) = δ_r₁[(2,1)]·δ_r₂[(1,2)]
}
x := [1,2,3,4,5,6]; // declare array
return x[uniformInt(0,x.length-1)]; // index randomly into array
y := ([] : R[]); // declare empty array of real numbers
z := x ~ [1]; // y is the concatenation of x with [1]. ~= is also supported
v := array(uniformInt(1,3),[1,2,3]); // declare array of arrays of random size, initialized with [1,2,3] at all indices
w := x[2..x.length-1]; // slice array (w is [3,4,5])
Note: Verifying that array indices and lengths are integers within the array bounds can be costly. Use --noboundscheck to disable bounds checking at the cost of undefined results in case a bounds violation actually occurs with positive probability.
The length of an array a
can be obtained using the expression a.length
.
PSI functions can be used as first-class values:
def const(x: R){
def retX(y: R){
return x;
}
return retX;
}
Lambda syntax can be used to create anonymous function values:
def const(x:R){
return (y: R){ return x; }
}
Function bodies that return an expression can be abbreviated using =>
syntax:
def const(x:R)=>(y:R)=>x;
PSI function declarations and lambdas can have an arbitrary number of argument lists:
def const(x:R)(y:R)=>x;
Square argument lists introduce generic type parameters:
def const[a](x:a)[b](y:b)=>x;
def main(){
return const[R](3)[R x R x R](1,2,3);
}
Generic type parameters are deduced automatically when the square argument list is omitted:
def const[a](x:a)[b](y:b)=>x;
def main(){
return const(3)(1,2,3);
}
def id[a](x:a) => x;
def main(){
return (id(2), id(id)(1,2)); // p(r₁,r₂) = δ[-r₁+2]·δ_r₂[(1,2)]
}
PSI supports the built-in functions:
infer[a](f: 1 -> a): Distribution[a]
sample[a](p: Distribution[a]): a
expectation(p: Distribution[R]): R
errorPr(p: Distribution[R]): R
The function infer
performs nested inference on the given program f
and returns the distribution of its results. (This is precisely what PSI does, but with infer
it can be used arbitrarily within a PSI program.)
Note: For technical reasons that will be resolved soon, the implementation of error tracking in the presence of nested inference is very slow. Use the --nocheck
flag to disable error tracking and improve performance, at the cost of getting undefined results if an error actually occurs with positive probability.
The function sample
samples a value from the given distribution.
The function expectation
computes the expectation of the given distribution.
Note: PSI does not currently verify that the expectation of the expression in fact exists, so some care is required when using this primitive. (e.g. expectation(infer(()=>gauss(0,1)/gauss(0,1)))
does not converge.)
The function errorPr
computes the probability of error for the given distribution.
Any expression can be annotated with a type using the (expression : Type)
annotation.
A type can be R
(real numbers), a Type[]
, the type of a tuple Type1 x Type2 x ... x Typen
, Distribution[a]
, a function type, or the name of a custom data type (see below).
The empty tuple has type 1
.
One can annotate method return types:
def main(): R x R{
return (0,1);
}
All built-in functions (except operators) are implemented in terms of the sampleFrom
special expression.
Their implementations can be found in library/prelude.psi
(or library/prelude-nocheck.psi
if PSI is called with --nocheck
).
By default, PSI will attempt to read the prelude library from a library
folder in the same location as the binary. (This can be overridden by changing the default import path in psi.d
.)
Both library/prelude.psi
and library/prelude-nocheck.psi
are baked into the PSI binary at build time, to be used as a fallback.
E.g.
dat Foo{
x : R;
y : R[];
def Foo(x: R, y: R[]){ // constructor
this.x = x;
this.y = y;
}
def method(){
y[1]+=y[0];
y[0]+=x;
return (x,y[0]);
}
}
def main(){
x := Foo(1,[1,2,3]);
return x.method(); // p(r₁) = δ_r₁[(1,2)]
}
This section gives a quick overview of the source code.
psi.d:
Contains the program entry point, interprets the command line arguments and calls the other components.
expression.d, type.d, declaration.d:
Contains the declaration of source language AST nodes.
lexer.d, parser.d:
Lexer and parser for the source language. (Note: Contains some cruft as it was quickly adapted from a parser for the D programming language.)
semantic_.d:
Semantic analyzer, performs type checking and annotates the AST generated by the parser.
backend.d:
Defines the backend interface.
symbolic.d:
Symbolic backend. Translates the annotated AST to distribution expressions.
distrib.d:
Contains the declaration of generalized probability density functions for the primitive distributions supported by PSI. Furthermore, this file contains the class Distribution which encapsulates a probability distribution over program variables with an additional error state.
dexpr.d:
Contains the implementation of the intermediate language of distribution expressions and most of the simplification code.
dparse.d:
Parser for distribution expressions. (Used for debugging and the sampleFrom primitive, aims to invert the default toString method for distribution expressions, while also allowing different notations).
differentiation.d:
Symbolic differentiation engine.
integration.d:
Symbolic integration engine.
summation.d:
Symbolically evaluates sum expressions.
limits.d, asymptotics.d:
Limit evaluation engine.
terminal.d:
Simple colored terminal output.
hashtable.d:
Hash table implementation with optimization for small tables.
help.d:
Implements the --help
option and a few other options.
util.d:
Utility functions not available in the standard library (not everything in this file is used by PSI).
Add a *.psi
file into the test
directory.
You are encouraged to cite the CAV 2016 and PLDI 2020 publications using the following BibTex entries if you use PSI for academic research.
@inproceedings{gehr2016psi,
title={{PSI}: Exact Symbolic Inference for Probabilistic Programs},
author={Gehr, Timon and Misailovic, Sasa and Vechev, Martin},
booktitle={International Conference on Computer Aided Verification},
pages={62--83},
year={2016},
organization={Springer}
}
@inproceedings{gehr2020lpsi,
title = {$\lambda${PSI}: Exact Inference for Higher-Order Probabilistic Programs},
author = {Gehr, Timon and Steffen, Samuel and Vechev, Martin},
doi = {10.1145/3385412.3386006},
booktitle = {Proceedings of the 41st {ACM} {SIGPLAN} {Conference} on {Programming} {Language} {Design} and {Implementation}},
publisher = {ACM},
month = jun,
year = {2020}
}