Dustin Lennon

Dustin Lennon

Applied Scientist

statistics theory probability integral transform proof

Probability Integral Transform, A Proof

This post elucidates the connection between the generalized inverse, the cdf, the quantile function, and the uniform distribution.

Dustin Lennon
September 2018
September 2018

\[ \DeclareMathOperator*{\min}{min} \newcommand{disteq}{ \overset{D}{=} } \]

This work was originally published as an Inferentialist blog post.



The probability integral transform is a fundamental concept in statistics that connects the cumulative distribution function, the quantile function, and the uniform distribution. We motivate the need for a generalized inverse of the CDF and prove the result in this context.



Suppose \(U\) is the uniform distribution on \([0,1]\) , and \(F\) is the cumulative distribution of the random variable, \(Y\) , i.e.

\[ \begin{gather*} P\left( Y \leq y \right) = F(y) \end{gather*} \]

and define the inverse cdf — the quantile function — as

\[ F^{-1}(u) = \min \left\{ y : F(y) \geq u \right\} \]

Figure 1: a cumulative density function
Figure 1: a cumulative density function

Recall that probability distributions — cdfs — are right continuous and monotonically increasing. However, as the example shows, they may also have flat sections and discontinuities. Indeed, the flat sections motivate the definition of the inverse cdf as a minimum. Thus, in the example, while \(F\) maps every point in the interval \([0,1]\) to \(0.5\), the preimage of the same interval is the single point, \(0\).

The Main Result

The Main Result

The key result is:

\[ P\left(F^{-1}(U) \leq y \right) = F(y) \]

That is, \(F^{-1}(U)\) and \(Y\) have the same distribution.


The proof isn’t particularly complicated, but it relies on two identities that follow from the definition of the inverse cdf.


\[ u \leq F(F^{-1}(u)) \]

This follows from the definition as \(F^{-1}(u)\) is the smallest value of \(y\) for which \(F(y) \geq u\).


\[ F^{-1}(F(y)) \leq y \]

Using the definition, write the left hand side, with a change of index, as

\[ \begin{align*} F^{-1}(F(y)) & = \min \left\{ z : F(z) \geq F(y) \right \} \\ & \leq y \end{align*} \]

The inequality follows from the following argument. The flat section of the example shows that it is possible to have \(z\) less than \(y\). However, right-continuity implies that \(z\) may never exceed \(y\).

The main result is a statement about probabilities. As such, we can proceed by showing the following:

\[ F^{-1}(U(\omega)) \leq y \iff U(\omega) \leq F(y) \]

In what follows, let \(u = U(\omega)\).

The forward implication

We first show

\[ F^{-1}(u) \leq y \implies u \leq F(y) \]

Suppose \(F^{-1}(u) \leq y\).

Then, since \(F\) is monotonic, we can apply it to both sides of the inequality without issue:

\[ F(F^{-1}(u)) \leq F(y) \]

Combining this with the first of the above identities,

\[ u \leq F(F^{-1}(u)) \leq F(y) \]


\[ F^{-1}(u) \leq y \implies u \leq F(y) \]

The reverse implication

The argument is similar. Note that \(F^{-1}\) is also monotonic. Thus, \(u \leq F(y)\) implies

\[ \begin{align*} F^{-1}(u) & \leq F^{-1}(F(y)) \\ & \leq y \end{align*} \]

and the result follows after applying the second of the above identities:

\[ u \leq F(y) \implies F^{-1}(u) \leq y \]