ConceptComplete

Random Variables and Distributions - Key Properties

The distribution of a random variable encodes all probabilistic information about it. Understanding key properties of distributions is essential for both theoretical and applied work.

Properties of CDFs

The cumulative distribution function FX(x)=P(X≀x)F_X(x) = P(X \leq x) has several important properties that follow from the axioms of probability.

Theorem: For any random variable XX:

  1. 0≀FX(x)≀10 \leq F_X(x) \leq 1 for all xx
  2. If x1<x2x_1 < x_2, then FX(x1)≀FX(x2)F_X(x_1) \leq F_X(x_2) (monotonicity)
  3. lim⁑xβ†’βˆ’βˆžFX(x)=0\lim_{x \to -\infty} F_X(x) = 0 and lim⁑xβ†’βˆžFX(x)=1\lim_{x \to \infty} F_X(x) = 1
  4. FXF_X is right-continuous: lim⁑hβ†’0+FX(x+h)=FX(x)\lim_{h \to 0^+} F_X(x+h) = F_X(x)

From the CDF, we can compute various probabilities: P(X>x)=1βˆ’FX(x)P(X > x) = 1 - F_X(x) P(X<x)=lim⁑hβ†’0+FX(xβˆ’h)=FX(xβˆ’)P(X < x) = \lim_{h \to 0^+} F_X(x-h) = F_X(x^-) P(X=x)=FX(x)βˆ’FX(xβˆ’)=jumpΒ atΒ xP(X = x) = F_X(x) - F_X(x^-) = \text{jump at } x

Example

For a discrete random variable with PMF pXp_X: FX(x)=βˆ‘k:k≀xpX(k)F_X(x) = \sum_{k: k \leq x} p_X(k)

The CDF is a step function with jumps of size pX(k)p_X(k) at each value kk where pX(k)>0p_X(k) > 0.

Properties of PDFs

For continuous random variables with PDF fXf_X:

Normalization: βˆ«βˆ’βˆžβˆžfX(x) dx=1\int_{-\infty}^{\infty} f_X(x) \, dx = 1

Non-negativity: fX(x)β‰₯0f_X(x) \geq 0 for all xx

Relationship to CDF: FX(x)=βˆ«βˆ’βˆžxfX(t) dtF_X(x) = \int_{-\infty}^x f_X(t) \, dt and fX(x)=ddxFX(x)f_X(x) = \frac{d}{dx} F_X(x) (where the derivative exists)

Key Insight: Unlike probabilities, fX(x)f_X(x) can exceed 1. It represents probability density, not probability. Only integrals of fXf_X give probabilities.

Example

The PDF fX(x)=2xf_X(x) = 2x for 0≀x≀10 \leq x \leq 1 (and 0 elsewhere) has fX(0.9)=1.8>1f_X(0.9) = 1.8 > 1, which is perfectly valid. The total integral is: ∫012x dx=[x2]01=1\int_0^1 2x \, dx = [x^2]_0^1 = 1 βœ“

Transformations of Random Variables

Theorem

If XX is a continuous random variable with PDF fXf_X and Y=g(X)Y = g(X) where gg is strictly monotonic and differentiable, then YY has PDF: fY(y)=fX(gβˆ’1(y))∣ddygβˆ’1(y)∣f_Y(y) = f_X(g^{-1}(y)) \left|\frac{d}{dy} g^{-1}(y)\right|

For discrete random variables, if Y=g(X)Y = g(X): pY(y)=βˆ‘x:g(x)=ypX(x)p_Y(y) = \sum_{x: g(x) = y} p_X(x)

Example

If X∼Uniform(0,1)X \sim \text{Uniform}(0,1) and Y=βˆ’ln⁑XY = -\ln X, then Y∼Exponential(1)Y \sim \text{Exponential}(1).

Starting with fX(x)=1f_X(x) = 1 for 0<x<10 < x < 1:

  • Inverse: x=eβˆ’yx = e^{-y}, so dxdy=βˆ’eβˆ’y\frac{dx}{dy} = -e^{-y}
  • PDF of YY: fY(y)=1β‹…βˆ£βˆ’eβˆ’y∣=eβˆ’yf_Y(y) = 1 \cdot |-e^{-y}| = e^{-y} for y>0y > 0

Quantile Function

Definition

The quantile function (inverse CDF) is: FXβˆ’1(p)=inf⁑{x:FX(x)β‰₯p}F_X^{-1}(p) = \inf\{x: F_X(x) \geq p\}

for 0<p<10 < p < 1. The value FXβˆ’1(p)F_X^{-1}(p) is the pp-th quantile.

Special quantiles:

  • Median: FXβˆ’1(0.5)F_X^{-1}(0.5)
  • First quartile: FXβˆ’1(0.25)F_X^{-1}(0.25)
  • Third quartile: FXβˆ’1(0.75)F_X^{-1}(0.75)
Remark

The quantile function inverts the CDF and is fundamental in generating random samples. The probability integral transform states that if U∼Uniform(0,1)U \sim \text{Uniform}(0,1), then FXβˆ’1(U)F_X^{-1}(U) has distribution FXF_X.