ConceptComplete

Random Variables and Distributions - Core Definitions

Random variables provide the bridge between abstract probability spaces and numerical analysis. They transform outcomes into numbers, enabling us to perform calculations and comparisons.

Random Variables

Definition

A random variable is a function X:ΩRX: \Omega \to \mathbb{R} that assigns a real number to each outcome in the sample space. Formally, XX must be measurable: for every real number xx, the set {ωΩ:X(ω)x}\{\omega \in \Omega: X(\omega) \leq x\} must be an event (i.e., belong to F\mathcal{F}).

Random variables are denoted by capital letters (X,Y,ZX, Y, Z) while their values are denoted by lowercase letters (x,y,zx, y, z).

Example

Coin Tosses: Toss a fair coin three times. Define XX = number of heads.

  • Sample space: Ω={HHH,HHT,HTH,HTT,THH,THT,TTH,TTT}\Omega = \{HHH, HHT, HTH, HTT, THH, THT, TTH, TTT\}
  • X(HHH)=3,X(HHT)=2,X(HTH)=2,,X(TTT)=0X(HHH) = 3, X(HHT) = 2, X(HTH) = 2, \ldots, X(TTT) = 0
  • XX takes values in {0,1,2,3}\{0, 1, 2, 3\}

Cumulative Distribution Function

Definition

The cumulative distribution function (CDF) of a random variable XX is: FX(x)=P(Xx)=P({ωΩ:X(ω)x})F_X(x) = P(X \leq x) = P(\{\omega \in \Omega: X(\omega) \leq x\})

for all xRx \in \mathbb{R}.

The CDF completely characterizes the distribution of XX. It satisfies three properties:

  1. Monotonicity: FXF_X is non-decreasing
  2. Limits: limxFX(x)=0\lim_{x \to -\infty} F_X(x) = 0 and limxFX(x)=1\lim_{x \to \infty} F_X(x) = 1
  3. Right-continuity: limh0+FX(x+h)=FX(x)\lim_{h \to 0^+} F_X(x+h) = F_X(x)

From the CDF, we can compute probabilities of intervals: P(a<Xb)=FX(b)FX(a)P(a < X \leq b) = F_X(b) - F_X(a)

Discrete vs. Continuous Random Variables

Definition

A random variable XX is discrete if it takes on countably many values. Its distribution is characterized by a probability mass function (PMF): pX(x)=P(X=x)p_X(x) = P(X = x)

A random variable XX is continuous if its CDF is continuous and differentiable almost everywhere. Its distribution is characterized by a probability density function (PDF): fX(x)=ddxFX(x)f_X(x) = \frac{d}{dx} F_X(x)

satisfying: P(aXb)=abfX(x)dxP(a \leq X \leq b) = \int_a^b f_X(x) \, dx

For continuous random variables, P(X=x)=0P(X = x) = 0 for all xx, so we focus on probabilities of intervals.

Example

Discrete: Rolling a fair die, X{1,2,3,4,5,6}X \in \{1,2,3,4,5,6\} with pX(k)=1/6p_X(k) = 1/6 for each kk.

Continuous: Uniform distribution on [0,1][0,1] has PDF fX(x)=1f_X(x) = 1 for 0x10 \leq x \leq 1 (and 0 elsewhere).

Remark

The distinction between discrete and continuous random variables parallels the distinction between counting and measuring. Discrete variables count outcomes (number of successes, number of arrivals), while continuous variables measure quantities (time, weight, distance).