Random Variables and Distributions - Core Definitions
Random variables provide the bridge between abstract probability spaces and numerical analysis. They transform outcomes into numbers, enabling us to perform calculations and comparisons.
Random Variables
A random variable is a function that assigns a real number to each outcome in the sample space. Formally, must be measurable: for every real number , the set must be an event (i.e., belong to ).
Random variables are denoted by capital letters () while their values are denoted by lowercase letters ().
Coin Tosses: Toss a fair coin three times. Define = number of heads.
- Sample space:
- takes values in
Cumulative Distribution Function
The cumulative distribution function (CDF) of a random variable is:
for all .
The CDF completely characterizes the distribution of . It satisfies three properties:
- Monotonicity: is non-decreasing
- Limits: and
- Right-continuity:
From the CDF, we can compute probabilities of intervals:
Discrete vs. Continuous Random Variables
A random variable is discrete if it takes on countably many values. Its distribution is characterized by a probability mass function (PMF):
A random variable is continuous if its CDF is continuous and differentiable almost everywhere. Its distribution is characterized by a probability density function (PDF):
satisfying:
For continuous random variables, for all , so we focus on probabilities of intervals.
Discrete: Rolling a fair die, with for each .
Continuous: Uniform distribution on has PDF for (and 0 elsewhere).
The distinction between discrete and continuous random variables parallels the distinction between counting and measuring. Discrete variables count outcomes (number of successes, number of arrivals), while continuous variables measure quantities (time, weight, distance).