The Central Limit Theorem
The Central Limit Theorem (CLT) is arguably the most important result in probability theory, explaining why the normal distribution appears so frequently in nature and providing the theoretical foundation for statistical inference.
Statement
A sequence of random variables converges in distribution to a random variable , written , if for every at which is continuous, where and are the cumulative distribution functions.
Let be independent and identically distributed (i.i.d.) random variables with mean and finite variance . Let . Then the standardized sample mean converges in distribution to the standard normal: Equivalently, for all , where is the standard normal CDF.
Examples
Let be the outcome of rolling a fair die: , . For rolls, is approximately normal with mean and standard deviation . The probability of the average exceeding is approximately:
For with trials: is the sample proportion, and This gives the approximate confidence interval .
The CLT is remarkable because it requires no assumption about the shape of the distribution — only that the mean and variance exist. Whether the are discrete, continuous, skewed, or multimodal, the average approaches normality. This universality explains the prevalence of the bell curve in real-world data: any quantity that is the sum of many small independent effects will be approximately normal.