ConceptComplete

Concentration Inequalities

Concentration inequalities show that random variables are tightly clustered around their expected values. These tools are indispensable in probabilistic combinatorics, allowing one to go beyond expectation arguments and establish sharp thresholds.


Chernoff-Type Bounds

Definition7.7Chernoff Bound

Let X=i=1nXiX = \sum_{i=1}^n X_i where XiX_i are independent Bernoulli random variables with Pr[Xi=1]=pi\Pr[X_i = 1] = p_i. Let μ=E[X]=pi\mu = \mathbb{E}[X] = \sum p_i. Then for δ>0\delta > 0: Pr[X(1+δ)μ](eδ(1+δ)(1+δ))μ,\Pr[X \geq (1 + \delta)\mu] \leq \left(\frac{e^\delta}{(1+\delta)^{(1+\delta)}}\right)^\mu, and for 0<δ<10 < \delta < 1: Pr[Xμδμ]2exp(δ2μ3).\Pr[|X - \mu| \geq \delta\mu] \leq 2\exp\left(-\frac{\delta^2 \mu}{3}\right).

ExampleDegree Concentration in $G(n, p)$

In G(n,p)G(n, p) with p=d/np = d/n and d=ω(logn)d = \omega(\log n), the degree of each vertex is concentrated around dd. By the Chernoff bound, Pr[deg(v)d>εd]2eε2d/3\Pr[|\deg(v) - d| > \varepsilon d] \leq 2e^{-\varepsilon^2 d / 3}. A union bound over all nn vertices shows that with high probability, all degrees are (1±ε)d(1 \pm \varepsilon)d.


Martingale Methods

Definition7.8Azuma-Hoeffding Inequality

Let X0,X1,,XnX_0, X_1, \ldots, X_n be a martingale with XiXi1ci|X_i - X_{i-1}| \leq c_i almost surely. Then for t>0t > 0: Pr[XnX0t]2exp(t22i=1nci2).\Pr[|X_n - X_0| \geq t] \leq 2\exp\left(-\frac{t^2}{2\sum_{i=1}^n c_i^2}\right).

Definition7.9Method of Bounded Differences

If f:Ω1××ΩnRf: \Omega_1 \times \cdots \times \Omega_n \to \mathbb{R} satisfies the Lipschitz condition f(x)f(x)ck|f(x) - f(x')| \leq c_k whenever xx and xx' differ only in coordinate kk, and X1,,XnX_1, \ldots, X_n are independent random variables, then: Pr[f(X1,,Xn)E[f]t]2exp(2t2ck2).\Pr[|f(X_1, \ldots, X_n) - \mathbb{E}[f]| \geq t] \leq 2\exp\left(-\frac{2t^2}{\sum c_k^2}\right).

ExampleChromatic Number Concentration

The chromatic number χ(G(n,1/2))\chi(G(n, 1/2)) can change by at most 1 when a single vertex is added or removed. By the method of bounded differences with ci=1c_i = 1: Pr[χE[χ]t]2e2t2/n.\Pr[|\chi - \mathbb{E}[\chi]| \geq t] \leq 2e^{-2t^2/n}. This shows χ(G(n,1/2))\chi(G(n, 1/2)) is concentrated in an interval of width O(n)O(\sqrt{n}). Much sharper results are known: Shamir and Spencer showed concentration in O(n/logn)O(\sqrt{n/\log n}), and the true answer is believed to be O(1)O(1).

RemarkTalagrand's Inequality

Talagrand's concentration inequality provides much stronger bounds when the function ff depends on the variables in a "smooth" way. For product spaces, if ff is Lipschitz and certifiable, then ff is concentrated within O(E[f])O(\sqrt{\mathbb{E}[f]}) of its median, a far tighter bound than Azuma-Hoeffding.