ProofComplete

Proof of Cauchy-Schwarz Inequality

We present three proofs of the Cauchy--Schwarz inequality, each offering a different perspective: the quadratic form argument, the projection argument, and the Lagrange identity approach.


Statement

Theorem6.5Cauchy-Schwarz Inequality

For all vectors u,vu, v in an inner product space VV:

u,v2u,uv,v.|\langle u, v \rangle|^2 \leq \langle u, u \rangle \cdot \langle v, v \rangle.

Equality holds if and only if uu and vv are linearly dependent.


Proof 1: The quadratic form argument

ProofProof via non-negative quadratic form

Case 1: If v=0v = 0, both sides are 00 and the inequality holds with equality.

Case 2: Assume v0v \neq 0. For any scalar tRt \in \mathbb{R} (or tCt \in \mathbb{C}), consider the vector utvu - tv. By positive definiteness:

0utv,utv=u,utu,vtˉv,u+t2v,v.0 \leq \langle u - tv, u - tv \rangle = \langle u, u \rangle - t\langle u, v \rangle - \bar{t}\langle v, u \rangle + |t|^2 \langle v, v \rangle.

Real case: Set t=u,vv,vt = \frac{\langle u, v \rangle}{\langle v, v \rangle} (the optimal tt minimizing the quadratic):

0u,u2u,v2v,v+u,v2v,v=u,uu,v2v,v.0 \leq \langle u, u \rangle - 2 \frac{\langle u, v \rangle^2}{\langle v, v \rangle} + \frac{\langle u, v \rangle^2}{\langle v, v \rangle} = \langle u, u \rangle - \frac{\langle u, v \rangle^2}{\langle v, v \rangle}.

Rearranging: u,v2u,uv,v\langle u, v \rangle^2 \leq \langle u, u \rangle \cdot \langle v, v \rangle.

Complex case: Set t=u,vv,vt = \frac{\langle u, v \rangle}{\langle v, v \rangle}:

0u,uu,vu,vv,v=u,uu,v2v,v.0 \leq \langle u, u \rangle - \frac{\langle u, v \rangle \overline{\langle u, v \rangle}}{\langle v, v \rangle} = \langle u, u \rangle - \frac{|\langle u, v \rangle|^2}{\langle v, v \rangle}.

Rearranging: u,v2u,uv,v|\langle u, v \rangle|^2 \leq \langle u, u \rangle \cdot \langle v, v \rangle.

Equality: We have equality iff utv,utv=0\langle u - tv, u - tv \rangle = 0, iff utv=0u - tv = 0, iff u=tvu = tv. \blacksquare

ExampleVisualizing the quadratic proof in R^2

u=(3,1)u = (3, 1), v=(1,2)v = (1, 2). v,v=5\langle v, v \rangle = 5, u,v=5\langle u, v \rangle = 5.

t=5/5=1t = 5/5 = 1, utv=(3,1)(1,2)=(2,1)u - tv = (3, 1) - (1, 2) = (2, -1), utv2=5\|u - tv\|^2 = 5.

u2u,v2/v2=1025/5=50\|u\|^2 - |\langle u, v \rangle|^2 / \|v\|^2 = 10 - 25/5 = 5 \geq 0 ✓.

If instead u=(2,4)=2vu = (2, 4) = 2v: t=10/5=2t = 10/5 = 2, utv=0u - tv = 0, equality holds.

ExampleComplex case verification

u=(1,i)u = (1, i), v=(1,1)v = (1, 1) in C2\mathbb{C}^2.

u,v=11+i1=1+i\langle u, v \rangle = 1 \cdot 1 + i \cdot 1 = 1 + i, u,v2=2|\langle u, v \rangle|^2 = 2.

u,u=1+1=2\langle u, u \rangle = 1 + 1 = 2, v,v=2\langle v, v \rangle = 2. Product: 44.

242 \leq 4 ✓. Strict inequality because uu and vv are linearly independent (no scalar tt with (1,i)=t(1,1)(1, i) = t(1, 1)).


Proof 2: Via orthogonal projection

ProofProof using projection

If v=0v = 0, the result is trivial. Assume v0v \neq 0.

Decompose uu orthogonally with respect to vv:

u=projv(u)+(uprojv(u)),where projv(u)=u,vv,vv.u = \operatorname{proj}_v(u) + (u - \operatorname{proj}_v(u)), \quad \text{where } \operatorname{proj}_v(u) = \frac{\langle u, v \rangle}{\langle v, v \rangle} v.

By the Pythagorean theorem (since the two components are orthogonal):

u2=projv(u)2+uprojv(u)2.\|u\|^2 = \|\operatorname{proj}_v(u)\|^2 + \|u - \operatorname{proj}_v(u)\|^2.

Since uprojv(u)20\|u - \operatorname{proj}_v(u)\|^2 \geq 0:

u2projv(u)2=u,v2v2.\|u\|^2 \geq \|\operatorname{proj}_v(u)\|^2 = \frac{|\langle u, v \rangle|^2}{\|v\|^2}.

Multiplying both sides by v2\|v\|^2: u2v2u,v2\|u\|^2 \|v\|^2 \geq |\langle u, v \rangle|^2.

Equality iff u=projv(u)u = \operatorname{proj}_v(u), iff uu is a scalar multiple of vv. \blacksquare

ExampleProjection proof visualized

u=(4,3)u = (4, 3), v=(1,0)v = (1, 0).

projv(u)=4(1,0)=(4,0)\operatorname{proj}_v(u) = 4 \cdot (1, 0) = (4, 0). Residual: (0,3)(0, 3).

u2=25=16+9=proj2+residual2\|u\|^2 = 25 = 16 + 9 = \|\operatorname{proj}\|^2 + \|\text{residual}\|^2.

Cauchy--Schwarz: u,v2=16251=25|\langle u, v \rangle|^2 = 16 \leq 25 \cdot 1 = 25 ✓. The gap is residual2=9\|\text{residual}\|^2 = 9.

ExampleProjection proof for functions

f(x)=xf(x) = x, g(x)=1g(x) = 1 on [0,1][0, 1].

projg(f)=f,gg2g=1/211=12\operatorname{proj}_g(f) = \frac{\langle f, g \rangle}{\|g\|^2} g = \frac{1/2}{1} \cdot 1 = \frac{1}{2}.

f2=1/3f,g2/g2=1/4\|f\|^2 = 1/3 \geq |\langle f, g \rangle|^2 / \|g\|^2 = 1/4 ✓.

The residual is f1/2=x1/2f - 1/2 = x - 1/2, with x1/22=1/31/4=1/12\|x - 1/2\|^2 = 1/3 - 1/4 = 1/12.


Proof 3: Lagrange identity (for R^n)

ProofProof via Lagrange identity (real case, R^n)

For u,vRnu, v \in \mathbb{R}^n, the Lagrange identity states:

u2v2(uv)2=1i<jn(uivjujvi)2.\|u\|^2 \|v\|^2 - (u \cdot v)^2 = \sum_{1 \leq i < j \leq n} (u_i v_j - u_j v_i)^2.

The right side is a sum of squares, hence 0\geq 0. This gives u2v2(uv)2\|u\|^2 \|v\|^2 \geq (u \cdot v)^2.

Equality holds iff all uivjujvi=0u_i v_j - u_j v_i = 0 for i<ji < j, iff all 2×22 \times 2 minors of (u1unv1vn)\begin{pmatrix} u_1 & \cdots & u_n \\ v_1 & \cdots & v_n \end{pmatrix} vanish, iff uu and vv are proportional. \blacksquare

ExampleLagrange identity in R^2

u=(a,b)u = (a, b), v=(c,d)v = (c, d):

u2v2(uv)2=(a2+b2)(c2+d2)(ac+bd)2=(adbc)2\|u\|^2\|v\|^2 - (u \cdot v)^2 = (a^2 + b^2)(c^2 + d^2) - (ac + bd)^2 = (ad - bc)^2.

For u=(1,2)u = (1, 2), v=(3,4)v = (3, 4): (1+4)(9+16)(3+8)2=125121=4=(46)2=4(1 + 4)(9 + 16) - (3 + 8)^2 = 125 - 121 = 4 = (4 - 6)^2 = 4 ✓.

ExampleLagrange identity in R^3

u=(1,0,0)u = (1, 0, 0), v=(0,1,0)v = (0, 1, 0):

u2v2(uv)2=110=1\|u\|^2\|v\|^2 - (u \cdot v)^2 = 1 \cdot 1 - 0 = 1.

(u1v2u2v1)2+(u1v3u3v1)2+(u2v3u3v2)2=1+0+0=1(u_1 v_2 - u_2 v_1)^2 + (u_1 v_3 - u_3 v_1)^2 + (u_2 v_3 - u_3 v_2)^2 = 1 + 0 + 0 = 1 ✓.

Note: in R3\mathbb{R}^3, u2v2(uv)2=u×v2\|u\|^2\|v\|^2 - (u \cdot v)^2 = \|u \times v\|^2, where u×vu \times v is the cross product.


Consequences derived

ExampleDeriving the triangle inequality

u+v2=u2+2Reu,v+v2u2+2uv+v2=(u+v)2\|u + v\|^2 = \|u\|^2 + 2\operatorname{Re}\langle u, v \rangle + \|v\|^2 \leq \|u\|^2 + 2\|u\|\|v\| + \|v\|^2 = (\|u\| + \|v\|)^2.

The key step uses Reu,vu,vuv\operatorname{Re}\langle u, v \rangle \leq |\langle u, v \rangle| \leq \|u\|\|v\| (Cauchy--Schwarz).

ExampleDeriving Bessel's inequality

For an orthonormal set {e1,,ek}\{e_1, \ldots, e_k\} and any vv, let w=viv,eieiw = v - \sum_i \langle v, e_i \rangle e_i. Then wejw \perp e_j for all jj, and:

0w2=v2iv,ei20 \leq \|w\|^2 = \|v\|^2 - \sum_i |\langle v, e_i \rangle|^2.

This is Bessel's inequality, which is a multi-dimensional Cauchy--Schwarz.

ExampleAngle is well-defined

Cauchy--Schwarz guarantees 1u,vuv1-1 \leq \frac{\langle u, v \rangle}{\|u\|\|v\|} \leq 1 for real inner product spaces, so θ=arccos(u,vuv)[0,π]\theta = \arccos\left(\frac{\langle u, v \rangle}{\|u\|\|v\|}\right) \in [0, \pi] is well-defined.

Without Cauchy--Schwarz, the argument of arccos\arccos could exceed [1,1][-1, 1], making the angle undefined.


Comparison of proofs

RemarkThree perspectives on Cauchy-Schwarz
  • Quadratic form proof: The most general, works in any inner product space (real or complex, finite or infinite-dimensional). Shows that the inequality is equivalent to non-negativity of a specific scalar quadratic in tt.

  • Projection proof: Geometrically intuitive -- the inequality says the projection of uu onto vv is at most as long as uu itself. The "gap" uprojv(u)2\|u - \operatorname{proj}_v(u)\|^2 measures how far uu is from being a multiple of vv.

  • Lagrange identity proof: Algebraically explicit (for Rn\mathbb{R}^n only), showing the "defect" u2v2(uv)2=(uivjujvi)2\|u\|^2\|v\|^2 - (u \cdot v)^2 = \sum(u_iv_j - u_jv_i)^2 as a sum of squares. In R3\mathbb{R}^3, this equals u×v2\|u \times v\|^2, connecting to the cross product.


Historical note

RemarkHistory of the inequality

The inequality is named after Augustin-Louis Cauchy (who proved the finite sum version in 1821), Viktor Bunyakovsky (who proved the integral version in 1859), and Hermann Amandus Schwarz (who independently proved the integral version in 1884). In some traditions it is called the Cauchy--Bunyakovsky--Schwarz inequality or CBS inequality.

The inequality appears in every branch of mathematics:

  • Analysis: the foundation of LpL^p space theory.
  • Probability: bounds on covariance and correlation.
  • Physics: uncertainty principles in quantum mechanics.
  • Geometry: the angle between vectors, curvature bounds.
  • Combinatorics: counting arguments via the second moment method.

Extended examples

ExampleUncertainty principle from Cauchy-Schwarz

In quantum mechanics, observables A,BA, B act on a Hilbert space H\mathcal{H}. For a state ψ|\psi\rangle, the uncertainty principle:

ΔAΔB12[A,B]\Delta A \cdot \Delta B \geq \frac{1}{2}|\langle [A, B] \rangle|

follows from Cauchy--Schwarz applied to u=(AA)ψ|u\rangle = (A - \langle A \rangle)|\psi\rangle and v=(BB)ψ|v\rangle = (B - \langle B \rangle)|\psi\rangle.

For position XX and momentum PP with [X,P]=i[X, P] = i\hbar: ΔXΔP/2\Delta X \cdot \Delta P \geq \hbar/2.

ExampleCauchy-Schwarz in number theory

For real sequences a1,,ana_1, \ldots, a_n and b1,,bnb_1, \ldots, b_n:

(aibi)2(ai2)(bi2).\left(\sum a_i b_i\right)^2 \leq \left(\sum a_i^2\right)\left(\sum b_i^2\right).

Setting ai=1a_i = 1 and bi=d(i)b_i = d(i) (number of divisors of ii): (i=1nd(i))2ni=1nd(i)2\left(\sum_{i=1}^n d(i)\right)^2 \leq n \sum_{i=1}^n d(i)^2. Since d(i)nlogn\sum d(i) \sim n \log n, this gives d(i)2n(logn)2\sum d(i)^2 \gg n(\log n)^2.

ExampleGeneralization: Holder's inequality

Cauchy--Schwarz is the p=q=2p = q = 2 case of Holder's inequality: for 1p+1q=1\frac{1}{p} + \frac{1}{q} = 1,

aibi(aip)1/p(biq)1/q.\sum |a_i b_i| \leq \left(\sum |a_i|^p\right)^{1/p} \left(\sum |b_i|^q\right)^{1/q}.

The p=q=2p = q = 2 case is exactly Cauchy--Schwarz.


Summary

RemarkThe cornerstone inequality

The Cauchy--Schwarz inequality is the single most consequential inequality in inner product space theory. From it flow:

  • The triangle inequality (hence metric space structure).
  • Bessel's inequality (finite energy of Fourier coefficients).
  • The well-definedness of angles.
  • The optimality of orthogonal projections.
  • Uncertainty principles in physics.

Its proof, via the non-negativity of utv2\|u - tv\|^2, is a masterclass in the power of positive definiteness.