ProofComplete

Proof of the Large Sieve Inequality

The large sieve inequality is a fundamental tool in analytic number theory, bounding exponential sums over well-spaced frequencies. It underlies the Bombieri-Vinogradov theorem and many results on the distribution of primes.


Statement

Theorem6.3Large Sieve Inequality

Let aM+1,…,aM+Na_{M+1}, \ldots, a_{M+N} be complex numbers and let Ξ±1,…,Ξ±R∈R\alpha_1, \ldots, \alpha_R \in \mathbb{R} satisfy βˆ₯Ξ±rβˆ’Ξ±sβˆ₯β‰₯Ξ΄>0\|\alpha_r - \alpha_s\| \geq \delta > 0 for rβ‰ sr \neq s (where βˆ₯β‹…βˆ₯\|\cdot\| denotes distance to the nearest integer). Then: βˆ‘r=1Rβˆ£βˆ‘n=M+1M+Nane2Ο€inΞ±r∣2≀(Nβˆ’1+Ξ΄βˆ’1)βˆ‘n=M+1M+N∣an∣2\sum_{r=1}^R \left|\sum_{n=M+1}^{M+N} a_n e^{2\pi i n\alpha_r}\right|^2 \leq (N - 1 + \delta^{-1}) \sum_{n=M+1}^{M+N} |a_n|^2


Proof

Proof

Method (Selberg-Montgomery-Vaughan). Define S(Ξ±)=βˆ‘n=M+1M+Nane2Ο€inΞ±S(\alpha) = \sum_{n=M+1}^{M+N} a_n e^{2\pi i n\alpha}.

We seek a trigonometric polynomial T(Ξ±)=βˆ‘βˆ£kβˆ£β‰€Kcke2Ο€ikΞ±T(\alpha) = \sum_{|k| \leq K} c_k e^{2\pi i k\alpha} with K=βŒŠΞ΄βˆ’1βŒ‹K = \lfloor \delta^{-1} \rfloor such that T(Ξ±r)=1T(\alpha_r) = 1 for all rr and ∫01∣T(Ξ±)∣2 dΞ±\int_0^1 |T(\alpha)|^2\, d\alpha is minimized.

By duality, the large sieve is equivalent to: there exist b1,…,bRb_1, \ldots, b_R with βˆ‘βˆ£br∣2≀1\sum|b_r|^2 \leq 1 such that βˆ₯βˆ‘rbre2Ο€inΞ±rβˆ₯β„“22≀N+Ξ΄βˆ’1βˆ’1\left\|\sum_r b_r e^{2\pi i n\alpha_r}\right\|_{\ell^2}^2 \leq N + \delta^{-1} - 1.

Direct proof via Beurling-Selberg functions. Consider the Hermitian form βˆ‘r,sBrszrzΛ‰s\sum_{r,s} B_{rs} z_r \bar{z}_s where Brs=βˆ‘n=M+1M+Ne2Ο€in(Ξ±rβˆ’Ξ±s)B_{rs} = \sum_{n=M+1}^{M+N} e^{2\pi i n(\alpha_r - \alpha_s)}. The large sieve is equivalent to showing the operator norm βˆ₯Bβˆ₯≀N+Ξ΄βˆ’1βˆ’1\|B\| \leq N + \delta^{-1} - 1.

For the diagonal: Brr=NB_{rr} = N. For off-diagonal terms, ∣Brs∣=∣sin⁑(Ο€N(Ξ±rβˆ’Ξ±s))/sin⁑(Ο€(Ξ±rβˆ’Ξ±s))∣|B_{rs}| = |\sin(\pi N(\alpha_r - \alpha_s))/\sin(\pi(\alpha_r - \alpha_s))|.

Hilbert-type inequality. The key estimate: βˆ‘rβ‰ s∣zr∣∣zs∣∣sin⁑(Ο€(Ξ±rβˆ’Ξ±s))βˆ£β‰€(Ξ΄βˆ’1βˆ’1)βˆ‘βˆ£zr∣2\sum_{r \neq s} \frac{|z_r||z_s|}{|\sin(\pi(\alpha_r - \alpha_s))|} \leq (\delta^{-1} - 1)\sum|z_r|^2 when βˆ₯Ξ±rβˆ’Ξ±sβˆ₯β‰₯Ξ΄\|\alpha_r - \alpha_s\| \geq \delta. This follows because the quadratic form βˆ‘rβ‰ szrzΛ‰s/sin⁑(Ο€(Ξ±rβˆ’Ξ±s))\sum_{r \neq s} z_r\bar{z}_s / \sin(\pi(\alpha_r - \alpha_s)) is bounded by (Ξ΄βˆ’1βˆ’1)(\delta^{-1} - 1) times the identity, which can be proved using the Beurling-Selberg majorant/minorant for the characteristic function of an interval.

Combining: βˆ‘r∣S(Ξ±r)∣2=βˆ‘r,sBrszrzΛ‰s≀(N+Ξ΄βˆ’1βˆ’1)βˆ‘βˆ£an∣2\sum_r |S(\alpha_r)|^2 = \sum_{r,s} B_{rs} z_r \bar{z}_s \leq (N + \delta^{-1} - 1) \sum |a_n|^2 where we identified zr=βˆ‘ane2Ο€inΞ±rz_r = \sum a_n e^{2\pi i n\alpha_r}... more precisely, expanding βˆ‘r∣S(Ξ±r)∣2\sum_r|S(\alpha_r)|^2 and using the bound on the Hermitian form.

Alternatively (Montgomery-Vaughan): βˆ‘r∣S(Ξ±r)∣2β‰€βˆ‘n∣an∣2βˆ‘rβˆ£βˆ‘βˆ£kβˆ£β‰€Kck(r)e2Ο€ikn∣\sum_r |S(\alpha_r)|^2 \leq \sum_n |a_n|^2 \sum_r \left|\sum_{|k|\leq K} c_k^{(r)} e^{2\pi ikn}\right| where ck(r)c_k^{(r)} are chosen optimally, leading to the stated bound with constant Nβˆ’1+Ξ΄βˆ’1N - 1 + \delta^{-1}. β–‘\square

β– 

ExampleLarge Sieve for Characters

Specializing to Dirichlet characters: Ξ±r=a/q\alpha_r = a/q for Farey fractions with q≀Qq \leq Q. The spacing Ξ΄=1/Q2\delta = 1/Q^2 (Farey fractions are spaced at least 1/Q21/Q^2 apart). The character form of the large sieve: βˆ‘q≀QqΟ•(q)βˆ‘Ο‡β€Šmodβ€Šqβˆ—βˆ£βˆ‘n=M+1M+NanΟ‡(n)∣2≀(N+Q2)βˆ‘βˆ£an∣2\sum_{q \leq Q} \frac{q}{\phi(q)} \sum_{\chi \bmod q}^* \left|\sum_{n=M+1}^{M+N} a_n \chi(n)\right|^2 \leq (N + Q^2) \sum |a_n|^2.

RemarkSharpness

The constant Nβˆ’1+Ξ΄βˆ’1N - 1 + \delta^{-1} is the best possible (it cannot be replaced by max⁑(N,Ξ΄βˆ’1)\max(N, \delta^{-1}) in general). The extremal case is achieved by an arithmetic progression of frequencies Ξ±r=rΞ΄\alpha_r = r\delta.