ProofComplete

Proof of the Gershgorin Circle Theorem (Detailed)

The Gershgorin theorem provides a simple yet powerful method for eigenvalue localization. We present the full proof including the component-counting refinement and column variation.


Statement

Theorem6.3Gershgorin Theorem with Refinements

Let A=(aij)Cn×nA = (a_{ij}) \in \mathbb{C}^{n \times n}. Define row discs DiR={z:zaiiRi}D_i^R = \{z : |z - a_{ii}| \leq R_i\} with Ri=jiaijR_i = \sum_{j\neq i}|a_{ij}|, and column discs DjC={z:zajjCj}D_j^C = \{z : |z - a_{jj}| \leq C_j\} with Cj=ijaijC_j = \sum_{i\neq j}|a_{ij}|. Then:

  1. Every eigenvalue lies in iDiR\bigcup_i D_i^R and also in jDjC\bigcup_j D_j^C.
  2. Every eigenvalue lies in i(DiRDiC)\bigcup_i (D_i^R \cap D_i^C) is not true in general, but every eigenvalue lies in the intersection (iDiR)(jDjC)\left(\bigcup_i D_i^R\right) \cap \left(\bigcup_j D_j^C\right).
  3. If a union of mm discs is disjoint from the remaining nmn - m discs, it contains exactly mm eigenvalues.

Proof

Proof

Row discs. Let (λ,x)(\lambda, x) be an eigenpair with x=xp=1\|x\|_\infty = |x_p| = 1. From Ax=λxAx = \lambda x: japjxj=λxp\sum_j a_{pj}x_j = \lambda x_p, giving (λapp)xp=jpapjxj(\lambda - a_{pp})x_p = \sum_{j \neq p} a_{pj}x_j. Thus λappjpapjxj/xpRp|\lambda - a_{pp}| \leq \sum_{j \neq p}|a_{pj}||x_j|/|x_p| \leq R_p.

Column discs. Since λ\lambda is also an eigenvalue of ATA^T (same characteristic polynomial), applying the row-disc result to ATA^T gives λDjC\lambda \in D_j^C for some jj, proving statement (2).

Component counting (detailed). Define A(t)=(1t)D+tAA(t) = (1-t)D + tA for t[0,1]t \in [0,1] where D=diag(A)D = \mathrm{diag}(A). The off-diagonal entries of A(t)A(t) are taijt \cdot a_{ij} for iji \neq j, so the Gershgorin radii for A(t)A(t) are Ri(t)=tRiR_i(t) = t R_i.

The eigenvalues λ1(t),,λn(t)\lambda_1(t), \ldots, \lambda_n(t) of A(t)A(t) are continuous functions of tt (they are roots of det(A(t)λI)=0\det(A(t) - \lambda I) = 0, a polynomial in λ\lambda with continuous coefficients).

At t=0t = 0: λi(0)=aii\lambda_i(0) = a_{ii} and Di(0)={aii}D_i(0) = \{a_{ii}\}. Exactly one eigenvalue starts in each disc (assuming distinct diagonal entries; the general case follows by a limiting argument).

Suppose that for all t[0,1]t \in [0, 1], the discs Di1(t),,Dim(t)D_{i_1}(t), \ldots, D_{i_m}(t) form a connected component Ω(t)\Omega(t) separated from Dim+1(t),,Din(t)D_{i_{m+1}}(t), \ldots, D_{i_n}(t) by a positive gap. By continuity, no eigenvalue path λj(t)\lambda_j(t) can jump from Ω(t)\Omega(t) to the complement (or vice versa), since that would require passing through the gap where no disc exists. Therefore, the number of eigenvalues in Ω(t)\Omega(t) is constant in tt. At t=0t = 0, exactly mm eigenvalues (ai1,,aima_{i_1}, \ldots, a_{i_m}) lie in Ω(0)\Omega(0), so exactly mm eigenvalues lie in Ω(1)\Omega(1).

Remark on distinct diagonals. If some aii=ajja_{ii} = a_{jj}, perturb the diagonal entries by ϵ\epsilon to make them distinct, apply the theorem, then take ϵ0\epsilon \to 0. The disc boundaries move continuously, preserving the count by a limiting argument. \square


RemarkTightness and Extensions

Gershgorin's theorem is tight: for every configuration of discs satisfying the connectivity hypothesis, there exists a matrix realizing any permissible eigenvalue distribution. Extensions include the Brauer ovals of Cassini (zaiizajjRiRj|z - a_{ii}||z - a_{jj}| \leq R_i R_j for each pair i,ji,j), which can give tighter localization, and the Brualdi regions based on directed graph structure of the matrix.

ExampleBlock Gershgorin Theorem

The Gershgorin theorem extends to block matrices: if A=(Aij)A = (A_{ij}) with square diagonal blocks, then every eigenvalue lies in i{z:σmin(AiizI)jiAij}\bigcup_i \{z : \sigma_{\min}(A_{ii} - zI) \leq \sum_{j \neq i}\|A_{ij}\|\}. For 2×22 \times 2 blocks, this uses the smallest singular value instead of the simple modulus, giving tighter localization for matrices with natural block structure.