Mathematics Lecture Notes

Theorem6.2Convergence of the Shifted QR Algorithm

Let $A \in \mathbb{R}^{n \times n}$ be a symmetric matrix in tridiagonal form with nonzero subdiagonal entries. The QR algorithm with Wilkinson shift produces a sequence $A_k$ such that the bottom subdiagonal entry $\beta_k = a_{n,n-1}^{(k)}$ satisfies $|\beta_{k+1}| \leq c |\beta_k|^3$ for a constant $c$ depending on the eigenvalue gap. In particular, the convergence is globally and at least cubically convergent: the bottom eigenvalue is isolated in $O(\log\log(1/\varepsilon))$ steps to achieve accuracy $\varepsilon$ .

Proof

Setup. Consider the tridiagonal matrix $T_k$ with bottom-right $2 \times 2$ block $\begin{pmatrix} \alpha_{n-1} & \beta \\ \beta & \alpha_n \end{pmatrix}$ . The Wilkinson shift $\sigma$ is the eigenvalue of this block closer to $\alpha_n$ .

Key estimate. Write $\delta = (\alpha_{n-1} - \alpha_n)/2$ and $\mu = \alpha_n - \sigma = -\delta + \mathrm{sign}(\delta)\sqrt{\delta^2 + \beta^2}$ . Then $|\mu| \leq \beta^2 / (2|\delta|)$ when $|\delta| \gg |\beta|$ , and $|\mu| \leq |\beta|$ always.

One QR step. After the shifted QR step $T_k - \sigma I = QR$ , $T_{k+1} = RQ + \sigma I$ , the new subdiagonal entry satisfies $|\beta'| = |\beta| \prod_{j=1}^{n-2} \frac{|\lambda_j - \sigma|}{|d_j|}$ where $d_j$ are related to the QR factorization. The Wilkinson shift guarantees that $(T_k - \sigma I)$ has its smallest singular value at the bottom, causing $|\beta'|/|\beta|$ to be small.

Cubic convergence. For symmetric tridiagonal matrices, Wilkinson proved that $|\beta_{k+1}| = O(|\beta_k|^3)$ using the identity relating the shift quality to the subdiagonal entry. The key insight is that $|\alpha_n - \sigma_k| = O(\beta_k^2)$ (the shift approximates the eigenvalue quadratically well), and this quadratic approximation combined with the inverse-iteration-like nature of shifted QR yields cubic convergence of $\beta_k \to 0$ .

Global convergence. The monotonicity $\sum_i \beta_i^2$ is non-increasing under QR steps (related to the Wilkinson monotonicity theorem) ensures global convergence. Once $|\beta_k|$ is small enough for the cubic estimate to dominate, convergence is rapid. $\square$

■

ExampleOverall Complexity of Symmetric Eigenvalue Problem

For an $n \times n$ symmetric matrix:

Tridiagonalization via Householder: $\frac{4}{3}n^3$ flops
QR iterations with Wilkinson shift: $\sim 2n$ total QR steps (cubic convergence means $\sim 2$ steps per eigenvalue), each costing $O(n)$ , total $O(n^2)$
Total: $\frac{4}{3}n^3 + O(n^2) \approx \frac{4}{3}n^3$ flops for all eigenvalues

For eigenvalues and eigenvectors: $\frac{4}{3}n^3 + O(n^3) = O(n^3)$ (accumulating orthogonal transformations dominates).

RemarkNon-Symmetric Case

For non-symmetric matrices, the QR algorithm (with double shifts) converges in practice but lacks a rigorous global convergence proof. The Hessenberg QR step costs $O(n^2)$ and typically requires $O(n)$ total iterations. Exceptional shifts are used when convergence stalls. The total cost is $O(n^3)$ for computing all eigenvalues of a general $n \times n$ matrix.

Math Notes

QR Algorithm Convergence Theorem

Proof Sketch