TheoremComplete

The Cayley-Hamilton Theorem

The Cayley-Hamilton theorem states that every square matrix satisfies its own characteristic equation. This remarkable result connects matrices to their characteristic polynomials in a fundamental way.

TheoremCayley-Hamilton Theorem

Let AA be an n×nn \times n matrix with characteristic polynomial: pA(λ)=det(AλI)=c0+c1λ++cn1λn1+cnλnp_A(\lambda) = \det(A - \lambda I) = c_0 + c_1\lambda + \cdots + c_{n-1}\lambda^{n-1} + c_n\lambda^n

Then AA satisfies its characteristic equation: pA(A)=c0I+c1A++cn1An1+cnAn=0p_A(A) = c_0I + c_1A + \cdots + c_{n-1}A^{n-1} + c_nA^n = 0

That is, substituting matrix AA for variable λ\lambda in pA(λ)p_A(\lambda) yields the zero matrix.

This theorem implies that any power AkA^k for knk \geq n can be expressed as a linear combination of I,A,A2,,An1I, A, A^2, \ldots, A^{n-1}. The space of polynomials in AA has dimension at most nn.

ExampleVerifying Cayley-Hamilton

Let A=[2112]A = \begin{bmatrix} 2 & 1 \\ 1 & 2 \end{bmatrix}.

Characteristic polynomial: pA(λ)=det(AλI)=(2λ)21=λ24λ+3p_A(\lambda) = \det(A - \lambda I) = (2-\lambda)^2 - 1 = \lambda^2 - 4\lambda + 3

Cayley-Hamilton predicts: A24A+3I=0A^2 - 4A + 3I = 0

Check: A2=[5445]A^2 = \begin{bmatrix} 5 & 4 \\ 4 & 5 \end{bmatrix}, 4A=[8448]4A = \begin{bmatrix} 8 & 4 \\ 4 & 8 \end{bmatrix}, 3I=[3003]3I = \begin{bmatrix} 3 & 0 \\ 0 & 3 \end{bmatrix}

A24A+3I=[58+344+044+058+3]=[0000]A^2 - 4A + 3I = \begin{bmatrix} 5-8+3 & 4-4+0 \\ 4-4+0 & 5-8+3 \end{bmatrix} = \begin{bmatrix} 0 & 0 \\ 0 & 0 \end{bmatrix}

TheoremMinimal Polynomial

For matrix AA, the minimal polynomial mA(λ)m_A(\lambda) is the monic polynomial of smallest degree such that mA(A)=0m_A(A) = 0.

Properties:

  1. mA(λ)m_A(\lambda) divides every polynomial q(λ)q(\lambda) with q(A)=0q(A) = 0
  2. mA(λ)m_A(\lambda) divides the characteristic polynomial pA(λ)p_A(\lambda)
  3. mA(λ)m_A(\lambda) and pA(λ)p_A(\lambda) have the same roots (eigenvalues)
  4. deg(mA)n\deg(m_A) \leq n
  5. AA is diagonalizable if and only if mA(λ)m_A(\lambda) factors into distinct linear factors
TheoremApplications of Cayley-Hamilton

Computing Matrix Inverse: If AA is invertible and pA(λ)=det(AλI)=c0+c1λ++λnp_A(\lambda) = \det(A - \lambda I) = c_0 + c_1\lambda + \cdots + \lambda^n, then: c0I+c1A++cn1An1+An=0c_0I + c_1A + \cdots + c_{n-1}A^{n-1} + A^n = 0

Since det(A)=(1)nc00\det(A) = (-1)^n c_0 \neq 0, we have c00c_0 \neq 0, so: A1=1c0(c1I+c2A++cn1An2+An1)A^{-1} = -\frac{1}{c_0}(c_1I + c_2A + \cdots + c_{n-1}A^{n-2} + A^{n-1})

Computing Matrix Powers: To find AkA^k for large kk, use Cayley-Hamilton to reduce to degree <n< n.

ExampleUsing Cayley-Hamilton for Powers

For A=[1101]A = \begin{bmatrix} 1 & 1 \\ 0 & 1 \end{bmatrix}, the characteristic polynomial is pA(λ)=(λ1)2=λ22λ+1p_A(\lambda) = (\lambda-1)^2 = \lambda^2 - 2\lambda + 1.

By Cayley-Hamilton: A22A+I=0A^2 - 2A + I = 0, so A2=2AIA^2 = 2A - I.

Then: A3=AA2=A(2AI)=2A2A=2(2AI)A=3A2IA^3 = A \cdot A^2 = A(2A - I) = 2A^2 - A = 2(2A - I) - A = 3A - 2I

Generally: An=nA(n1)IA^n = nA - (n-1)I for n1n \geq 1.

Remark

The Cayley-Hamilton theorem has profound implications: it shows that the algebraic structure of a matrix is completely captured by a polynomial equation. This connection between linear algebra and polynomial algebra underlies much of matrix theory, from Jordan form to matrix functions.