ConceptComplete

Characteristic Polynomial

The characteristic polynomial of a matrix AA encodes all eigenvalue information in a single polynomial. Its roots are the eigenvalues, its coefficients are symmetric functions of the eigenvalues, and it is invariant under similarity -- making it one of the most important invariants in linear algebra.


Definition

Definition5.5Characteristic polynomial

Let AMn×n(F)A \in M_{n \times n}(F). The characteristic polynomial of AA is:

pA(λ)=det(AλI).p_A(\lambda) = \det(A - \lambda I).

This is a polynomial of degree nn in λ\lambda with leading coefficient (1)n(-1)^n:

pA(λ)=(1)nλn+(1)n1tr(A)λn1++det(A).p_A(\lambda) = (-1)^n \lambda^n + (-1)^{n-1} \operatorname{tr}(A) \lambda^{n-1} + \cdots + \det(A).

The eigenvalues of AA are exactly the roots of pA(λ)=0p_A(\lambda) = 0.

RemarkSign convention

Some authors define pA(λ)=det(λIA)p_A(\lambda) = \det(\lambda I - A) instead, which gives a monic polynomial (leading coefficient 11). In this convention, pA(λ)=λntr(A)λn1++(1)ndet(A)p_A(\lambda) = \lambda^n - \operatorname{tr}(A)\lambda^{n-1} + \cdots + (-1)^n \det(A). We will follow the convention det(AλI)\det(A - \lambda I) but note when the monic form is used.


Computing characteristic polynomials

ExampleGeneral 2x2 matrix

For A=(abcd)A = \begin{pmatrix} a & b \\ c & d \end{pmatrix}:

pA(λ)=det(aλbcdλ)=(aλ)(dλ)bc=λ2(a+d)λ+(adbc).p_A(\lambda) = \det \begin{pmatrix} a - \lambda & b \\ c & d - \lambda \end{pmatrix} = (a - \lambda)(d - \lambda) - bc = \lambda^2 - (a+d)\lambda + (ad - bc).

So pA(λ)=λ2tr(A)λ+det(A)p_A(\lambda) = \lambda^2 - \operatorname{tr}(A)\lambda + \det(A), and the eigenvalues are:

λ=tr(A)±tr(A)24det(A)2.\lambda = \frac{\operatorname{tr}(A) \pm \sqrt{\operatorname{tr}(A)^2 - 4\det(A)}}{2}.

ExampleA specific 3x3 matrix

Let A=(210031002)A = \begin{pmatrix} 2 & 1 & 0 \\ 0 & 3 & 1 \\ 0 & 0 & 2 \end{pmatrix}. Since AA is upper triangular:

pA(λ)=(2λ)(3λ)(2λ)=(λ2)2(λ3).p_A(\lambda) = (2 - \lambda)(3 - \lambda)(2 - \lambda) = -({\lambda - 2})^2(\lambda - 3).

Eigenvalues: λ=2\lambda = 2 (algebraic multiplicity 22) and λ=3\lambda = 3 (algebraic multiplicity 11).

ExampleCompanion matrix

The companion matrix of the polynomial p(x)=xn+an1xn1++a1x+a0p(x) = x^n + a_{n-1}x^{n-1} + \cdots + a_1 x + a_0 is:

C=(000a0100a1010a2001an1).C = \begin{pmatrix} 0 & 0 & \cdots & 0 & -a_0 \\ 1 & 0 & \cdots & 0 & -a_1 \\ 0 & 1 & \cdots & 0 & -a_2 \\ \vdots & & \ddots & & \vdots \\ 0 & 0 & \cdots & 1 & -a_{n-1} \end{pmatrix}.

The characteristic polynomial of CC (in the monic convention) is exactly p(x)p(x). This proves that every monic polynomial is the characteristic polynomial of some matrix.

For p(x)=x36x2+11x6=(x1)(x2)(x3)p(x) = x^3 - 6x^2 + 11x - 6 = (x-1)(x-2)(x-3): the companion matrix C=(0061011016)C = \begin{pmatrix} 0 & 0 & 6 \\ 1 & 0 & -11 \\ 0 & 1 & 6 \end{pmatrix} has eigenvalues 1,2,31, 2, 3.


Coefficients of the characteristic polynomial

Theorem5.1Coefficients and symmetric functions

Let AMn×n(F)A \in M_{n \times n}(F) with eigenvalues λ1,,λn\lambda_1, \ldots, \lambda_n (over the algebraic closure). In the monic convention det(λIA)=i=1n(λλi)\det(\lambda I - A) = \prod_{i=1}^n (\lambda - \lambda_i), the coefficients are the elementary symmetric polynomials:

det(λIA)=λne1λn1+e2λn2+(1)nen,\det(\lambda I - A) = \lambda^n - e_1 \lambda^{n-1} + e_2 \lambda^{n-2} - \cdots + (-1)^n e_n,

where:

  • e1=iλi=tr(A)e_1 = \sum_i \lambda_i = \operatorname{tr}(A),
  • e2=i<jλiλje_2 = \sum_{i < j} \lambda_i \lambda_j,
  • ek=i1<<ikλi1λike_k = \sum_{i_1 < \cdots < i_k} \lambda_{i_1} \cdots \lambda_{i_k},
  • en=λ1λn=det(A)e_n = \lambda_1 \cdots \lambda_n = \det(A).
ExampleSymmetric functions for a 3x3 matrix

Let AA have eigenvalues 1,2,41, 2, 4. Then:

  • e1=1+2+4=7=tr(A)e_1 = 1 + 2 + 4 = 7 = \operatorname{tr}(A).
  • e2=12+14+24=14e_2 = 1 \cdot 2 + 1 \cdot 4 + 2 \cdot 4 = 14.
  • e3=124=8=det(A)e_3 = 1 \cdot 2 \cdot 4 = 8 = \det(A).

Characteristic polynomial (monic): λ37λ2+14λ8=(λ1)(λ2)(λ4)\lambda^3 - 7\lambda^2 + 14\lambda - 8 = (\lambda - 1)(\lambda - 2)(\lambda - 4).

ExampleThe coefficient e_2 as sum of 2x2 minors

The coefficient e2e_2 can be computed directly from AA as the sum of all 2×22 \times 2 principal minors:

e2=i<jdet(aiiaijajiajj).e_2 = \sum_{i < j} \det \begin{pmatrix} a_{ii} & a_{ij} \\ a_{ji} & a_{jj} \end{pmatrix}.

For A=(123045006)A = \begin{pmatrix} 1 & 2 & 3 \\ 0 & 4 & 5 \\ 0 & 0 & 6 \end{pmatrix}: e2=det(1204)+det(1306)+det(4506)=4+6+24=34e_2 = \det\begin{pmatrix}1&2\\0&4\end{pmatrix} + \det\begin{pmatrix}1&3\\0&6\end{pmatrix} + \det\begin{pmatrix}4&5\\0&6\end{pmatrix} = 4 + 6 + 24 = 34.

Check: eigenvalues 1,4,61, 4, 6, so e2=14+16+46=4+6+24=34e_2 = 1 \cdot 4 + 1 \cdot 6 + 4 \cdot 6 = 4 + 6 + 24 = 34.


Similarity invariance

Theorem5.2Characteristic polynomial is a similarity invariant

If B=P1APB = P^{-1}AP for some invertible PP, then pB(λ)=pA(λ)p_B(\lambda) = p_A(\lambda).

In particular, similar matrices have the same eigenvalues (with the same algebraic multiplicities), the same trace, and the same determinant.

ProofProof of similarity invariance

det(BλI)=det(P1APλI)=det(P1(AλI)P)=det(P1)det(AλI)det(P)=det(AλI)\det(B - \lambda I) = \det(P^{-1}AP - \lambda I) = \det(P^{-1}(A - \lambda I)P) = \det(P^{-1}) \det(A - \lambda I) \det(P) = \det(A - \lambda I).

ExampleSimilar matrices share characteristic polynomial

Let A=(0110)A = \begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix} and P=(11ii)P = \begin{pmatrix} 1 & 1 \\ i & -i \end{pmatrix} (over C\mathbb{C}). Then P1AP=(i00i)=BP^{-1}AP = \begin{pmatrix} i & 0 \\ 0 & -i \end{pmatrix} = B.

Both have p(λ)=λ2+1p(\lambda) = \lambda^2 + 1, eigenvalues ±i\pm i.

ExampleSame eigenvalues but not similar

A=(1101)A = \begin{pmatrix} 1 & 1 \\ 0 & 1 \end{pmatrix} and B=(1001)=IB = \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix} = I both have characteristic polynomial (λ1)2(\lambda - 1)^2 and eigenvalues {1,1}\{1, 1\}.

But AIA \neq I while B=IB = I, and since P1IP=IP^{-1}IP = I for all PP, the matrices AA and BB are not similar. The characteristic polynomial alone does not determine the similarity class.


Characteristic polynomial of a linear transformation

Definition5.6Characteristic polynomial of a linear map

For a linear transformation T:VVT: V \to V on a finite-dimensional vector space, the characteristic polynomial is defined as pT(λ)=det([T]BλI)p_T(\lambda) = \det([T]_\mathcal{B} - \lambda I) for any basis B\mathcal{B} of VV.

This is well-defined (independent of the basis choice) by the similarity invariance theorem.

ExampleDifferentiation operator on polynomials

Let V={p(x):degp2}V = \{p(x) : \deg p \leq 2\} with basis {1,x,x2}\{1, x, x^2\}, and T=d/dxT = d/dx.

The matrix of TT: T(1)=0T(1) = 0, T(x)=1T(x) = 1, T(x2)=2xT(x^2) = 2x, so [T]=(010002000)[T] = \begin{pmatrix} 0 & 1 & 0 \\ 0 & 0 & 2 \\ 0 & 0 & 0 \end{pmatrix}.

Characteristic polynomial: pT(λ)=(λ)3=λ3p_T(\lambda) = (-\lambda)^3 = -\lambda^3. Only eigenvalue: λ=0\lambda = 0 with ma=3m_a = 3. The eigenspace is kerT=span{1}\ker T = \operatorname{span}\{1\}, so mg=1m_g = 1. This confirms TT is nilpotent (T3=0T^3 = 0).

ExampleShift operator

On V=R3V = \mathbb{R}^3, the right shift operator S(a,b,c)=(0,a,b)S(a, b, c) = (0, a, b) has matrix [S]=(000100010)[S] = \begin{pmatrix} 0 & 0 & 0 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \end{pmatrix}.

Characteristic polynomial: λ3-\lambda^3. Eigenvalue λ=0\lambda = 0 with ma=3m_a = 3, mg=dimkerS=1m_g = \dim\ker S = 1 (kernel is span{(0,0,1)}\operatorname{span}\{(0, 0, 1)\}).


Factoring characteristic polynomials

ExampleFactoring over Q using rational root theorem

A=(120012201)A = \begin{pmatrix} 1 & 2 & 0 \\ 0 & 1 & 2 \\ 2 & 0 & 1 \end{pmatrix}.

Characteristic polynomial (monic): λ33λ2+3λ+7\lambda^3 - 3\lambda^2 + 3\lambda + 7... let us compute carefully.

det(λIA)=det(λ1200λ1220λ1)\det(\lambda I - A) = \det \begin{pmatrix} \lambda - 1 & -2 & 0 \\ 0 & \lambda - 1 & -2 \\ -2 & 0 & \lambda - 1 \end{pmatrix}.

Expanding along the first row: (λ1)[(λ1)20]+2[04]=(λ1)38(\lambda - 1)[(\lambda-1)^2 - 0] + 2[0 - 4] = (\lambda-1)^3 - 8.

So p(λ)=(λ1)38p(\lambda) = (\lambda - 1)^3 - 8. Setting u=λ1u = \lambda - 1: u3=8u^3 = 8, so u=2u = 2, giving λ=3\lambda = 3. The other roots (over C\mathbb{C}) are u=2ω,2ω2u = 2\omega, 2\omega^2 where ω=e2πi/3\omega = e^{2\pi i/3}.

Over R\mathbb{R}: only λ=3\lambda = 3 is a real eigenvalue.

ExampleIrreducible quadratic factor

A=(010100002)A = \begin{pmatrix} 0 & 1 & 0 \\ -1 & 0 & 0 \\ 0 & 0 & 2 \end{pmatrix}.

pA(λ)=det(λ101λ0002λ)=(2λ)(λ2+1)p_A(\lambda) = \det \begin{pmatrix} -\lambda & 1 & 0 \\ -1 & -\lambda & 0 \\ 0 & 0 & 2 - \lambda \end{pmatrix} = (2 - \lambda)(\lambda^2 + 1).

Over R\mathbb{R}: eigenvalue λ=2\lambda = 2 only. Over C\mathbb{C}: eigenvalues 2,i,i2, i, -i.

The factor λ2+1\lambda^2 + 1 is irreducible over R\mathbb{R} but splits as (λi)(λ+i)(\lambda - i)(\lambda + i) over C\mathbb{C}.

ExampleRepeated roots and their meaning

A=(542452222)A = \begin{pmatrix} 5 & 4 & 2 \\ 4 & 5 & 2 \\ 2 & 2 & 2 \end{pmatrix}.

tr(A)=12\operatorname{tr}(A) = 12, and one can compute pA(λ)=λ3+12λ221λ+10p_A(\lambda) = -\lambda^3 + 12\lambda^2 - 21\lambda + 10, or in monic form: λ312λ2+21λ10=(λ1)2(λ10)\lambda^3 - 12\lambda^2 + 21\lambda - 10 = (\lambda - 1)^2(\lambda - 10).

Eigenvalues: λ=1\lambda = 1 (ma=2m_a = 2) and λ=10\lambda = 10 (ma=1m_a = 1).


The discriminant

Definition5.7Discriminant of the characteristic polynomial

For a 2×22 \times 2 matrix AA with pA(λ)=λ2tr(A)λ+det(A)p_A(\lambda) = \lambda^2 - \operatorname{tr}(A)\lambda + \det(A), the discriminant is:

Δ=tr(A)24det(A).\Delta = \operatorname{tr}(A)^2 - 4\det(A).

  • Δ>0\Delta > 0: two distinct real eigenvalues.
  • Δ=0\Delta = 0: one repeated real eigenvalue.
  • Δ<0\Delta < 0: two complex conjugate eigenvalues (no real eigenvalues).
ExampleThree cases of the discriminant
  • A=(3101)A = \begin{pmatrix} 3 & 1 \\ 0 & 1 \end{pmatrix}: Δ=1612=4>0\Delta = 16 - 12 = 4 > 0. Eigenvalues: 3,13, 1.
  • A=(2102)A = \begin{pmatrix} 2 & 1 \\ 0 & 2 \end{pmatrix}: Δ=1616=0\Delta = 16 - 16 = 0. Eigenvalue: 22 (repeated).
  • A=(0110)A = \begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix}: Δ=04=4<0\Delta = 0 - 4 = -4 < 0. Eigenvalues: ±i\pm i.

Summary

RemarkRole of the characteristic polynomial

The characteristic polynomial pA(λ)=det(AλI)p_A(\lambda) = \det(A - \lambda I) is the bridge between matrix algebra and polynomial algebra:

  • Its roots are the eigenvalues.
  • Its coefficients encode symmetric functions of eigenvalues (trace, determinant, etc.).
  • It is similarity-invariant, so it belongs to the linear map, not just the matrix.
  • It factors over C\mathbb{C} into linear factors (Fundamental Theorem of Algebra), guaranteeing that every complex matrix has at least one eigenvalue.
  • The Cayley--Hamilton theorem states that pA(A)=0p_A(A) = 0: every matrix satisfies its own characteristic polynomial.