ProofComplete

Proof of the Rank-Nullity Theorem

We prove: if T:VWT : V \to W is linear and VV is finite-dimensional, then dimV=rank(T)+nullity(T)\dim V = \text{rank}(T) + \text{nullity}(T).


Main proof

Proof

Let dimV=n\dim V = n, nullity(T)=k\text{nullity}(T) = k, and let {u1,,uk}\{u_1, \ldots, u_k\} be a basis for ker(T)\ker(T).

By the basis extension theorem, extend to a basis for VV:

{u1,,uk,w1,,wnk}\{u_1, \ldots, u_k, w_1, \ldots, w_{n-k}\}

is a basis for VV, where nk=dimVnullity(T)n - k = \dim V - \text{nullity}(T).

Claim: {T(w1),,T(wnk)}\{T(w_1), \ldots, T(w_{n-k})\} is a basis for im(T)\text{im}(T).

Spanning: Let yim(T)y \in \text{im}(T). Then y=T(v)y = T(v) for some vVv \in V. Write v=a1u1++akuk+b1w1++bnkwnkv = a_1 u_1 + \cdots + a_k u_k + b_1 w_1 + \cdots + b_{n-k} w_{n-k}. Then

y=T(v)=a1T(u1)++akT(uk)+b1T(w1)++bnkT(wnk).y = T(v) = a_1 T(u_1) + \cdots + a_k T(u_k) + b_1 T(w_1) + \cdots + b_{n-k} T(w_{n-k}).

Since uiker(T)u_i \in \ker(T), T(ui)=0T(u_i) = 0. So y=b1T(w1)++bnkT(wnk)y = b_1 T(w_1) + \cdots + b_{n-k} T(w_{n-k}).

Independence: Suppose c1T(w1)++cnkT(wnk)=0c_1 T(w_1) + \cdots + c_{n-k} T(w_{n-k}) = 0. Then

T(c1w1++cnkwnk)=0,T(c_1 w_1 + \cdots + c_{n-k} w_{n-k}) = 0,

so c1w1++cnkwnkker(T)c_1 w_1 + \cdots + c_{n-k} w_{n-k} \in \ker(T). Write this as a linear combination of the kernel basis:

c1w1++cnkwnk=d1u1++dkuk.c_1 w_1 + \cdots + c_{n-k} w_{n-k} = d_1 u_1 + \cdots + d_k u_k.

Rearranging: d1u1++dkukc1w1cnkwnk=0d_1 u_1 + \cdots + d_k u_k - c_1 w_1 - \cdots - c_{n-k} w_{n-k} = 0. Since {u1,,uk,w1,,wnk}\{u_1, \ldots, u_k, w_1, \ldots, w_{n-k}\} is a basis for VV (hence linearly independent), all coefficients are zero. In particular, c1==cnk=0c_1 = \cdots = c_{n-k} = 0.

Therefore {T(w1),,T(wnk)}\{T(w_1), \ldots, T(w_{n-k})\} is a basis for im(T)\text{im}(T), so

rank(T)=nk=dimVnullity(T).\text{rank}(T) = n - k = \dim V - \text{nullity}(T).

Rearranging: dimV=rank(T)+nullity(T)\dim V = \text{rank}(T) + \text{nullity}(T).


Worked examples illustrating the proof

ExampleTrace: concrete illustration

T=tr:M2×2(R)RT = \text{tr} : M_{2 \times 2}(\mathbb{R}) \to \mathbb{R}, dimV=4\dim V = 4.

Step 1: ker(T)={Aa11+a22=0}\ker(T) = \{A \mid a_{11} + a_{22} = 0\}. Basis for kernel:

u1=(1001),  u2=(0100),  u3=(0010).u_1 = \begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix},\; u_2 = \begin{pmatrix} 0 & 1 \\ 0 & 0 \end{pmatrix},\; u_3 = \begin{pmatrix} 0 & 0 \\ 1 & 0 \end{pmatrix}.

So k=3k = 3.

Step 2: Extend to a basis of M2×2(R)M_{2 \times 2}(\mathbb{R}) by adding w1=(1000)w_1 = \begin{pmatrix} 1 & 0 \\ 0 & 0 \end{pmatrix}.

Step 3: {T(w1)}={1}\{T(w_1)\} = \{1\} is a basis for im(T)=R\text{im}(T) = \mathbb{R}.

So rank =1=43= 1 = 4 - 3. Check: 4=1+34 = 1 + 3.

ExampleDerivative: concrete illustration

D:P2(R)P1(R)D : P_2(\mathbb{R}) \to P_1(\mathbb{R}), dimV=3\dim V = 3.

ker(D)=span{1}\ker(D) = \text{span}\{1\}, k=1k = 1.

Extend: {1,x,x2}\{1, x, x^2\} is a basis for P2(R)P_2(\mathbb{R}), with u1=1u_1 = 1, w1=xw_1 = x, w2=x2w_2 = x^2.

{D(x),D(x2)}={1,2x}\{D(x), D(x^2)\} = \{1, 2x\} is a basis for P1(R)=im(D)P_1(\mathbb{R}) = \text{im}(D).

rank =2=31= 2 = 3 - 1. Check: 3=2+13 = 2 + 1.

ExampleProjection: concrete illustration

T:R3R2T : \mathbb{R}^3 \to \mathbb{R}^2, T(x,y,z)=(x,y)T(x, y, z) = (x, y), dimV=3\dim V = 3.

ker(T)=span{e3}\ker(T) = \text{span}\{e_3\}, k=1k = 1.

Extend: {e3,e1,e2}\{e_3, e_1, e_2\} is a basis for R3\mathbb{R}^3.

{T(e1),T(e2)}={(1,0),(0,1)}\{T(e_1), T(e_2)\} = \{(1,0), (0,1)\} is a basis for R2=im(T)\mathbb{R}^2 = \text{im}(T).

rank =2=31= 2 = 3 - 1. Check: 3=2+13 = 2 + 1.

ExampleInjection: trivial kernel

T:R2R3T : \mathbb{R}^2 \to \mathbb{R}^3, T(x,y)=(x,y,0)T(x, y) = (x, y, 0), dimV=2\dim V = 2.

ker(T)={0}\ker(T) = \{0\}, k=0k = 0. No kernel basis to extend.

{T(e1),T(e2)}={(1,0,0),(0,1,0)}\{T(e_1), T(e_2)\} = \{(1,0,0), (0,1,0)\} is a basis for im(T)\text{im}(T).

rank =2=20= 2 = 2 - 0. Check: 2=2+02 = 2 + 0.

ExampleMatrix with nontrivial kernel

A=(1224)A = \begin{pmatrix} 1 & 2 \\ 2 & 4 \end{pmatrix}, TA:R2R2T_A : \mathbb{R}^2 \to \mathbb{R}^2, dimV=2\dim V = 2.

ker(TA)=span{(2,1)}\ker(T_A) = \text{span}\{(-2, 1)\}, k=1k = 1.

Extend: {(2,1),(1,0)}\{(-2, 1), (1, 0)\} is a basis for R2\mathbb{R}^2.

{TA(1,0)}={(1,2)}\{T_A(1, 0)\} = \{(1, 2)\} is a basis for im(TA)=span{(1,2)}\text{im}(T_A) = \text{span}\{(1, 2)\} (the column space).

rank =1=21= 1 = 2 - 1. Check: 2=1+12 = 1 + 1.

ExampleStructure of the proof

The proof strategy is beautifully simple:

  1. Find a basis for ker(T)\ker(T) (size kk).
  2. Extend to a basis for all of VV (total size nn, added nkn - k vectors).
  3. Show the images of the added vectors form a basis for im(T)\text{im}(T).

The kernel vectors contribute nothing to the image (they map to 00), while the remaining vectors map to an independent spanning set.

ExampleShort exact sequence viewpoint

The Rank-Nullity Theorem says the sequence

0ker(T)ιVTim(T)00 \to \ker(T) \xrightarrow{\iota} V \xrightarrow{T} \text{im}(T) \to 0

is a short exact sequence of vector spaces. The dimension formula dimV=dimker(T)+dimim(T)\dim V = \dim \ker(T) + \dim \text{im}(T) is the statement that dimensions are additive across short exact sequences. This generalizes to the theory of exact sequences in abelian categories.

ExampleQuotient space viewpoint

The first isomorphism theorem gives V/ker(T)im(T)V/\ker(T) \cong \text{im}(T). Therefore:

dimV=dimker(T)+dim(V/ker(T))=dimker(T)+dimim(T).\dim V = \dim \ker(T) + \dim(V/\ker(T)) = \dim \ker(T) + \dim \text{im}(T).

This is perhaps the most conceptual proof of the Rank-Nullity Theorem.

ExampleApplication: existence of solutions

Does T(x,y,z)=(x+y,y+z,x+z)T(x, y, z) = (x + y, y + z, x + z) satisfy T(v)=(1,2,3)T(v) = (1, 2, 3) for some vv?

rank(T)\text{rank}(T): the matrix is (110011101)\begin{pmatrix} 1 & 1 & 0 \\ 0 & 1 & 1 \\ 1 & 0 & 1 \end{pmatrix} with det=20\det = 2 \neq 0. So rank =3=dimR3= 3 = \dim \mathbb{R}^3, meaning TT is surjective. Yes, a solution exists (and it is unique since nullity =0= 0).

ExampleRow rank equals column rank

For a matrix AMm×n(F)A \in M_{m \times n}(F), the Rank-Nullity Theorem applied to TAT_A gives rank(A)=n(A) = n - nullity(A)(A). Here rank(A)(A) is the column rank. A separate argument shows row rank = column rank, giving a single notion of "rank."

ExampleLinear algebra pigeonhole

If T:F5F3T : F^5 \to F^3 is linear, then nullity(T)53=2(T) \geq 5 - 3 = 2. So ker(T)\ker(T) is at least 2-dimensional. This "pigeonhole principle for linear maps" is a consequence of Rank-Nullity.

ExampleSummary for linear systems

For Ax=bAx = b with AMm×n(F)A \in M_{m \times n}(F):

  • Existence: bC(A)b \in C(A) iff rank(A)=(A) = rank(Ab)(A | b).
  • Uniqueness: the solution is unique iff nullity(A)=0(A) = 0, i.e., rank(A)=n(A) = n.
  • Number of free variables: nn - rank(A)(A).

RemarkSummary

The Rank-Nullity Theorem is the fundamental balance equation of linear algebra: the "input dimension" splits into "information preserved" (rank) plus "information lost" (nullity). This principle permeates all of linear algebra and extends to exact sequences in homological algebra.