TheoremComplete

Inverse Function Theorem (Statement and Proof Sketch)

The Inverse Function Theorem is proved using the Banach Fixed-Point Theorem. The key idea: solving f(x)=yf(\mathbf{x}) = \mathbf{y} is equivalent to finding a fixed point of an auxiliary map T(x)=x+A1(yf(x))T(\mathbf{x}) = \mathbf{x} + A^{-1}(\mathbf{y} - f(\mathbf{x})), where A=Jf(a)A = Jf(\mathbf{a}). Near a\mathbf{a}, this map is a contraction, guaranteeing a unique solution.


Statement

Theorem10.1Inverse Function Theorem

Let f:URnRnf : U \subseteq \mathbb{R}^n \to \mathbb{R}^n be continuously differentiable on an open set UU, and let aU\mathbf{a} \in U with detJf(a)0\det Jf(\mathbf{a}) \neq 0. Then there exist neighborhoods VV of a\mathbf{a} and WW of f(a)f(\mathbf{a}) such that:

  1. f:VWf : V \to W is a bijection.
  2. g=f1:WVg = f^{-1} : W \to V is continuously differentiable.
  3. Jg(y)=[Jf(g(y))]1Jg(\mathbf{y}) = [Jf(g(\mathbf{y}))]^{-1} for yW\mathbf{y} \in W.

Proof Sketch

Proof

Step 1: Setup. Let A=Jf(a)A = Jf(\mathbf{a}), which is invertible. For y\mathbf{y} near f(a)f(\mathbf{a}), define

Ty(x)=x+A1(yf(x)).T_{\mathbf{y}}(\mathbf{x}) = \mathbf{x} + A^{-1}(\mathbf{y} - f(\mathbf{x})).

Note that f(x)=yf(\mathbf{x}) = \mathbf{y} iff Ty(x)=xT_{\mathbf{y}}(\mathbf{x}) = \mathbf{x} (i.e., x\mathbf{x} is a fixed point of TyT_{\mathbf{y}}).

Step 2: TyT_{\mathbf{y}} is a contraction near a\mathbf{a}. The derivative of TyT_{\mathbf{y}} is

DTy(x)=IA1Jf(x).DT_{\mathbf{y}}(\mathbf{x}) = I - A^{-1} Jf(\mathbf{x}).

At x=a\mathbf{x} = \mathbf{a}, DTy(a)=IA1A=0DT_{\mathbf{y}}(\mathbf{a}) = I - A^{-1} A = 0. By continuity of JfJf, for x\mathbf{x} near a\mathbf{a},

DTy(x)<1/2.\|DT_{\mathbf{y}}(\mathbf{x})\| < 1/2.

By the Mean Value Theorem, TyT_{\mathbf{y}} is a contraction on a ball around a\mathbf{a} with contraction constant λ<1\lambda < 1.

Step 3: Existence and uniqueness of inverse. By the Banach Fixed-Point Theorem, TyT_{\mathbf{y}} has a unique fixed point x\mathbf{x} in a neighborhood VV of a\mathbf{a}. Define g(y)=xg(\mathbf{y}) = \mathbf{x}. Then f(g(y))=yf(g(\mathbf{y})) = \mathbf{y}, so g=f1g = f^{-1}.

Step 4: gg is continuously differentiable. Differentiating f(g(y))=yf(g(\mathbf{y})) = \mathbf{y} using the chain rule,

Jf(g(y))Jg(y)=I,Jf(g(\mathbf{y})) \cdot Jg(\mathbf{y}) = I,

so Jg(y)=[Jf(g(y))]1Jg(\mathbf{y}) = [Jf(g(\mathbf{y}))]^{-1}. Since JfJf is continuous and gg is continuous, JgJg is continuous.


Summary

The Inverse Function Theorem:

  • Proved via Banach Fixed-Point Theorem (contraction mapping).
  • Local invertibility when Jacobian is nonsingular.
  • Inverse is smooth with derivative (f1)=(f)1(f^{-1})' = (f')^{-1}.

See Inverse Function Theorem concept and Implicit Function Theorem.