ConceptComplete

The Chain Rule and Directional Derivatives

The multivariable chain rule generalizes the single-variable version to compositions of multivariable functions, while directional derivatives measure rates of change along arbitrary directions.


The Multivariable Chain Rule

Theorem8.2Multivariable Chain Rule

Let g:RmRn\mathbf{g} : \mathbb{R}^m \to \mathbb{R}^n be differentiable at a\mathbf{a} and f:RnRpf : \mathbb{R}^n \to \mathbb{R}^p be differentiable at g(a)\mathbf{g}(\mathbf{a}). Then the composition fgf \circ \mathbf{g} is differentiable at a\mathbf{a} and D(fg)(a)=Df(g(a))Dg(a)D(f \circ \mathbf{g})(\mathbf{a}) = Df(\mathbf{g}(\mathbf{a})) \circ D\mathbf{g}(\mathbf{a}) In matrix form, this is the product of Jacobian matrices: Jfg(a)=Jf(g(a))Jg(a)J_{f \circ g}(\mathbf{a}) = J_f(\mathbf{g}(\mathbf{a})) \cdot J_g(\mathbf{a}).

ExampleChain rule with parametric curves

If f(x,y)f(x, y) is differentiable and x=x(t)x = x(t), y=y(t)y = y(t) are differentiable functions of tt, then ddtf(x(t),y(t))=fxdxdt+fydydt\frac{d}{dt}f(x(t), y(t)) = \frac{\partial f}{\partial x}\frac{dx}{dt} + \frac{\partial f}{\partial y}\frac{dy}{dt} For instance, if f(x,y)=x2yf(x,y) = x^2 y and x=costx = \cos t, y=sinty = \sin t, then ddtf=2xy(sint)+x2(cost)=2costsin2t+cos3t\frac{d}{dt}f = 2xy(-\sin t) + x^2(\cos t) = -2\cos t \sin^2 t + \cos^3 t.


Directional Derivatives

Definition

The directional derivative of f:RnRf : \mathbb{R}^n \to \mathbb{R} at a\mathbf{a} in the direction of a unit vector u\mathbf{u} is Duf(a)=limh0f(a+hu)f(a)h=f(a)uD_\mathbf{u} f(\mathbf{a}) = \lim_{h \to 0} \frac{f(\mathbf{a} + h\mathbf{u}) - f(\mathbf{a})}{h} = \nabla f(\mathbf{a}) \cdot \mathbf{u} when ff is differentiable at a\mathbf{a}. The maximum value of DufD_\mathbf{u} f is f(a)\|\nabla f(\mathbf{a})\|, achieved when u=f(a)/f(a)\mathbf{u} = \nabla f(\mathbf{a}) / \|\nabla f(\mathbf{a})\|.


The Jacobian Matrix

Definition

For a function f:RnRm\mathbf{f} : \mathbb{R}^n \to \mathbb{R}^m with components f1,,fmf_1, \ldots, f_m, the Jacobian matrix at a\mathbf{a} is Jf(a)=(f1x1f1xnfmx1fmxn)J_\mathbf{f}(\mathbf{a}) = \begin{pmatrix} \frac{\partial f_1}{\partial x_1} & \cdots & \frac{\partial f_1}{\partial x_n} \\ \vdots & \ddots & \vdots \\ \frac{\partial f_m}{\partial x_1} & \cdots & \frac{\partial f_m}{\partial x_n} \end{pmatrix} The Jacobian matrix represents the total derivative Df(a)D\mathbf{f}(\mathbf{a}) as a linear map RnRm\mathbb{R}^n \to \mathbb{R}^m.

RemarkGeometric meaning of the Jacobian

The Jacobian matrix describes how the function f\mathbf{f} locally distorts space near a\mathbf{a}. Its determinant (when m=nm = n) measures the factor by which volumes are scaled, which explains the role of the Jacobian determinant in the change of variables formula for multiple integrals.