Multivariable Chain Rule
The Chain Rule for multivariable functions states that the derivative of a composition is the composition of derivatives (matrix multiplication of Jacobians). This fundamental result enables computing derivatives of complex functions and is essential for differential geometry, machine learning (backpropagation), and optimization.
Statement
Theorem9.1Chain Rule
Let be differentiable at and be differentiable at . Then is differentiable at , and
In terms of Jacobian matrices,
Applications
ExampleChain rule in polar coordinates
Let and . Then
Similarly, .
ExampleBackpropagation in neural networks
Neural networks compute . The chain rule gives
the basis for backpropagation (gradient descent training).
Summary
The multivariable chain rule:
- Derivative of composition = composition of derivatives (matrix multiplication).
- Essential for computing derivatives in complex systems.
- Applications: coordinate transformations, optimization, machine learning.
See Total Derivative and Inverse Function Theorem.