The chain rule is a method to compute the derivative of the functional composition of two or more functions.
If a function depends on a variable , which in turn depends on another variable , that is , then the rate of change of with respect to can be computed as the rate of change of with respect to multiplied by the rate of change of with respect to .
The method is called the "chain rule" because it can be applied sequentially to as many functions as are nested inside one another. For example, if is a function of which is in turn a function of , which is in turn a function of , that is
the derivative of with respect to is given by
- and so on.
A useful mnemonic is to think of the differentials as individual entities that can be canceled algebraically, such as
However, keep in mind that this trick comes about through a clever choice of notation rather than through actual algebraic cancellation.
The chain rule has broad applications in physics, chemistry, and engineering, as well as being used to study related rates in many disciplines. The chain rule can also be generalized to multiple variables in cases where the nested functions depend on more than one variable.
Examples
Example I
Suppose that a mountain climber ascends at a rate of . The temperature is lower at higher elevations; suppose the rate by which it decreases is per kilometer. To calculate the decrease in air temperature per hour that the climber experiences, one multiplies by , to obtain . This calculation is a typical chain rule application.
Example II
Consider the function . It follows from the chain rule that
|
Function to differentiate
|
|
Define as inside function
|
|
Express in terms of
|
|
Express chain rule applicable here
|
|
Substitute in and
|
|
Compute derivatives with power rule
|
|
Substitute back in terms of
|
|
Simplify.
|
Example III
In order to differentiate the trigonometric function
one can write:
|
Function to differentiate
|
|
Define as inside function
|
|
Express in terms of
|
|
Express chain rule applicable here
|
|
Substitute in and
|
|
Evaluate derivatives
|
|
Substitute in terms of .
|
Example IV: absolute value
The chain rule can be used to differentiate , the absolute value function:
|
Function to differentiate
|
|
Equivalent function
|
|
Define as inside function
|
|
Express in terms of
|
|
Express chain rule applicable here
|
|
Substitute in and
|
|
Compute derivatives with power rule
|
|
Substitute back in terms of
|
|
Simplify
|
|
Express as absolute value.
|
Example V: three nested functions
The method is called the "chain rule" because it can be applied sequentially to as many functions as are nested inside one another. For example, if , sequential application of the chain rule yields the derivative as follows (we make use of the fact that , which will be proved in a later section):
|
Original (outermost) function
|
|
Define as innermost function
|
|
as middle function
|
|
Express chain rule applicable here
|
|
Differentiate f(g)
|
|
Differentiate
|
|
Differentiate
|
|
Substitute into chain rule.
|
Proof of the chain rule
Suppose is a function of which is a function of (it is assumed that is differentiable at and , and is differentiable at .
To prove the chain rule we use the definition of the derivative.
We now multiply by and perform some algebraic manipulation.
Note that as approaches , also approaches . So taking the limit as of a function as approaches is the same as taking its limit as approaches . Thus
So we have
Resources
Licensing
Content obtained and/or adapted from: