Chain Rule

From Department of Mathematics at UTSA
Jump to navigation Jump to search

The chain rule is a method to compute the derivative of the functional composition of two or more functions.

If a function depends on a variable , which in turn depends on another variable , that is , then the rate of change of with respect to can be computed as the rate of change of with respect to multiplied by the rate of change of with respect to .

Chain Rule

If a function is composed to two differentiable functions and , so that , then is differentiable and,

The method is called the "chain rule" because it can be applied sequentially to as many functions as are nested inside one another. For example, if is a function of which is in turn a function of , which is in turn a function of , that is

the derivative of with respect to is given by

and so on.

A useful mnemonic is to think of the differentials as individual entities that can be canceled algebraically, such as

However, keep in mind that this trick comes about through a clever choice of notation rather than through actual algebraic cancellation.

The chain rule has broad applications in physics, chemistry, and engineering, as well as being used to study related rates in many disciplines. The chain rule can also be generalized to multiple variables in cases where the nested functions depend on more than one variable.

Examples

Example I

Suppose that a mountain climber ascends at a rate of . The temperature is lower at higher elevations; suppose the rate by which it decreases is per kilometer. To calculate the decrease in air temperature per hour that the climber experiences, one multiplies by , to obtain . This calculation is a typical chain rule application.

Example II

Consider the function . It follows from the chain rule that

Function to differentiate
Define as inside function
Express in terms of
Express chain rule applicable here
Substitute in and
Compute derivatives with power rule
Substitute back in terms of
Simplify.

Example III

In order to differentiate the trigonometric function

one can write:

Function to differentiate
Define as inside function
Express in terms of
Express chain rule applicable here
Substitute in and
Evaluate derivatives
Substitute in terms of .

Example IV: absolute value

The chain rule can be used to differentiate , the absolute value function:

Function to differentiate
Equivalent function
Define as inside function
Express in terms of
Express chain rule applicable here
Substitute in and
Compute derivatives with power rule
Substitute back in terms of
Simplify
Express as absolute value.

Example V: three nested functions

The method is called the "chain rule" because it can be applied sequentially to as many functions as are nested inside one another. For example, if , sequential application of the chain rule yields the derivative as follows (we make use of the fact that , which will be proved in a later section):

Original (outermost) function
Define as innermost function
as middle function
Express chain rule applicable here
Differentiate f(g)
Differentiate
Differentiate
Substitute into chain rule.

Proof of the chain rule

Suppose is a function of which is a function of (it is assumed that is differentiable at and , and is differentiable at . To prove the chain rule we use the definition of the derivative.

We now multiply by and perform some algebraic manipulation.

Note that as approaches , also approaches . So taking the limit as of a function as approaches is the same as taking its limit as approaches . Thus

So we have


Resources

Licensing

Content obtained and/or adapted from: