All Topics  
Chain rule

 

   Email Print
   Bookmark   Link






 

Chain rule



 
 
In calculus
Calculus

Calculus is a branch of mathematics that includes the study of limit , derivatives, integrals, and infinite series, and constitutes a major part of modern university education....
, the chain rule is a formula
Formula

In mathematics and in the sciences, a formula is a concise way of expressing information symbolically , or a general relationship between quantities....
 for the derivative
Derivative

In calculus, a branch of mathematics, the derivative is a measure of how a function changes as its input changes. Loosely speaking, a derivative can be thought of as how much a quantity is changing at a given point....
 of the composite of two functions
Function (mathematics)

The mathematical concept of a function expresses dependence between two quantities, one of which is known and the other which is produced. A function associates a single output to each input element drawn from a fixed Set , such as the real numbers , although different inputs may have the same output....
.

In intuitive terms, if a variable
Variable

A variable is a symbol that stands for a value that may vary; the term usually occurs in opposition to constant, which is a symbol for a non-varying value, i.e....
, y, depends on a second variable, u, which in turn depends on a third variable, x, then the rate of change
Mathematics

Mathematics is the study of quantity, structure, space, change, and related topics of pattern and form. Mathematicians seek out patterns whether found in numbers, space, natural science, computers, imaginary abstractions, or elsewhere....
 of y with respect to x can be computed
Computation

Computation is a general term for any type of information processing. This includes phenomena ranging from human thinking to calculations with a more narrow meaning....
 as the rate of change of y with respect to u multiplied by
Multiplication

Multiplication is the Operation of scaling one number by another. It is one of the four basic operations in elementary arithmetic .Multiplication is defined for Natural number in terms of repeated addition; for example, 4 multiplied by 3 can be calculated by adding 3 copies of 4 together:...
 the rate of change of u with respect to x.






Discussion
Ask a question about 'Chain rule'
Start a new discussion about 'Chain rule'
Answer questions from other users
Full Discussion Forum



Encyclopedia


In calculus
Calculus

Calculus is a branch of mathematics that includes the study of limit , derivatives, integrals, and infinite series, and constitutes a major part of modern university education....
, the chain rule is a formula
Formula

In mathematics and in the sciences, a formula is a concise way of expressing information symbolically , or a general relationship between quantities....
 for the derivative
Derivative

In calculus, a branch of mathematics, the derivative is a measure of how a function changes as its input changes. Loosely speaking, a derivative can be thought of as how much a quantity is changing at a given point....
 of the composite of two functions
Function (mathematics)

The mathematical concept of a function expresses dependence between two quantities, one of which is known and the other which is produced. A function associates a single output to each input element drawn from a fixed Set , such as the real numbers , although different inputs may have the same output....
.

In intuitive terms, if a variable
Variable

A variable is a symbol that stands for a value that may vary; the term usually occurs in opposition to constant, which is a symbol for a non-varying value, i.e....
, y, depends on a second variable, u, which in turn depends on a third variable, x, then the rate of change
Mathematics

Mathematics is the study of quantity, structure, space, change, and related topics of pattern and form. Mathematicians seek out patterns whether found in numbers, space, natural science, computers, imaginary abstractions, or elsewhere....
 of y with respect to x can be computed
Computation

Computation is a general term for any type of information processing. This includes phenomena ranging from human thinking to calculations with a more narrow meaning....
 as the rate of change of y with respect to u multiplied by
Multiplication

Multiplication is the Operation of scaling one number by another. It is one of the four basic operations in elementary arithmetic .Multiplication is defined for Natural number in terms of repeated addition; for example, 4 multiplied by 3 can be calculated by adding 3 copies of 4 together:...
 the rate of change of u with respect to x. Schematically,

Informal discussion

For an explanation of notation used in this section, see Function composition
Function composition

In mathematics, a composite function represents the application of one function to the results of another. For instance, the functions and can be composed by first computing a f and then applying a function g to the output of f....
.
The chain rule states that, under appropriate conditions,

which in short form is written as

Alternatively, in the Leibniz notation
Leibniz notation

In calculus, Leibniz's notation, named in honor of the 17th-century Germany philosophy and mathematics Gottfried Leibniz, was originally the use of expressions such as dx and dy and to represent "infinitely small" increments of quantities x and y, just as ?x and ?y represent finite increments of x and y respe...
, the chain rule is

In integration
Integral

Integration is an important concept in mathematics, specifically in the field of calculus and, more broadly, mathematical analysis. Given a function ƒ of a Real number variable x and an interval [ab] of the real line, the integral...
, the counterpart to the chain rule is the substitution rule.

Theorem

The chain rule in one variable may be stated more completely as follows. Let g be a real-valued function on (a,b) which is differentiable
Derivative

In calculus, a branch of mathematics, the derivative is a measure of how a function changes as its input changes. Loosely speaking, a derivative can be thought of as how much a quantity is changing at a given point....
 at c ∈ (a,b); and f a real-valued function defined on an interval I containing the range of g and g(c) as an interior point. If f is differentiable at g(c), then
  • is differentiable at x = c, and

Examples


Example I

Suppose that a mountain climber ascends at a rate of 0.5 kilometers per hour. The temperature
Temperature

In physics, temperature is a physical property of a Physical system that underlies the common notions of hot and cold; something that feels hotter generally has the greater temperature....
 is lower at higher elevations; suppose the rate by which it decreases is 6 °C per kilometer. To calculate the decrease in air temperature per unit time that the climber experiences, one multiplies 6 °C per kilometer by 0.5 kilometer per hour, to obtain 3 °C per hour. This calculation is a typical chain rule application.

Example II

Consider the function f(x) = (x2 + 1)3. Since f(x) = h(g(x)) where g(x) = U = x2 + 1 and h(U) = U3 it follows from the chain rule that

    
  


In order to differentiate the trigonometric function one can write f(x) = h(g(x)) with h(x) = sin x and g(x) = x2. The chain rule then yields since h′(g(x)) = cos(x2) and g′(x) = 2x.

Example III

Differentiate arctan(sin x).

Thus, by the chain rule,

and in particular,

Chain rule for several variables

The chain rule works for functions of more than one variable. Consider the function z = f(x, y) where x = g(t) and y = h(t), and g(t) and h(t) are differentiable with respect to t, then

Suppose that each argument of z = f(u, v) is a two-variable function such that u = h(x, y) and v = g(x, y), and that these functions are all differentiable. Then the chain rule would look like:

If we considered above as a vector function, we can use vector notation to write the above equivalently as the dot product
Dot product

In mathematics, the dot product, also known as the scalar product, is an operation which takes two vector over the real numbers R and returns a real-valued scalar quantity....
 of the gradient
Gradient

In vector calculus, the gradient of a scalar field is a vector field which points in the direction of the greatest rate of increase of the scalar field, and whose magnitude is the greatest rate of change....
 of f and a derivative of :

More generally, for functions of vectors to vectors, the chain rule says that the Jacobian
Jacobian

In vector calculus, the Jacobian is shorthand for either the Jacobian matrix or its determinant, the Jacobian determinant.In algebraic geometry the Jacobian of a algebraic curve means the Jacobian variety: a group variety associated to the curve, in which the curve can be embedded....
 matrix of a composite function is the product of the Jacobian matrices of the two functions:

Proof of the chain rule

Let f and g be functions and let x be a number such that f is differentiable at g(x) and g is differentiable at x. Then by the definition of differentiability,

where ε(δ) → 0 as δ → 0. Similarly, where η(α) → 0 as α → 0. Define also that

Now

  
  


where Observe that as δ → 0, αδ / δg′(x) and αδ → 0, and thus η(αδ) → 0. It follows that

The fundamental chain rule

The chain rule is a fundamental property of all definitions of derivative and is therefore valid in much more general contexts. For instance, if E, F and G are Banach space
Banach space

In mathematics, Banach spaces are one of the central objects of study in functional analysis. They are topological vector spaces that have many interesting properties associated with them....
s (which includes Euclidean space
Euclidean space

Around 300 Before Christ, the Ancient Greece mathematician Euclid undertook a study of relationships among distances and angles, first in a plane and then in space....
) and f : EF and g : FG are functions, and if x is an element of E such that f is differentiable at x and g is differentiable at f(x), then the derivative (the Fréchet derivative
Fréchet derivative

In mathematics, the Fr?chet derivative is a derivative defined on Banach spaces. Named after Maurice Fr?chet, it is commonly used to formalize the concept of the functional derivative used widely in mathematical analysis, especially functional analysis....
) of the composition g o f at the point x is given by

Note that the derivatives here are linear maps
Linear transformation

In mathematics, a linear map is a function between two vector spaces that preserves the operations of vector addition and scalar multiplication....
 and not numbers. If the linear maps are represented as matrices
Matrix (mathematics)

In mathematics, a matrix is a rectangular array of numbers, as shown at the right. In addition to a number of elementary, entrywise operations such as matrix addition a key notion is matrix multiplication....
 (namely Jacobian
Jacobian

In vector calculus, the Jacobian is shorthand for either the Jacobian matrix or its determinant, the Jacobian determinant.In algebraic geometry the Jacobian of a algebraic curve means the Jacobian variety: a group variety associated to the curve, in which the curve can be embedded....
s), the composition on the right hand side turns into a matrix multiplication.

A particularly clear formulation of the chain rule can be achieved in the most general setting: let M, N and P be Ck manifold
Manifold

In mathematics, more specifically topology, a manifold is a topological space in which every point has a neighborhood which "resembles" Euclidean space....
s (or even Banach-manifolds) and let

f : MN and g : NP


be differentiable maps. The derivative of f, denoted by df, is then a map from the tangent bundle
Tangent bundle

In mathematics, the tangent bundle of a differentiable manifold M, denoted by T or just TM, is the disjoint unionThe disjoint union assures that for any two points x1 and x2 of manifold M the tangent spaces T1 and T2 have no common vector....
 of M to the tangent bundle of N, and we may write

In this way, the formation of derivatives and tangent bundles is seen as a functor
Functor

In category theory, a branch of mathematics, a functor is a special type of mapping between categories. Functors can be thought of as morphisms in the category of small categories....
 on the category
Category theory

In mathematics, category theory deals in an abstract way with mathematical structures and relationships between them: it abstracts from set s and function s to objects linked in diagrams by morphisms or arrows....
 of C manifolds with C maps as morphisms.

Tensors and the chain rule

See tensor field
Tensor field

In mathematics, physics and engineering, a tensor field is a very general concept of variable geometric quantity. It is used in differential geometry and the theory of manifolds, in algebraic geometry, in general relativity, in the analysis of stress and strain tensor in materials, and in numerous applications in the physical sciences and en...
 for an advanced explanation of the fundamental role the chain rule plays in the geometric nature of tensor
Tensor

A tensor is an object which extends the notion of Scalar , Vector , and Matrix . The term has slightly different meanings in mathematics and physics....
s.

Higher derivatives

Faŕ di Bruno's formula
Faŕ di Bruno's formula

Fa? di Bruno's formula is an identity in mathematics generalizing the chain rule to higher derivatives, named in honor of Francesco Fa? di Bruno , who was a military officer, a mathematician, and a priest, and was beatification by the Pope a century after his death....
 generalizes the chain rule to higher derivatives. The first few derivatives are

See also

  • Inverse chain rule
  • Triple product rule
    Triple product rule

    The triple product rule, known variously as the cyclic chain rule, cyclic relation, or Leonhard Euler chain rule, is a formula which relates partial derivative of three interdependent variables....
  • Derivative
    Derivative

    In calculus, a branch of mathematics, the derivative is a measure of how a function changes as its input changes. Loosely speaking, a derivative can be thought of as how much a quantity is changing at a given point....
  • Leibniz integral rule
    Leibniz integral rule

    In mathematics, Leibniz's rule for differentiation under the integral sign, named after Gottfried Leibniz, tells us that if we have an integral of the form...
  • Leibniz rule (generalized product rule)
    Leibniz rule (generalized product rule)

    In calculus, the Leibniz rule, named after Gottfried Leibniz, generalizes the product rule. It states that if f and g are n-times differentiable functions, then the nth derivative of the product fg is given by...


External links