![Total derivative](https://www.english.nina.az/image-resize/1600/900/web/wikipedia.jpg)
This article includes a list of general references, but it lacks sufficient corresponding inline citations.(July 2013) |
In mathematics, the total derivative of a function f at a point is the best linear approximation near this point of the function with respect to its arguments. Unlike partial derivatives, the total derivative approximates the function with respect to all of its arguments, not just a single one. In many situations, this is the same as considering all partial derivatives simultaneously. The term "total derivative" is primarily used when f is a function of several variables, because when f is a function of a single variable, the total derivative is the same as the ordinary derivative of the function.: 198–203
The total derivative as a linear map
Let be an open subset. Then a function
is said to be (totally) differentiable at a point
if there exists a linear transformation
such that
The linear map is called the (total) derivative or (total) differential of
at
. Other notations for the total derivative include
and
. A function is (totally) differentiable if its total derivative exists at every point in its domain.
Conceptually, the definition of the total derivative expresses the idea that is the best linear approximation to
at the point
. This can be made precise by quantifying the error in the linear approximation determined by
. To do so, write
where equals the error in the approximation. To say that the derivative of
at
is
is equivalent to the statement
where is little-o notation and indicates that
is much smaller than
as
. The total derivative
is the unique linear transformation for which the error term is this small, and this is the sense in which it is the best linear approximation to
.
The function is differentiable if and only if each of its components
is differentiable, so when studying total derivatives, it is often possible to work one coordinate at a time in the codomain. However, the same is not true of the coordinates in the domain. It is true that if
is differentiable at
, then each partial derivative
exists at
. The converse does not hold: it can happen that all of the partial derivatives of
at
exist, but
is not differentiable at
. This means that the function is very "rough" at
, to such an extreme that its behavior cannot be adequately described by its behavior in the coordinate directions. When
is not so rough, this cannot happen. More precisely, if all the partial derivatives of
at
exist and are continuous in a neighborhood of
, then
is differentiable at
. When this happens, then in addition, the total derivative of
is the linear transformation corresponding to the Jacobian matrix of partial derivatives at that point.
The total derivative as a differential form
When the function under consideration is real-valued, the total derivative can be recast using differential forms. For example, suppose that is a differentiable function of variables
. The total derivative of
at
may be written in terms of its Jacobian matrix, which in this instance is a row matrix:
The linear approximation property of the total derivative implies that if
is a small vector (where the denotes transpose, so that this vector is a column vector), then
Heuristically, this suggests that if are infinitesimal increments in the coordinate directions, then
In fact, the notion of the infinitesimal, which is merely symbolic here, can be equipped with extensive mathematical structure. Techniques, such as the theory of differential forms, effectively give analytical and algebraic descriptions of objects like infinitesimal increments, . For instance,
may be inscribed as a linear functional on the vector space
. Evaluating
at a vector
in
measures how much
"points" in the
th coordinate direction. The total derivative
is a linear combination of linear functionals and hence is itself a linear functional. The evaluation
measures how much
points in the direction determined by
at
, and this direction is the gradient. This point of view makes the total derivative an instance of the exterior derivative.
Suppose now that is a vector-valued function, that is,
. In this case, the components
of
are real-valued functions, so they have associated differential forms
. The total derivative
amalgamates these forms into a single object and is therefore an instance of a vector-valued differential form.
The chain rule for total derivatives
The chain rule has a particularly elegant statement in terms of total derivatives. It says that, for two functions and
, the total derivative of the composite function
at
satisfies
If the total derivatives of and
are identified with their Jacobian matrices, then the composite on the right-hand side is simply matrix multiplication. This is enormously useful in applications, as it makes it possible to account for essentially arbitrary dependencies among the arguments of a composite function.
Example: Differentiation with direct dependencies
Suppose that f is a function of two variables, x and y. If these two variables are independent, so that the domain of f is , then the behavior of f may be understood in terms of its partial derivatives in the x and y directions. However, in some situations, x and y may be dependent. For example, it might happen that f is constrained to a curve
. In this case, we are actually interested in the behavior of the composite function
. The partial derivative of f with respect to x does not give the true rate of change of f with respect to changing x because changing x necessarily changes y. However, the chain rule for the total derivative takes such dependencies into account. Write
. Then, the chain rule says
By expressing the total derivative using Jacobian matrices, this becomes:
Suppressing the evaluation at for legibility, we may also write this as
This gives a straightforward formula for the derivative of in terms of the partial derivatives of
and the derivative of
.
For example, suppose
The rate of change of f with respect to x is usually the partial derivative of f with respect to x; in this case,
However, if y depends on x, the partial derivative does not give the true rate of change of f as x changes because the partial derivative assumes that y is fixed. Suppose we are constrained to the line
Then
and the total derivative of f with respect to x is
which we see is not equal to the partial derivative . Instead of immediately substituting for y in terms of x, however, we can also use the chain rule as above:
Example: Differentiation with indirect dependencies
While one can often perform substitutions to eliminate indirect dependencies, the chain rule provides for a more efficient and general technique. Suppose is a function of time
and
variables
which themselves depend on time. Then, the time derivative of
is
The chain rule expresses this derivative in terms of the partial derivatives of and the time derivatives of the functions
:
This expression is often used in physics for a gauge transformation of the Lagrangian, as two Lagrangians that differ only by the total time derivative of a function of time and the generalized coordinates lead to the same equations of motion. An interesting example concerns the resolution of causality concerning the Wheeler–Feynman time-symmetric theory. The operator in brackets (in the final expression above) is also called the total derivative operator (with respect to
).
For example, the total derivative of is
Here there is no term since
itself does not depend on the independent variable
directly.
Total differential equation
A total differential equation is a differential equation expressed in terms of total derivatives. Since the exterior derivative is coordinate-free, in a sense that can be given a technical meaning, such equations are intrinsic and geometric.
Application to equation systems
In economics, it is common for the total derivative to arise in the context of a system of equations.: pp. 217–220 For example, a simple supply-demand system might specify the quantity q of a product demanded as a function D of its price p and consumers' income I, the latter being an exogenous variable, and might specify the quantity supplied by producers as a function S of its price and two exogenous resource cost variables r and w. The resulting system of equations
determines the market equilibrium values of the variables p and q. The total derivative of p with respect to r, for example, gives the sign and magnitude of the reaction of the market price to the exogenous variable r. In the indicated system, there are a total of six possible total derivatives, also known in this context as comparative static derivatives: dp / dr, dp / dw, dp / dI, dq / dr, dq / dw, and dq / dI. The total derivatives are found by totally differentiating the system of equations, dividing through by, say dr, treating dq / dr and dp / dr as the unknowns, setting dI = dw = 0, and solving the two totally differentiated equations simultaneously, typically by using Cramer's rule.
See also
- Directional derivative – Instantaneous rate of change of the function
- Fréchet derivative – Derivative defined on normed spaces - generalization of the total derivative
- Gateaux derivative – Generalization of the concept of directional derivative
- Generalizations of the derivative – Fundamental construction of differential calculus
- Gradient#Total derivative – Multivariate derivative (mathematics)
References
- A. D. Polyanin and V. F. Zaitsev, Handbook of Exact Solutions for Ordinary Differential Equations (2nd edition), Chapman & Hall/CRC Press, Boca Raton, 2003. ISBN 1-58488-297-2
- From thesaurus.maths.org total derivative
External links
- Weisstein, Eric W. "Total Derivative". MathWorld.
- Ronald D. Kriz (2007) Envisioning total derivatives of scalar functions of two dimensions using raised surfaces and tangent planes from Virginia Tech
This article includes a list of general references but it lacks sufficient corresponding inline citations Please help to improve this article by introducing more precise citations July 2013 Learn how and when to remove this message In mathematics the total derivative of a function f at a point is the best linear approximation near this point of the function with respect to its arguments Unlike partial derivatives the total derivative approximates the function with respect to all of its arguments not just a single one In many situations this is the same as considering all partial derivatives simultaneously The term total derivative is primarily used when f is a function of several variables because when f is a function of a single variable the total derivative is the same as the ordinary derivative of the function 198 203 The total derivative as a linear mapLet U Rn displaystyle U subseteq mathbb R n be an open subset Then a function f U Rm displaystyle f U to mathbb R m is said to be totally differentiable at a point a U displaystyle a in U if there exists a linear transformation dfa Rn Rm displaystyle df a mathbb R n to mathbb R m such that limx a f x f a dfa x a x a 0 displaystyle lim x to a frac f x f a df a x a x a 0 The linear map dfa displaystyle df a is called the total derivative or total differential of f displaystyle f at a displaystyle a Other notations for the total derivative include Daf displaystyle D a f and Df a displaystyle Df a A function is totally differentiable if its total derivative exists at every point in its domain Conceptually the definition of the total derivative expresses the idea that dfa displaystyle df a is the best linear approximation to f displaystyle f at the point a displaystyle a This can be made precise by quantifying the error in the linear approximation determined by dfa displaystyle df a To do so write f a h f a dfa h e h displaystyle f a h f a df a h varepsilon h where e h displaystyle varepsilon h equals the error in the approximation To say that the derivative of f displaystyle f at a displaystyle a is dfa displaystyle df a is equivalent to the statement e h o h displaystyle varepsilon h o lVert h rVert where o displaystyle o is little o notation and indicates that e h displaystyle varepsilon h is much smaller than h displaystyle lVert h rVert as h 0 displaystyle h to 0 The total derivative dfa displaystyle df a is the unique linear transformation for which the error term is this small and this is the sense in which it is the best linear approximation to f displaystyle f The function f displaystyle f is differentiable if and only if each of its components fi U R displaystyle f i colon U to mathbb R is differentiable so when studying total derivatives it is often possible to work one coordinate at a time in the codomain However the same is not true of the coordinates in the domain It is true that if f displaystyle f is differentiable at a displaystyle a then each partial derivative f xi displaystyle partial f partial x i exists at a displaystyle a The converse does not hold it can happen that all of the partial derivatives of f displaystyle f at a displaystyle a exist but f displaystyle f is not differentiable at a displaystyle a This means that the function is very rough at a displaystyle a to such an extreme that its behavior cannot be adequately described by its behavior in the coordinate directions When f displaystyle f is not so rough this cannot happen More precisely if all the partial derivatives of f displaystyle f at a displaystyle a exist and are continuous in a neighborhood of a displaystyle a then f displaystyle f is differentiable at a displaystyle a When this happens then in addition the total derivative of f displaystyle f is the linear transformation corresponding to the Jacobian matrix of partial derivatives at that point The total derivative as a differential formWhen the function under consideration is real valued the total derivative can be recast using differential forms For example suppose that f Rn R displaystyle f colon mathbb R n to mathbb R is a differentiable function of variables x1 xn displaystyle x 1 ldots x n The total derivative of f displaystyle f at a displaystyle a may be written in terms of its Jacobian matrix which in this instance is a row matrix Dfa f x1 a f xn a displaystyle Df a begin bmatrix frac partial f partial x 1 a amp cdots amp frac partial f partial x n a end bmatrix The linear approximation property of the total derivative implies that if Dx Dx1 Dxn T displaystyle Delta x begin bmatrix Delta x 1 amp cdots amp Delta x n end bmatrix mathsf T is a small vector where the T displaystyle mathsf T denotes transpose so that this vector is a column vector then f a Dx f a Dfa Dx i 1n f xi a Dxi displaystyle f a Delta x f a approx Df a cdot Delta x sum i 1 n frac partial f partial x i a cdot Delta x i Heuristically this suggests that if dx1 dxn displaystyle dx 1 ldots dx n are infinitesimal increments in the coordinate directions then dfa i 1n f xi a dxi displaystyle df a sum i 1 n frac partial f partial x i a cdot dx i In fact the notion of the infinitesimal which is merely symbolic here can be equipped with extensive mathematical structure Techniques such as the theory of differential forms effectively give analytical and algebraic descriptions of objects like infinitesimal increments dxi displaystyle dx i For instance dxi displaystyle dx i may be inscribed as a linear functional on the vector space Rn displaystyle mathbb R n Evaluating dxi displaystyle dx i at a vector h displaystyle h in Rn displaystyle mathbb R n measures how much h displaystyle h points in the i displaystyle i th coordinate direction The total derivative dfa displaystyle df a is a linear combination of linear functionals and hence is itself a linear functional The evaluation dfa h displaystyle df a h measures how much f displaystyle f points in the direction determined by h displaystyle h at a displaystyle a and this direction is the gradient This point of view makes the total derivative an instance of the exterior derivative Suppose now that f displaystyle f is a vector valued function that is f Rn Rm displaystyle f colon mathbb R n to mathbb R m In this case the components fi displaystyle f i of f displaystyle f are real valued functions so they have associated differential forms dfi displaystyle df i The total derivative df displaystyle df amalgamates these forms into a single object and is therefore an instance of a vector valued differential form The chain rule for total derivativesThe chain rule has a particularly elegant statement in terms of total derivatives It says that for two functions f displaystyle f and g displaystyle g the total derivative of the composite function f g displaystyle f circ g at a displaystyle a satisfies d f g a dfg a dga displaystyle d f circ g a df g a cdot dg a If the total derivatives of f displaystyle f and g displaystyle g are identified with their Jacobian matrices then the composite on the right hand side is simply matrix multiplication This is enormously useful in applications as it makes it possible to account for essentially arbitrary dependencies among the arguments of a composite function Example Differentiation with direct dependencies Suppose that f is a function of two variables x and y If these two variables are independent so that the domain of f is R2 displaystyle mathbb R 2 then the behavior of f may be understood in terms of its partial derivatives in the x and y directions However in some situations x and y may be dependent For example it might happen that f is constrained to a curve y y x displaystyle y y x In this case we are actually interested in the behavior of the composite function f x y x displaystyle f x y x The partial derivative of f with respect to x does not give the true rate of change of f with respect to changing x because changing x necessarily changes y However the chain rule for the total derivative takes such dependencies into account Write g x x y x displaystyle gamma x x y x Then the chain rule says d f g x0 df x0 y x0 dgx0 displaystyle d f circ gamma x 0 df x 0 y x 0 cdot d gamma x 0 By expressing the total derivative using Jacobian matrices this becomes df x y x dx x0 f x x0 y x0 dxdx x0 f y x0 y x0 dydx x0 displaystyle frac df x y x dx x 0 frac partial f partial x x 0 y x 0 cdot frac dx dx x 0 frac partial f partial y x 0 y x 0 cdot frac dy dx x 0 Suppressing the evaluation at x0 displaystyle x 0 for legibility we may also write this as df x y x dx f xdxdx f ydydx displaystyle frac df x y x dx frac partial f partial x frac dx dx frac partial f partial y frac dy dx This gives a straightforward formula for the derivative of f x y x displaystyle f x y x in terms of the partial derivatives of f displaystyle f and the derivative of y x displaystyle y x For example suppose f x y xy displaystyle f x y xy The rate of change of f with respect to x is usually the partial derivative of f with respect to x in this case f x y displaystyle frac partial f partial x y However if y depends on x the partial derivative does not give the true rate of change of f as x changes because the partial derivative assumes that y is fixed Suppose we are constrained to the line y x displaystyle y x Then f x y f x x x2 displaystyle f x y f x x x 2 and the total derivative of f with respect to x is dfdx 2x displaystyle frac df dx 2x which we see is not equal to the partial derivative f x displaystyle partial f partial x Instead of immediately substituting for y in terms of x however we can also use the chain rule as above dfdx f x f ydydx y x 1 x y 2x displaystyle frac df dx frac partial f partial x frac partial f partial y frac dy dx y x cdot 1 x y 2x Example Differentiation with indirect dependencies While one can often perform substitutions to eliminate indirect dependencies the chain rule provides for a more efficient and general technique Suppose L t x1 xn displaystyle L t x 1 dots x n is a function of time t displaystyle t and n displaystyle n variables xi displaystyle x i which themselves depend on time Then the time derivative of L displaystyle L is dLdt ddtL t x1 t xn t displaystyle frac dL dt frac d dt L bigl t x 1 t ldots x n t bigr The chain rule expresses this derivative in terms of the partial derivatives of L displaystyle L and the time derivatives of the functions xi displaystyle x i dLdt L t i 1n L xidxidt t i 1ndxidt xi L displaystyle frac dL dt frac partial L partial t sum i 1 n frac partial L partial x i frac dx i dt biggl frac partial partial t sum i 1 n frac dx i dt frac partial partial x i biggr L This expression is often used in physics for a gauge transformation of the Lagrangian as two Lagrangians that differ only by the total time derivative of a function of time and the n displaystyle n generalized coordinates lead to the same equations of motion An interesting example concerns the resolution of causality concerning the Wheeler Feynman time symmetric theory The operator in brackets in the final expression above is also called the total derivative operator with respect to t displaystyle t For example the total derivative of f x t y t displaystyle f x t y t is dfdt f xdxdt f ydydt displaystyle frac df dt partial f over partial x dx over dt partial f over partial y dy over dt Here there is no f t displaystyle partial f partial t term since f displaystyle f itself does not depend on the independent variable t displaystyle t directly Total differential equationA total differential equation is a differential equation expressed in terms of total derivatives Since the exterior derivative is coordinate free in a sense that can be given a technical meaning such equations are intrinsic and geometric Application to equation systemsIn economics it is common for the total derivative to arise in the context of a system of equations pp 217 220 For example a simple supply demand system might specify the quantity q of a product demanded as a function D of its price p and consumers income I the latter being an exogenous variable and might specify the quantity supplied by producers as a function S of its price and two exogenous resource cost variables r and w The resulting system of equations q D p I displaystyle q D p I q S p r w displaystyle q S p r w determines the market equilibrium values of the variables p and q The total derivative dp dr displaystyle dp dr of p with respect to r for example gives the sign and magnitude of the reaction of the market price to the exogenous variable r In the indicated system there are a total of six possible total derivatives also known in this context as comparative static derivatives dp dr dp dw dp dI dq dr dq dw and dq dI The total derivatives are found by totally differentiating the system of equations dividing through by say dr treating dq dr and dp dr as the unknowns setting dI dw 0 and solving the two totally differentiated equations simultaneously typically by using Cramer s rule See alsoDirectional derivative Instantaneous rate of change of the function Frechet derivative Derivative defined on normed spaces generalization of the total derivative Gateaux derivative Generalization of the concept of directional derivative Generalizations of the derivative Fundamental construction of differential calculus Gradient Total derivative Multivariate derivative mathematics ReferencesChiang Alpha C 1984 Fundamental Methods of Mathematical Economics Third ed McGraw Hill ISBN 0 07 010813 7 Abraham Ralph Marsden J E Ratiu Tudor 2012 Manifolds Tensor Analysis and Applications Springer Science amp Business Media p 78 ISBN 9781461210290 A D Polyanin and V F Zaitsev Handbook of Exact Solutions for Ordinary Differential Equations 2nd edition Chapman amp Hall CRC Press Boca Raton 2003 ISBN 1 58488 297 2 From thesaurus maths org total derivativeExternal linksWeisstein Eric W Total Derivative MathWorld Ronald D Kriz 2007 Envisioning total derivatives of scalar functions of two dimensions using raised surfaces and tangent planes from Virginia Tech