![Derivative](https://www.english.nina.az/wikipedia/image/aHR0cHM6Ly91cGxvYWQud2lraW1lZGlhLm9yZy93aWtpcGVkaWEvY29tbW9ucy90aHVtYi8wLzBmL1RhbmdlbnRfdG9fYV9jdXJ2ZS5zdmcvMTYwMHB4LVRhbmdlbnRfdG9fYV9jdXJ2ZS5zdmcucG5n.png )
In mathematics, the derivative is a fundamental tool that quantifies the sensitivity to change of a function's output with respect to its input. The derivative of a function of a single variable at a chosen input value, when it exists, is the slope of the tangent line to the graph of the function at that point. The tangent line is the best linear approximation of the function near that input value. For this reason, the derivative is often described as the instantaneous rate of change, the ratio of the instantaneous change in the dependent variable to that of the independent variable. The process of finding a derivative is called differentiation.
There are multiple different notations for differentiation, two of the most commonly used being Leibniz notation and prime notation. Leibniz notation, named after Gottfried Wilhelm Leibniz, is represented as the ratio of two differentials, whereas prime notation is written by adding a prime mark. Higher order notations represent repeated differentiation, and they are usually denoted in Leibniz notation by adding superscripts to the differentials, and in prime notation by adding additional prime marks. The higher order derivatives can be applied in physics; for example, while the first derivative of the position of a moving object with respect to time is the object's velocity, how the position changes as time advances, the second derivative is the object's acceleration, how the velocity changes as time advances.
Derivatives can be generalized to functions of several real variables. In this generalization, the derivative is reinterpreted as a linear transformation whose graph is (after an appropriate translation) the best linear approximation to the graph of the original function. The Jacobian matrix is the matrix that represents this linear transformation with respect to the basis given by the choice of independent and dependent variables. It can be calculated in terms of the partial derivatives with respect to the independent variables. For a real-valued function of several variables, the Jacobian matrix reduces to the gradient vector.
Definition
As a limit
A function of a real variable is differentiable at a point
of its domain, if its domain contains an open interval containing
, and the limit
exists. This means that, for every positive real number
, there exists a positive real number
such that, for every
such that
and
then
is defined, and
where the vertical bars denote the absolute value. This is an example of the (ε, δ)-definition of limit.
If the function is differentiable at
, that is if the limit
exists, then this limit is called the derivative of
at
. Multiple notations for the derivative exist. The derivative of
at
can be denoted
, read as "
prime of
"; or it can be denoted
, read as "the derivative of
with respect to
at
" or "
by (or over)
at
". See § Notation below. If
is a function that has a derivative at every point in its domain, then a function can be defined by mapping every point
to the value of the derivative of
at
. This function is written
and is called the derivative function or the derivative of
. The function
sometimes has a derivative at most, but not all, points of its domain. The function whose value at
equals
whenever
is defined and elsewhere is undefined is also called the derivative of
. It is still a function, but its domain may be smaller than the domain of
.
For example, let be the squaring function:
. Then the quotient in the definition of the derivative is
The division in the last step is valid as long as
. The closer
is to
, the closer this expression becomes to the value
. The limit exists, and for every input
the limit is
. So, the derivative of the squaring function is the doubling function:
.
The ratio in the definition of the derivative is the slope of the line through two points on the graph of the function , specifically the points
and
. As
is made smaller, these points grow closer together, and the slope of this line approaches the limiting value, the slope of the tangent to the graph of
at
. In other words, the derivative is the slope of the tangent.
Using infinitesimals
One way to think of the derivative is as the ratio of an infinitesimal change in the output of the function
to an infinitesimal change in its input. In order to make this intuition rigorous, a system of rules for manipulating infinitesimal quantities is required. The system of hyperreal numbers is a way of treating infinite and infinitesimal quantities. The hyperreals are an extension of the real numbers that contain numbers greater than anything of the form
for any finite number of terms. Such numbers are infinite, and their reciprocals are infinitesimals. The application of hyperreal numbers to the foundations of calculus is called nonstandard analysis. This provides a way to define the basic concepts of calculus such as the derivative and integral in terms of infinitesimals, thereby giving a precise meaning to the
in the Leibniz notation. Thus, the derivative of
becomes
for an arbitrary infinitesimal
, where
denotes the standard part function, which "rounds off" each finite hyperreal to the nearest real. Taking the squaring function
as an example again,
Continuity and differentiability
If is differentiable at
, then
must also be continuous at
. As an example, choose a point
and let
be the step function that returns the value 1 for all
less than
, and returns a different value 10 for all
greater than or equal to
. The function
cannot have a derivative at
. If
is negative, then
is on the low part of the step, so the secant line from
to
is very steep; as
tends to zero, the slope tends to infinity. If
is positive, then
is on the high part of the step, so the secant line from
to
has slope zero. Consequently, the secant lines do not approach any single slope, so the limit of the difference quotient does not exist. However, even if a function is continuous at a point, it may not be differentiable there. For example, the absolute value function given by
is continuous at
, but it is not differentiable there. If
is positive, then the slope of the secant line from 0 to
is one; if
is negative, then the slope of the secant line from
to
is
. This can be seen graphically as a "kink" or a "cusp" in the graph at
. Even a function with a smooth graph is not differentiable at a point where its tangent is vertical: For instance, the function given by
is not differentiable at
. In summary, a function that has a derivative is continuous, but there are continuous functions that do not have a derivative.
Most functions that occur in practice have derivatives at all points or almost every point. Early in the history of calculus, many mathematicians assumed that a continuous function was differentiable at most points. Under mild conditions (for example, if the function is a monotone or a Lipschitz function), this is true. However, in 1872, Weierstrass found the first example of a function that is continuous everywhere but differentiable nowhere. This example is now known as the Weierstrass function. In 1931, Stefan Banach proved that the set of functions that have a derivative at some point is a meager set in the space of all continuous functions. Informally, this means that hardly any random continuous functions have a derivative at even one point.
Notation
One common way of writing the derivative of a function is Leibniz notation, introduced by Gottfried Wilhelm Leibniz in 1675, which denotes a derivative as the quotient of two differentials, such as and
. It is still commonly used when the equation
is viewed as a functional relationship between dependent and independent variables. The first derivative is denoted by
, read as "the derivative of
with respect to
". This derivative can alternately be treated as the application of a differential operator to a function,
Higher derivatives are expressed using the notation
for the
-th derivative of
. These are abbreviations for multiple applications of the derivative operator; for example,
Unlike some alternatives, Leibniz notation involves explicit specification of the variable for differentiation, in the denominator, which removes ambiguity when working with multiple interrelated quantities. The derivative of a composed function can be expressed using the chain rule: if
and
then
Another common notation for differentiation is by using the prime mark in the symbol of a function . This notation, due to Joseph-Louis Lagrange, is now known as prime notation. The first derivative is written as
, read as "
prime of
, or
, read as "
prime". Similarly, the second and the third derivatives can be written as
and
, respectively. For denoting the number of higher derivatives beyond this point, some authors use Roman numerals in superscript, whereas others place the number in parentheses, such as
or
. The latter notation generalizes to yield the notation
for the
th derivative of
.
In Newton's notation or the dot notation, a dot is placed over a symbol to represent a time derivative. If is a function of
, then the first and second derivatives can be written as
and
, respectively. This notation is used exclusively for derivatives with respect to time or arc length. It is typically used in differential equations in physics and differential geometry. However, the dot notation becomes unmanageable for high-order derivatives (of order 4 or more) and cannot deal with multiple independent variables.
Another notation is D-notation, which represents the differential operator by the symbol . The first derivative is written
and higher derivatives are written with a superscript, so the
-th derivative is
. This notation is sometimes called Euler notation, although it seems that Leonhard Euler did not use it, and the notation was introduced by Louis François Antoine Arbogast. To indicate a partial derivative, the variable differentiated by is indicated with a subscript, for example given the function
, its partial derivative with respect to
can be written
or
. Higher partial derivatives can be indicated by superscripts or multiple subscripts, e.g.
and
.
Rules of computation
In principle, the derivative of a function can be computed from the definition by considering the difference quotient and computing its limit. Once the derivatives of a few simple functions are known, the derivatives of other functions are more easily computed using rules for obtaining derivatives of more complicated functions from simpler ones. This process of finding a derivative is known as differentiation.
Rules for basic functions
The following are the rules for the derivatives of the most common basic functions. Here, is a real number, and
is the base of the natural logarithm, approximately 2.71828.
- Derivatives of powers:
- Functions of exponential, natural logarithm, and logarithm with general base:
, for
, for
, for
- Trigonometric functions:
- Inverse trigonometric functions:
, for
, for
Rules for combined functions
Given that the and
are the functions. The following are some of the most basic rules for deducing the derivative of functions from derivatives of basic functions.
- Constant rule: if
is constant, then for all
,
- Sum rule:
for all functions
and
and all real numbers
and
.
- Product rule:
for all functions
and
. As a special case, this rule includes the fact
whenever
is a constant because
by the constant rule.
- Quotient rule:
for all functions
and
at all inputs where g ≠ 0.
- Chain rule for composite functions: If
, then
Computation example
The derivative of the function given by is
Here the second term was computed using the chain rule and the third term using the product rule. The known derivatives of the elementary functions
,
,
,
, and
, as well as the constant
, were also used.
Higher-order derivatives
Higher order derivatives are the result of differentiating a function repeatedly. Given that is a differentiable function, the derivative of
is the first derivative, denoted as
. The derivative of
is the second derivative, denoted as
, and the derivative of
is the third derivative, denoted as
. By continuing this process, if it exists, the
th derivative is the derivative of the
th derivative or the derivative of order
. As has been discussed above, the generalization of derivative of a function
may be denoted as
. A function that has
successive derivatives is called
times differentiable. If the
-th derivative is continuous, then the function is said to be of differentiability class
. A function that has infinitely many derivatives is called infinitely differentiable or smooth. Any polynomial function is infinitely differentiable; taking derivatives repeatedly will eventually result in a constant function, and all subsequent derivatives of that function are zero.
One application of higher-order derivatives is in physics. Suppose that a function represents the position of an object at the time. The first derivative of that function is the velocity of an object with respect to time, the second derivative of the function is the acceleration of an object with respect to time, and the third derivative is the jerk.
In other dimensions
Vector-valued functions
A vector-valued function of a real variable sends real numbers to vectors in some vector space
. A vector-valued function can be split up into its coordinate functions
, meaning that
. This includes, for example, parametric curves in
or
. The coordinate functions are real-valued functions, so the above definition of derivative applies to them. The derivative of
is defined to be the vector, called the tangent vector, whose coordinates are the derivatives of the coordinate functions. That is,
if the limit exists. The subtraction in the numerator is the subtraction of vectors, not scalars. If the derivative of
exists for every value of
, then
is another vector-valued function.
Partial derivatives
Functions can depend upon more than one variable. A partial derivative of a function of several variables is its derivative with respect to one of those variables, with the others held constant. Partial derivatives are used in vector calculus and differential geometry. As with ordinary derivatives, multiple notations exist: the partial derivative of a function with respect to the variable
is variously denoted by
among other possibilities. It can be thought of as the rate of change of the function in the -direction. Here ∂ is a rounded d called the partial derivative symbol. To distinguish it from the letter d, ∂ is sometimes pronounced "der", "del", or "partial" instead of "dee". For example, let
, then the partial derivative of function
with respect to both variables
and
are, respectively:
In general, the partial derivative of a function
in the direction
at the point
is defined to be:
This is fundamental for the study of the functions of several real variables. Let be such a real-valued function. If all partial derivatives
with respect to
are defined at the point
, these partial derivatives define the vector
which is called the gradient of
at
. If
is differentiable at every point in some domain, then the gradient is a vector-valued function
that maps the point
to the vector
. Consequently, the gradient determines a vector field.
Directional derivatives
If is a real-valued function on
, then the partial derivatives of
measure its variation in the direction of the coordinate axes. For example, if
is a function of
and
, then its partial derivatives measure the variation in
in the
and
direction. However, they do not directly measure the variation of
in any other direction, such as along the diagonal line
. These are measured using directional derivatives. Given a vector
, then the directional derivative of
in the direction of
at the point
is:
If all the partial derivatives of exist and are continuous at
, then they determine the directional derivative of
in the direction
by the formula:
Total derivative, total differential and Jacobian matrix
When is a function from an open subset of
to
, then the directional derivative of
in a chosen direction is the best linear approximation to
at that point and in that direction. However, when
, no single directional derivative can give a complete picture of the behavior of
. The total derivative gives a complete picture by considering all directions at once. That is, for any vector
starting at
, the linear approximation formula holds:
Similarly with the single-variable derivative,
is chosen so that the error in this approximation is as small as possible. The total derivative of
at
is the unique linear transformation
such that
Here
is a vector in
, so the norm in the denominator is the standard length on
. However,
is a vector in
, and the norm in the numerator is the standard length on
. If
is a vector starting at
, then
is called the pushforward of
by
.
If the total derivative exists at , then all the partial derivatives and directional derivatives of
exist at
, and for all
,
is the directional derivative of
in the direction
. If
is written using coordinate functions, so that
, then the total derivative can be expressed using the partial derivatives as a matrix. This matrix is called the Jacobian matrix of
at
:
Generalizations
The concept of a derivative can be extended to many other settings. The common thread is that the derivative of a function at a point serves as a linear approximation of the function at that point.
- An important generalization of the derivative concerns complex functions of complex variables, such as functions from (a domain in) the complex numbers
to
. The notion of the derivative of such a function is obtained by replacing real variables with complex variables in the definition. If
is identified with
by writing a complex number
as
then a differentiable function from
to
is certainly differentiable as a function from
to
(in the sense that its partial derivatives all exist), but the converse is not true in general: the complex derivative only exists if the real derivative is complex linear and this imposes relations between the partial derivatives called the Cauchy–Riemann equations – see holomorphic functions.
- Another generalization concerns functions between differentiable or smooth manifolds. Intuitively speaking such a manifold
is a space that can be approximated near each point
by a vector space called its tangent space: the prototypical example is a smooth surface in
. The derivative (or differential) of a (differentiable) map
between manifolds, at a point
in
, is then a linear map from the tangent space of
at
to the tangent space of
at
. The derivative function becomes a map between the tangent bundles of
and
. This definition is used in differential geometry.
- Differentiation can also be defined for maps between vector space, such as Banach space, in which those generalizations are the Gateaux derivative and the Fréchet derivative.
- One deficiency of the classical derivative is that very many functions are not differentiable. Nevertheless, there is a way of extending the notion of the derivative so that all continuous functions and many other functions can be differentiated using a concept known as the weak derivative. The idea is to embed the continuous functions in a larger space called the space of distributions and only require that a function is differentiable "on average".
- Properties of the derivative have inspired the introduction and study of many similar objects in algebra and topology; an example is differential algebra. Here, it consists of the derivation of some topics in abstract algebra, such as rings, ideals, field, and so on.
- The discrete equivalent of differentiation is finite differences. The study of differential calculus is unified with the calculus of finite differences in time scale calculus.
- The arithmetic derivative involves the function that is defined for the integers by the prime factorization. This is an analogy with the product rule.
See also
- Covariant derivative
- Derivation
- Exterior derivative
- Functional derivative
- Integral
- Lie derivative
Notes
- Apostol 1967, p. 160; Stewart 2002, pp. 129–130; Strang et al. 2023, p. 224.
- Apostol 1967, p. 160; Stewart 2002, p. 127; Strang et al. 2023, p. 220.
- Gonick 2012, p. 83; Thomas et al. 2014, p. 60.
- Gonick 2012, p. 88; Strang et al. 2023, p. 234.
- Gonick 2012, p. 83; Strang et al. 2023, p. 232.
- Gonick 2012, pp. 77–80.
- Thompson 1998, pp. 34, 104; Stewart 2002, p. 128.
- Thompson 1998, pp. 84–85.
- Keisler 2012, pp. 902–904.
- Keisler 2012, p. 45; Henle & Kleinberg 2003, p. 66.
- Gonick 2012, p. 156; Thomas et al. 2014, p. 114; Strang et al. 2023, p. 237.
- Gonick 2012, p. 149; Thomas et al. 2014, p. 113; Strang et al. 2023, p. 237.
- Gonick 2012, p. 156; Thomas et al. 2014, p. 114; Strang et al. 2023, pp. 237–238.
- Jašek 1922; Jarník 1922; Rychlík 1923.
- David 2018.
- Banach 1931, cited in Hewitt & Stromberg 1965.
- Apostol 1967, p. 172; Cajori 2007, p. 204.
- Moore & Siegel 2013, p. 110.
- Varberg, Purcell & Rigdon 2007, pp. 125–126.
- In the formulation of calculus in terms of limits, various authors have assigned the
symbol various meanings. Some authors such as Varberg, Purcell & Rigdon 2007, p. 119 and Stewart 2002, p. 177 do not assign a meaning to
by itself, but only as part of the symbol
. Others define
as an independent variable, and define
by
. In non-standard analysis
is defined as an infinitesimal. It is also interpreted as the exterior derivative of a function
. See differential (infinitesimal) for further information.
- Schwartzman 1994, p. 171; Cajori 1923, pp. 6–7, 10–12, 21–24.
- Moore & Siegel 2013, p. 110; Goodman 1963, pp. 78–79.
- Varberg, Purcell & Rigdon 2007, pp. 125–126; Cajori 2007, p. 228.
- Choudary & Niculescu 2014, p. 222; Apostol 1967, p. 171.
- Evans 1999, p. 63; Kreyszig 1991, p. 1.
- Cajori 1923.
- Apostol 1967, p. 172; Varberg, Purcell & Rigdon 2007, pp. 125–126.
- Apostol 1967, p. 160.
- Varberg, Purcell & Rigdon 2007. See p. 133 for the power rule, pp. 115–116 for the trigonometric functions, p. 326 for the natural logarithm, pp. 338–339 for exponential with base
, p. 343 for the exponential with base
, p. 344 for the logarithm with base
, and p. 369 for the inverse of trigonometric functions.
- For constant rule and sum rule, see Apostol 1967, pp. 161, 164, respectively. For the product rule, quotient rule, and chain rule, see Varberg, Purcell & Rigdon 2007, pp. 111–112, 119, respectively. For the special case of the product rule, that is, the product of a constant and a function, see Varberg, Purcell & Rigdon 2007, pp. 108–109.
- Apostol 1967, p. 160; Varberg, Purcell & Rigdon 2007, pp. 125–126.
- Warner 1983, p. 5.
- Debnath & Shah 2015, p. 40.
- Carothers 2000, p. 176.
- Stewart 2002, p. 193.
- Stewart 2002, p. 893.
- Stewart 2002, p. 947; Christopher 2013, p. 682.
- Stewart 2002, p. 949.
- Silverman 1989, p. 216; Bhardwaj 2005, See p. 6.4.
- Mathai & Haubold 2017, p. 52.
- Gbur 2011, pp. 36–37.
- Varberg, Purcell & Rigdon 2007, p. 642.
- Guzman 2003, p. 35.
- Davvaz 2023, p. 266.
- Lee 2013, p. 72.
- Davvaz 2023, p. 267.
- Roussos 2014, p. 303.
- Gbur 2011, pp. 261–264.
- Gray, Abbena & Salamon 2006, p. 826.
- Azegami 2020. See p. 209 for the Gateaux derivative, and p. 211 for the Fréchet derivative.
- Funaro 1992, pp. 84–85.
- Kolchin 1973, pp. 58, 126.
- Georgiev 2018, p. 8.
- Barbeau 1961.
References
- Apostol, Tom M. (June 1967), Calculus, Vol. 1: One-Variable Calculus with an Introduction to Linear Algebra, vol. 1 (2nd ed.), Wiley, ISBN 978-0-471-00005-1
- Azegami, Hideyuki (2020), Shape Optimization Problems, Springer Optimization and Its Applications, vol. 164, Springer, doi:10.1007/978-981-15-7618-8, ISBN 978-981-15-7618-8, S2CID 226442409
- Banach, Stefan (1931), "Uber die Baire'sche Kategorie gewisser Funktionenmengen", Studia Math., 3 (3): 174–179, doi:10.4064/sm-3-1-174-179.
- Barbeau, E. J. (1961). "Remarks on an arithmetic derivative". Canadian Mathematical Bulletin. 4 (2): 117–122. doi:10.4153/CMB-1961-013-0. Zbl 0101.03702.
- Bhardwaj, R. S. (2005), Mathematics for Economics & Business (2nd ed.), Excel Books India, ISBN 9788174464507
- Cajori, Florian (1923), "The History of Notations of the Calculus", Annals of Mathematics, 25 (1): 1–46, doi:10.2307/1967725, hdl:2027/mdp.39015017345896, JSTOR 1967725
- Cajori, Florian (2007), A History of Mathematical Notations, vol. 2, Cosimo Classics, ISBN 978-1-60206-713-4
- Carothers, N. L. (2000), Real Analysis, Cambridge University Press
- Choudary, A. D. R.; Niculescu, Constantin P. (2014), Real Analysis on Intervals, Springer India, doi:10.1007/978-81-322-2148-7, ISBN 978-81-322-2148-7
- Christopher, Essex (2013), Calculus: A complete course, Pearson, p. 682, ISBN 9780321781079, OCLC 872345701
- Courant, Richard; John, Fritz (December 22, 1998), Introduction to Calculus and Analysis, Vol. 1, Springer-Verlag, doi:10.1007/978-1-4613-8955-2, ISBN 978-3-540-65058-4
- David, Claire (2018), "Bypassing dynamical systems: A simple way to get the box-counting dimension of the graph of the Weierstrass function", Proceedings of the International Geometry Center, 11 (2), Academy of Sciences of Ukraine: 53–68, arXiv:1711.10349, doi:10.15673/tmgc.v11i2.1028
- Davvaz, Bijan (2023), Vectors and Functions of Several Variables, Springer, doi:10.1007/978-981-99-2935-1, ISBN 978-981-99-2935-1, S2CID 259885793
- Debnath, Lokenath; Shah, Firdous Ahmad (2015), Wavelet Transforms and Their Applications (2nd ed.), Birkhäuser, doi:10.1007/978-0-8176-8418-1, ISBN 978-0-8176-8418-1
- Evans, Lawrence (1999), Partial Differential Equations, American Mathematical Society, ISBN 0-8218-0772-2
- Eves, Howard (January 2, 1990), An Introduction to the History of Mathematics (6th ed.), Brooks Cole, ISBN 978-0-03-029558-4
In mathematics the derivative is a fundamental tool that quantifies the sensitivity to change of a function s output with respect to its input The derivative of a function of a single variable at a chosen input value when it exists is the slope of the tangent line to the graph of the function at that point The tangent line is the best linear approximation of the function near that input value For this reason the derivative is often described as the instantaneous rate of change the ratio of the instantaneous change in the dependent variable to that of the independent variable The process of finding a derivative is called differentiation There are multiple different notations for differentiation two of the most commonly used being Leibniz notation and prime notation Leibniz notation named after Gottfried Wilhelm Leibniz is represented as the ratio of two differentials whereas prime notation is written by adding a prime mark Higher order notations represent repeated differentiation and they are usually denoted in Leibniz notation by adding superscripts to the differentials and in prime notation by adding additional prime marks The higher order derivatives can be applied in physics for example while the first derivative of the position of a moving object with respect to time is the object s velocity how the position changes as time advances the second derivative is the object s acceleration how the velocity changes as time advances Derivatives can be generalized to functions of several real variables In this generalization the derivative is reinterpreted as a linear transformation whose graph is after an appropriate translation the best linear approximation to the graph of the original function The Jacobian matrix is the matrix that represents this linear transformation with respect to the basis given by the choice of independent and dependent variables It can be calculated in terms of the partial derivatives with respect to the independent variables For a real valued function of several variables the Jacobian matrix reduces to the gradient vector DefinitionAs a limit A function of a real variable f x displaystyle f x is differentiable at a point a displaystyle a of its domain if its domain contains an open interval containing a displaystyle a and the limit L limh 0f a h f a h displaystyle L lim h to 0 frac f a h f a h exists This means that for every positive real number e displaystyle varepsilon there exists a positive real number d displaystyle delta such that for every h displaystyle h such that h lt d displaystyle h lt delta and h 0 displaystyle h neq 0 then f a h displaystyle f a h is defined and L f a h f a h lt e displaystyle left L frac f a h f a h right lt varepsilon where the vertical bars denote the absolute value This is an example of the e d definition of limit If the function f displaystyle f is differentiable at a displaystyle a that is if the limit L displaystyle L exists then this limit is called the derivative of f displaystyle f at a displaystyle a Multiple notations for the derivative exist The derivative of f displaystyle f at a displaystyle a can be denoted f a displaystyle f a read as f displaystyle f prime of a displaystyle a or it can be denoted dfdx a displaystyle textstyle frac df dx a read as the derivative of f displaystyle f with respect to x displaystyle x at a displaystyle a or df displaystyle df by or over dx displaystyle dx at a displaystyle a See Notation below If f displaystyle f is a function that has a derivative at every point in its domain then a function can be defined by mapping every point x displaystyle x to the value of the derivative of f displaystyle f at x displaystyle x This function is written f displaystyle f and is called the derivative function or the derivative of f displaystyle f The function f displaystyle f sometimes has a derivative at most but not all points of its domain The function whose value at a displaystyle a equals f a displaystyle f a whenever f a displaystyle f a is defined and elsewhere is undefined is also called the derivative of f displaystyle f It is still a function but its domain may be smaller than the domain of f displaystyle f For example let f displaystyle f be the squaring function f x x2 displaystyle f x x 2 Then the quotient in the definition of the derivative isf a h f a h a h 2 a2h a2 2ah h2 a2h 2a h displaystyle frac f a h f a h frac a h 2 a 2 h frac a 2 2ah h 2 a 2 h 2a h The division in the last step is valid as long as h 0 displaystyle h neq 0 The closer h displaystyle h is to 0 displaystyle 0 the closer this expression becomes to the value 2a displaystyle 2a The limit exists and for every input a displaystyle a the limit is 2a displaystyle 2a So the derivative of the squaring function is the doubling function f x 2x displaystyle f x 2x The graph of a function drawn in black and a tangent line to that graph drawn in red The slope of the tangent line is equal to the derivative of the function at the marked point The derivative at different points of a differentiable function In this case the derivative is equal to sin x2 2x2cos x2 displaystyle sin left x 2 right 2x 2 cos left x 2 right The ratio in the definition of the derivative is the slope of the line through two points on the graph of the function f displaystyle f specifically the points a f a displaystyle a f a and a h f a h displaystyle a h f a h As h displaystyle h is made smaller these points grow closer together and the slope of this line approaches the limiting value the slope of the tangent to the graph of f displaystyle f at a displaystyle a In other words the derivative is the slope of the tangent Using infinitesimals One way to think of the derivative dfdx a textstyle frac df dx a is as the ratio of an infinitesimal change in the output of the function f displaystyle f to an infinitesimal change in its input In order to make this intuition rigorous a system of rules for manipulating infinitesimal quantities is required The system of hyperreal numbers is a way of treating infinite and infinitesimal quantities The hyperreals are an extension of the real numbers that contain numbers greater than anything of the form 1 1 1 displaystyle 1 1 cdots 1 for any finite number of terms Such numbers are infinite and their reciprocals are infinitesimals The application of hyperreal numbers to the foundations of calculus is called nonstandard analysis This provides a way to define the basic concepts of calculus such as the derivative and integral in terms of infinitesimals thereby giving a precise meaning to the d displaystyle d in the Leibniz notation Thus the derivative of f x displaystyle f x becomes f x st f x dx f x dx displaystyle f x operatorname st left frac f x dx f x dx right for an arbitrary infinitesimal dx displaystyle dx where st displaystyle operatorname st denotes the standard part function which rounds off each finite hyperreal to the nearest real Taking the squaring function f x x2 displaystyle f x x 2 as an example again f x st x2 2x dx dx 2 x2dx st 2x dx dx 2dx st 2x dxdx dx 2dx st 2x dx 2x displaystyle begin aligned f x amp operatorname st left frac x 2 2x cdot dx dx 2 x 2 dx right amp operatorname st left frac 2x cdot dx dx 2 dx right amp operatorname st left frac 2x cdot dx dx frac dx 2 dx right amp operatorname st left 2x dx right amp 2x end aligned Continuity and differentiabilityThis function does not have a derivative at the marked point as the function is not continuous there specifically it has a jump discontinuity The absolute value function is continuous but fails to be differentiable at x 0 since the tangent slopes do not approach the same value from the left as they do from the right If f displaystyle f is differentiable at a displaystyle a then f displaystyle f must also be continuous at a displaystyle a As an example choose a point a displaystyle a and let f displaystyle f be the step function that returns the value 1 for all x displaystyle x less than a displaystyle a and returns a different value 10 for all x displaystyle x greater than or equal to a displaystyle a The function f displaystyle f cannot have a derivative at a displaystyle a If h displaystyle h is negative then a h displaystyle a h is on the low part of the step so the secant line from a displaystyle a to a h displaystyle a h is very steep as h displaystyle h tends to zero the slope tends to infinity If h displaystyle h is positive then a h displaystyle a h is on the high part of the step so the secant line from a displaystyle a to a h displaystyle a h has slope zero Consequently the secant lines do not approach any single slope so the limit of the difference quotient does not exist However even if a function is continuous at a point it may not be differentiable there For example the absolute value function given by f x x displaystyle f x x is continuous at x 0 displaystyle x 0 but it is not differentiable there If h displaystyle h is positive then the slope of the secant line from 0 to h displaystyle h is one if h displaystyle h is negative then the slope of the secant line from 0 displaystyle 0 to h displaystyle h is 1 displaystyle 1 This can be seen graphically as a kink or a cusp in the graph at x 0 displaystyle x 0 Even a function with a smooth graph is not differentiable at a point where its tangent is vertical For instance the function given by f x x1 3 displaystyle f x x 1 3 is not differentiable at x 0 displaystyle x 0 In summary a function that has a derivative is continuous but there are continuous functions that do not have a derivative Most functions that occur in practice have derivatives at all points or almost every point Early in the history of calculus many mathematicians assumed that a continuous function was differentiable at most points Under mild conditions for example if the function is a monotone or a Lipschitz function this is true However in 1872 Weierstrass found the first example of a function that is continuous everywhere but differentiable nowhere This example is now known as the Weierstrass function In 1931 Stefan Banach proved that the set of functions that have a derivative at some point is a meager set in the space of all continuous functions Informally this means that hardly any random continuous functions have a derivative at even one point NotationOne common way of writing the derivative of a function is Leibniz notation introduced by Gottfried Wilhelm Leibniz in 1675 which denotes a derivative as the quotient of two differentials such as dy displaystyle dy and dx displaystyle dx It is still commonly used when the equation y f x displaystyle y f x is viewed as a functional relationship between dependent and independent variables The first derivative is denoted by dydx displaystyle textstyle frac dy dx read as the derivative of y displaystyle y with respect to x displaystyle x This derivative can alternately be treated as the application of a differential operator to a function dydx ddxf x textstyle frac dy dx frac d dx f x Higher derivatives are expressed using the notation dnydxn textstyle frac d n y dx n for the n displaystyle n th derivative of y f x displaystyle y f x These are abbreviations for multiple applications of the derivative operator for example d2ydx2 ddx ddxf x textstyle frac d 2 y dx 2 frac d dx Bigl frac d dx f x Bigr Unlike some alternatives Leibniz notation involves explicit specification of the variable for differentiation in the denominator which removes ambiguity when working with multiple interrelated quantities The derivative of a composed function can be expressed using the chain rule if u g x displaystyle u g x and y f g x displaystyle y f g x then dydx dydu dudx textstyle frac dy dx frac dy du cdot frac du dx Another common notation for differentiation is by using the prime mark in the symbol of a function f x displaystyle f x This notation due to Joseph Louis Lagrange is now known as prime notation The first derivative is written as f x displaystyle f x read as f displaystyle f prime of x displaystyle x or y displaystyle y read as y displaystyle y prime Similarly the second and the third derivatives can be written as f displaystyle f and f displaystyle f respectively For denoting the number of higher derivatives beyond this point some authors use Roman numerals in superscript whereas others place the number in parentheses such as fiv displaystyle f mathrm iv or f 4 displaystyle f 4 The latter notation generalizes to yield the notation f n displaystyle f n for the n displaystyle n th derivative of f displaystyle f In Newton s notation or the dot notation a dot is placed over a symbol to represent a time derivative If y displaystyle y is a function of t displaystyle t then the first and second derivatives can be written as y displaystyle dot y and y displaystyle ddot y respectively This notation is used exclusively for derivatives with respect to time or arc length It is typically used in differential equations in physics and differential geometry However the dot notation becomes unmanageable for high order derivatives of order 4 or more and cannot deal with multiple independent variables Another notation is D notation which represents the differential operator by the symbol D displaystyle D The first derivative is written Df x displaystyle Df x and higher derivatives are written with a superscript so the n displaystyle n th derivative is Dnf x displaystyle D n f x This notation is sometimes called Euler notation although it seems that Leonhard Euler did not use it and the notation was introduced by Louis Francois Antoine Arbogast To indicate a partial derivative the variable differentiated by is indicated with a subscript for example given the function u f x y displaystyle u f x y its partial derivative with respect to x displaystyle x can be written Dxu displaystyle D x u or Dxf x y displaystyle D x f x y Higher partial derivatives can be indicated by superscripts or multiple subscripts e g Dxyf x y y xf x y textstyle D xy f x y frac partial partial y Bigl frac partial partial x f x y Bigr and Dx2f x y x xf x y displaystyle textstyle D x 2 f x y frac partial partial x Bigl frac partial partial x f x y Bigr Rules of computationIn principle the derivative of a function can be computed from the definition by considering the difference quotient and computing its limit Once the derivatives of a few simple functions are known the derivatives of other functions are more easily computed using rules for obtaining derivatives of more complicated functions from simpler ones This process of finding a derivative is known as differentiation Rules for basic functions The following are the rules for the derivatives of the most common basic functions Here a displaystyle a is a real number and e displaystyle e is the base of the natural logarithm approximately 2 71828 Derivatives of powers ddxxa axa 1 displaystyle frac d dx x a ax a 1 Functions of exponential natural logarithm and logarithm with general base ddxex ex displaystyle frac d dx e x e x ddxax axln a displaystyle frac d dx a x a x ln a for a gt 0 displaystyle a gt 0 ddxln x 1x displaystyle frac d dx ln x frac 1 x for x gt 0 displaystyle x gt 0 ddxloga x 1xln a displaystyle frac d dx log a x frac 1 x ln a for x a gt 0 displaystyle x a gt 0 Trigonometric functions ddxsin x cos x displaystyle frac d dx sin x cos x ddxcos x sin x displaystyle frac d dx cos x sin x ddxtan x sec2 x 1cos2 x 1 tan2 x displaystyle frac d dx tan x sec 2 x frac 1 cos 2 x 1 tan 2 x Inverse trigonometric functions ddxarcsin x 11 x2 displaystyle frac d dx arcsin x frac 1 sqrt 1 x 2 for 1 lt x lt 1 displaystyle 1 lt x lt 1 ddxarccos x 11 x2 displaystyle frac d dx arccos x frac 1 sqrt 1 x 2 for 1 lt x lt 1 displaystyle 1 lt x lt 1 ddxarctan x 11 x2 displaystyle frac d dx arctan x frac 1 1 x 2 Rules for combined functions Given that the f displaystyle f and g displaystyle g are the functions The following are some of the most basic rules for deducing the derivative of functions from derivatives of basic functions Constant rule if f displaystyle f is constant then for all x displaystyle x f x 0 displaystyle f x 0 Sum rule af bg af bg displaystyle alpha f beta g alpha f beta g for all functions f displaystyle f and g displaystyle g and all real numbers a displaystyle alpha and b displaystyle beta Product rule fg f g fg displaystyle fg f g fg for all functions f displaystyle f and g displaystyle g As a special case this rule includes the fact af af displaystyle alpha f alpha f whenever a displaystyle alpha is a constant because a f 0 f 0 displaystyle alpha f 0 cdot f 0 by the constant rule Quotient rule fg f g fg g2 displaystyle left frac f g right frac f g fg g 2 for all functions f displaystyle f and g displaystyle g at all inputs where g 0 Chain rule for composite functions If f x h g x displaystyle f x h g x then f x h g x g x displaystyle f x h g x cdot g x Computation example The derivative of the function given by f x x4 sin x2 ln x ex 7 displaystyle f x x 4 sin left x 2 right ln x e x 7 is f x 4x 4 1 d x2 dxcos x2 d ln x dxex ln x d ex dx 0 4x3 2xcos x2 1xex ln x ex displaystyle begin aligned f x amp 4x 4 1 frac d left x 2 right dx cos left x 2 right frac d left ln x right dx e x ln x frac d left e x right dx 0 amp 4x 3 2x cos left x 2 right frac 1 x e x ln x e x end aligned Here the second term was computed using the chain rule and the third term using the product rule The known derivatives of the elementary functions x2 displaystyle x 2 x4 displaystyle x 4 sin x displaystyle sin x ln x displaystyle ln x and exp x ex displaystyle exp x e x as well as the constant 7 displaystyle 7 were also used Higher order derivativesHigher order derivatives are the result of differentiating a function repeatedly Given that f displaystyle f is a differentiable function the derivative of f displaystyle f is the first derivative denoted as f displaystyle f The derivative of f displaystyle f is the second derivative denoted as f displaystyle f and the derivative of f displaystyle f is the third derivative denoted as f displaystyle f By continuing this process if it exists the n displaystyle n th derivative is the derivative of the n 1 displaystyle n 1 th derivative or the derivative of order n displaystyle n As has been discussed above the generalization of derivative of a function f displaystyle f may be denoted as f n displaystyle f n A function that has k displaystyle k successive derivatives is called k displaystyle k times differentiable If the k displaystyle k th derivative is continuous then the function is said to be of differentiability class Ck displaystyle C k A function that has infinitely many derivatives is called infinitely differentiable or smooth Any polynomial function is infinitely differentiable taking derivatives repeatedly will eventually result in a constant function and all subsequent derivatives of that function are zero One application of higher order derivatives is in physics Suppose that a function represents the position of an object at the time The first derivative of that function is the velocity of an object with respect to time the second derivative of the function is the acceleration of an object with respect to time and the third derivative is the jerk In other dimensionsVector valued functions A vector valued function y displaystyle mathbf y of a real variable sends real numbers to vectors in some vector space Rn displaystyle mathbb R n A vector valued function can be split up into its coordinate functions y1 t y2 t yn t displaystyle y 1 t y 2 t dots y n t meaning that y y1 t y2 t yn t displaystyle mathbf y y 1 t y 2 t dots y n t This includes for example parametric curves in R2 displaystyle mathbb R 2 or R3 displaystyle mathbb R 3 The coordinate functions are real valued functions so the above definition of derivative applies to them The derivative of y t displaystyle mathbf y t is defined to be the vector called the tangent vector whose coordinates are the derivatives of the coordinate functions That is y t limh 0y t h y t h displaystyle mathbf y t lim h to 0 frac mathbf y t h mathbf y t h if the limit exists The subtraction in the numerator is the subtraction of vectors not scalars If the derivative of y displaystyle mathbf y exists for every value of t displaystyle t then y displaystyle mathbf y is another vector valued function Partial derivatives Functions can depend upon more than one variable A partial derivative of a function of several variables is its derivative with respect to one of those variables with the others held constant Partial derivatives are used in vector calculus and differential geometry As with ordinary derivatives multiple notations exist the partial derivative of a function f x y displaystyle f x y dots with respect to the variable x displaystyle x is variously denoted by fx displaystyle f x fx displaystyle f x xf displaystyle partial x f xf displaystyle frac partial partial x f or f x displaystyle frac partial f partial x among other possibilities It can be thought of as the rate of change of the function in the x displaystyle x direction Here is a rounded d called the partial derivative symbol To distinguish it from the letter d is sometimes pronounced der del or partial instead of dee For example let f x y x2 xy y2 displaystyle f x y x 2 xy y 2 then the partial derivative of function f displaystyle f with respect to both variables x displaystyle x and y displaystyle y are respectively f x 2x y f y x 2y displaystyle frac partial f partial x 2x y qquad frac partial f partial y x 2y In general the partial derivative of a function f x1 xn displaystyle f x 1 dots x n in the direction xi displaystyle x i at the point a1 an displaystyle a 1 dots a n is defined to be f xi a1 an limh 0f a1 ai h an f a1 ai an h displaystyle frac partial f partial x i a 1 ldots a n lim h to 0 frac f a 1 ldots a i h ldots a n f a 1 ldots a i ldots a n h This is fundamental for the study of the functions of several real variables Let f x1 xn displaystyle f x 1 dots x n be such a real valued function If all partial derivatives f displaystyle f with respect to xj displaystyle x j are defined at the point a1 an displaystyle a 1 dots a n these partial derivatives define the vector f a1 an f x1 a1 an f xn a1 an displaystyle nabla f a 1 ldots a n left frac partial f partial x 1 a 1 ldots a n ldots frac partial f partial x n a 1 ldots a n right which is called the gradient of f displaystyle f at a displaystyle a If f displaystyle f is differentiable at every point in some domain then the gradient is a vector valued function f displaystyle nabla f that maps the point a1 an displaystyle a 1 dots a n to the vector f a1 an displaystyle nabla f a 1 dots a n Consequently the gradient determines a vector field Directional derivatives If f displaystyle f is a real valued function on Rn displaystyle mathbb R n then the partial derivatives of f displaystyle f measure its variation in the direction of the coordinate axes For example if f displaystyle f is a function of x displaystyle x and y displaystyle y then its partial derivatives measure the variation in f displaystyle f in the x displaystyle x and y displaystyle y direction However they do not directly measure the variation of f displaystyle f in any other direction such as along the diagonal line y x displaystyle y x These are measured using directional derivatives Given a vector v v1 vn displaystyle mathbf v v 1 ldots v n then the directional derivative of f displaystyle f in the direction of v displaystyle mathbf v at the point x displaystyle mathbf x is Dvf x limh 0f x hv f x h displaystyle D mathbf v f mathbf x lim h rightarrow 0 frac f mathbf x h mathbf v f mathbf x h If all the partial derivatives of f displaystyle f exist and are continuous at x displaystyle mathbf x then they determine the directional derivative of f displaystyle f in the direction v displaystyle mathbf v by the formula Dvf x j 1nvj f xj displaystyle D mathbf v f mathbf x sum j 1 n v j frac partial f partial x j Total derivative total differential and Jacobian matrix When f displaystyle f is a function from an open subset of Rn displaystyle mathbb R n to Rm displaystyle mathbb R m then the directional derivative of f displaystyle f in a chosen direction is the best linear approximation to f displaystyle f at that point and in that direction However when n gt 1 displaystyle n gt 1 no single directional derivative can give a complete picture of the behavior of f displaystyle f The total derivative gives a complete picture by considering all directions at once That is for any vector v displaystyle mathbf v starting at a displaystyle mathbf a the linear approximation formula holds f a v f a f a v displaystyle f mathbf a mathbf v approx f mathbf a f mathbf a mathbf v Similarly with the single variable derivative f a displaystyle f mathbf a is chosen so that the error in this approximation is as small as possible The total derivative of f displaystyle f at a displaystyle mathbf a is the unique linear transformation f a Rn Rm displaystyle f mathbf a colon mathbb R n to mathbb R m such thatlimh 0 f a h f a f a h h 0 displaystyle lim mathbf h to 0 frac lVert f mathbf a mathbf h f mathbf a f mathbf a mathbf h rVert lVert mathbf h rVert 0 Here h displaystyle mathbf h is a vector in Rn displaystyle mathbb R n so the norm in the denominator is the standard length on Rn displaystyle mathbb R n However f a h displaystyle f mathbf a mathbf h is a vector in Rm displaystyle mathbb R m and the norm in the numerator is the standard length on Rm displaystyle mathbb R m If v displaystyle v is a vector starting at a displaystyle a then f a v displaystyle f mathbf a mathbf v is called the pushforward of v displaystyle mathbf v by f displaystyle f If the total derivative exists at a displaystyle mathbf a then all the partial derivatives and directional derivatives of f displaystyle f exist at a displaystyle mathbf a and for all v displaystyle mathbf v f a v displaystyle f mathbf a mathbf v is the directional derivative of f displaystyle f in the direction v displaystyle mathbf v If f displaystyle f is written using coordinate functions so that f f1 f2 fm displaystyle f f 1 f 2 dots f m then the total derivative can be expressed using the partial derivatives as a matrix This matrix is called the Jacobian matrix of f displaystyle f at a displaystyle mathbf a f a Jaca fi xj ij displaystyle f mathbf a operatorname Jac mathbf a left frac partial f i partial x j right ij GeneralizationsThe concept of a derivative can be extended to many other settings The common thread is that the derivative of a function at a point serves as a linear approximation of the function at that point An important generalization of the derivative concerns complex functions of complex variables such as functions from a domain in the complex numbers C displaystyle mathbb C to C displaystyle mathbb C The notion of the derivative of such a function is obtained by replacing real variables with complex variables in the definition If C displaystyle mathbb C is identified with R2 displaystyle mathbb R 2 by writing a complex number z displaystyle z as x iy displaystyle x iy then a differentiable function from C displaystyle mathbb C to C displaystyle mathbb C is certainly differentiable as a function from R2 displaystyle mathbb R 2 to R2 displaystyle mathbb R 2 in the sense that its partial derivatives all exist but the converse is not true in general the complex derivative only exists if the real derivative is complex linear and this imposes relations between the partial derivatives called the Cauchy Riemann equations see holomorphic functions Another generalization concerns functions between differentiable or smooth manifolds Intuitively speaking such a manifold M displaystyle M is a space that can be approximated near each point x displaystyle x by a vector space called its tangent space the prototypical example is a smooth surface in R3 displaystyle mathbb R 3 The derivative or differential of a differentiable map f M N displaystyle f M to N between manifolds at a point x displaystyle x in M displaystyle M is then a linear map from the tangent space of M displaystyle M at x displaystyle x to the tangent space of N displaystyle N at f x displaystyle f x The derivative function becomes a map between the tangent bundles of M displaystyle M and N displaystyle N This definition is used in differential geometry Differentiation can also be defined for maps between vector space such as Banach space in which those generalizations are the Gateaux derivative and the Frechet derivative One deficiency of the classical derivative is that very many functions are not differentiable Nevertheless there is a way of extending the notion of the derivative so that all continuous functions and many other functions can be differentiated using a concept known as the weak derivative The idea is to embed the continuous functions in a larger space called the space of distributions and only require that a function is differentiable on average Properties of the derivative have inspired the introduction and study of many similar objects in algebra and topology an example is differential algebra Here it consists of the derivation of some topics in abstract algebra such as rings ideals field and so on The discrete equivalent of differentiation is finite differences The study of differential calculus is unified with the calculus of finite differences in time scale calculus The arithmetic derivative involves the function that is defined for the integers by the prime factorization This is an analogy with the product rule See alsoCovariant derivative Derivation Exterior derivative Functional derivative Integral Lie derivativeNotesApostol 1967 p 160 Stewart 2002 pp 129 130 Strang et al 2023 p 224 Apostol 1967 p 160 Stewart 2002 p 127 Strang et al 2023 p 220 Gonick 2012 p 83 Thomas et al 2014 p 60 Gonick 2012 p 88 Strang et al 2023 p 234 Gonick 2012 p 83 Strang et al 2023 p 232 Gonick 2012 pp 77 80 Thompson 1998 pp 34 104 Stewart 2002 p 128 Thompson 1998 pp 84 85 Keisler 2012 pp 902 904 Keisler 2012 p 45 Henle amp Kleinberg 2003 p 66 Gonick 2012 p 156 Thomas et al 2014 p 114 Strang et al 2023 p 237 Gonick 2012 p 149 Thomas et al 2014 p 113 Strang et al 2023 p 237 Gonick 2012 p 156 Thomas et al 2014 p 114 Strang et al 2023 pp 237 238 Jasek 1922 Jarnik 1922 Rychlik 1923 David 2018 Banach 1931 cited in Hewitt amp Stromberg 1965 Apostol 1967 p 172 Cajori 2007 p 204 Moore amp Siegel 2013 p 110 Varberg Purcell amp Rigdon 2007 pp 125 126 In the formulation of calculus in terms of limits various authors have assigned the du displaystyle du symbol various meanings Some authors such as Varberg Purcell amp Rigdon 2007 p 119 and Stewart 2002 p 177 do not assign a meaning to du displaystyle du by itself but only as part of the symbol dudx textstyle frac du dx Others define dx displaystyle dx as an independent variable and define du displaystyle du by du dxf x displaystyle textstyle du dxf x In non standard analysis du displaystyle du is defined as an infinitesimal It is also interpreted as the exterior derivative of a function u displaystyle u See differential infinitesimal for further information Schwartzman 1994 p 171 Cajori 1923 pp 6 7 10 12 21 24 Moore amp Siegel 2013 p 110 Goodman 1963 pp 78 79 Varberg Purcell amp Rigdon 2007 pp 125 126 Cajori 2007 p 228 Choudary amp Niculescu 2014 p 222 Apostol 1967 p 171 Evans 1999 p 63 Kreyszig 1991 p 1 Cajori 1923 Apostol 1967 p 172 Varberg Purcell amp Rigdon 2007 pp 125 126 Apostol 1967 p 160 Varberg Purcell amp Rigdon 2007 See p 133 for the power rule pp 115 116 for the trigonometric functions p 326 for the natural logarithm pp 338 339 for exponential with base e displaystyle e p 343 for the exponential with base a displaystyle a p 344 for the logarithm with base a displaystyle a and p 369 for the inverse of trigonometric functions For constant rule and sum rule see Apostol 1967 pp 161 164 respectively For the product rule quotient rule and chain rule see Varberg Purcell amp Rigdon 2007 pp 111 112 119 respectively For the special case of the product rule that is the product of a constant and a function see Varberg Purcell amp Rigdon 2007 pp 108 109 Apostol 1967 p 160 Varberg Purcell amp Rigdon 2007 pp 125 126 Warner 1983 p 5 Debnath amp Shah 2015 p 40 Carothers 2000 p 176 Stewart 2002 p 193 Stewart 2002 p 893 Stewart 2002 p 947 Christopher 2013 p 682 Stewart 2002 p 949 Silverman 1989 p 216 Bhardwaj 2005 See p 6 4 Mathai amp Haubold 2017 p 52 Gbur 2011 pp 36 37 Varberg Purcell amp Rigdon 2007 p 642 Guzman 2003 p 35 Davvaz 2023 p 266 Lee 2013 p 72 Davvaz 2023 p 267 Roussos 2014 p 303 Gbur 2011 pp 261 264 Gray Abbena amp Salamon 2006 p 826 Azegami 2020 See p 209 for the Gateaux derivative and p 211 for the Frechet derivative Funaro 1992 pp 84 85 Kolchin 1973 pp 58 126 Georgiev 2018 p 8 Barbeau 1961 ReferencesApostol Tom M June 1967 Calculus Vol 1 One Variable Calculus with an Introduction to Linear Algebra vol 1 2nd ed Wiley ISBN 978 0 471 00005 1 Azegami Hideyuki 2020 Shape Optimization Problems Springer Optimization and Its Applications vol 164 Springer doi 10 1007 978 981 15 7618 8 ISBN 978 981 15 7618 8 S2CID 226442409 Banach Stefan 1931 Uber die Baire sche Kategorie gewisser Funktionenmengen Studia Math 3 3 174 179 doi 10 4064 sm 3 1 174 179 Barbeau E J 1961 Remarks on an arithmetic derivative Canadian Mathematical Bulletin 4 2 117 122 doi 10 4153 CMB 1961 013 0 Zbl 0101 03702 Bhardwaj R S 2005 Mathematics for Economics amp Business 2nd ed Excel Books India ISBN 9788174464507 Cajori Florian 1923 The History of Notations of the Calculus Annals of Mathematics 25 1 1 46 doi 10 2307 1967725 hdl 2027 mdp 39015017345896 JSTOR 1967725 Cajori Florian 2007 A History of Mathematical Notations vol 2 Cosimo Classics ISBN 978 1 60206 713 4 Carothers N L 2000 Real Analysis Cambridge University Press Choudary A D R Niculescu Constantin P 2014 Real Analysis on Intervals Springer India doi 10 1007 978 81 322 2148 7 ISBN 978 81 322 2148 7 Christopher Essex 2013 Calculus A complete course Pearson p 682 ISBN 9780321781079 OCLC 872345701 Courant Richard John Fritz December 22 1998 Introduction to Calculus and Analysis Vol 1 Springer Verlag doi 10 1007 978 1 4613 8955 2 ISBN 978 3 540 65058 4 David Claire 2018 Bypassing dynamical systems A simple way to get the box counting dimension of the graph of the Weierstrass function Proceedings of the International Geometry Center 11 2 Academy of Sciences of Ukraine 53 68 arXiv 1711 10349 doi 10 15673 tmgc v11i2 1028 Davvaz Bijan 2023 Vectors and Functions of Several Variables Springer doi 10 1007 978 981 99 2935 1 ISBN 978 981 99 2935 1 S2CID 259885793 Debnath Lokenath Shah Firdous Ahmad 2015 Wavelet Transforms and Their Applications 2nd ed Birkhauser doi 10 1007 978 0 8176 8418 1 ISBN 978 0 8176 8418 1 Evans Lawrence 1999 Partial Differential Equations American Mathematical Society ISBN 0 8218 0772 2 Eves Howard January 2 1990 An Introduction to the History of Mathematics 6th ed Brooks Cole ISBN 978 0 03 029558 4