![Indicator function](https://www.english.nina.az/wikipedia/image/aHR0cHM6Ly91cGxvYWQud2lraW1lZGlhLm9yZy93aWtpcGVkaWEvY29tbW9ucy90aHVtYi9mL2Y1L0luZGljYXRvcl9mdW5jdGlvbl9pbGx1c3RyYXRpb24ucG5nLzE2MDBweC1JbmRpY2F0b3JfZnVuY3Rpb25faWxsdXN0cmF0aW9uLnBuZw==.png )
This article includes a list of general references, but it lacks sufficient corresponding inline citations.(December 2009) |
In mathematics, an indicator function or a characteristic function of a subset of a set is a function that maps elements of the subset to one, and all other elements to zero. That is, if A is a subset of some set X, then if and otherwise, where is a common notation for the indicator function. Other common notations are and
![image](https://www.english.nina.az/wikipedia/image/aHR0cHM6Ly93d3cuZW5nbGlzaC5uaW5hLmF6L3dpa2lwZWRpYS9pbWFnZS9hSFIwY0hNNkx5OTFjR3h2WVdRdWQybHJhVzFsWkdsaExtOXlaeTkzYVd0cGNHVmthV0V2WTI5dGJXOXVjeTkwYUhWdFlpOW1MMlkxTDBsdVpHbGpZWFJ2Y2w5bWRXNWpkR2x2Ymw5cGJHeDFjM1J5WVhScGIyNHVjRzVuTHpJeU1IQjRMVWx1WkdsallYUnZjbDltZFc1amRHbHZibDlwYkd4MWMzUnlZWFJwYjI0dWNHNW4ucG5n.png)
The indicator function of A is the Iverson bracket of the property of belonging to A; that is,
For example, the Dirichlet function is the indicator function of the rational numbers as a subset of the real numbers.
Definition
The indicator function of a subset A of a set X is a function
defined as
The Iverson bracket provides the equivalent notation, or ⟦x ∈ A⟧, to be used instead of
The function is sometimes denoted IA, χA, KA, or even just A.
Notation and terminology
The notation is also used to denote the characteristic function in convex analysis, which is defined as if using the reciprocal of the standard definition of the indicator function.
A related concept in statistics is that of a dummy variable. (This must not be confused with "dummy variables" as that term is usually used in mathematics, also called a bound variable.)
The term "characteristic function" has an unrelated meaning in classic probability theory. For this reason, traditional probabilists use the term indicator function for the function defined here almost exclusively, while mathematicians in other fields are more likely to use the term characteristic function to describe the function that indicates membership in a set.
In fuzzy logic and modern many-valued logic, predicates are the characteristic functions of a probability distribution. That is, the strict true/false valuation of the predicate is replaced by a quantity interpreted as the degree of truth.
Basic properties
The indicator or characteristic function of a subset A of some set X maps elements of X to the codomain .
This mapping is surjective only when A is a non-empty proper subset of X. If then
By a similar argument, if
then
If and
are two subsets of
then
and the indicator function of the complement of i.e.
is:
More generally, suppose is a collection of subsets of X. For any
is clearly a product of 0s and 1s. This product has the value 1 at precisely those that belong to none of the sets
and is 0 otherwise. That is
Expanding the product on the left hand side,
where is the cardinality of F. This is one form of the principle of inclusion-exclusion.
As suggested by the previous example, the indicator function is a useful notational device in combinatorics. The notation is used in other places as well, for instance in probability theory: if X is a probability space with probability measure and A is a measurable set, then
becomes a random variable whose expected value is equal to the probability of A:
This identity is used in a simple proof of Markov's inequality.
In many cases, such as order theory, the inverse of the indicator function may be defined. This is commonly called the generalized Möbius function, as a generalization of the inverse of the indicator function in elementary number theory, the Möbius function. (See paragraph below about the use of the inverse in classical recursion theory.)
Mean, variance and covariance
Given a probability space with
the indicator random variable
is defined by
if
otherwise
- Mean
(also called "Fundamental Bridge").
- Variance
- Covariance
Characteristic function in recursion theory, Gödel's and Kleene's representing function
Kurt Gödel described the representing function in his 1934 paper "On undecidable propositions of formal mathematical systems" (the "¬" indicates logical inversion, i.e. "NOT"):: 42
There shall correspond to each class or relation R a representing function
if
and
if
Kleene offers up the same definition in the context of the primitive recursive functions as a function φ of a predicate P takes on values 0 if the predicate is true and 1 if the predicate is false.
For example, because the product of characteristic functions whenever any one of the functions equals 0, it plays the role of logical OR: IF
OR
OR ... OR
THEN their product is 0. What appears to the modern reader as the representing function's logical inversion, i.e. the representing function is 0 when the function R is "true" or satisfied", plays a useful role in Kleene's definition of the logical functions OR, AND, and IMPLY,: 228 the bounded-: 228 and unbounded-: 279 ff mu operators and the CASE function.: 229
Characteristic function in fuzzy set theory
In classical mathematics, characteristic functions of sets only take values 1 (members) or 0 (non-members). In fuzzy set theory, characteristic functions are generalized to take value in the real unit interval [0, 1], or more generally, in some algebra or structure (usually required to be at least a poset or lattice). Such generalized characteristic functions are more usually called membership functions, and the corresponding "sets" are called fuzzy sets. Fuzzy sets model the gradual change in the membership degree seen in many real-world predicates like "tall", "warm", etc.
Smoothness
In general, the indicator function of a set is not smooth; it is continuous if and only if its support is a connected component. In the algebraic geometry of finite fields, however, every affine variety admits a (Zariski) continuous indicator function. Given a finite set of functions let
be their vanishing locus. Then, the function
acts as an indicator function for
. If
then
, otherwise, for some
, we have
, which implies that
, hence
.
Although indicator functions are not smooth, they admit weak derivatives. For example, consider Heaviside step function The distributional derivative of the Heaviside step function is equal to the Dirac delta function, i.e.
and similarly the distributional derivative of
is
Thus the derivative of the Heaviside step function can be seen as the inward normal derivative at the boundary of the domain given by the positive half-line. In higher dimensions, the derivative naturally generalises to the inward normal derivative, while the Heaviside step function naturally generalises to the indicator function of some domain D. The surface of D will be denoted by S. Proceeding, it can be derived that the inward normal derivative of the indicator gives rise to a 'surface delta function', which can be indicated by :
where n is the outward normal of the surface S. This 'surface delta function' has the following property:
By setting the function f equal to one, it follows that the inward normal derivative of the indicator integrates to the numerical value of the surface area S.
See also
- Dirac measure
- Laplacian of the indicator
- Dirac delta
- Extension (predicate logic)
- Free variables and bound variables
- Heaviside step function
- Identity function
- Iverson bracket
- Kronecker delta, a function that can be viewed as an indicator for the identity relation
- Macaulay brackets
- Multiset
- Membership function
- Simple function
- Dummy variable (statistics)
- Statistical classification
- Zero-one loss function
- Subobject classifier, a related concept from topos theory.
Notes
- The Greek letter χ appears because it is the initial letter of the Greek word χαρακτήρ, which is the ultimate origin of the word characteristic.
- The set of all indicator functions on X can be identified with
the power set of X. Consequently, both sets are sometimes denoted by
This is a special case (
) of the notation
for the set of all functions
References
- Davis, Martin, ed. (1965). The Undecidable. New York, NY: Raven Press Books. pp. 41–74.
- Kleene, Stephen (1971) [1952]. Introduction to Metamathematics (Sixth reprint, with corrections ed.). Netherlands: Wolters-Noordhoff Publishing and North Holland Publishing Company. p. 227.
- Serre. Course in Arithmetic. p. 5.
- Lange, Rutger-Jan (2012). "Potential theory, path integrals and the Laplacian of the indicator". Journal of High Energy Physics. 2012 (11): 29–30. arXiv:1302.0864. Bibcode:2012JHEP...11..032L. doi:10.1007/JHEP11(2012)032. S2CID 56188533.
Sources
- Folland, G.B. (1999). Real Analysis: Modern Techniques and Their Applications (Second ed.). John Wiley & Sons, Inc. ISBN 978-0-471-31716-6.
- Cormen, Thomas H.; Leiserson, Charles E.; Rivest, Ronald L.; Stein, Clifford (2001). "Section 5.2: Indicator random variables". Introduction to Algorithms (Second ed.). MIT Press and McGraw-Hill. pp. 94–99. ISBN 978-0-262-03293-3.
- Davis, Martin, ed. (1965). The Undecidable. New York, NY: Raven Press Books.
- Kleene, Stephen (1971) [1952]. Introduction to Metamathematics (Sixth reprint, with corrections ed.). Netherlands: Wolters-Noordhoff Publishing and North Holland Publishing Company.
- Boolos, George; Burgess, John P.; Jeffrey, Richard C. (2002). Computability and Logic. Cambridge UK: Cambridge University Press. ISBN 978-0-521-00758-0.
- Zadeh, L.A. (June 1965). "Fuzzy sets". Information and Control. 8 (3). San Diego: 338–353. doi:10.1016/S0019-9958(65)90241-X. ISSN 0019-9958. Zbl 0139.24606. Wikidata Q25938993.
- Goguen, Joseph (1967). "L-fuzzy sets". Journal of Mathematical Analysis and Applications. 18 (1): 145–174. doi:10.1016/0022-247X(67)90189-8. hdl:10338.dmlcz/103980.
This article includes a list of general references but it lacks sufficient corresponding inline citations Please help to improve this article by introducing more precise citations December 2009 Learn how and when to remove this message In mathematics an indicator function or a characteristic function of a subset of a set is a function that maps elements of the subset to one and all other elements to zero That is if A is a subset of some set X then 1A x 1 displaystyle mathbf 1 A x 1 if x A displaystyle x in A and 1A x 0 displaystyle mathbf 1 A x 0 otherwise where 1A displaystyle mathbf 1 A is a common notation for the indicator function Other common notations are IA displaystyle I A and xA displaystyle chi A A three dimensional plot of an indicator function shown over a square two dimensional domain set X the raised portion overlays those two dimensional points which are members of the indicated subset A The indicator function of A is the Iverson bracket of the property of belonging to A that is 1A x x A displaystyle mathbf 1 A x x in A For example the Dirichlet function is the indicator function of the rational numbers as a subset of the real numbers DefinitionThe indicator function of a subset A of a set X is a function 1A X 0 1 displaystyle mathbf 1 A colon X to 0 1 defined as The Iverson bracket provides the equivalent notation x A displaystyle x in A or x A to be used instead of 1A x displaystyle mathbf 1 A x The function 1A displaystyle mathbf 1 A is sometimes denoted IA xA KA or even just A Notation and terminologyThe notation xA displaystyle chi A is also used to denote the characteristic function in convex analysis which is defined as if using the reciprocal of the standard definition of the indicator function A related concept in statistics is that of a dummy variable This must not be confused with dummy variables as that term is usually used in mathematics also called a bound variable The term characteristic function has an unrelated meaning in classic probability theory For this reason traditional probabilists use the term indicator function for the function defined here almost exclusively while mathematicians in other fields are more likely to use the term characteristic function to describe the function that indicates membership in a set In fuzzy logic and modern many valued logic predicates are the characteristic functions of a probability distribution That is the strict true false valuation of the predicate is replaced by a quantity interpreted as the degree of truth Basic propertiesThe indicator or characteristic function of a subset A of some set X maps elements of X to the codomain 0 1 displaystyle 0 1 This mapping is surjective only when A is a non empty proper subset of X If A X displaystyle A X then 1A 1 displaystyle mathbf 1 A equiv 1 By a similar argument if A displaystyle A emptyset then 1A 0 displaystyle mathbf 1 A equiv 0 If A displaystyle A and B displaystyle B are two subsets of X displaystyle X then 1A B min 1A 1B 1A 1B 1A B max 1A 1B 1A 1B 1A 1B displaystyle begin aligned mathbf 1 A cap B amp min mathbf 1 A mathbf 1 B mathbf 1 A cdot mathbf 1 B mathbf 1 A cup B amp max mathbf 1 A mathbf 1 B mathbf 1 A mathbf 1 B mathbf 1 A cdot mathbf 1 B end aligned and the indicator function of the complement of A displaystyle A i e AC displaystyle A C is 1A 1 1A displaystyle mathbf 1 A complement 1 mathbf 1 A More generally suppose A1 An displaystyle A 1 dotsc A n is a collection of subsets of X For any x X displaystyle x in X k I 1 1Ak x displaystyle prod k in I 1 mathbf 1 A k x is clearly a product of 0 s and 1 s This product has the value 1 at precisely those x X displaystyle x in X that belong to none of the sets Ak displaystyle A k and is 0 otherwise That is k I 1 1Ak 1X kAk 1 1 kAk displaystyle prod k in I 1 mathbf 1 A k mathbf 1 X bigcup k A k 1 mathbf 1 bigcup k A k Expanding the product on the left hand side 1 kAk 1 F 1 2 n 1 F 1 FAk F 1 2 n 1 F 11 FAk displaystyle mathbf 1 bigcup k A k 1 sum F subseteq 1 2 dotsc n 1 F mathbf 1 bigcap F A k sum emptyset neq F subseteq 1 2 dotsc n 1 F 1 mathbf 1 bigcap F A k where F displaystyle F is the cardinality of F This is one form of the principle of inclusion exclusion As suggested by the previous example the indicator function is a useful notational device in combinatorics The notation is used in other places as well for instance in probability theory if X is a probability space with probability measure P displaystyle operatorname P and A is a measurable set then 1A displaystyle mathbf 1 A becomes a random variable whose expected value is equal to the probability of A E 1A X1A x dP AdP P A displaystyle operatorname E mathbf 1 A int X mathbf 1 A x d operatorname P int A d operatorname P operatorname P A This identity is used in a simple proof of Markov s inequality In many cases such as order theory the inverse of the indicator function may be defined This is commonly called the generalized Mobius function as a generalization of the inverse of the indicator function in elementary number theory the Mobius function See paragraph below about the use of the inverse in classical recursion theory Mean variance and covarianceGiven a probability space W F P displaystyle textstyle Omega mathcal F operatorname P with A F displaystyle A in mathcal F the indicator random variable 1A W R displaystyle mathbf 1 A colon Omega rightarrow mathbb R is defined by 1A w 1 displaystyle mathbf 1 A omega 1 if w A displaystyle omega in A otherwise 1A w 0 displaystyle mathbf 1 A omega 0 Mean E 1A w P A displaystyle operatorname E mathbf 1 A omega operatorname P A also called Fundamental Bridge Variance Var 1A w P A 1 P A displaystyle operatorname Var mathbf 1 A omega operatorname P A 1 operatorname P A Covariance Cov 1A w 1B w P A B P A P B displaystyle operatorname Cov mathbf 1 A omega mathbf 1 B omega operatorname P A cap B operatorname P A operatorname P B Characteristic function in recursion theory Godel s and Kleene s representing functionKurt Godel described the representing function in his 1934 paper On undecidable propositions of formal mathematical systems the indicates logical inversion i e NOT 42 There shall correspond to each class or relation R a representing function ϕ x1 xn 0 displaystyle phi x 1 ldots x n 0 if R x1 xn displaystyle R x 1 ldots x n and ϕ x1 xn 1 displaystyle phi x 1 ldots x n 1 if R x1 xn displaystyle neg R x 1 ldots x n Kleene offers up the same definition in the context of the primitive recursive functions as a function f of a predicate P takes on values 0 if the predicate is true and 1 if the predicate is false For example because the product of characteristic functions ϕ1 ϕ2 ϕn 0 displaystyle phi 1 phi 2 cdots phi n 0 whenever any one of the functions equals 0 it plays the role of logical OR IF ϕ1 0 displaystyle phi 1 0 OR ϕ2 0 displaystyle phi 2 0 OR OR ϕn 0 displaystyle phi n 0 THEN their product is 0 What appears to the modern reader as the representing function s logical inversion i e the representing function is 0 when the function R is true or satisfied plays a useful role in Kleene s definition of the logical functions OR AND and IMPLY 228 the bounded 228 and unbounded 279 ff mu operators and the CASE function 229 Characteristic function in fuzzy set theoryIn classical mathematics characteristic functions of sets only take values 1 members or 0 non members In fuzzy set theory characteristic functions are generalized to take value in the real unit interval 0 1 or more generally in some algebra or structure usually required to be at least a poset or lattice Such generalized characteristic functions are more usually called membership functions and the corresponding sets are called fuzzy sets Fuzzy sets model the gradual change in the membership degree seen in many real world predicates like tall warm etc SmoothnessIn general the indicator function of a set is not smooth it is continuous if and only if its support is a connected component In the algebraic geometry of finite fields however every affine variety admits a Zariski continuous indicator function Given a finite set of functions fa Fq x1 xn displaystyle f alpha in mathbb F q x 1 ldots x n let V x Fqn fa x 0 displaystyle V left x in mathbb F q n f alpha x 0 right be their vanishing locus Then the function P x 1 fa x q 1 textstyle P x prod left 1 f alpha x q 1 right acts as an indicator function for V displaystyle V If x V displaystyle x in V then P x 1 displaystyle P x 1 otherwise for some fa displaystyle f alpha we have fa x 0 displaystyle f alpha x neq 0 which implies that fa x q 1 1 displaystyle f alpha x q 1 1 hence P x 0 displaystyle P x 0 Although indicator functions are not smooth they admit weak derivatives For example consider Heaviside step function H x 1x gt 0 displaystyle H x mathbf 1 x gt 0 The distributional derivative of the Heaviside step function is equal to the Dirac delta function i e dH x dx d x displaystyle frac dH x dx delta x and similarly the distributional derivative of G x 1x lt 0 displaystyle G x mathbf 1 x lt 0 is dG x dx d x displaystyle frac dG x dx delta x Thus the derivative of the Heaviside step function can be seen as the inward normal derivative at the boundary of the domain given by the positive half line In higher dimensions the derivative naturally generalises to the inward normal derivative while the Heaviside step function naturally generalises to the indicator function of some domain D The surface of D will be denoted by S Proceeding it can be derived that the inward normal derivative of the indicator gives rise to a surface delta function which can be indicated by dS x displaystyle delta S mathbf x dS x nx x1x D displaystyle delta S mathbf x mathbf n x cdot nabla x mathbf 1 mathbf x in D where n is the outward normal of the surface S This surface delta function has the following property Rnf x nx x1x Ddnx Sf b dn 1b displaystyle int mathbb R n f mathbf x mathbf n x cdot nabla x mathbf 1 mathbf x in D d n mathbf x oint S f mathbf beta d n 1 mathbf beta By setting the function f equal to one it follows that the inward normal derivative of the indicator integrates to the numerical value of the surface area S See alsoDirac measure Laplacian of the indicator Dirac delta Extension predicate logic Free variables and bound variables Heaviside step function Identity function Iverson bracket Kronecker delta a function that can be viewed as an indicator for the identity relation Macaulay brackets Multiset Membership function Simple function Dummy variable statistics Statistical classification Zero one loss function Subobject classifier a related concept from topos theory NotesThe Greek letter x appears because it is the initial letter of the Greek word xarakthr which is the ultimate origin of the word characteristic The set of all indicator functions on X can be identified with P X displaystyle mathcal P X the power set of X Consequently both sets are sometimes denoted by 2X displaystyle 2 X This is a special case Y 0 1 2 displaystyle Y 0 1 2 of the notation YX displaystyle Y X for the set of all functions f X Y displaystyle f X to Y ReferencesDavis Martin ed 1965 The Undecidable New York NY Raven Press Books pp 41 74 Kleene Stephen 1971 1952 Introduction to Metamathematics Sixth reprint with corrections ed Netherlands Wolters Noordhoff Publishing and North Holland Publishing Company p 227 Serre Course in Arithmetic p 5 Lange Rutger Jan 2012 Potential theory path integrals and the Laplacian of the indicator Journal of High Energy Physics 2012 11 29 30 arXiv 1302 0864 Bibcode 2012JHEP 11 032L doi 10 1007 JHEP11 2012 032 S2CID 56188533 SourcesFolland G B 1999 Real Analysis Modern Techniques and Their Applications Second ed John Wiley amp Sons Inc ISBN 978 0 471 31716 6 Cormen Thomas H Leiserson Charles E Rivest Ronald L Stein Clifford 2001 Section 5 2 Indicator random variables Introduction to Algorithms Second ed MIT Press and McGraw Hill pp 94 99 ISBN 978 0 262 03293 3 Davis Martin ed 1965 The Undecidable New York NY Raven Press Books Kleene Stephen 1971 1952 Introduction to Metamathematics Sixth reprint with corrections ed Netherlands Wolters Noordhoff Publishing and North Holland Publishing Company Boolos George Burgess John P Jeffrey Richard C 2002 Computability and Logic Cambridge UK Cambridge University Press ISBN 978 0 521 00758 0 Zadeh L A June 1965 Fuzzy sets Information and Control 8 3 San Diego 338 353 doi 10 1016 S0019 9958 65 90241 X ISSN 0019 9958 Zbl 0139 24606 Wikidata Q25938993 Goguen Joseph 1967 L fuzzy sets Journal of Mathematical Analysis and Applications 18 1 145 174 doi 10 1016 0022 247X 67 90189 8 hdl 10338 dmlcz 103980