Functions:Inverses
In mathematics, an inverse function (or anti-function) is a function that "reverses" another function: if the function f applied to an input x gives a result of y, then applying its inverse function g to y gives the result x, i.e., g(y) = x if and only if f(x) = y. The inverse function of f is also denoted as .
As an example, consider the real-valued function of a real variable given by f(x) = 5x − 7. Thinking of this as a step-by-step procedure (namely, take a number x, multiply it by 5, then subtract 7 from the result), to reverse this and get x back from some output value, say y, we would undo each step in reverse order. In this case, it means to add 7 to y, and then divide the result by 5. In functional notation, this inverse function would be given by,
With y = 5x − 7 we have that f(x) = y and g(y) = x.
Not all functions have inverse functions. Those that do are called invertible. For a function f: X → Y to have an inverse, it must have the property that for every y in Y, there is exactly one x in X such that f(x) = y. This property ensures that a function g: Y → X exists with the necessary relationship with f.
Definitions
Let f be a function whose domain is the set X, and whose codomain is the set Y. Then f is invertible if there exists a function g with domain Y and codomain X, with the property:
If f is invertible, then the function g is unique, which means that there is exactly one function g satisfying this property. Moreover, it also follows that the ranges of g and f equal their respective codomains. The function g is called the inverse of f, and is usually denoted as f −1, a notation introduced by John Frederick William Herschel in 1813.
Stated otherwise, a function, considered as a binary relation, has an inverse if and only if the converse relation is a function on the codomain Y, in which case the converse relation is the inverse function.
Not all functions have an inverse. For a function to have an inverse, each element y ∈ Y must correspond to no more than one x ∈ X; a function f with this property is called one-to-one or an injection. If f −1 is to be a function on Y, then each element y ∈ Y must correspond to some x ∈ X. Functions with this property are called surjections. This property is satisfied by definition if Y is the image of f, but may not hold in a more general context. To be invertible, a function must be both an injection and a surjection. Such functions are called bijections. The inverse of an injection f: X → Y that is not a bijection (that is, not a surjection), is only a partial function on Y, which means that for some y ∈ Y, f −1(y) is undefined. If a function f is invertible, then both it and its inverse function f−1 are bijections.
Another convention is used in the definition of functions, referred to as the "set-theoretic" or "graph" definition using ordered pairs, which makes the codomain and image of the function the same. Under this convention, all functions are surjective, so bijectivity and injectivity are the same. Authors using this convention may use the phrasing that a function is invertible if and only if it is an injection. The two conventions need not cause confusion, as long as it is remembered that in this alternate convention, the codomain of a function is always taken to be the image of the function.
Example: Squaring and square root functions
The function f: R → [0,∞) given by f(x) = x2 is not injective, since each possible result y (except 0) corresponds to two different starting points in X – one positive and one negative, and so this function is not invertible. With this type of function, it is impossible to deduce a (unique) input from its output. Such a function is called non-injective or, in some applications, information-losing.
If the domain of the function is restricted to the nonnegative reals, that is, the function is redefined to be f: [0, ∞) → [0, ∞) with the same rule as before, then the function is bijective and so, invertible. The inverse function here is called the (positive) square root function.
Inverses and composition
If f is an invertible function with domain X and codomain Y, then
- , for every ; and , for every .
Using the composition of functions, we can rewrite this statement as follows:
- and
where idX is the identity function on the set X; that is, the function that leaves its argument unchanged. In category theory, this statement is used as the definition of an inverse morphism.
Considering function composition helps to understand the notation f −1. Repeatedly composing a function with itself is called iteration. If f is applied n times, starting with the value x, then this is written as f n(x); so f2(x) = f (f (x)), etc. Since f −1(f (x)) = x, composing f −1 and f n yields f n−1, "undoing" the effect of one application of f.
Notation
While the notation f −1(x) might be misunderstood, (f(x))−1 certainly denotes the multiplicative inverse of f(x) and has nothing to do with the inverse function of f.
In keeping with the general notation, some English authors use expressions like sin−1(x) to denote the inverse of the sine function applied to x (actually a partial inverse; see below). Other authors feel that this may be confused with the notation for the multiplicative inverse of sin (x), which can be denoted as (sin (x))−1. To avoid any confusion, an inverse trigonometric function is often indicated by the prefix "arc" (for Latin arcus). For instance, the inverse of the sine function is typically called the arcsine function, written as arcsin(x). Similarly, the inverse of a hyperbolic function is indicated by the prefix "ar" (for Latin ārea). For instance, the inverse of the hyperbolic sine function is typically written as arsinh(x). Other inverse special functions are sometimes prefixed with the prefix "inv", if the ambiguity of the f −1 notation should be avoided.
Properties
Since a function is a special type of binary relation, many of the properties of an inverse function correspond to properties of converse relations.
Uniqueness
If an inverse function exists for a given function f, then it is unique. This follows since the inverse function must be the converse relation, which is completely determined by f.
Symmetry
There is a symmetry between a function and its inverse. Specifically, if f is an invertible function with domain X and codomain Y, then its inverse f −1 has domain Y and image X, and the inverse of f −1 is the original function f. In symbols, for functions f:X → Y and f−1:Y → X,
- and
This statement is a consequence of the implication that for f to be invertible it must be bijective. The involutory nature of the inverse can be concisely expressed by
The inverse of a composition of functions is given by
Notice that the order of g and f have been reversed; to undo f followed by g, we must first undo g, and then undo f.
For example, let f(x) = 3x and let g(x) = x + 5. Then the composition g ∘ f is the function that first multiplies by three and then adds five,
To reverse this process, we must first subtract five, and then divide by three,
This is the composition (f −1 ∘ g −1)(x).
Self-inverses
If X is a set, then the identity function on X is its own inverse:
More generally, a function f : X → X is equal to its own inverse, if and only if the composition f ∘ f is equal to idX. Such a function is called an involution.
Generalizations
Partial inverses
Even if a function f is not one-to-one, it may be possible to define a partial inverse of f by restricting the domain. For example, the function
is not one-to-one, since x2 = (−x)2. However, the function becomes one-to-one if we restrict to the domain x ≥ 0, in which case
(If we instead restrict to the domain x ≤ 0, then the inverse is the negative of the square root of y.) Alternatively, there is no need to restrict the domain if we are content with the inverse being a multivalued function:
Sometimes, this multivalued inverse is called the full inverse of f, and the portions (such as and −) are called branches. The most important branch of a multivalued function (e.g. the positive square root) is called the principal branch, and its value at y is called the principal value of f −1(y).
For a continuous function on the real line, one branch is required between each pair of local extrema. For example, the inverse of a cubic function with a local maximum and a local minimum has three branches (see the adjacent picture).
These considerations are particularly important for defining the inverses of trigonometric functions. For example, the sine function is not one-to-one, since
for every real x (and more generally sin(x + 2n) = sin(x) for every integer n). However, the sine is one-to-one on the interval , , and the corresponding partial inverse is called the arcsine. This is considered the principal branch of the inverse sine, so the principal value of the inverse sine is always between − and . The following table describes the principal branch of each inverse trigonometric function:
function Range of usual principal value arcsin - ≤ sin−1(x) ≤ arccos 0 ≤ cos−1(x) ≤ arctan - < tan−1(x) < arccot 0 < cot−1(x) < arcsec 0 ≤ sec−1(x) ≤ arccsc - ≤ csc−1(x) ≤
Left and right inverses
Left and right inverses are not necessarily the same. If g is a left inverse for f, then g may or may not be a right inverse for f; and if g is a right inverse for f, then g is not necessarily a left inverse for f. For example, let {{math|f: R → [0, ∞} denote the squaring map, such that f(x) = x2 for all x in R, and let g: [0, ∞} → R denote the square root map, such that g(x) = for all x ≥ 0. Then f(g(x)) = x for all x in [0, ∞}; that is, g is a right inverse to f. However, g is not a left inverse to f, since, e.g., g(f(−1)) = 1 ≠ −1.
Left inverses
If f: X → Y, a left inverse for f (or retraction of f ) is a function g: Y → X such that composing f with g from the left gives the identity function:
That is, the function g satisfies the rule
- If , then
Thus, g must equal the inverse of f on the image of f, but may take any values for elements of Y not in the image.
A function f is injective if and only if it has a left inverse or is the empty function.
- If g is the left inverse of f, then f is injective. If f(x) = f(y), then .
- If f: X→Y is injective, f either is the empty function (X = ∅) or has a left inverse g: Y → X (X ≠ ∅), which can be constructed as follows: for all y ∈ Y, if y is in the image of f (there exists x ∈ X such that f(x) = y), let g(y) = x (x is unique because f is injective); otherwise, let g(y) be an arbitrary element of X. For all x ∈ X, f(x) is in the image of f, so g(f(x)) = x by above, so g is a left inverse of f.
In classical mathematics, every injective function f with a nonempty domain necessarily has a left inverse; however, this may fail in constructive mathematics. For instance, a left inverse of the inclusion {0,1} → R of the two-element set in the reals violates indecomposability by giving a retraction of the real line to the set {0,1} .
Right inverses
A right inverse for f (or section of f ) is a function h: Y → X such that
That is, the function h satisfies the rule
- If , then
Thus, h(y) may be any of the elements of X that map to y under f.
A function f has a right inverse if and only if it is surjective (though constructing such an inverse in general requires the axiom of choice).
- If h is the right inverse of f, then f is surjective. For all , there is such that .
- If f is surjective, f has a right inverse h, which can be constructed as follows: for all , there is at least one such that (because f is surjective), so we choose one to be the value of h(y).
Two-sided inverses
An inverse that is both a left and right inverse (a two-sided inverse), if it exists, must be unique. In fact, if a function has a left inverse and a right inverse, they are both the same two-sided inverse, so it can be called the inverse.
- If is a left inverse and a right inverse of , for all , .
A function has a two-sided inverse if and only if it is bijective.
- A bijective function f is injective, so it has a left inverse (if f is the empty function, is its own left inverse). f is surjective, so it has a right inverse. By the above, the left and right inverse are the same.
- If f has a two-sided inverse g, then g is a left inverse and right inverse of f, so f is injective and surjective.
Preimages
If f: X → Y is any function (not necessarily invertible), the preimage (or inverse image) of an element y ∈ Y, is the set of all elements of X that map to y:
The preimage of y can be thought of as the image of y under the (multivalued) full inverse of the function f.
Similarly, if S is any subset of Y, the preimage of S, denoted , is the set of all elements of X that map to S:
For example, take a function f: R → R, where f: x ↦ x2. This function is not invertible for reasons discussed in Example: Squaring and square root functions. Yet preimages may be defined for subsets of the codomain:
The preimage of a single element y ∈ Y – a singleton set {y} – is sometimes called the fiber of y. When Y is the set of real numbers, it is common to refer to f −1({y}) as a level set.
Licensing
Content obtained and/or adapted from:
- Inverse function, Wikipedia under a CC BY-SA license