Change of variables
![]() |
It has been suggested that substitution (algebra) be merged into this article. (Discuss) Proposed since July 2013. |
This article does not cite any references or sources. (December 2009) |
Calculus |
---|
Integral calculus
|
Specialized calculi
|
In mathematics, a change of variables is a basic technique used to simplify problems in which the original variables are replaced with other variables derived from the originals; the new and old variables being related in some specified way. The intent is that when expressed in new variables, the problem may become simpler, or equivalent to a better understood problem.
A very simple example of a useful variable change can be seen in the problem of finding the roots of the sixth order polynomial:
Sixth order polynomial equations are generally impossible to solve in terms of elementary functions. This particular equation, however, may be simplified by defining a new variable x3 = u. Substituting this into the polynomial:
which is just a quadratic equation with solutions:
The solution in terms of the original variable is obtained by replacing the original variable:
Contents
Simple example[edit source | edit]
Consider the system of equations
where and
are positive integers with
. (Source: 1991 AIME)
Solving this normally is not terrible, but it may get a little tedious. However, we can rewrite the second equation as . Making the substitution
reduces the system to
Solving this gives
or
Back-substituting the first ordered pair gives us
, which easily gives the solution
Back-substituting the second ordered pair gives us
, which gives no solutions. Hence the solution that solves the system is
.
Formal introduction[edit source | edit]
Let ,
be smooth manifolds and let
be a
-diffeomorphism between them, that is:
is a
times continuously differentiable, bijective map from
to
with
times continuously differentiable inverse from
to
. Here
may be any natural number (or zero),
(smooth) or
(analytic).
The map is called a regular coordinate transformation or regular variable substitution, where
refers to the
-ness of
. Usually one will write
to indicate the replacement of the variable
by the variable
by substituting the value of
in
for every occurrence of
.
Other examples[edit source | edit]
Coordinate transformation[edit source | edit]
Some systems can be more easily solved when switching to cylindrical coordinates. Consider for example the equation
This may be a potential energy function for some physical problem. If one does not immediately see a solution, one might try the substitution
given by
.
Note that if runs outside a
-length interval, for example,
, the map
is no longer bijective. Therefore
should be limited to, for example
. Notice how
is excluded, for
is not bijective in the origin (
can take any value, the point will be mapped to (0, 0, z)). Then, replacing all occurrences of the original variables by the new expressions prescribed by
and using the identity
, we get
.
Now the solutions can be readily found: , so
or
. Applying the inverse of
shows that this is equivalent to
while
. Indeed we see that for
the function vanishes, except for the origin.
Note that, had we allowed , the origin would also have been a solution, though it is not a solution to the original problem. Here the bijectivity of
is crucial.
Differentiation[edit source | edit]
The chain rule is used to simplify complicated differentiation. For example, to calculate the derivative
the variable x may be changed by introducing x2 = u. Then, by the chain rule:
so that
where in the very last step u has been replaced with x2.
Integration[edit source | edit]
Difficult integrals may often be evaluated by changing variables; this is enabled by the substitution rule and is analogous to the use of the chain rule above. Difficult integrals may also be solved by simplifying the integral using a change of variables given by the corresponding Jacobian matrix and determinant. Using the Jacobian determinant and the corresponding change of variable that it gives is the basis of coordinate systems such as polar, cylindrical, and spherical coordinate systems.
Differential equations[edit source | edit]
Variable changes for differentiation and integration are taught in elementary calculus and the steps are rarely carried out in full.
The very broad use of variable changes is apparent when considering differential equations, where the independent variables may be changed using the chain rule or the dependent variables are changed resulting in some differentiation to be carried out. Exotic changes, such as the mingling of dependent and independent variables in point and contact transformations, can be very complicated but allow much freedom.
Very often, a general form for a change is substituted into a problem and parameters picked along the way to best simplify the problem.
Scaling and shifting[edit source | edit]
Probably the simplest change is the scaling and shifting of variables, that is replacing them with new variables that are "stretched" and "moved" by constant amounts. This is very common in practical applications to get physical parameters out of problems. For an nth order derivative, the change simply results in
where
This may be shown readily through the chain rule and linearity of differentiation. This change is very common in practical applications to get physical parameters out of problems, for example, the boundary value problem
describes parallel fluid flow between flat solid walls separated by a distance δ; µ is the viscosity and the pressure gradient, both constants. By scaling the variables the problem becomes
where
Scaling is useful for many reasons. It simplifies analysis both by reducing the number of parameters and by simply making the problem neater. Proper scaling may normalize variables, that is make them have a sensible unitless range such as 0 to 1. Finally, if a problem mandates numeric solution, the fewer the parameters the fewer the number of computations.
Momentum vs. velocity[edit source | edit]
Consider a system of equations
for a given function . The mass can be eliminated by the (trivial) substitution
. Clearly this is a bijective map from
to
. Under the substitution
the system becomes
Lagrangian mechanics[edit source | edit]
Given a force field , Newton's equations of motion are
.
Lagrange examined how these equations of motion change under an arbitrary substitution of variables ,
.
He found that the equations
are equivalent to Newton's equations for the function , where T is the kinetic, and V the potential energy.
In fact, when the substitution is chosen well (exploiting for example symmetries and constraints of the system) these equations are much easier to solve than Newton's equations in Cartesian coordinates.