Imagine that you had to compute the double integral
| (1) |
We could start to calculate this as follows.
(x2 + y2) dy dx = a mess |
It turns out that this integral would be a lot easier if we could
change variables to polar coordinates. In rectangular
coordinates, as above, the disk is the region D defined by
-7
x
7 and
- ![]()
y![]()
. In polar
coordinates, the disk is the region we'll call D* defined by
0
r
7 and
0![]()
![]()
2
. Hence the region of
integration is simpler to describe using polar coordinates.
Moreover, the integrand x2 + y2 is simple in polar coordinates
because
x2 + y2 = r2. Using polar coordinates, our lives will be a
lot easier because it seems that all we need to do is integrate r2
over the region D* defined by
0
r
7 and
0![]()
![]()
2
.
Unfortunately, it's not quite that easy. We need to account for one
more consequence of changing variables, which is how changing
variables changes area. You may recall that dA stands for the area
of a little bit of the region D. In rectangular coordinates, we
replaced dA by dx dy (or dy dx). The rest of this reading
discusses what dA becomes when we change variables. (As you will
see, in polar coordinates, dA does not becomes
dr d
.)
To see how area gets changed, let's write the change of variables as the function
| T(r, |
(2) |
Do you understand why the disk looks like a rectangle in polar
coordinates (i.e., why the region D* on the left is a
rectangle)? Remember, the disk is described by
0
r
7 and
0![]()
![]()
2
, which is a rectangle when plotted in the
r
-plane.
We can say that
T(r,
) parametrizes D for
(r,
)
in D*. (This uses the same language that we used when parametrizing a
curve. We'll use it again when we talk about parametrizing surfaces.)
We can chop up the region D* into small rectangles of width
r and height
![]()
. The function
T(r,
) maps
each of these small rectangles into "curvy rectangles" in D, as
shown below.
The area of each small rectangle in D* is
r![]()
.
But we don't care about area in D*. The dA in the integral of
equation (1) is based on area in D not area in D*.
So we need to estimate the area of each "curvy rectangle" in D.
It turns out that we can approximate each "curvy rectangle" as a
parallelogram with sides
![]()
r
and
![]()
![]()
. We know the
area of a parallelogram is the magnitude of the
cross product
of the
two vectors spanning the parallelogram. So it looks like we are
finished.
We have to use one little trick since the cross product is defined
only for three-dimensional vectors and
T is two-dimensional.
We view the vectors
and
as three-dimensional vectors
that happen to have zero for their third component. Then, we can take
their cross product. We'll show in lecture that the area of a
"curvy rectangle" simplifies to
For
T given by equation (2), you can calculate
that
det DT(r,
)
= r so that the area
of each "curvy rectangle" is
r
r![]()
. This agrees
with the above picture, since the "curvy rectangles" were larger when
r was larger.
The important point is that
T stretches or shrinks D* when
it maps D* onto D. Consequently, when we convert from area in
D* to area in D, we need to multiply by the absolute value of the
determinant of the matrix of partial derivatives:
det DT(r,
)
.
We now put everything back together. We started off trying to
integrate the function
g(x, y) = x2 + y2 over the region D. If we
use
(x, y) = T(r,
) to change variables, we can instead
integrate the function
g(T(r,
)) = r2 over the region
D*. However, we need to include the factor
det DT(r,
)
= r in dA to account for the
stretching by
T. We can replace dA with
r dr d
.
We end up with the formula
For a general change of variables, we tend to use the variables u and
v (rather than r and
). In this case, if we change
variables by
(x, y) = T(u, v), our integral is
You can see some examples, including more details on the disk example, here.