This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The Integral of Secant and Stereographic Projections of Conic Sections

George Jennings, David Ni, Wai Yan Pong, Serban Raianu

California State University, Dominguez Hills
1000 E Victoria St, Carson, CA 90747
Abstract

We show that the four best-known substitutions used to integrate secant rely on different rational parametrizations of conic sections, coming from stereographic projections. We also investigate other possible explanations for the four substitutions, as well as connections between the buildups leading to them.

1 Introduction

The integral of secant, which competes for the unofficial title of The Most Interesting Integral in the World of Elementary Functions, can be taught in several ways [1, 8, 9, 14, 19].

We start by briefly reviewing some history [15]. The secant was integrated numerically in 1569 by Gerardus Mercator [5] to construct the famous map projection that bears his name. Mercator’s projection is especially useful for navigators because it depicts angles accurately, with zero distortion. Let θ=\theta= longitude and ϕ=\phi= latitude of a point on Earth. On Mercator’s map the xx-coordinate of the corresponding point on the map is the longitude, and its yy-coordinate depends only on the latitude

x=θ,y=y(ϕ).x=\theta,\quad y=y(\phi).

Consider an infinitesimal square on the earth with vertices at (θ,ϕ)(\theta,\phi), (θ+dθ,ϕ)(\theta+d\theta,\phi), (θ,ϕ+dϕ)(\theta,\phi+d\phi), and (θ+dθ,ϕ+dϕ)(\theta+d\theta,\phi+d\phi). Its sides have equal length,

side=ds=rcosϕdθ=rdϕ\text{side}=ds=r\cos\phi\;d\theta=r\;d\phi

where rr is the Earth’s radius. To avoid distorting angles the corresponding points on the map must also form an infinitesimal square. Its horizontal measure is

dx=dθ=dsrcosϕdx=d\theta=\frac{ds}{r\cos\phi}

so its vertical measure must be the same,

dy=dsrcosϕ=rdϕrcosϕ=secϕdϕ.dy=\frac{ds}{r\cos\phi}=\frac{rd\phi}{r\cos\phi}=\sec\phi\;d\phi.

Adding up these infinitesimal distances dydy on the map as one moves away from the equator, the yy-coordinate of a point at latitude LL will be a function of LL, say y=f(L)y=f(L). Then we have

f(L)=0f(L)𝑑y=0Lsecϕdϕ.f(L)=\int_{0}^{f(L)}dy=\int_{0}^{L}\sec\phi\;d\phi.

Taking derivatives shows that f(L)=secLf^{\prime}(L)=\sec L. Note that this gives us a practical reason to integrate the secant function but it does not help us find f(L)f(L). This is done by one of the following four available substitutions described in the next section.

2 The Four Substitutions Used to Integrate Secant

  1. 1.

    The Gregory substitution [4, 10], u=secx+tanxu=\sec x+\tan x. This is the method used in most modern calculus textbooks [19, 9], and is usually justified by a mysterious trick: multiplying and dividing secx\sec x by secx+tanx\sec x+\tan x. Here is an explanation (found independently many times, see [8, 17, 20]) for how to come up with this: integration is about spotting derivatives. Not being able to think of a function whose derivative is secant, we go for the next best thing, i.e. finding a function which at least has a secant in its derivative. It turns out there are only two of them:

    (tanx)\displaystyle(\tan x)^{\prime} =\displaystyle= sec2x\displaystyle\sec^{2}x
    (secx)\displaystyle(\sec x)^{\prime} =\displaystyle= secxtanx\displaystyle\sec x\tan x

    Seeing them nicely lined up like this prompts us to add the two equalities, and by doing so we get: (secx+tanx)=secx(secx+tanx)(\sec x+\tan x)^{\prime}=\sec x(\sec x+\tan x), so

    secxdx=ln|secx+tanx|+C.\int\sec x\,\mathrm{d}x=\ln|\sec x+\tan x|+C. (1)

    This was a conjecture from 1645, based on information from logarithmic tables (and known to many people, including Newton), and it was solved by James Gregory 23 years later [15]. As explained by Turnbull in [22], Gregory actually proves (using only geometry), that

    0αsecθdθ=1/2(secα+tanα)/2duu=log(secα+tanα),\int_{0}^{\alpha}\sec\theta\;\;{\rm d}\theta=\int_{1/\sqrt{2}}^{(\sec\alpha+\tan\alpha)/\sqrt{2}}\;\;\frac{{\rm d}u}{u}=\log(\sec\alpha+\tan\alpha),

    and this is how he arrives at the substitution.

  2. 2.

    The substitution used by Newton’s teacher, Barrow [3], u=sinxu=\sin x. This is set up by multiplying and dividing 1cosx\frac{1}{\cos x} by cosx\cos x, which, as we can see from the explanation of the Gregory substitution in [22], was the first step in Gregory’s chain of simple and double integrals.

  3. 3.

    The Weierstrass [7] substitution, t=tan(x2)t=\tan\left(\frac{x}{2}\right) [12].

  4. 4.

    The modified Weierstrass substitution [14], s=tan(x2+π4)s=\tan\left(\frac{x}{2}+\frac{\pi}{4}\right), which leads to the integral of 1s\frac{1}{s}.

The Barrow and Weierstrass substitutions both end up by using the method of partial fractions to integrate 11u2\frac{1}{1-u^{2}} (or 11t2\frac{1}{1-t^{2}}, respectively). It is believed that Barrow was the first to use partial fractions in integration [15]. In the North American curriculum partial fractions are usually taught after trigonometric integrals [19], but the following well known trick helps us avoid partial fractions in this particular case, so we can teach the integral of secant using any of the four methods at any time. Moreover, the antiderivative is the same as the one obtained when using partial fractions.

du1u2\displaystyle\int\frac{{\rm d}u}{1-u^{2}} =(1u+u)du(1+u)(1u)=12(21+u2u1u2)du\displaystyle=\int\frac{(1-u+u){\rm d}u}{(1+u)(1-u)}=\dfrac{1}{2}\int\left(\frac{2}{1+u}-\frac{-2u}{1-u^{2}}\right)\,{\rm d}u
=12(2ln|1+u|ln|1u2|)+C=12ln|1+u1u|+C\displaystyle=\frac{1}{2}\left(2\ln\left|1+u\right|-\ln\left|1-u^{2}\right|\right)+C=\frac{1}{2}\ln\left|\frac{1+u}{1-u}\right|+C

A similar but sneakier trick produces the partial fraction decomposition by splitting the 1 in the numerator into 1/2+1/21/2+1/2, then adding and subtracting u/2u/2 and separating the fractions.

We would to recall here that the method of partial fractions is a spectacular and surprising application of the arithmetic of the ring of polynomials in one indeterminate with real or complex coefficients to calculus [9, p. 165].

3 Connections Between the Four Substitutions and Other Ideas

We now discuss another idea for integrating secant and a different buildup to the Gregory substitution, both coming from Liouville’s Theorem on integration in finite terms, as proved in [16]. (By the way, the proof makes frequent use of partial fractions.) This is a theorem in differential algebra which explains why functions like ex2e^{x^{2}} or sinx2\sin x^{2} do not have antiderivatives which can be expressed as compositions of elementary functions (which are compositions of the functions we study in calculus). We give the full statement of the theorem, but keep in mind that an easy example of a differential field is the field of rational fractions, and for a rational fraction uu we can think of uu^{\prime} as the derivative of uu regarded as a rational function.

Liouville’s Theorem. Let FF be a differential field of characteristic zero and αF\alpha\in F. If the equation y=αy^{\prime}=\alpha has a solution in some elementary differential extension field of FF having the same subfield of constants, then there are constants c1,,cnFc_{1},\ldots,c_{n}\in F and elements u1,,un,vFu_{1},\ldots,u_{n},v\in F such that

α=i=1nciuiui+v.\alpha=\sum_{i=1}^{n}c_{i}\frac{u_{i}^{\prime}}{u_{i}}+v^{\prime}.

Now, if secx\sec x has an elementary indefinite integral, it would be of the form in Liouville’s Theorem, and so, since we can’t think of a function whose derivative is secant, guessing u(x)u(x) in the simplest case

secx=u(x)u(x)=ddxln|u(x)|\sec x=\frac{u^{\prime}(x)}{u(x)}=\frac{{\rm d}}{{\rm d}x}\ln|u(x)|

would be a reasonable thing to try first. We therefore try to find u(x)u(x) if u(x)cosx=u(x)u^{\prime}(x)\cos x=u(x). Since we are looking for u(x)cosxu^{\prime}(x)\cos x, it makes sense to differentiate h(x)=u(x)cosxh(x)=u(x)\cos x.

h(x)\displaystyle h^{\prime}(x) =\displaystyle= u(x)sinx+u(x)cosx\displaystyle-u(x)\sin x+u^{\prime}(x)\cos x
=\displaystyle= u(x)sinx+u(x)\displaystyle-u(x)\sin x+u(x)
=\displaystyle= h(x)cosxsinx+h(x)cosx\displaystyle-\frac{h(x)}{\cos x}\sin x+\frac{h(x)}{\cos x}
=\displaystyle= h(x)1sinxcosx\displaystyle h(x)\frac{1-\sin x}{\cos x}
=\displaystyle= h(x)cosx1+sinx.\displaystyle h(x)\frac{\cos x}{1+\sin x}.

This gives

ln|h(x)|=ln(1+sinx),\ln|h(x)|=\ln(1+\sin x),

and so

ln|u(x)|=ln|h(x)cosx|=ln|secx+tanx|,\ln|u(x)|=\ln\left|\frac{h(x)}{\cos x}\right|=\ln|\sec x+\tan x|,

which means taking u(x)=secx+tanxu(x)=\sec x+\tan x works.

This also tells us that the buildup to Barrow’s substitution, combined with the avoidance of partial fractions described above, can also lead to Gregory’s substitution. Keep in mind that Barrow’s buildup involved multiplying the top and the bottom by cosx\cos x, then adding and subtracting sinx\sin x in the numerator. If instead we reverse the order of these two actions, i.e. we start by adding and subtracting sinx\sin x on top, then we get:

1cosx=sinxcosx+1sinxcosx=sinxcosx+cosx1+sinx\frac{1}{\cos x}=\frac{\sin x}{\cos x}+\frac{1-\sin x}{\cos x}=\frac{\sin x}{\cos x}+\frac{\cos x}{1+\sin x} (2)

(multiplying and dividing by cosx\cos x is now hidden in the identity

1sinxcosx=cosx1+sinx,\frac{1-\sin x}{\cos x}=\frac{\cos x}{1+\sin x},

this may be seen after dividing both of its sides by 1sinx1-\sin x), and (2) immediately gives (1).

Now we can get yet another buildup leading to Gregory’s substitution. Start with (2), and keep going instead of integrating. We get

1cosx\displaystyle\frac{1}{\cos x} =\displaystyle= sinxcosx+cosx1+sinx\displaystyle\frac{\sin x}{\cos x}+\frac{\cos x}{1+\sin x}
=\displaystyle= tanx+1secx+tanx\displaystyle\tan x+\frac{1}{\sec x+\tan x}
=\displaystyle= tanx(secx+tanx)+(sec2xtan2x)secx+tanx\displaystyle\frac{\tan x(\sec x+\tan x)+(\sec^{2}x-\tan^{2}x)}{\sec x+\tan x}
=\displaystyle= (secx+tanx)secxsecx+tanx.\displaystyle\frac{(\sec x+\tan x)\sec x}{\sec x+\tan x}.

We end this section by noting that we can also derive (2) by adding and subtracting tanx\tan x to secx\sec x instead of sinx\sin x to 1 on top of 1cosx\frac{1}{\cos x} (this can be also independently motivated by the identity sec2xtan2x=1\sec^{2}x-\tan^{2}x=1).

secx=tanx+secxtanx=tanx+1secx+tanx=sinxcosx+cosx1+sinx\sec x=\tan x+\sec x-\tan x=\tan x+\frac{1}{\sec x+\tan x}=\frac{\sin x}{\cos x}+\frac{\cos x}{1+\sin x} (3)

4 The Substitutions Used in the Integral of Secant and Stereographic Projections

The stereographic projection of a curve from a point OO to a line is the function that maps any point PP on the curve to the intersection of that line with the secant OPOP to the curve. The usual projection parallel to a line is a particular case (the point in this case is a point at infinity).

The miraculous fact that the Weierstrass substitution always works, and turns any rational fraction in trigonometric functions into an integrable rational function in tt is a direct consequence of the fact that a circle is a curve of genus 0 (it has no “handles”), which also also means it has a rational parametrization. The Weierstrass substitution t=tan(θ/2)t=\tan(\theta/2) arises naturally from the stereographic projection of the unit circle from (1,0)(-1,0) to the yy-axis. Let

(x,y)=(cosθ,sinθ)(1,0)(x,y)=(\cos\theta,\sin\theta)\neq(-1,0)

be a point on the unit circle with center at the origin. The line connecting (x,y)(x,y) to (1,0)(-1,0) intersects the yy-axis at a point (0,t)(0,t). Elementary geometry (the triangle with vertices (x,y),(1,0),(0,0)(x,y),(-1,0),(0,0) is isosceles) says the angle between this line and the xx-axis is θ/2\theta/2. Stereographic projection of the circle from the point (1,0)(-1,0) to the yy-axis is the function that maps (x,y)(x,y) to (0,t)(0,t).

Refer to caption
Figure 1: A rational parametrization of the circle

The slope of the line is t=y/(x+1)=tan(θ/2)t=y/(x+1)=\tan(\theta/2). Plug y=t(x+1)y=t(x+1) into the equation x2+y21=0x^{2}+y^{2}-1=0, divide by x+1x+1, and solve for xx to obtain

(x,y)=(cosθ,sinθ)=(1t21+t2,2t1+t2),(x,y)=(\cos\theta,\sin\theta)=\left(\frac{1-t^{2}}{1+t^{2}},\frac{2t}{1+t^{2}}\right), (4)

a rational parametrization of the unit circle. Differentiating

y=sinθ=2t1+t2y=\sin\theta=\frac{2t}{1+t^{2}}

and using x=(1t2)/(1+t2)x=(1-t^{2})/(1+t^{2}) one obtains

dy=cosθdθ=2(1t2)(1+t2)2dt=2cosθ1+t2dt\displaystyle dy=\cos\theta\;d\theta=\frac{2(1-t^{2})}{(1+t^{2})^{2}}\;dt=\frac{2\cos\theta}{1+t^{2}}\;dt
hence
dθ=2dt1+t2.\displaystyle d\theta=\frac{2\;dt}{1+t^{2}}.

These substitutions help convert any rational trigonometric differential R(cosθ,sinθ)dθR(\cos\theta,\sin\theta)\;d\theta, where R(x,y)R(x,y) is a quotient of polynomials (including secθdθ\sec\theta d\theta), to a rational differential S(t)dtS(t)\;dt that is sometimes easier to integrate.

We also note (see [18, p. 4]) that this rational parametrization of the circle also produces a recipe for constructing Pythagorean triples of integers X,Y,ZX,Y,Z such that X2+Y2=Z2X^{2}+Y^{2}=Z^{2} (also see [6] for a biography of Pythagoras and [21, Theorem 6.3, p.162] for a purely algebraic description of all primitive — i.e. with no common factors — Pythagorean triples). Let t=a/bt=a/b where aa and b0b\neq 0 are integers. Plug tt into the stereographic projection formula (4) to obtain (x(t),y(t))(x(t),y(t)) satisfying x2+y2=1x^{2}+y^{2}=1 then clear denominators to obtain X2+Y2=Z2X^{2}+Y^{2}=Z^{2} where X,Y,ZX,Y,Z are integers. Conversely, given a Pythagorean triple (X,Y,Z)(X,Y,Z) one may reverse this process to find a rational tt that will produce (X,Y,Z)(X,Y,Z) or an integer multiple of it.

The modified Weierstrass substitution s=tan(θ/2+π/4)s=\tan(\theta/2+\pi/4) also uses a stereographic projection of the unit circle, but from the point (0,1)(0,1) to the xx-axis.

Refer to caption
Figure 2: Another rational parametrization of the circle

Let (cosθ,sinθ)=(x,y)(0,1)(\cos\theta,\sin\theta)=(x,y)\neq(0,1). The line through (x,y)(x,y) and (0,1)(0,1) intersects the xx-axis at (0,s)(0,s), where s=tan(θ/2+π/4)s=\tan(\theta/2+\pi/4). Its slope is 1/s=(y1)/x-1/s=(y-1)/x so x=s(1y)x=s(1-y). Plug that into the equation x2+y21=0x^{2}+y^{2}-1=0, divide by 1y1-y, and solve for yy to obtain

(x,y)=(cosθ,sinθ)=(2ss2+1,s21s2+1),(x,y)=(\cos\theta,\sin\theta)=\left(\frac{2s}{s^{2}+1},\frac{s^{2}-1}{s^{2}+1}\right),

another rational parametrization of the unit circle. As before, computing dy=cosθdθdy=\cos\theta\;d\theta yields a formula for dθd\theta

dθ=2s2+1ds.d\theta=\frac{2}{s^{2}+1}\;ds.

These formulas make integrating

secθdθ=s2+12s2s2+1𝑑s=dss=ln|s|+C\int\sec\theta\;d\theta=\int\frac{s^{2}+1}{2s}\frac{2}{s^{2}+1}\;ds=\int\frac{ds}{s}=\ln\lvert s\rvert+C

even simpler, without partial fractions. Finally, using x2+y2=1x^{2}+y^{2}=1,

s=x1y=x(1+y)1y2=x(1+y)x2=1x+yx=secθ+tanθs=\frac{x}{1-y}=\frac{x(1+y)}{1-y^{2}}=\frac{x(1+y)}{x^{2}}=\frac{1}{x}+\frac{y}{x}=\sec\theta+\tan\theta

produces the formula that appears in the back of your textbook. This shows that the Gregory substitution is basically the same as the modified Weierstrass. Nevertheless, we give a separate geometric argument for the Gregory substitution:

The parametrization of the right branch of the hyperbola x2y2=1x^{2}-y^{2}=1 is (x,y)=(secθ,tanθ)(x,y)=(\sec\theta,\tan\theta). The line at infinity z=0z=0 intersects the homogenized hyperbola x2y2=z2x^{2}-y^{2}=z^{2} at the two points at infinity P=[1:1:0]P_{-}=[1:1:0] and P+=[1:1:0]P_{+}=[1:-1:0]. The stereographic projection of the hyperbola from P+P_{+} to the xx-axis (which is the projection down a line x+y=ux+y=u parallel to the asymptote x+y=0x+y=0) sends the point (x,y)(x,y) on the hyperbola to the point (x+y,0)=(secθ+tanθ,0)(x+y,0)=(\sec\theta+\tan\theta,0), and we choose the parameter to be u=x+y=secθ+tanθu=x+y=\sec\theta+\tan\theta. Solving for the point where the line x+y=ux+y=u intersects the hyperbola, we obtain

(x(u),y(u))=(u2+12u,u212u)=(secθ,tanθ).(x(u),y(u))=\left(\frac{u^{2}+1}{2u},\frac{u^{2}-1}{2u}\right)=(\sec\theta,\tan\theta).

This rational parametrization of the hyperbola does the job.

Refer to caption
Figure 3: A rational parametrization of the hyperbola

We have

du=(secθtanθ+sec2θ)dθ=usecθdθ,\mathrm{d}u=(\sec\theta\tan\theta+\sec^{2}\theta)\mathrm{d}\theta=u\sec\theta\mathrm{d}\theta,

so

duu=secθdθ,\frac{\mathrm{d}u}{u}=\sec\theta\;\mathrm{d}\theta,

and this provides another natural buildup to Gregory’s substitution.

Other rational parametrizations of the hyperbola also work. One possible parameter is vv in the point (v,v)(v,v) which is the stereographic projection from P+P_{+} of the point (x,y)=(secθ,tanθ)(x,y)=(\sec\theta,\tan\theta) on the hyperbola to the asymptote y=xy=x. Another is w=2vw=\sqrt{2v}, which is the distance from (v,v)(v,v) to the origin. Both of these two parameters work just as well as the uu above, because v=u/2v=u/2, and w=u/2w=u/\sqrt{2}. Other choices include the parameters we have used before: t=tan(θ/2)t=\tan(\theta/2) and s=tan(θ/2+π/4)s=\tan(\theta/2+\pi/4), but these end up requiring partial fractions. The reader may wish to work out formulas for parametrizing the hyperbola with these parameters as an exercise.

We now see that all methods for integrating the secant function have something in common: they rely on certain rational parametrizations of the circle or the hyperbola. The figure below puts all these parametrizations together on the same picture. This shows how our previous three pictures are related.

Refer to caption
Figure 4: All previous three rational parametrizations

The relevant points are B(1,0)B(-1,0), D(0,1)D(0,1), C(cosθ,sinθ)C(\cos\theta,\sin\theta), A(secθ,tanθ)A(\sec\theta,\tan\theta), and the parameters are:

  • OT=t=tan(θ/2)OT=t=\tan(\theta/2) (Weierstrass)

  • OS=s=tan(θ/2+π/4)=secθ+tanθOS=s=\tan(\theta/2+\pi/4)=\sec\theta+\tan\theta (modified Weierstrass, Barrow — see (2) or (3), Gregory)

This picture is bursting with interesting geometry problems; see for example [11]. The most striking of these geometric facts is that the stereographic projection of the circle from the point D(0,1)D(0,1) to the xx-axis and the stereographic projection of the hyperbola from the point at infinity P+[1:1:0]P_{+}[1:-1:0] to the xx-axis (recall that the latter is the projection of the hyperbola to the xx-axis parallel to the asymptote x+y=0x+y=0) turn out to be the same if we identify the circle and the hyperbola via corresponding points (AA and CC).

Acknowledgments

We thank Frank Miles for helpful comments.

References