The Integral of Secant and Stereographic Projections of Conic Sections

George Jennings, David Ni, Wai Yan Pong, Serban Raianu

California State University, Dominguez Hills
1000 E Victoria St, Carson, CA 90747

Abstract

We show that the four best-known substitutions used to integrate secant rely on different rational parametrizations of conic sections, coming from stereographic projections. We also investigate other possible explanations for the four substitutions, as well as connections between the buildups leading to them.

1 Introduction

The integral of secant, which competes for the unofficial title of The Most Interesting Integral in the World of Elementary Functions, can be taught in several ways [1, 8, 9, 14, 19].

We start by briefly reviewing some history [15]. The secant was integrated numerically in 1569 by Gerardus Mercator [5] to construct the famous map projection that bears his name. Mercator’s projection is especially useful for navigators because it depicts angles accurately, with zero distortion. Let $\theta=$ longitude and $\phi=$ latitude of a point on Earth. On Mercator’s map the $x$ -coordinate of the corresponding point on the map is the longitude, and its $y$ -coordinate depends only on the latitude

x=\theta,\quad y=y(\phi).

Consider an infinitesimal square on the earth with vertices at $(\theta,\phi)$ , $(\theta+d\theta,\phi)$ , $(\theta,\phi+d\phi)$ , and $(\theta+d\theta,\phi+d\phi)$ . Its sides have equal length,

\text{side}=ds=r\cos\phi\;d\theta=r\;d\phi

where $r$ is the Earth’s radius. To avoid distorting angles the corresponding points on the map must also form an infinitesimal square. Its horizontal measure is

dx=d\theta=\frac{ds}{r\cos\phi}

so its vertical measure must be the same,

dy=\frac{ds}{r\cos\phi}=\frac{rd\phi}{r\cos\phi}=\sec\phi\;d\phi.

Adding up these infinitesimal distances $dy$ on the map as one moves away from the equator, the $y$ -coordinate of a point at latitude $L$ will be a function of $L$ , say $y=f(L)$ . Then we have

f(L)=\int_{0}^{f(L)}dy=\int_{0}^{L}\sec\phi\;d\phi.

Taking derivatives shows that $f^{\prime}(L)=\sec L$ . Note that this gives us a practical reason to integrate the secant function but it does not help us find $f(L)$ . This is done by one of the following four available substitutions described in the next section.

2 The Four Substitutions Used to Integrate Secant

The Gregory substitution [4, 10], $u=\sec x+\tan x$ . This is the method used in most modern calculus textbooks [19, 9], and is usually justified by a mysterious trick: multiplying and dividing $\sec x$ by $\sec x+\tan x$ . Here is an explanation (found independently many times, see [8, 17, 20]) for how to come up with this: integration is about spotting derivatives. Not being able to think of a function whose derivative is secant, we go for the next best thing, i.e. finding a function which at least has a secant in its derivative. It turns out there are only two of them:

	$\displaystyle(\tan x)^{\prime}$	$\displaystyle=$	$\displaystyle\sec^{2}x$
	$\displaystyle(\sec x)^{\prime}$	$\displaystyle=$	$\displaystyle\sec x\tan x$

Seeing them nicely lined up like this prompts us to add the two equalities, and by doing so we get: $(\sec x+\tan x)^{\prime}=\sec x(\sec x+\tan x)$ , so

\int\sec x\,\mathrm{d}x=\ln|\sec x+\tan x|+C.

(1)

This was a conjecture from 1645, based on information from logarithmic tables (and known to many people, including Newton), and it was solved by James Gregory 23 years later [15]. As explained by Turnbull in [22], Gregory actually proves (using only geometry), that

\int_{0}^{\alpha}\sec\theta\;\;{\rm d}\theta=\int_{1/\sqrt{2}}^{(\sec\alpha+\tan\alpha)/\sqrt{2}}\;\;\frac{{\rm d}u}{u}=\log(\sec\alpha+\tan\alpha),

and this is how he arrives at the substitution.

2.

The substitution used by Newton’s teacher, Barrow [3], $u=\sin x$ . This is set up by multiplying and dividing $\frac{1}{\cos x}$ by $\cos x$ , which, as we can see from the explanation of the Gregory substitution in [22], was the first step in Gregory’s chain of simple and double integrals.
3.

The Weierstrass [7] substitution, $t=\tan\left(\frac{x}{2}\right)$ [12].
4.

The modified Weierstrass substitution [14], $s=\tan\left(\frac{x}{2}+\frac{\pi}{4}\right)$ , which leads to the integral of $\frac{1}{s}$ .

The Barrow and Weierstrass substitutions both end up by using the method of partial fractions to integrate $\frac{1}{1-u^{2}}$ (or $\frac{1}{1-t^{2}}$ , respectively). It is believed that Barrow was the first to use partial fractions in integration [15]. In the North American curriculum partial fractions are usually taught after trigonometric integrals [19], but the following well known trick helps us avoid partial fractions in this particular case, so we can teach the integral of secant using any of the four methods at any time. Moreover, the antiderivative is the same as the one obtained when using partial fractions.

	$\displaystyle\int\frac{{\rm d}u}{1-u^{2}}$	$\displaystyle=\int\frac{(1-u+u){\rm d}u}{(1+u)(1-u)}=\dfrac{1}{2}\int\left(\frac{2}{1+u}-\frac{-2u}{1-u^{2}}\right)\,{\rm d}u$
		$\displaystyle=\frac{1}{2}\left(2\ln\left\|1+u\right\|-\ln\left\|1-u^{2}\right\|\right)+C=\frac{1}{2}\ln\left\|\frac{1+u}{1-u}\right\|+C$

A similar but sneakier trick produces the partial fraction decomposition by splitting the 1 in the numerator into $1/2+1/2$ , then adding and subtracting $u/2$ and separating the fractions.

We would to recall here that the method of partial fractions is a spectacular and surprising application of the arithmetic of the ring of polynomials in one indeterminate with real or complex coefficients to calculus [9, p. 165].

3 Connections Between the Four Substitutions and Other Ideas

We now discuss another idea for integrating secant and a different buildup to the Gregory substitution, both coming from Liouville’s Theorem on integration in finite terms, as proved in [16]. (By the way, the proof makes frequent use of partial fractions.) This is a theorem in differential algebra which explains why functions like $e^{x^{2}}$ or $\sin x^{2}$ do not have antiderivatives which can be expressed as compositions of elementary functions (which are compositions of the functions we study in calculus). We give the full statement of the theorem, but keep in mind that an easy example of a differential field is the field of rational fractions, and for a rational fraction $u$ we can think of $u^{\prime}$ as the derivative of $u$ regarded as a rational function.

Liouville’s Theorem. Let $F$ be a differential field of characteristic zero and $\alpha\in F$ . If the equation $y^{\prime}=\alpha$ has a solution in some elementary differential extension field of $F$ having the same subfield of constants, then there are constants $c_{1},\ldots,c_{n}\in F$ and elements $u_{1},\ldots,u_{n},v\in F$ such that

\alpha=\sum_{i=1}^{n}c_{i}\frac{u_{i}^{\prime}}{u_{i}}+v^{\prime}.

Now, if $\sec x$ has an elementary indefinite integral, it would be of the form in Liouville’s Theorem, and so, since we can’t think of a function whose derivative is secant, guessing $u(x)$ in the simplest case

\sec x=\frac{u^{\prime}(x)}{u(x)}=\frac{{\rm d}}{{\rm d}x}\ln|u(x)|

would be a reasonable thing to try first. We therefore try to find $u(x)$ if $u^{\prime}(x)\cos x=u(x)$ . Since we are looking for $u^{\prime}(x)\cos x$ , it makes sense to differentiate $h(x)=u(x)\cos x$ .

$\displaystyle h^{\prime}(x)$	$\displaystyle=$	$\displaystyle-u(x)\sin x+u^{\prime}(x)\cos x$
	$\displaystyle=$	$\displaystyle-u(x)\sin x+u(x)$
	$\displaystyle=$	$\displaystyle-\frac{h(x)}{\cos x}\sin x+\frac{h(x)}{\cos x}$
	$\displaystyle=$	$\displaystyle h(x)\frac{1-\sin x}{\cos x}$
	$\displaystyle=$	$\displaystyle h(x)\frac{\cos x}{1+\sin x}.$

This gives

\ln|h(x)|=\ln(1+\sin x),

and so

\ln|u(x)|=\ln\left|\frac{h(x)}{\cos x}\right|=\ln|\sec x+\tan x|,

which means taking $u(x)=\sec x+\tan x$ works.

This also tells us that the buildup to Barrow’s substitution, combined with the avoidance of partial fractions described above, can also lead to Gregory’s substitution. Keep in mind that Barrow’s buildup involved multiplying the top and the bottom by $\cos x$ , then adding and subtracting $\sin x$ in the numerator. If instead we reverse the order of these two actions, i.e. we start by adding and subtracting $\sin x$ on top, then we get:

\frac{1}{\cos x}=\frac{\sin x}{\cos x}+\frac{1-\sin x}{\cos x}=\frac{\sin x}{\cos x}+\frac{\cos x}{1+\sin x}

(2)

(multiplying and dividing by $\cos x$ is now hidden in the identity

\frac{1-\sin x}{\cos x}=\frac{\cos x}{1+\sin x},

this may be seen after dividing both of its sides by $1-\sin x$ ), and (2) immediately gives (1).

Now we can get yet another buildup leading to Gregory’s substitution. Start with (2), and keep going instead of integrating. We get

$\displaystyle\frac{1}{\cos x}$	$\displaystyle=$	$\displaystyle\frac{\sin x}{\cos x}+\frac{\cos x}{1+\sin x}$
	$\displaystyle=$	$\displaystyle\tan x+\frac{1}{\sec x+\tan x}$
	$\displaystyle=$	$\displaystyle\frac{\tan x(\sec x+\tan x)+(\sec^{2}x-\tan^{2}x)}{\sec x+\tan x}$
	$\displaystyle=$	$\displaystyle\frac{(\sec x+\tan x)\sec x}{\sec x+\tan x}.$

We end this section by noting that we can also derive (2) by adding and subtracting $\tan x$ to $\sec x$ instead of $\sin x$ to 1 on top of $\frac{1}{\cos x}$ (this can be also independently motivated by the identity $\sec^{2}x-\tan^{2}x=1$ ).

\sec x=\tan x+\sec x-\tan x=\tan x+\frac{1}{\sec x+\tan x}=\frac{\sin x}{\cos x}+\frac{\cos x}{1+\sin x}

(3)

4 The Substitutions Used in the Integral of Secant and Stereographic Projections

The stereographic projection of a curve from a point $O$ to a line is the function that maps any point $P$ on the curve to the intersection of that line with the secant $OP$ to the curve. The usual projection parallel to a line is a particular case (the point in this case is a point at infinity).

The miraculous fact that the Weierstrass substitution always works, and turns any rational fraction in trigonometric functions into an integrable rational function in $t$ is a direct consequence of the fact that a circle is a curve of genus 0 (it has no “handles”), which also also means it has a rational parametrization. The Weierstrass substitution $t=\tan(\theta/2)$ arises naturally from the stereographic projection of the unit circle from $(-1,0)$ to the $y$ -axis. Let

(x,y)=(\cos\theta,\sin\theta)\neq(-1,0)

be a point on the unit circle with center at the origin. The line connecting $(x,y)$ to $(-1,0)$ intersects the $y$ -axis at a point $(0,t)$ . Elementary geometry (the triangle with vertices $(x,y),(-1,0),(0,0)$ is isosceles) says the angle between this line and the $x$ -axis is $\theta/2$ . Stereographic projection of the circle from the point $(-1,0)$ to the $y$ -axis is the function that maps $(x,y)$ to $(0,t)$ .

Refer to caption — Figure 1: A rational parametrization of the circle

The slope of the line is $t=y/(x+1)=\tan(\theta/2)$ . Plug $y=t(x+1)$ into the equation $x^{2}+y^{2}-1=0$ , divide by $x+1$ , and solve for $x$ to obtain

(x,y)=(\cos\theta,\sin\theta)=\left(\frac{1-t^{2}}{1+t^{2}},\frac{2t}{1+t^{2}}\right),

(4)

a rational parametrization of the unit circle. Differentiating

y=\sin\theta=\frac{2t}{1+t^{2}}

and using $x=(1-t^{2})/(1+t^{2})$ one obtains

	$\displaystyle dy=\cos\theta\;d\theta=\frac{2(1-t^{2})}{(1+t^{2})^{2}}\;dt=\frac{2\cos\theta}{1+t^{2}}\;dt$
hence
	$\displaystyle d\theta=\frac{2\;dt}{1+t^{2}}.$

These substitutions help convert any rational trigonometric differential $R(\cos\theta,\sin\theta)\;d\theta$ , where $R(x,y)$ is a quotient of polynomials (including $\sec\theta d\theta$ ), to a rational differential $S(t)\;dt$ that is sometimes easier to integrate.

We also note (see [18, p. 4]) that this rational parametrization of the circle also produces a recipe for constructing Pythagorean triples of integers $X,Y,Z$ such that $X^{2}+Y^{2}=Z^{2}$ (also see [6] for a biography of Pythagoras and [21, Theorem 6.3, p.162] for a purely algebraic description of all primitive — i.e. with no common factors — Pythagorean triples). Let $t=a/b$ where $a$ and $b\neq 0$ are integers. Plug $t$ into the stereographic projection formula (4) to obtain $(x(t),y(t))$ satisfying $x^{2}+y^{2}=1$ then clear denominators to obtain $X^{2}+Y^{2}=Z^{2}$ where $X,Y,Z$ are integers. Conversely, given a Pythagorean triple $(X,Y,Z)$ one may reverse this process to find a rational $t$ that will produce $(X,Y,Z)$ or an integer multiple of it.

The modified Weierstrass substitution $s=\tan(\theta/2+\pi/4)$ also uses a stereographic projection of the unit circle, but from the point $(0,1)$ to the $x$ -axis.

Let $(\cos\theta,\sin\theta)=(x,y)\neq(0,1)$ . The line through $(x,y)$ and $(0,1)$ intersects the $x$ -axis at $(0,s)$ , where $s=\tan(\theta/2+\pi/4)$ . Its slope is $-1/s=(y-1)/x$ so $x=s(1-y)$ . Plug that into the equation $x^{2}+y^{2}-1=0$ , divide by $1-y$ , and solve for $y$ to obtain

(x,y)=(\cos\theta,\sin\theta)=\left(\frac{2s}{s^{2}+1},\frac{s^{2}-1}{s^{2}+1}\right),

another rational parametrization of the unit circle. As before, computing $dy=\cos\theta\;d\theta$ yields a formula for $d\theta$

d\theta=\frac{2}{s^{2}+1}\;ds.

These formulas make integrating

\int\sec\theta\;d\theta=\int\frac{s^{2}+1}{2s}\frac{2}{s^{2}+1}\;ds=\int\frac{ds}{s}=\ln\lvert s\rvert+C

even simpler, without partial fractions. Finally, using $x^{2}+y^{2}=1$ ,

s=\frac{x}{1-y}=\frac{x(1+y)}{1-y^{2}}=\frac{x(1+y)}{x^{2}}=\frac{1}{x}+\frac{y}{x}=\sec\theta+\tan\theta

produces the formula that appears in the back of your textbook. This shows that the Gregory substitution is basically the same as the modified Weierstrass. Nevertheless, we give a separate geometric argument for the Gregory substitution:

The parametrization of the right branch of the hyperbola $x^{2}-y^{2}=1$ is $(x,y)=(\sec\theta,\tan\theta)$ . The line at infinity $z=0$ intersects the homogenized hyperbola $x^{2}-y^{2}=z^{2}$ at the two points at infinity $P_{-}=[1:1:0]$ and $P_{+}=[1:-1:0]$ . The stereographic projection of the hyperbola from $P_{+}$ to the $x$ -axis (which is the projection down a line $x+y=u$ parallel to the asymptote $x+y=0$ ) sends the point $(x,y)$ on the hyperbola to the point $(x+y,0)=(\sec\theta+\tan\theta,0)$ , and we choose the parameter to be $u=x+y=\sec\theta+\tan\theta$ . Solving for the point where the line $x+y=u$ intersects the hyperbola, we obtain

(x(u),y(u))=\left(\frac{u^{2}+1}{2u},\frac{u^{2}-1}{2u}\right)=(\sec\theta,\tan\theta).

This rational parametrization of the hyperbola does the job.

We have

\mathrm{d}u=(\sec\theta\tan\theta+\sec^{2}\theta)\mathrm{d}\theta=u\sec\theta\mathrm{d}\theta,

\frac{\mathrm{d}u}{u}=\sec\theta\;\mathrm{d}\theta,

and this provides another natural buildup to Gregory’s substitution.

Other rational parametrizations of the hyperbola also work. One possible parameter is $v$ in the point $(v,v)$ which is the stereographic projection from $P_{+}$ of the point $(x,y)=(\sec\theta,\tan\theta)$ on the hyperbola to the asymptote $y=x$ . Another is $w=\sqrt{2v}$ , which is the distance from $(v,v)$ to the origin. Both of these two parameters work just as well as the $u$ above, because $v=u/2$ , and $w=u/\sqrt{2}$ . Other choices include the parameters we have used before: $t=\tan(\theta/2)$ and $s=\tan(\theta/2+\pi/4)$ , but these end up requiring partial fractions. The reader may wish to work out formulas for parametrizing the hyperbola with these parameters as an exercise.

We now see that all methods for integrating the secant function have something in common: they rely on certain rational parametrizations of the circle or the hyperbola. The figure below puts all these parametrizations together on the same picture. This shows how our previous three pictures are related.

The relevant points are $B(-1,0)$ , $D(0,1)$ , $C(\cos\theta,\sin\theta)$ , $A(\sec\theta,\tan\theta)$ , and the parameters are:

•

$OT=t=\tan(\theta/2)$ (Weierstrass)
•

$OS=s=\tan(\theta/2+\pi/4)=\sec\theta+\tan\theta$ (modified Weierstrass, Barrow — see (2) or (3), Gregory)

This picture is bursting with interesting geometry problems; see for example [11]. The most striking of these geometric facts is that the stereographic projection of the circle from the point $D(0,1)$ to the $x$ -axis and the stereographic projection of the hyperbola from the point at infinity $P_{+}[1:-1:0]$ to the $x$ -axis (recall that the latter is the projection of the hyperbola to the $x$ -axis parallel to the asymptote $x+y=0$ ) turn out to be the same if we identify the circle and the hyperbola via corresponding points ( $A$ and $C$ ).

Acknowledgments

We thank Frank Miles for helpful comments.

References

[1] Wikipedia, The integral of the secant function,
https://en.wikipedia.org/wiki/Integral_of_the_secant_function
[2] Wikipedia, Weierstrass substitution,
https://en.wikipedia.org/wiki/Weierstrass_substitution
[3] MacTutor, Isaac Barrow,
https://mathshistory.st-andrews.ac.uk/Biographies/Barrow/
[4] MacTutor, James Gregory,
https://mathshistory.st-andrews.ac.uk/Biographies/Gregory/
[5] MacTutor, Gerardus Mercator,
https://mathshistory.st-andrews.ac.uk/Biographies/Mercator_Gerardus/
[6] MacTutor, Pythagoras of Samos,
https://mathshistory.st-andrews.ac.uk/Biographies/Pythagoras/
[7] MacTutor, Karl Weierstrass,
https://mathshistory.st-andrews.ac.uk/Biographies/Weierstrass/
[8] MIT OpenCourseWare, The integral of secant,
https://ocw.mit.edu/courses/mathematics/18-01sc-single-variable-calculus-fall-2010/unit-4-techniques-of-integration/part-a-trigonometric-powers-trigonometric-substitution-and-completing-the-square/session-71-integrals-involving-secant-cosecant-and-cotangent/MIT18_01SCF10_Ses71c.pdf
[9] Joel Feldman, Andrew Rechnitzer and Elyse Yeager, CLP-2 Integral Calculus, https://www.math.ubc.ca/~CLP/CLP2/clp_2_ic_text.pdf
[10] Alexander Inglis, James Gregory: A survey of his work in mathematical analysis, University of St. Andrews (United Kingdom), ProQuest Dissertations Publishing, 1933. https://www-proquest-com.libproxy.csudh.edu/docview/1823909645?pq-origsite=primo
[11] George Jennings, David Ni, Wai Yan Pong and Serban Raianu, Problem 1392, to appear in Pi Mu Epsilon J., fall 2022.
[12] Warren P. Johnson, Down with Weierstrass, Amer. Math. Monthly, 127 (2020), 649–653.
[13] Bill Kinney, Rational Function parameterization of the Unit Circle (the unit circle minus one point), YouTube channel “Bill Kinney Math”,
https://www.youtube.com/watch?v=iICpdqNny5g
[14] Michael Hardy, Efficiency in Antidifferentiation of the Secant Function, Amer. Math. Monthly, 120 (2013), page 580.
[15] V. Frederick Rickey and Philip M. Tuchinsky, An Application of Geography to Mathematics: History of the Integral of the Secant, Mathematics Magazine, 53 (1980), 162–166. https://www.jstor.org/stable/2690106
[16] Maxwell Rosenlicht, Integration in Finite Terms, Amer. Math. Monthly, 79 (1972), 963–972. https://www.jstor.org/stable/2318066?seq=1
[17] Norman Schaumberger, $\int\sec\theta\;{\rm d}\theta$ , Amer.Math.Monthly, 68 (1961), 565.
[18] Joseph H. Silverman, John T. Tate, Rational points on elliptic curves, Second edition, Undergraduate Texts in Mathematics, Springer, 2015.
[19] James Stewart, Essential Calculus, Brooks-Cole, 2013.
[20] Simon W. Strauss, The Integrals $\int\sec x\;{\rm d}x$ and $\int\csc x\;{\rm d}x$ Revisited, J. Wash. Acad. Sci., 70 (1980, 137–143.
[21] James K. Strayer, Elementary Number Theory, Waveland Press, Inc., 1994.
[22] Herbert W. Turnbull, The Exercitationes Geometricae, in James Gregory Tercentenary Memorial Volume, edited by H.W. Turnbull, 459–465, G. Bell and Sons LTD, London, 1939.