This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Computer-assisted proofs of “Kariya’s theorem” with computer algebra

Ayane Ito
University of Tsukuba
   Takefumi Kasai
University of Tsukuba
   Akira Terui
terui@math.tsukuba.ac.jp
University of Tsukuba
Abstract

We demonstrate computer-assisted proofs of “Kariya’s theorem,” a theorem in elementary geometry, with computer algebra. In the proof of geometry theorem with computer algebra, vertices of geometric figures that are subjects for the proof are expressed as variables. The variables are classified into two classes: arbitrarily given points and the points defined from the former points by constraints. We show proofs of Kariya’s theorem with two formulations according to two ways for giving the arbitrary points: one is called “vertex formulation,” and the other is called “incenter formulation,” with two methods: one is Gröbner basis computation, and the other is Wu’s method. Furthermore, we show computer-assisted proofs of the property that the point so-called “Kariya point” is located on the hyperbola so-called “Feuerbach’s hyperbola”, with two formulations and two methods.

1 Introduction

This paper discusses computer-assisted proofs of “Kariya’s theorem,” a theorem in elementary geometry with computer algebra.

In proving elementary geometry theorems with computer algebra, the hypothesis and the conclusion are expressed as a system of polynomial equations and a polynomial equation, respectively. In this formulation, variables appearing in the equations are divided into two sets: one consists of variables corresponding to arbitrarily given points, and the other one consists of variables corresponding to the points derived from the points represented by the variables in the former set, with constraints. The proof is demonstrated by showing that the algebraic variety defined by “hypothesis” equations is included in the algebraic variety defined by the “conclusion” equation. Generally, the “computation” of the proof is reduced to solving an ideal or a radical membership problem derived from “hypothesis” equations. This computation is accomplished by Gröbner basis computation [4] or Wu’s method [2].

“Kariya’s theorem” [5] is a theorem in elementary geometry related to the incenter of a triangle. It was discovered at the end of the 19th century [1] and is still a subject of study today ([6], [10]). In this paper, we show computational proof of the theorem with two different formulations: 1) with the given points located on the vertices of the triangle (which is called “vertex formulation”), and 2) with the given point located in the vertices of the base and the incenter of the triangle (which is called “incenter formulation”). For each formulation, we show the proof of Kariya’s theorem and its corollary with the methods of Gröbner basis computation and Wu’s method. Furthermore, it is known that the point appearing in the assertion of Kariya’s theorem (so-called “Kariya point”) is located on the rectangle hyperbola called “Feuerbach hyperbola.” Therefore, we also show the computer-assisted proofs of this property with the two formulations above using Gröbner basis computation and Wu’s method. We note that, to the authors’ knowledge, except for the proof of the corollary with incenter formulation (see Section˜2.2.2), the proofs shown in the present paper have never appeared in the literature.

The paper is organized as follows. In Section˜2, Kariya’s theorem and its formulations are shown. In Section˜3, methods of computer-assisted proof of Kariya’s theorem with Gröbner basis computation and Wu’s method are explained. In Section˜4, proofs of Kariya’s theorem with Gröbner basis computation and Wu’s method are demonstrated. In Section˜5, we show proofs of the property that the Kariya point is located on the Feuerbach hyperbola using the two methods with two different formulations. Finally, we make concluding remarks in Section˜6.

2 Kariya’s theorem and its formulations

“Kariya’s theorem” is a theorem in elementary geometry describing a property related to the incenter of a triangle. While several mathematicians have discovered its proof (with generalizations of the original theorem) during the end of the 19th century and the beginning of the 20th century [1], The name of Kariya [5] has remained until today.

Kariya’s theorem is as follows. Note that a claim widely known as “Kariya’s theorem” is a corollary of the following theorem (see Corollary 2).

Theorem 1 (Kariya’s theorem [5])

In triangle ABCABC, let OO be the incenter of the triangle ABCABC, and let DD^{\prime}, EE^{\prime}, and FF^{\prime} be the points where the incenter circle touches the sides BCBC, CACA, and ABAB, respectively. For a real number kk, let DD, EE and FF be the points on lines ODOD^{\prime}, OEOE^{\prime} and OFOF^{\prime}, respectively, satisfying that \vvOD=k\vvOD\vv{OD}=k\vv{OD^{\prime}}, \vvOE=k\vvOE\vv{OE}=k\vv{OE^{\prime}}, \vvOF=k\vvOF\vv{OF}=k\vv{OF^{\prime}}. Then, the lines ADAD, BEBE and CFCF are concurrent at a point GG.

The point GG in Theorem 1 is called the “Kariya point.” In Theorem 1, by setting k=1k=1, that is DD, EE and FF coincides with DD^{\prime}, EE^{\prime} and FF^{\prime}, respectively, we obtain the following corollary.

Corollary 2

In triangle ABCABC, let DD, EE and FF be the points where the incenter circle touches the sides BCBC, CACA and ABAB, respectively. Then, the lines ADAD, BEBE and CFCF are concurrent at a point GG.

In proving a theorem in elementary geometry with computer algebra, we give a coordinate system on the real plane (or space). Then, using the coordinates of the points appearing in the proof as variables, we express the relations on the geometric figures containing those points as algebraic relations of variables, which form polynomial equations. As a result, we express the hypothesis and the conclusion as a system of polynomial equations and a polynomial equation, respectively. As a coordinate system, the cartesian coordinate system is widely used.

Some points appearing in the proof whose coordinates are to be expressed as variables are arbitrarily given, and others are derived from the arbitrarily given points. For the arbitrarily given points, let their coordinates be expressed as u1,u2,,umu_{1},u_{2},\dots,u_{m}, and the coordinates of the points derived from the arbitrarily given points whose coordinate is expressed with uiu_{i} be expressed with x1,x2,,xnx_{1},x_{2},\dots,x_{n}. The variables u1,u2,,umu_{1},u_{2},\dots,u_{m} are called “free variables,” and the variables x1,x2,,xnx_{1},x_{2},\dots,x_{n} are called “dependent variables.”

Properties of geometric figures are expressed as polynomial equations with respect to the above variables. For convenience, tuples of variables are denoted as 𝐮=(u1,u2,,um)\mathbf{u}=(u_{1},u_{2},\dots,u_{m}) and 𝐱=(x1,x2,,xn)\mathbf{x}=(x_{1},x_{2},\dots,x_{n}). The number of polynomial equations expressing the hypothesis generally equals the number of dependent variables mm. Thus, let the polynomial equations expressing the hypothesis be

h1(𝐮,𝐱)=0,,hn(𝐮,𝐱)=0,h_{1}(\mathbf{u},\mathbf{x})=0,\dots,h_{n}(\mathbf{u},\mathbf{x})=0,

where h1(𝐮,𝐱),,hn(𝐮,𝐱)[𝐮,𝐱]h_{1}(\mathbf{u},\mathbf{x}),\dots,h_{n}(\mathbf{u},\mathbf{x})\in\mathbb{R}[\mathbf{u},\mathbf{x}], and the polynomial equation expressing the conclusion be g(𝐮,𝐱)=0g(\mathbf{u},\mathbf{x})=0, where g(𝐮,𝐱)[𝐮,𝐱]g(\mathbf{u},\mathbf{x})\in\mathbb{R}[\mathbf{u},\mathbf{x}].

With the discussion above, let us show formulations of Kariya’s theorem into polynomial equations. We have two kinds of formulations according to setting the arbitrarily given points as

  1. 1.

    The vertices of triangle ABCABC, and

  2. 2.

    The vertices of the base and the incenter of triangle ABCABC.

The former is called “vertex formulation,” and the latter is called “the incenter formulation.”

2.1 A formulation with setting arbitrarily given points as the vertices of a triangle (vertex formulation)

In vertex formulation, without loss of generality, let BCBC be the base of triangle ABCABC with setting A(u1,u2)A(u_{1},u_{2}), B(0,0)B(0,0), C(0,1)C(0,1) where u1>0u_{1}>0 and u2>0u_{2}>0. Formulations for Theorem 1 and Corollary 2 are shown as follows.

2.1.1 The case of Theorem 1

In the formulation of Theorem 1 (see Figure˜1), let O(x1,x13)O(x_{1},x_{13}) be the incenter and D(x1,0)D^{\prime}(x_{1},0), E(x2,x3)E^{\prime}(x_{2},x_{3}) and F(x4,x5)F^{\prime}(x_{4},x_{5}) be the points where the incenter circle touches the sides BCBC, CACA and ABAB, respectively. Let D(x1,x6)D(x_{1},x_{6}), E(x7,x8)E(x_{7},x_{8}) and F(x9,x10)F(x_{9},x_{10}) be the points on lines ODOD^{\prime}, OEOE^{\prime} and OFOF^{\prime}, respectively, satisfying that, for kk\in\mathbb{R}, \vvOD=k\vvOD\vv{OD}=k\vv{OD^{\prime}}, \vvOE=k\vvOE\vv{OE}=k\vv{OE^{\prime}} and \vvOF=k\vvOF\vv{OF}=k\vv{OF^{\prime}}. Let G(x11,x12)G(x_{11},x_{12}) be the Kariya point. Then, the hypothesis are expressed as h1,,h13h_{1},\dots,h_{13} in Equation˜1, and the conclusion is expressed as g1g_{1} in Equation˜2. Note that, in each equation, an arrow (\Longleftrightarrow) followed by a comment on the right side shows the corresponding geometric condition.

h1=x12x42x52BD=BF,h2=(1x1)2(1x2)2x32CD=CE,h3=(u1x4)2+(u2x5)2(u1x2)2(u2x3)2AF=AE,h4=u1x5u2x4AB and F are collinear,h5=u2(1x2)x3(1u1)AE and C are collinear,h6=(x2x1)2+(x3x13)2x132OD=OE,h7=(x6x13)+kx13\vvOD=k\vvOD,h8=(x7x1)k(x2x1)\vvOE=k\vvOE(with respect to the x coordinate),h9=(x8x13)k(x3x13)\vvOE=k\vvOE(with respect to the y coordinate),h10=(x9x1)k(x4x1)\vvOF=k\vvOF(with respect to the x coordinate),h11=(x10x13)k(x5x13)\vvOF=k\vvOF(with respect to the y coordinates),h12=(u2x6)(u1x11)(u1x1)(u2x12)AG and D are collinear,h13=(1x11)x10(1x9)x12CG and F are collinear.\begin{split}h_{1}&=x_{1}^{2}-x_{4}^{2}-x_{5}^{2}\,\Longleftrightarrow\,BD^{\prime}=BF^{\prime},\\ h_{2}&=(1-x_{1})^{2}-(1-x_{2})^{2}-x_{3}^{2}\,\Longleftrightarrow\,CD^{\prime}=CE^{\prime},\\ h_{3}&=(u_{1}-x_{4})^{2}+(u_{2}-x_{5})^{2}-(u_{1}-x_{2})^{2}-(u_{2}-x_{3})^{2}\,\Longleftrightarrow\,AF^{\prime}=AE^{\prime},\\ h_{4}&=u_{1}x_{5}-u_{2}x_{4}\,\Longleftrightarrow\,\text{$A$, $B$ and $F^{\prime}$ are collinear,}\\ h_{5}&=u_{2}(1-x_{2})-x_{3}(1-u_{1})\,\Longleftrightarrow\,\text{$A$, $E^{\prime}$ and $C$ are collinear,}\\ h_{6}&=(x_{2}-x_{1})^{2}+(x_{3}-x_{13})^{2}-x_{13}^{2}\,\Longleftrightarrow\,OD^{\prime}=OE^{\prime},\\ h_{7}&=(x_{6}-x_{13})+kx_{13}\,\Longleftrightarrow\,\vv{OD}=k\vv{OD^{\prime}},\\ h_{8}&=(x_{7}-x_{1})-k(x_{2}-x_{1})\,\Longleftrightarrow\,\vv{OE}=k\vv{OE^{\prime}}\quad\text{(with respect to the $x$ coordinate),}\\ h_{9}&=(x_{8}-x_{13})-k(x_{3}-x_{13})\,\Longleftrightarrow\,\vv{OE}=k\vv{OE^{\prime}}\quad\text{(with respect to the $y$ coordinate),}\\ h_{10}&=(x_{9}-x_{1})-k(x_{4}-x_{1})\,\Longleftrightarrow\,\vv{OF}=k\vv{OF^{\prime}}\quad\text{(with respect to the $x$ coordinate),}\\ h_{11}&=(x_{10}-x_{13})-k(x_{5}-x_{13})\,\Longleftrightarrow\,\vv{OF}=k\vv{OF^{\prime}}\quad\text{(with respect to the $y$ coordinates),}\\ h_{12}&=(u_{2}-x_{6})(u_{1}-x_{11})-(u_{1}-x_{1})(u_{2}-x_{12})\,\Longleftrightarrow\,\text{$A$, $G$ and $D$ are collinear,}\\ h_{13}&=(1-x_{11})x_{10}-(1-x_{9})x_{12}\,\Longleftrightarrow\,\text{$C$, $G$ and $F$ are collinear.}\end{split} (1)
g=x7x12x8x11BG and E are collinear.g=x_{7}x_{12}-x_{8}x_{11}\,\Longleftrightarrow\,\text{$B$, $G$ and $E$ are collinear.} (2)
Refer to caption
Fig. 1: An example of Theorem 1 with vertex formulation. See Section˜2.1.1 for details.

2.1.2 The case of Corollary 2

In Corollary 2 (see Figure˜2), the points DD, EE and FF coincides with D(x1,0)D^{\prime}(x_{1},0), E(x2,x3)E^{\prime}(x_{2},x_{3}) and F(x4,x5)F^{\prime}(x_{4},x_{5}), respectively, in Theorem 1. Thus, in the formulation of Corollary 2, set D(x1,0)D(x_{1},0), E(x2,x3)E(x_{2},x_{3}), F(x4,x5)F(x_{4},x_{5}) and G(x6,x7)G(x_{6},x_{7}). Then, the hypothesis are expressed as h1,,h8h_{1},\dots,h_{8} in Equation˜3, and the conclusion is expressed as gg in Equation˜4.

h1=x12x42x52BD=BF,h2=(u1x4)2+(u2x5)2(u1x2)2(u2x3)2AF=AE,h3=(x21)2+x32(1x1)2CE=CD,h4=u1x5u2x4AF and B are collinear,h5=u2(1x2)x3(1u1)AE and C are collinear,h6=(u1x1)x7(x6x1)u2AG and D are collinear,h7=x2x7x6x3BG and E are collinear.\begin{split}h_{1}&=x_{1}^{2}-x_{4}^{2}-x_{5}^{2}\,\Longleftrightarrow\,BD=BF,\\ h_{2}&=(u_{1}-x_{4})^{2}+(u_{2}-x_{5})^{2}-(u_{1}-x_{2})^{2}-(u_{2}-x_{3})^{2}\,\Longleftrightarrow\,AF=AE,\\ h_{3}&=(x_{2}-1)^{2}+x_{3}^{2}-(1-x_{1})^{2}\,\Longleftrightarrow\,CE=CD,\\ h_{4}&=u_{1}x_{5}-u_{2}x_{4}\,\Longleftrightarrow\,\text{$A$, $F$ and $B$ are collinear,}\\ h_{5}&=u_{2}(1-x_{2})-x_{3}(1-u_{1})\,\Longleftrightarrow\,\text{$A$, $E$ and $C$ are collinear,}\\ h_{6}&=(u_{1}-x_{1})x_{7}-(x_{6}-x_{1})u_{2}\,\Longleftrightarrow\,\text{$A$, $G$ and $D$ are collinear,}\\ h_{7}&=x_{2}x_{7}-x_{6}x_{3}\,\Longleftrightarrow\,\text{$B$, $G$ and $E$ are collinear.}\end{split} (3)
g=(x41)x7(x61)x5CG and F are collinear.g=(x_{4}-1)x_{7}-(x_{6}-1)x_{5}\,\Longleftrightarrow\,\text{$C$, $G$ and $F$ are collinear.} (4)
Refer to caption
Fig. 2: An example of Corollary 2 with vertex formulation. See Section˜2.1.2 for details.

2.2 A formulation with setting arbitrarily given points as the vertices of the base and the incenter of the triangle (incenter formulation)

In incenter formulation, without loss of generality, let BCBC be the base of triangle ABCABC with setting B(0,0)B(0,0) and C(1,0)C(1,0). Furthermore, let O(u1,u2)O(u_{1},u_{2}) be the incenter of triangle ABCABC. Formulations for Theorem 1 and Corollary 2 are shown as follows.

2.2.1 The case of Theorem 1

In the formulation of Theorem 1 (see Figure˜3), let A(x2,x1)A(x_{2},x_{1}) be the remaining vertex of the triangle, and let D(u1,0)D^{\prime}(u_{1},0), E(x4,x3)E^{\prime}(x_{4},x_{3}) and F(x6,x5)F^{\prime}(x_{6},x_{5}) be the points where the incenter circle touches the sides BCBC, CACA and ABAB, respectively. Let D(u1,x7)D(u_{1},x_{7}), E(x9,x8)E(x_{9},x_{8}) and F(x11,x10)F(x_{11},x_{10}) be the points on the lines ODOD^{\prime}, OEOE^{\prime} and OFOF^{\prime}, respectively, satisfying that, for kk\in\mathbb{R}, \vvOD=k\vvOD\vv{OD}=k\vv{OD^{\prime}}, \vvOE=k\vvOE\vv{OE}=k\vv{OE^{\prime}} and \vvOF=k\vvOF\vv{OF}=k\vv{OF^{\prime}}. Then, the hypothesis are expressed as h1,,h13h_{1},\dots,h_{13} in eq.˜5, and the conclusion is expressed as g1g_{1} in eq.˜6.

h1=(u11)2x1+u22x1+2(u11)u2(x21)tanBCO=tanOCA,h2=x1(u12+u22)+2u1u2x2tanCBO=tanOBA,h3=x3(x21)x1(x41)AE and C are collinear,h4=(x21)(x4u1)+x1(x3u2)OE and CA intersect perpendicularly,h5=x1x6x2x5AF and B are collinear,h6=x2(x6u1)+x1(x5u2)OF and BA intersect perpendicularly,h7=x7+u2(k1)\vvOD=k\vvOD(with respect to the y coordinate),h8=(x9u1)(k(x4u1))\vvOE=k\vvOE(with respect to the x coordinate),h9=(x8u2)(k(x3u2))\vvOE=k\vvOE(with respect to the y coordinate),h10=(x11u1)(k(x6u1))\vvOF=k\vvOF(with respect to the x coordinate),h11=(x10u2)(k(x5u2))\vvOF=k\vvOF(with respect to the y coordinate),h12=(x13u1)(x1x7)(x12x7)(x2u1)AG and D are collinear,h13=x12(x111)x10(x131)CG and F are collinear.\begin{split}h_{1}&=-(u_{1}-1)^{2}x_{1}+u_{2}^{2}x_{1}+2(u_{1}-1)u_{2}(x_{2}-1)\,\Longleftrightarrow\,\tan{\angle BCO}=\tan{\angle OCA},\\ h_{2}&=x_{1}(-u_{1}^{2}+u_{2}^{2})+2u_{1}u_{2}x_{2}\,\Longleftrightarrow\,\tan{\angle CBO}=\tan{\angle OBA},\\ h_{3}&=x_{3}(x_{2}-1)-x_{1}(x_{4}-1)\,\Longleftrightarrow\,\text{$A$, $E^{\prime}$ and $C$ are collinear,}\\ h_{4}&=(x_{2}-1)(x_{4}-u_{1})+x_{1}(x_{3}-u_{2})\,\Longleftrightarrow\,\text{$OE^{\prime}$ and $CA$ intersect perpendicularly,}\\ h_{5}&=x_{1}x_{6}-x_{2}x_{5}\,\Longleftrightarrow\,\text{$A$, $F^{\prime}$ and $B$ are collinear,}\\ h_{6}&=x_{2}(x_{6}-u_{1})+x_{1}(x_{5}-u_{2})\,\Longleftrightarrow\,\text{$OF^{\prime}$ and $BA$ intersect perpendicularly,}\\ h_{7}&=x_{7}+u_{2}(k-1)\,\Longleftrightarrow\,\vv{OD}=k\vv{OD^{\prime}}\,\text{(with respect to the $y$ coordinate),}\\ h_{8}&=(x_{9}-u_{1})-(k(x_{4}-u_{1}))\,\Longleftrightarrow\,\vv{OE}=k\vv{OE^{\prime}}\,\text{(with respect to the $x$ coordinate),}\\ h_{9}&=(x_{8}-u_{2})-(k(x_{3}-u_{2}))\,\Longleftrightarrow\,\vv{OE}=k\vv{OE^{\prime}}\,\text{(with respect to the $y$ coordinate),}\\ h_{10}&=(x_{11}-u_{1})-(k(x_{6}-u_{1}))\,\Longleftrightarrow\,\vv{OF}=k\vv{OF^{\prime}}\,\text{(with respect to the $x$ coordinate),}\\ h_{11}&=(x_{10}-u_{2})-(k(x_{5}-u_{2}))\,\Longleftrightarrow\,\vv{OF}=k\vv{OF^{\prime}}\,\text{(with respect to the $y$ coordinate),}\\ h_{12}&=(x_{13}-u_{1})(x_{1}-x_{7})-(x_{12}-x_{7})(x_{2}-u_{1})\,\Longleftrightarrow\,\text{$A$, $G$ and $D$ are collinear,}\\ h_{13}&=x_{12}(x_{11}-1)-x_{10}(x_{13}-1)\,\Longleftrightarrow\,\text{$C$, $G$ and $F$ are collinear.}\\ \end{split} (5)
g=x8x13x9x12BG and E are collinear.g=x_{8}x_{13}-x_{9}x_{12}\,\Longleftrightarrow\,\text{$B$, $G$ and $E$ are collinear.} (6)
Refer to caption
Fig. 3: An example of Theorem 1 with incenter formulation. Note that, in this figure, points DD, EE and FF divides segments ODOD^{\prime}, OEOE^{\prime} and OFOF^{\prime} externally, respectively, while, in fig.˜1, points DD, EE and FF divides segments ODOD^{\prime}, OEOE^{\prime} and OFOF^{\prime} internally, respectively. See Section˜2.1.1 for details.

2.2.2 The case of Corollary 2

In the formulation of Corollary 2 (see fig.˜4) As in Section˜2.1.2, set D(u1,0)D(u_{1},0), E(x4,x3)E(x_{4},x_{3}) and F(x6,x5)F(x_{6},x_{5}). Then, the hypothesis are expressed as h1,,h7h_{1},\dots,h_{7} in eq.˜7, and the conclusion is expressed as gg in eq.˜8. Note that this formulation has also been given in Chou [2, Example 336].

h1=u22((1x4)2+x32)(1u1)2((u1x4)2+(u2x3)2)tanBCO=tanOCA,h2=u22(x62+x52)u12((u1x6)2+(u2x5)2)tanCBO=tanOBA,h3=x3(x21)x1(x41)AE and C are collinear,h4=x3(x3u2)+(x41)(x4u1)OE and CE intersect perpendicularly,h5=x6x1x2x5AF and B are collinear,h6=x5(x5u2)+x6(x6u1)OF and BF intersect perpendicularly,h7=x8x3x4x7BG and E are collinear,h8=(x8u1)x1x7(x2u1)AG and D are collinear.\begin{split}h_{1}&=u_{2}^{2}((1-x_{4})^{2}+x_{3}^{2})-(1-u_{1})^{2}((u_{1}-x_{4})^{2}+(u_{2}-x_{3})^{2})\,\Longleftrightarrow\,\tan{\angle BCO}=\tan{\angle OCA},\\ h_{2}&=u_{2}^{2}(x_{6}^{2}+x_{5}^{2})-u_{1}^{2}((u_{1}-x_{6})^{2}+(u_{2}-x_{5})^{2})\,\Longleftrightarrow\,\tan{\angle CBO}=\tan{\angle OBA},\\ h_{3}&=x_{3}(x_{2}-1)-x_{1}(x_{4}-1)\,\Longleftrightarrow\,\text{$A$, $E$ and $C$ are collinear,}\\ h_{4}&=x_{3}(x_{3}-u_{2})+(x_{4}-1)(x_{4}-u_{1})\,\Longleftrightarrow\,\text{$OE$ and $CE$ intersect perpendicularly,}\\ h_{5}&=x_{6}x_{1}-x_{2}x_{5}\,\Longleftrightarrow\,\text{$A$, $F$ and $B$ are collinear,}\\ h_{6}&=x_{5}(x_{5}-u_{2})+x_{6}(x_{6}-u_{1})\,\Longleftrightarrow\,\text{$OF$ and $BF$ intersect perpendicularly,}\\ h_{7}&=x_{8}x_{3}-x_{4}x_{7}\,\Longleftrightarrow\,\text{$B$, $G$ and $E$ are collinear,}\\ h_{8}&=(x_{8}-u_{1})x_{1}-x_{7}(x_{2}-u_{1})\,\Longleftrightarrow\,\text{$A$, $G$ and $D$ are collinear.}\end{split} (7)
g=(x81)x5x7(x61)CG and F are collinear.g=(x_{8}-1)x_{5}-x_{7}(x_{6}-1)\,\Longleftrightarrow\,\text{$C$, $G$ and $F$ are collinear.} (8)
Refer to caption
Fig. 4: An example of Corollary 2 with incenter formulation. See Section˜2.2.2 for details.

3 Proofs of Kariya’s theorem with Gröbner basis computation and Wu’s method

We review the fundamental theory of proving geometric theorem with computer algebra, following Cox et al. [4]. Assume that the hypothesis are expressed as h1(𝐮,𝐱),,hn(𝐮,𝐱)h_{1}(\mathbf{u},\mathbf{x}),\dots,h_{n}(\mathbf{u},\mathbf{x}) and the conclusion is expressed as g(𝐮,𝐱)g(\mathbf{u},\mathbf{x}), where 𝐮\mathbf{u}, 𝐱\mathbf{x}, h1,,hnh_{1},\dots,h_{n} and gg are defined as the same as above. Proving a theorem in elementary geometry can be reduced to showing that the real zeros of the equations a h1(𝐮,𝐱)==hn(𝐮,𝐱)=0h_{1}(\mathbf{u},\mathbf{x})=\cdots=h_{n}(\mathbf{u},\mathbf{x})=0 are also zeros of the equation g(𝐮,𝐱)=0g(\mathbf{u},\mathbf{x})=0. This idea gives us a naive definition that one can deduce the conclusion. In what follows, let V=𝐕(h1,,hn)m+nV=\mathbf{V}(h_{1},\dots,h_{n})\subset\mathbb{R}^{m+n} be the affine variety defined by h1,,hnh_{1},\dots,h_{n} and let 𝐈(V)\mathbf{I}(V) be the ideal of VV.

Definition 3 (“follows strictly”)

The conclusion gg follows strictly from the hypothesis h1,,hnh_{1},\dots,h_{n} if g𝐈(V)g\in\mathbf{I}(V).

Proposition 4

If gh1,,hng\in\sqrt{\langle{h_{1},\dots,h_{n}}\rangle}, then gg follows strictly from h1,,hnh_{1},\dots,h_{n}.

From Definition 3 and Proposition 4, a naive proof of the theorem is reduced to solving the radical membership problem. However, this condition seems too strict because the converse of Proposition 4 may not be true. In such a case, there may exist a polynomial h(𝐮)h(\mathbf{u}) in the hypothesis containing only independent variables 𝐮\mathbf{u}, and h(𝐮¯)=0h(\bar{\mathbf{u}})=0 for 𝐮¯=(u¯1,,u¯m)V\bar{\mathbf{u}}=(\bar{u}_{1},\dots,\bar{u}_{m})\in V, which means a degenerate case of the configuration of geometric figures [4]. To avoid such degenerate cases, we handle a subvariety of VV satisfying that for the points in which a defining polynomial with only independent variables is always nonzero, as in the following definition.

Definition 5 (Algebraically independent)

Let WRm+nW\subseteq R^{m+n} be an irreducible affine variety with the coordinates u1,,um,x1,,xnu_{1},\dots,u_{m},x_{1},\dots,x_{n}. The variables u1,,umu_{1},\dots,u_{m} are algebraically independent on WW if there exist no nonzero polynomial with variables u1,,umu_{1},\dots,u_{m} that has zeros in WW, that is, u1,,umu_{1},\dots,u_{m} satisfy that 𝐈(W)[u1,,um]={0}\mathbf{I}(W)\cap\mathbb{R}[u_{1},\dots,u_{m}]=\{0\}.

Then, we accept non-degenerate cases for the geometric proving with the following definition.

Definition 6 (“follows generically”)

The conclusion gg follows generically from the hypothesis h1,,hnh_{1},\dots,h_{n} if

g𝐈(V)R[u1,,um,x1,,xn],g\in\mathbf{I}(V^{\prime})\subseteq R[u_{1},\dots,u_{m},x_{1},\dots,x_{n}],

where V=W1Wpm+nV^{\prime}=W_{1}\cup\cdots\cup W_{p}\subset\mathbb{R}^{m+n} satisfying that, for i=1,,pi=1,\dots,p, WiW_{i} is irreducible and u1,,umu_{1},\dots,u_{m} are algebraically independent on WiW_{i}.

Initially, for deriving a proof with Definition 6, one needs to compute irreducible components of V=𝐕(h1,,hn)V=\mathbf{V}(h_{1},\dots,h_{n}). Fortunately, we have the following proposition.

Proposition 7

Let H=h1,,hnH=\langle{h_{1},\dots,h_{n}}\rangle. If there exists a nonzero polynomial c(u1,,um)[u1,,um]c(u_{1},\dots,u_{m})\in\mathbb{R}[u_{1},\dots,u_{m}] satisfying that cgHc\cdot g\in\sqrt{H}, then the conclusion gg follows generically from the hypothesis h1,,hnh_{1},\dots,h_{n}.

Note that, if gHg\in H, then gg and HH satisfy Proposition 7.

3.1 Computing a proof with Gröbner basis computation

This section explains computing a proof with Gröbner basis computation [4]. Proposition 7 tells us that computing a proof is reduced to solving the radical membership problem. We have the following corollary.

Corollary 8

Under the conditions of Proposition 7, the following are equivalent.

  1. 1.

    There exists a nonzero polynomial c(u1,,um)[u1,,um]c(u_{1},\dots,u_{m})\in\mathbb{R}[u_{1},\dots,u_{m}] satisfying that cgHc\cdot g\in\sqrt{H}.

  2. 2.

    Let H~\tilde{H} be an ideal in (u1,,um)[x1,,xn]\mathbb{R}(u_{1},\dots,u_{m})[x_{1},\dots,x_{n}] generated by h1,,hnh_{1},\dots,h_{n}. Then, we have gH~g\in\sqrt{\tilde{H}}.

  3. 3.

    The reduced Gröbner basis of an ideal h1,,hn,1yg(u1,,um)[x1,,xn,y]\langle{h_{1},\dots,h_{n},1-yg}\rangle\subseteq\mathbb{R}(u_{1},\dots,u_{m})[x_{1},\dots,x_{n},y] is equal to {1}\{1\}.

Corollary 8 tells us that computing the proof is reduced to either solving the ideal membership problem gHg\in H or computing the reduced Gröber basis of the ideal h1,,hn,1yg\langle{h_{1},\dots,h_{n},1-yg}\rangle.

3.2 Computing a proof with Wu’s method

In this section, we explain computing proof with Wu’s method. Note that the method of computation presented here is an elementary version of Wu’s method [4], and a complete version of it can be found in other literature (for example, see Chou [2]). In Wu’s method, we first “triangulate” the polynomials corresponding to the hypothesis by pseudo-divisions. Then, we repeat pseudo-divisions on the polynomial corresponding to the conclusion by the triangulated polynomials to show that the conclusion follows from the hypothesis.

Proposition 9 (Pseudo-division [4])

Let f,gk[x1,,xn,y]f,g\in k[x_{1},\ldots,x_{n},y] be polynomials expressed as

f=cpyp++c1y+c0,g=dmym++d1y+d0,f=c_{p}y^{p}+\ldots+c_{1}y+c_{0},\quad g=d_{m}y^{m}+\ldots+d_{1}y+d_{0}, (9)

where ci,dik[x1,,xn]c_{i},d_{i}\in k[x_{1},\ldots,x_{n}] with mpm\leq p and dm0d_{m}\neq 0. Then, there exist polynomials q,rk[x1,,xn]q,r\in k[x_{1},\ldots,x_{n}] satisfying the following conditions.

  1. 1.

    r=0r=0, or degyr<m\deg_{y}r<m and there exists a nonnegative integer ss satisfying dmsf=qg+rd_{m}^{s}f=qg+r.

  2. 2.

    rf,gr\in\langle{f,g}\rangle in the ring k[x1,,xn,y]k[x_{1},\ldots,x_{n},y].

In Proposition 9, polynomials qq and rr are called a pseudoquotient and a pseudoremainder, respectively, of ff on pseudo-division by gg with respect to yy. The pseudoremainder rr is denoted by prem(f,g,y)\mathrm{prem}(f,g,y).

In the algorithm of pseudo-division, dmsd_{m}^{s} is chosen such that the division is executed in the polynomial ring k[x1,,xn]k[x_{1},\ldots,x_{n}]. Furthermore, in place of dmsd_{m}^{s}, dm/gcd(dm,cp,,c0)d_{m}/\gcd(d_{m},c_{p},\dots,c_{0}) can be used for avoiding the growth of degrees of coefficient polynomials [8].

In “triangulation” of the hypothesis polynomials h1,,hnk[𝐮,𝐱]h_{1},\ldots,h_{n}\in k[\mathbf{u},\mathbf{x}], pseudo-divisions with respect to variables xn,xn1,x1x_{n},x_{n-1}\ldots,x_{1} is executed repeatedly for reducing to a “triangulated” system of polynomials

f1(x1),f2(x1,x2),,fn(x1,,xn).\begin{split}f_{1}(x_{1}),f_{2}(x_{1},x_{2}),\dots,f_{n}(x_{1},\ldots,x_{n}).\end{split} (10)

The order of variables used for computing f1,,fnf_{1},\dots,f_{n} is denoted by

xnxn1x1,x_{n}\succ x_{n-1}\succ\cdots\succ x_{1},

and the set of polynomials in eq.˜10 is called an ascending chain.

Definition 10 (Irreducible ascending chain)

An ascending chain of polynomials in eq.˜10 is called irreducible if, for i=1,,ni=1,\dots,n, fif_{i} is irreducible in the polynomial ring k(u1,,um)[xi,,xi]/f1,,fi1k(u_{1},\dots,u_{m})[x_{i},\dots,x_{i}]/\langle{f_{1},\dots,f_{i-1}}\rangle.

Then, for the conclusion polynomial gk[𝐮,𝐱]g\in k[\mathbf{u},\mathbf{x}], pseudo-division by the polynomials in the ascending chain eq.˜10 is repeated for computing polynomials Rn1,,Rn0R_{n-1},\dots,R_{n-0} as

Rn1=prem(g,fn,xn),Rn2=prem(Rn1,fn1,xn1),,R0=prem(R1,f1,x1),R_{n-1}=\mathrm{prem}(g,f_{n},x_{n}),\,R_{n-2}=\mathrm{prem}(R_{n-1},f_{n-1},x_{n-1}),\dots,R_{0}=\mathrm{prem}(R_{1},f_{1},x_{1}), (11)

and R0R_{0} is denoted by prem(g,f1,,fn)\mathrm{prem}(g,f_{1},\dots,f_{n}). We have the following proposition.

Proposition 11

Let {f1,,fn}\{f_{1},\dots,f_{n}\} be an ascending chain derived from the hypothesis polynomials h1,,hnh_{1},\dots,h_{n} expressed as in eq.˜10, and let gg be the conclusion polynomial. Then, the following are equivalent.

  1. 1.

    prem(g,f1,,fn)=0\mathrm{prem}(g,f_{1},\dots,f_{n})=0.

  2. 2.

    There exists a nonzero polynomial c(𝐮)[𝐮]c(\mathbf{u})\in\mathbb{R}[\mathbf{u}] satisfying that cgf1,,fnc\cdot g\in\langle{f_{1},\dots,f_{n}}\rangle.

prem(g,h1,,hn)\mathrm{prem}(g,h_{1},\dots,h_{n})

In Proposition 11, note that we have f1,,fnH\langle{f_{1},\dots,f_{n}}\rangle\subset H, where HH is defined as in Proposition 7, since fiHf_{i}\in H. Thus, Corollary 8 and Proposition 11 tells us that, if we have prem(g,f1,,fn)=0\mathrm{prem}(g,f_{1},\dots,f_{n})=0, then the conclusion gg follows generically from the hypothesis h1,,hnh_{1},\dots,h_{n}.

4 Experiments

We have implemented an elementary version of Wu’s method on the Computer Algebra System (CAS) Risa/Asir [7], and have computed proofs of Theorem 1 and Corollary 2 with the Gröbner basis computation and Wu’s method using the vertex and the incenter formulations [9]. The test was conducted in the following environment: Intel Xeon Silver 4210 at 2.20 GHz, RAM 256 GB, Linux 5.4.0 (SMP), Asir Version 20210326.

4.1 Computing proofs with the Gröbner basis computation

This section separately explains computing proofs with the Gröbner basis computation for the vertex and the incenter formulations.

4.1.1 Computing proofs using the vertex formulation

In computing the proof of Theorem 1, for the hypothesis polynomials h1,,h13h_{1},\dots,h_{13} in eq.˜1, we have computed a Grob̈ner basis G1G_{1} of the ideal I=h1,,h13I=\langle{h_{1},\dots,h_{13}}\rangle with respect to the degree reverse lexicographic (DegRevLex) ordering with the variable order given as

x6x7x8x9x10x11x12x1x2x3x4x5x13.x_{6}\succ x_{7}\succ x_{8}\succ x_{9}\succ x_{10}\succ x_{11}\succ x_{12}\succ x_{1}\succ x_{2}\succ x_{3}\succ x_{4}\succ x_{5}\succ x_{13}. (12)

Then, for the conclusion polynomial gg in eq.˜2, we have verified that gH=h1,,h13g\in H=\langle{h_{1},\dots,h_{13}}\rangle by showing that the normal form of gg with respect to G1G_{1} is equal to 0.

In computing the proof of Corollary 2, for the hypothesis polynomials h1,,h7h_{1},\dots,h_{7} in eq.˜3, we have computed a Grob̈ner basis G2G_{2} of the ideal I=h1,,h7I=\langle{h_{1},\dots,h_{7}}\rangle with respect to the DegRevLex ordering with the variable order given as x7x6x5x4x3x2x1x_{7}\succ x_{6}\succ x_{5}\succ x_{4}\succ x_{3}\succ x_{2}\succ x_{1}. Then, for the conclusion polynomial gg in eq.˜4, we have verified that gH=h1,,h7g\in H=\langle{h_{1},\dots,h_{7}}\rangle by showing that the normal form of gg with respect to G2G_{2} is equal to 0.

4.1.2 Computing proofs using the incenter formulation

In computing the proof of Theorem 1, for the hypothesis polynomials h1,,h13h_{1},\dots,h_{13} in eq.˜5, we have computed a Grob̈ner basis G3G_{3} of the ideal I=h1,,h13I=\langle{h_{1},\dots,h_{13}}\rangle with respect to the DegRevLex ordering with the variable order given as x13x12x11x10x9x8x7x6x5x4x3x2x1x_{13}\succ x_{12}\succ x_{11}\succ x_{10}\succ x_{9}\succ x_{8}\succ x_{7}\succ x_{6}\succ x_{5}\succ x_{4}\succ x_{3}\succ x_{2}\succ x_{1}. Then, for the conclusion polynomial gg in eq.˜6, we have verified that gH=h1,,h13g\in H=\langle{h_{1},\dots,h_{13}}\rangle by showing that the normal form of gg with respect to G3G_{3} is equal to 0.

In computing the proof of Corollary 2, for the hypothesis polynomials h1,,h7h_{1},\dots,h_{7} in eq.˜7, we have computed a Grob̈ner basis G4G_{4} of the ideal I=h1,,h7I=\langle{h_{1},\dots,h_{7}}\rangle with respect to the DegRevLex ordering with the variable order given as x8x7x6x5x4x3x2x1x_{8}\succ x_{7}\succ x_{6}\succ x_{5}\succ x_{4}\succ x_{3}\succ x_{2}\succ x_{1}. We have computed that the normal form of the conclusion polynomial gg in eq.˜8 with respect G4G_{4} is not equal to zero, and the reduced Gröbner basis of the ideal h1,,h7,1yg\langle{h_{1},\dots,h_{7},1-yg}\rangle is not equal to {1}\{1\}. Then, by adding a constraint that u10u_{1}\neq 0 (with a new variable y2y_{2}), we have computed that the reduced Gröbner basis of the ideal

h1,,h7,1yg,1y2u1\langle{h_{1},\dots,h_{7},1-yg,1-y_{2}u_{1}}\rangle (13)

equals {1}\{1\}.

4.1.3 Computing time and the variable ordering

Table˜1 shows the computing time of computing the proofs in this section. For each formulation, the table shows the computing times of the Gröbner basis and the normal form. The exception is the proof of Corollary 2 using the incenter formulation: in this case, the table shows only the computing time of the reduced Gröbner basis of the ideal in eq.˜13. In the tables below, computing time with the letter {\dagger}, {\ddagger} and {\dagger}{\dagger} denote the average of repeatedly measured data for 10, 100, and 1000 times, respectively.

In each computation of the proof, variable ordering is defined as follows. For the proof of Theorem 1 using the vertex formulation, in the hypothesis polynomials in eq.˜1, the variables x6,x7,x8,x9,x10,x11,x12x_{6},x_{7},x_{8},x_{9},x_{10},x_{11},x_{12} appear in the terms of total degree 11. Thus, we have defined the variable ordering as in eq.˜12 for reducing the terms in x6,x7,x8,x9,x10,x11,x12x_{6},x_{7},x_{8},x_{9},x_{10},x_{11},x_{12} first. In the other cases, since the computing time of the Gröbner basis, as well as the normal form, was sufficiently small, we have defined the variable ordering as x13x11x1x_{13}\succ x_{11}\succ\cdots\succ x_{1} for the proof of Theorem 1 and x8x7x1x_{8}\succ x_{7}\succ\cdots\succ x_{1} for the proof of Corollary 2.

Table 1: Computing time of the proofs with Gröbner basis computation. Note that computing time with the letter {\dagger}, and {\dagger}{\dagger} denote the average of repeatedly measured data for 10 and 1000 times, respectively. See Section˜4.1.3 for details.
Formulation Theorem Computing time (sec.)
Gröbner basis The normal form
Vertex formulation Theorem 1 762.0762.0^{{\dagger}} 0.41940.4194^{{\dagger}}
Corollary 2 0.5240.524^{{\dagger}} 0.021520.02152^{{\dagger}}
Incenter formulation Theorem 1 0.051790.05179^{{\dagger}} 0.0073140.007314^{{\dagger}}
Corollary 2 0.0020690.002069^{{\dagger}{\dagger}} N/A

4.2 Computing proofs with Wu’s method

In this section, we explain computing proofs with Wu’s method separately for the vertex and the incenter formulations.

4.2.1 Computing proofs using the vertex formulation

In computing the proof of Theorem 1, we have set the order of variables as

x8x7x12x11x10x9x6x13x5x4x3x2x1,x_{8}\succ x_{7}\succ x_{12}\succ x_{11}\succ x_{10}\succ x_{9}\succ x_{6}\succ x_{13}\succ x_{5}\succ x_{4}\succ x_{3}\succ x_{2}\succ x_{1}, (14)

and, for h1,,h13h_{1},\dots,h_{13} in eq.˜1, computed an ascending chain as

f1,1(x1),f1,2(x1,x2),,f1,13(x8,x7,x12,,x2,x1).f_{1,1}(x_{1}),f_{1,2}(x_{1},x_{2}),\dots,f_{1,13}(x_{8},x_{7},x_{12},\dots,x_{2},x_{1}).

Then, for gg in eq.˜2, we have computed prem(g,f1,1,,f1,13)=0\mathrm{prem}(g,f_{1,1},\dots,f_{1,13})=0.

In computing the proof of Corollary 2, we have set the order of variables as x7x6x5x4x3x2x1x_{7}\succ x_{6}\succ x_{5}\succ x_{4}\succ x_{3}\succ x_{2}\succ x_{1}, and, for h1,,h7h_{1},\dots,h_{7} in eq.˜3, computed an ascending chain as

f2,1(x1),f2,2(x1,x2),,f2,7(x7,,x2,x1).f_{2,1}(x_{1}),f_{2,2}(x_{1},x_{2}),\dots,f_{2,7}(x_{7},\dots,x_{2},x_{1}).

Then, for gg in eq.˜4, we have computed prem(g,f2,1,,f2,7)=0\mathrm{prem}(g,f_{2,1},\dots,f_{2,7})=0.

4.2.2 Computing proofs using the incenter formulation

In computing the proof of Theorem 1, we have set the order of variables as x13x12x11x10x9x8x7x6x5x4x3x2x1x_{13}\succ x_{12}\succ x_{11}\succ x_{10}\succ x_{9}\succ x_{8}\succ x_{7}\succ x_{6}\succ x_{5}\succ x_{4}\succ x_{3}\succ x_{2}\succ x_{1}, and, for h1,,h13h_{1},\dots,h_{13} in eq.˜5, computed an ascending chain as

f3,1(x1),f3,2(x1,x2),,f3,13(x13,x12,,x2,x1).f_{3,1}(x_{1}),f_{3,2}(x_{1},x_{2}),\dots,f_{3,13}(x_{13},x_{12},\dots,x_{2},x_{1}).

Then, for gg in eq.˜6, we have computed prem(g,f3,1,,f3,13)=0\mathrm{prem}(g,f_{3,1},\dots,f_{3,13})=0.

In computing the proof of Corollary 2, we have set the order of variables as x8x7x6x5x4x3x2x1x_{8}\succ x_{7}\succ x_{6}\succ x_{5}\succ x_{4}\succ x_{3}\succ x_{2}\succ x_{1}, and, for h1,,h8h_{1},\dots,h_{8} in eq.˜7, computed an ascending chain as

f4,1(x1),f4,2(x1,x2),,f4,8(x8,,x2,x1).f_{4,1}(x_{1}),f_{4,2}(x_{1},x_{2}),\dots,f_{4,8}(x_{8},\dots,x_{2},x_{1}).

Then, for gg in eq.˜8, we have computed prem(g,f4,1,,f4,8)=0\mathrm{prem}(g,f_{4,1},\dots,f_{4,8})=0.

4.2.3 Computing time and the variable ordering

Table˜2 shows the computing time of computing this section’s proofs. For each formulation, the table shows the total computing time for calculating the ascending chain and for computing repeated pseudo-divisons of the conclusion polynomial as in eq.˜11.

In each computation of the proof, the order of variables has been defined as follows. For the proof of Theorem 1 using vertex formulation, the conclusion polynomial gg in eq.˜2 has variables x7,x8,x11,x12x_{7},x_{8},x_{11},x_{12}. Furthermore, in the hypothesis polynomials in eq.˜1, there are polynomials with x7x_{7} and x8x_{8} of degree 1, respectively. Thus, we have aimed to eliminate x7x_{7} and x8x_{8} from gg first, then x11x_{11} and x12x_{12} from pseudoremainders. After that, since there exist hypothesis polynomials in eq.˜1 which have terms in x11x_{11}, x10x_{10}, x9x_{9}, x6x_{6} and x13x_{13} of degree 1, we have aimed to eliminate these variables. As a result, we have defined the order of variables as in eq.˜14. In the other cases, since the computing time was sufficiently short, we have defined the order of variables as x13x12x1x_{13}\succ x_{12}\succ\cdots\succ x_{1} for the proof of Theorem 1 and x8x7x1x_{8}\succ x_{7}\succ\cdots\succ x_{1} for the proof of Corollary 2.

Table 2: Computing time of the proofs with Wu’s method. Note that computing time with the letter {\dagger}, and {\dagger}{\dagger} denote the average of repeatedly measured data for 10 and 1000 times, respectively. See Section˜4.2.3 for details.
Formulation Theorem Computing time (sec.)
Vertex formulation Theorem 1 2.9522.952^{{\dagger}}
Corollary 2 0.0053550.005355^{{\dagger}{\dagger}}
Incenter formulation Theorem 1 0.001240.00124^{{\dagger}{\dagger}}
Corollary 2 0.011870.01187^{{\dagger}{\dagger}}

5 Computation on the Feuerbach hyperbola

For a given triangle, the Feuerbach hyperbola is a rectangular hyperbola centered at the point of contact of the nine-point circle and the incircle and passing the triangle’s vertices. Furthermore, it is known that for changing the value of kk in Theorem 1, the Kariya point is located on the Feuerbach hyperbola [6]. This section, shows this property with the Gröbner basis computation and Wu’s method using vertex and incenter formulations.

For aa\in\mathbb{R}, a rectangular hyperbola whose focus is located at (±2a,0)(\pm\sqrt{2}\,a,0) is expressed as

x2y2=a2.x^{2}-y^{2}=a^{2}. (15)

By translating the center to (px,py)(p_{x},p_{y}) and rotating θ\theta counterclockwise, where px,py,θp_{x},p_{y},\theta\in\mathbb{R}, the hypothesis in eq.˜15 becomes as

(cx+sypx)2(sx+cypy)2=a2,(cx+sy-p_{x})^{2}-(-sx+cy-p_{y})^{2}=a^{2}, (16)

where c=cosθc=\cos\theta, s=sinθs=\sin\theta.

The proofs are computed as follows. From eq.˜16, let g=(cx+sypx)2(sx+cypy)2a2g^{\prime}=(cx+sy-p_{x})^{2}-(-sx+cy-p_{y})^{2}-a^{2} with xx and yy are replaced with appropriate variables. After computing the Gröbner basis or the ascending set from the hypothesis polynomials in Theorem 1 or Corollary 2, add the constraint c2+s21c^{2}+s^{2}-1 to the Gröbner basis or the ascending set. If the result of the reduction of gg^{\prime} by the set of polynomials is equal to 0, we see that the Kariya point is located on the Feuerbach hyperbola.

5.1 Computing the proof with Gröbner basis computation

Gröbner basis computation has been used for computing the proofs as follows.

In computing the proof with the vertex formulation, for the Gröbner basis G1G_{1} computed in Section˜4.1.1, let G¯1={c2+s21}G1\bar{G}_{1}=\{c^{2}+s^{2}-1\}\cup G_{1}. Using eq.˜16, let

g=(cx11+sx12px)2(sx11+cx12py)2a2G(x11,x12) is located on the hyperbola,g^{\prime}=(cx_{11}+sx_{12}-p_{x})^{2}-(-sx_{11}+cx_{12}-p_{y})^{2}-a^{2}\Longleftrightarrow\,\text{$G(x_{11},x_{12})$ is located on the hyperbola},

and we have computed that the normal form of gg^{\prime} with respect to G¯1\bar{G}_{1} is equal to 0 to show gG¯1g^{\prime}\in\langle{\bar{G}_{1}}\rangle.

In computing the proof with the incenter formulation, for the Gröbner basis G3G_{3} computed in Section˜4.1.2, let G¯3={c2+s21}G3\bar{G}_{3}=\{c^{2}+s^{2}-1\}\cup G_{3}. Using eq.˜16, let

g=(cx13+sx12px)2(sx13+cx12py)2a2G(x13,x12) is located on the hyperbola,g^{\prime}=(cx_{13}+sx_{12}-p_{x})^{2}-(-sx_{13}+cx_{12}-p_{y})^{2}-a^{2}\Longleftrightarrow\,\text{$G(x_{13},x_{12})$ is located on the hyperbola},

and we have computed that the normal form of gg^{\prime} with respect to G¯3\bar{G}_{3} is equal to 0 to show gG¯3g^{\prime}\in\langle{\bar{G}_{3}}\rangle.

5.2 Computing the proof with Wu’s method

Wu’s method has been used for computing the proofs as follows.

In computing the proof with the vertex formulation, for the hypothesis polynomials h1,,h9,h_{1},\dots,h_{9}, h12,h13h_{12},h_{13} in eq.˜1, we have computed an ascending chain

f5,1(x5),f5,2(x5,x13),,f5,11(x5,,x8,x10),f_{5,1}(x_{5}),f_{5,2}(x_{5},x_{13}),\dots,f_{5,11}(x_{5},\dots,x_{8},x_{10}),

with respect to the order of variables given as

x10x8x7x6x11x12x1x2x3x4x13x5.x_{10}\succ x_{8}\succ x_{7}\succ x_{6}\succ x_{11}\succ x_{12}\succ x_{1}\succ x_{2}\succ x_{3}\succ x_{4}\succ x_{13}\succ x_{5}. (17)

Then, let

g=(cx11+sx12px)2(sx11+cx12py)2a2G(x11,x12) is located on the hyperbola,g^{\prime}=(cx_{11}+sx_{12}-p_{x})^{2}-(-sx_{11}+cx_{12}-p_{y})^{2}-a^{2}\Longleftrightarrow\,\text{$G(x_{11},x_{12})$ is located on the hyperbola},

and we have computed prem(g,f5,1,,f5,11)=0\mathrm{prem}(g,f_{5,1},\dots,f_{5,11})=0.

In computing the proof with the incenter formulation, for a hyperbola in eq.˜15, translate the center to (px,py)(p_{x},p_{y}) and rotate θ\theta counterclockwise, and let c=cos2θc=\cos 2\theta, s=sin2θs=\sin 2\theta. Let the set of hypothesis polynomials consists of h1,,h9,h12,gh_{1},\dots,h_{9},h_{12},g in eq.˜5, and

h14=c2+s21(a constraint on sin2θ and cos2θ),h15=c2pxc2pysB(0,0) and C(1,0) are located on the hyperbola,h16=x22c2x2pxc+2x1x2s2x2pys2pxx1sx12c+2x1pycB(0,0) and A(x2,x1) are located on the hyperbola,h17=u12c2u1pxc+2u1u2s2u1pys2pxu2su22c+2u2pycB(0,0) and O(u1,u2) are located on the hyperbola.\begin{split}h_{14}&=c^{2}+s^{2}-1\quad\text{(a constraint on $\sin 2\theta$ and $\cos 2\theta$),}\\ h_{15}&=c-2p_{x}c-2p_{y}s\,\Longleftrightarrow\,\text{$B(0,0)$ and $C(1,0)$ are located on the hyperbola,}\\ h_{16}&=x_{2}^{2}c-2x_{2}p_{x}c+2x_{1}x_{2}s-2x_{2}p_{y}s-2p_{x}x_{1}s-x_{1}^{2}c+2x_{1}p_{y}c\\ &\quad\Longleftrightarrow\,\text{$B(0,0)$ and $A(x_{2},x_{1})$ are located on the hyperbola,}\\ h_{17}&=u_{1}^{2}c-2u_{1}p_{x}c+2u_{1}u_{2}s-2u_{1}p_{y}s-2p_{x}u_{2}s-u_{2}^{2}c+2u_{2}p_{y}c\\ &\quad\Longleftrightarrow\,\text{$B(0,0)$ and $O(u1,u2)$ are located on the hyperbola.}\end{split} (18)

For the set of the hypothesis polynomials, we have computed an ascending chain

f6,1(x1),f6,2(x1,x2),,f6,15(x1,,py,px),f_{6,1}(x_{1}),f_{6,2}(x_{1},x_{2}),\dots,f_{6,15}(x_{1},\dots,p_{y},p_{x}),

with respect to the order of variables as

pxpycsx13x12x9x8x7x6x5x4x3x2x1.p_{x}\succ p_{y}\succ c\succ s\succ x_{13}\succ x_{12}\succ x_{9}\succ x_{8}\succ x_{7}\succ x_{6}\succ x_{5}\succ x_{4}\succ x_{3}\succ x_{2}\succ x_{1}. (19)

Then, let

g=x132c2x13pxc+2x13x12s2x13pys2pxx12sx122c+2x12pycB(0,0) and G(x13,x12) are located on the hyperbola,\begin{split}g^{\prime}&=x_{13}^{2}c-2x_{13}p_{x}c+2x_{13}x_{12}s-2x_{13}p_{y}s-2p_{x}x_{12}s-x_{12}^{2}c+2x_{12}p_{y}c\\ &\quad\Longleftrightarrow\,\text{$B(0,0)$ and $G(x_{13},x_{12})$ are located on the hyperbola},\end{split}

and we have computed prem(g2,f6,1,,f6,15)=0\mathrm{prem}(g_{2},f_{6,1},\dots,f_{6,15})=0.

Note that the derivation of h15,h16,h17,g2h_{15},h_{16},h_{17},g_{2} will be explained in the Appendix.

5.3 Computing time and the variable ordering

Table˜3 shows the computing time of the proofs in this section, with Gröbner basis computation and Wu’s method, using the vertex and the incenter formulations.

The Ordering of variables is defined as follows. In the Gröbner basis computation, the order of variables used for the proof of Theorem 1 are used (see Section˜4.1.3).

In Wu’s method with the vertex formulation, the order of variables are given as in eq.˜17 by the following reason. When we compute the ascending chain, we first eliminate x10x_{10} because the number of terms in x10x_{10} which appear in h1,,h9,h12,h13h_{1},\dots,h_{9},h_{12},h_{13} is the smallest among the variables which appear in h1,,h9,h12,h13h_{1},\dots,h_{9},h_{12},h_{13}. Next, we eliminate x8x_{8} because the number of terms in x8x_{8} which appear in the input polynomials is the smallest among the variables which appear in the input polynomials. By repeating the procedure, we eliminate the variable in which the number of terms appearing in the polynomials is the smallest in each step in computing the ascending chain.

In Wu’s method with the incenter formulation, the order of variables is given as in eq.˜19 by the following reason. In computing the ascending chain, we first eliminate newly added variables pxp_{x}, pyp_{y}, cc, and ss in this order, then eliminate the rest of the variables with the same ordering as the computation for the proof of Theorem 1 (see Section˜4.2.2).

Table 3: Computing time of the proofs on Feuerbach hyperbola. Note that computing time with the letter {\dagger}, {\ddagger} and {\dagger}{\dagger} denote the average of repeatedly measured data for 10, 100, and 1000 times, respectively. See Section˜5.3 for details.
Computing method Formulation Computing time (sec.)
Gröbner basis Vertex formulation 0.011470.01147^{{\dagger}{\dagger}}
Incenter formulation 0.0022550.002255^{{\dagger}{\dagger}}
Wu’s method Vertex formulation 0.16390.1639^{{\ddagger}}
Incenter formulation 2.1622.162^{{\dagger}}

6 Concluding remarks

In this paper, we have demonstrated computational proofs of Kariya’s theorem and its corollary with the Gröbner basis computation and Wu’s method using the vertex and the incenter formulations. Furthermore, we have demonstrated computational proofs of the property that the Kariya point is similarly located on the Feuerbach hyperbola.

Computing time (see Tables˜1 and 2) suggests that the incenter formulation is more suitable for efficient computation for the proof of Theorem 1. For the proof of Corollary 2, while using the incenter formulation made computation more efficient with Gröbner basis computation, using the vertex formulation made computation more efficient with Wu’s method, thus formulation used for better efficiency was different depending on the methods.

Future research topics on computer-assisted proof of Kariya’s theorem with computer algebra include the following.

  1. 1.

    In Gröbner basis computation and Wu’s method for the proofs, other variable orderings than those used in the present paper may speed up the computation.

  2. 2.

    Setting the incenter to the origin may speed up the computation using the incenter formulation.

  3. 3.

    While Kariya’s theorem uses the incenter, the theorem may hold for the excenter(s).

  4. 4.

    With Gröbner basis computation, the formula of Feuerbach hyperbola may be derived from the hypothesis polynomials.

  5. 5.

    Although the cartesian coordinate system was used in this paper, other coordinate systems may speed up the computation. (Note that Coanda̧ et al. [3] use the barycentric coordinate system for deriving Kariya’s theorem from their theorem in a more general form.)

Acknowledgements

The research in this paper has been initiated as an undergraduate research project in the College of Mathematics, School of Science and Engineering, University of Tsukuba. The authors thank Nanako Ishii and Gaku Kuriyama for collaborating with the authors during the project.

References

  • [1] A. Bostan. 11154: Triangle Center X(79), Problems and Solutions. American Mathematics Monthly, 119(8):703–704, October 2012.
  • [2] S.-C. Chou. Mechanical Geometry Theorem Proving. D. Reidel, 1988.
  • [3] C. Coanda, F. Smarandache, and I. Patraşcu. A generalization of certain remarkable points of the triangle geometry. In I. Patraşcu and F. Smarandache, editors, Variance of Topics of Plane Geometry, pages 69–73. Education Publishing, 2013.
  • [4] D. A. Cox, J. Little, and D. O’Shea. Ideals, Varieties, and Algorithms: An Introduction to Computational Algebraic Geometry and Commutative Algebra. Springer, 4th edition, 2015.
  • [5] J. Kariya. Un theoreme sur le triangle. L’Enseign. Math., 6:130–132, 1904.
  • [6] S. N. Kiss and P. Yiu. The Touchpoints Triangles and the Feuerbach Hyperbolas. Forum Geometricorum, 14:63–86, 2014.
  • [7] M. Noro. A computer algebra system: Risa/Asir. In M. Joswig and N. Takayama, editors, Algebra, Geometry and Software Systems, pages 147–162. Springer, 2003.
  • [8] K. Shiraishi, H. Kai, and M.-T. Noda. Application of stabilized Wu’s method to robot control. In Computer algebra—algorithms, implementations and applications, volume 1295 of RIMS Kôkyûroku, pages 203–208. Kyoto Univ., 2002. (Japanese).
  • [9] A. Terui, A. Ito, and T. Kasai. teamsnactsukuba/kariya: Pre-submission release (v1.0) [computer software]. Zenodo, 2023. https://doi.org/10.5281/zenodo.7828998
  • [10] P. Yiu. The Kariya Problem and Related Constructions. Forum Geometricorum, 15:191–201, 2015.

Appendix: Derivation of h15,h16,h17h_{15},h_{16},h_{17} and g2g_{2} in Equation˜18

In the appendix, we show derivation of h15,h16,h17h_{15},h_{16},h_{17} and g2g_{2} in eq.˜18.

Before deriving the formulas, we show a transform on the hyperbola in eq.˜15. By rotating the hyperbola in eq.˜15 θ\theta counterclockwise, we have

(xcosθ+ysinθ)2(xsinθ+ycosθ)2=a2.(x\cos\theta+y\sin\theta)^{2}-(-x\sin\theta+y\cos\theta)^{2}=a^{2}.

Expanding the left-hand side and collecting the terms with respect to xx and yy is expressed as

x2(cos2θsin2θ)2xy(2cosθsinθ)y2(cos2θsin2θ)=a2.x^{2}(\cos^{2}\theta-\sin^{2}\theta)-2xy(2\cos\theta\sin\theta)-y^{2}(\cos^{2}\theta-\sin^{2}\theta)=a^{2}.

By applying the double angle formula, we have

x2c2xysy2c=a2,x^{2}c-2xys-y^{2}c=a^{2},

where c=cos2θc=\cos 2\theta, s=sin2θs=\sin 2\theta. By translating the origin to (px,py)(p_{x},p_{y}), we have

(xpx)2c2(xpx)(ypy)s(ypy)2c=a2.(x-p_{x})^{2}c-2(x-p_{x})(y-p_{y})s-(y-p_{y})^{2}c=a^{2}. (20)

Now, h15h_{15} is derived as follows. In Figure˜3, since the hyperbola in eq.˜20 passes through B(0,0)B(0,0), we have

px2c2pxpyspy2c=a2.p_{x}^{2}c-2p_{x}p_{y}s-p_{y}^{2}c=a^{2}. (21)

Furthermore, since the same hyperbola passes through C(1,0)C(1,0), we have

(1px)2c2(1px)pyspy2c=a2.(1-p_{x})^{2}c-2(1-p_{x})p_{y}s-p_{y}^{2}c=a^{2}. (22)

By equating the left-hand-sides of eqs.˜21 and 22, h15h_{15} is derived.

Next, h16h_{16} is derived as follows. In Figure˜3, since the hyperbola in eq.˜20 passes through A(x2,x1)A(x_{2},x_{1}), we have

(x2px)2c2(x2px)(x1py)s(x1py)2c=a2.(x_{2}-p_{x})^{2}c-2(x_{2}-p_{x})(x_{1}-p_{y})s-(x_{1}-p_{y})^{2}c=a^{2}. (23)

By equating the left-hand-sides of eqs.˜21 and 23, h16h_{16} is derived.

Next, h17h_{17} is derived as follows. In Figure˜3, since the hyperbola in eq.˜20 passes through O(u1,u2)O(u_{1},u_{2}), we have

(u1px)2c2(u1px)(u2py)s(u2py)2c=a2.(u_{1}-p_{x})^{2}c-2(u_{1}-p_{x})(u_{2}-p_{y})s-(u_{2}-p_{y})^{2}c=a^{2}. (24)

By equating the left-hand-sides of eqs.˜21 and 24, h17h_{17} is derived.

Finally, g2g_{2} is derived as follows. In Figure˜3, since the hyperbola in eq.˜20 passes through G(x13,x12)G(x_{13},x_{12}), we have

(x13px)2c2(x13px)(x12py)s(x12py)2c=a2.(x_{13}-p_{x})^{2}c-2(x_{13}-p_{x})(x_{12}-p_{y})s-(x_{12}-p_{y})^{2}c=a^{2}. (25)

By equating the left-hand-sides of eqs.˜21 and 25, g2g_{2} is derived.