Computations about formal multiple zeta spaces defined by binary extended double shuffle relations

Tomoya Machide National Institute of Informatics, 2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo 101-8430, Japan

Abstract

The formal multiple zeta space we consider with a computer is an $\mathbb{F}_{2}$ -vector space generated by $2^{k-2}$ formal symbols for a given weight $k$ , where the symbols satisfy binary extended double shuffle relations. Up to weight $k=22$ , we compute the dimensions of the formal multiple zeta spaces, and verify the dimension conjecture on original extended double shuffle relations of real multiple zeta values. Our computations adopt Gaussian forward elimination and give information for spaces filtered by depth. We can observe that the dimensions of the depth-graded formal multiple zeta spaces have a Pascal triangle pattern expected by the Hoffman mult-indices.

⁰⁰0e-mail : machide@nii.ac.jp⁰⁰0MSC-class: 11M32 (Primary); 15A03,68W30 (Secondary)⁰⁰0Key words: multiple zeta value, graded vector space, dimension calculation

1 Introduction

The space generated by multiple zeta values (MZVs for short) has been elucidated theoretically and numerically in recent years, but its structure remains mysterious. In this paper, we shed light on a formal space generated by binary analogs of MZVs by computer experiments for unraveling both of the original and formal spaces.

Let $\mathbb{N}$ denote the set of positive integers. The MZV is a real number that belongs to an image of a function (customarily denoted by $\zeta$ ) whose domain is

\displaystyle{\bf I}

\displaystyle=

\displaystyle{\textstyle\bigcup\limits_{r\geq 0}}\{{\bf k}_{r}=(k_{1},k_{2},\ldots,k_{r})\in\mathbb{N}^{r}{\,|\,}k_{1}\geq 2\},

(1.1)

where ${\bf k}_{0}=\varnothing$ is the empty mult-index and $\zeta(\varnothing)=1$ . We call ${\mathrm{w}}({\bf k}_{r})=k_{1}+\cdots+k_{r}$ and ${\mathrm{d}}({\bf k}_{r})=r$ the weight and depth, respectively. The function $\zeta$ has two definitions by the iterated integral and nested summation, which endow the $\mathbb{Q}$ -vector space $\mathcal{Z}$ spanned by MZVs with abundant linear relations. Euler [13], who solved the Basel problem $\zeta(2)=\pi^{2}/6$ and advanced the case $r=1$ , also studied the case $r=2$ .

Zagier [34] conjectured¹¹1 Zagier noted the conjectures were made after many discussions with Drinfel’d, Kontsevich and Goncharov. that $\mathcal{Z}$ is graded by weight and the dimensions of graded pieces are expressed in terms of a Fibonacci-like sequence. Let ${\bf I}_{k}$ be the subset consisting of mult-indices of weight $k$ , and let $\mathcal{Z}_{k}$ be the subspace spanned by MZVs in $\zeta({\bf I}_{k})=\{\zeta({\bf k}){\,|\,}{\bf k}\in{\bf I}_{k}\}$ . The dimension conjecture is

\displaystyle\dim_{\mathbb{Q}}\mathcal{Z}_{k}

\displaystyle\overset{?}{=}

\displaystyle d_{k},

(1.2)

where $d_{k}=d_{k-2}+d_{k-3}$ $(k\geq 3)$ , $d_{0}=d_{2}=1$ and $d_{1}=0$ . These integers fit together into the generating series

\displaystyle\sum_{k\geq 0}d_{k}X^{k}

\displaystyle=

\displaystyle\frac{1}{1-(X^{2}+X^{3})}.

(1.3)

The ultimate upper bound theorem (i.e., $\dim_{\mathbb{Q}}\mathcal{Z}_{k}\leq d_{k}$ ) was established independently by Goncharov [10, 16] and Terasoma [32]. Brown [8] furthermore proved that $\mathcal{Z}_{k}$ is generated by MZVs in $\zeta({\bf I}^{H}_{k})$ , where ${\bf I}^{H}_{k}$ is the set of Hoffman mult-indices of weight $k$ :

\displaystyle{\bf I}^{H}_{k}

\displaystyle=

\displaystyle\{{\bf k}=(k_{1},\ldots,k_{r})\in{\bf I}_{k}{\,|\,}k_{i}\in\{2,3\}\}.

(1.4)

Hoffman [18] conjectured $\zeta({\bf I}^{H}_{k})$ is a basis of $\mathcal{Z}_{k}$ , which would imply the dimension conjecture because the same recurrence relation $|{\bf I}^{H}_{k}|=|{\bf I}^{H}_{k-2}|+|{\bf I}^{H}_{k-3}|$ holds by a simple count of the number of $2$ ’s and $3$ ’s. Umezawa [33] also suggested a basis conjecture in terms of iterated log-sine integrals, in which sets of mult-indices different from ${\bf I}^{H}_{k}$ are used. Because of the difficulty to show the independence between MZVs, no non-trivial lower bounds are known.

By the upper bound theorem, it is natural to ask that what sorts of relations are needed to reduce the number of generators of $\mathcal{Z}_{k}$ to $d_{k}$ . There are several conjectural candidates: e.g., [11, 14, 17, 22, 23]. In particular, the extended double shuffle (EDS) relations [19, 29] known from early on are often selected for experimentally attacking this question, because they are easier to write down and included in the other candidates except Kawasima’s [23]. Minh and Petitot [28] verified that the class of EDS relations is a right candidate up to weight $10$ , Bigotte et al. [5] verified it up to weight $12$ , Minh et al. [27] verified it up to weight $16$ ,²²2 This experimental result was announced in their private communication (see [21, Section 1]). Espie et al. [12] verified it up to weight $19$ , and Kaneko et al. [21] verified it up to weight $20$ that seems to be the latest record. The first two experiments are by the Gröbner basis method, and the last three ones are by the vector space (or matrix) method. The fourth one of [12] was executed under modulo rational multiples of powers of $\zeta(2)$ , or module $\mathbb{Q}[\zeta(2)]$ .

The first purpose of this paper is to improve the record to weight $k=22$ . For this, we consider an $\mathbb{F}_{2}$ -vector space $\mathcal{Z}_{k}^{\mathfrak{b}}$ instead of the $\mathbb{Q}$ -vector space $\mathcal{Z}_{k}$ : roughly speaking, $\mathcal{Z}_{k}^{\mathfrak{b}}$ is generated by binary multiple zeta symbols $\zeta^{\mathfrak{b}}({\bf k})$ $({\bf k}\in{\bf I}_{k})$ (binary MZSs for short), where $\zeta^{\mathfrak{b}}({\bf k})$ satisfy binary EDS relations that are obtained from original EDS relations after the modulo $2$ arithmetic to integer coefficients. (Exact definitions of the binary analogs in this section will be stated in the next section.) We will verify $\zeta^{\mathfrak{b}}({\bf I}^{H}_{k})$ is a basis of $\mathcal{Z}_{k}^{\mathfrak{b}}$ and $\dim_{\mathbb{F}_{2}}\mathcal{Z}_{k}^{\mathfrak{b}}=d_{k}$ . Our calculation results break the record because $\dim_{\mathbb{Q}}\mathcal{Z}_{k}\leq\dim_{\mathbb{F}_{2}}\mathcal{Z}_{k}^{\mathfrak{b}}$ (as will be mentioned in Section 3). The space $\mathcal{Z}_{k}^{\mathfrak{b}}$ reduces the computation cost since $\mathbb{F}_{2}$ is the binary and simplest finite field. The field $\mathbb{F}_{2}$ makes it easy to apply useful techniques in computer since $\mathbb{F}_{2}$ is compatible with the Boolean datatype: in fact, we will employ a conflict based algorithm discussed in [24] for a fast Gaussian forward elimination.

The second and main purpose is to observe a Pascal triangle pattern in $\mathcal{Z}_{k}^{\mathfrak{b}}$ from the viewpoint of a direct sum decomposition,

\displaystyle\mathcal{Z}_{k}^{\mathfrak{b}}

\displaystyle\cong

\displaystyle\overline{\mathcal{Z}}_{k,k-1}^{\mathfrak{b}}\operatorname*{\bigoplus}\cdots\operatorname*{\bigoplus}\overline{\mathcal{Z}}_{k,0}^{\mathfrak{b}},

(1.5)

where $\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}$ are quotient spaces defined by means of depth filtration: the descending chain $\mathcal{Z}_{k,k-1}^{\mathfrak{b}}\supset\cdots\supset\mathcal{Z}_{k,0}^{\mathfrak{b}}$ is used for $\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}=\mathcal{Z}_{k,r}^{\mathfrak{b}}/\mathcal{Z}_{k,r-1}^{\mathfrak{b}}$ , where $\mathcal{Z}_{k,r}^{\mathfrak{b}}$ are the subspaces spanned by binary MZSs of weight $k$ and depth at most $r$ . We define ${\bf I}_{k,r}=\{{\bf k}\in{\bf I}_{k}{\,|\,}{\mathrm{d}}({\bf k})=r\}$ and

\displaystyle{\bf I}^{H}_{k,r}

\displaystyle=

\displaystyle{\bf I}^{H}_{k}\cap{\bf I}_{k,r},

(1.6)

with $d^{\mathfrak{b}}_{k,r}=|{\bf I}^{H}_{k,r}|$ . We denote by $\overline{\zeta}^{\mathfrak{b}}({\bf k})$ the canonical image³³3 We use the same notation $\overline{\zeta}^{\mathfrak{b}}$ for all canonical images in the quotient spaces $\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}$ $(k>r\geq 0)$ . There should be no confusion because the quotient space under consideration is clear from context. of $\zeta^{\mathfrak{b}}({\bf k})$ in $\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}$ for any ${\bf k}\in{\bf I}_{k,r}$ . Up to weight $k=22$ , we will verify $\overline{\zeta}^{\mathfrak{b}}({\bf I}^{H}_{k,r})$ is a basis of $\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}$ and $\dim_{\mathbb{F}_{2}}\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}=d^{\mathfrak{b}}_{k,r}$ . Counting the number of $2$ ’s and $3$ ’s implies that the double sequence $(d^{\mathfrak{b}}_{k,r})$ satisfies a recurrence relation with a Pascal triangle pattern: $d^{\mathfrak{b}}_{k,r}=d^{\mathfrak{b}}_{k-2,r-1}+d^{\mathfrak{b}}_{k-3,r-1}$ $(k\geq 3,r\geq 1)$ , $d^{\mathfrak{b}}_{0,0}=d^{\mathfrak{b}}_{2,1}=1$ and $d^{\mathfrak{b}}_{k,r}=0$ for other $k$ and $r$ , or equivalently,

\displaystyle\sum_{k,r\geq 0}d^{\mathfrak{b}}_{k,r}X^{k}Y^{r}

\displaystyle=

\displaystyle\frac{1}{1-(X^{2}+X^{3})Y}.

(1.7)

More precisely, $d^{\mathfrak{b}}_{k,r}=\binom{r}{k-2r}$ since the integers $P_{r,k}=d^{\mathfrak{b}}_{k+2r,r}$ satisfy the same recurrence relation as the binomial coefficients $\binom{r}{k}$ . As expected from (1.5), the formula (1.7) specializes to (1.3) upon $Y=1$ .

We also try experiments on parts of EDS relations, ‘ $\mathrm{KNT}$ ’ and ‘ $\mathrm{MJPO}$ ’ relations, which are expected to be alternatives to EDS and actually employed in [21, 27] for verification, respectively. Unlike the case in $\mathcal{Z}_{k}$ , those relations do not suffice to give all relations in $\mathcal{Z}_{k}^{\mathfrak{b}}$ , but we can find a quasi Fibonacci-like rule in dimensions of spaces defined by $\mathrm{MJPO}$ relations.

The idea of the depth filtration in (1.5) was conceived by Broadhurst and Kreimer [7] to propose a refinement of the dimension conjecture. Their conjecture indicates two interesting facts in the $\mathbb{Q}$ -vector spaces of MZVs graded by both weight and depth: (i) modular forms influence the structure through quotient spaces $\overline{\mathcal{Z}}_{k,r}$ defined by the $\mathbb{Q}$ -version of (1.5); and (ii) the Hoffman values $\zeta({\bf k})$ $({\bf k}\in{\bf I}^{H}_{k})$ are irrelevant to the structure in the sense that most of the values vanish in the graded pieces of same depth. In terms of the generating series, the conjecture is

\displaystyle\sum_{k,r\geq 0}\dim\overline{\mathcal{Z}}_{k,r}X^{k}Y^{r}

\displaystyle\overset{?}{=}

\displaystyle\frac{1+E(X)Y}{1-O(X)Y+S(X)Y^{2}(1-Y^{2})},

(1.8)

where $E(X)=X^{2}/(1-X^{2})$ , $O(X)=X^{3}/(1-X^{2})$ and $S(X)=X^{12}/(1-X^{4})(1-X^{6})$ , and $S(X)$ is the generating series of the dimensions of the vector spaces of cusp forms on the full modular group. Specific examples for $r=2$ are given in [15] and a modern formulation is discussed in [9] (see also [31]). However our computational results suggest the following when we adopt $\mathbb{F}_{2}$ as the scalar field instead of $\mathbb{Q}$ : (i) the influence of modular forms disappears; but (ii) the Hoffman symbols $\zeta^{\mathfrak{b}}({\bf k})$ $({\bf k}\in{\bf I}^{H}_{k})$ remain as basis elements with a Pascal triangle pattern.

It should be noted that the Broadhurst-Kreimer conjecture has two equivalent formulations of vector and algebra (see [19, Appendix]). The equivalence requires $\mathbb{Q}[\zeta(2)]$ is isomorphic to the polynomial ring in one variable over $\mathbb{Q}$ . The isomorphy does not hold when $\mathbb{F}_{2}$ is the scalar field as will be mentioned in the final section, and we will consider only the vector formulation in this paper.

It should also be noted that Blümlein et al. [6] provided a data mine for not only MZVs but also Euler sums by experiments to Broadhurst-Kreimer type conjectures, in which it was verified that the union of EDS and duality relations suffices to reduce the number of generators of $\mathcal{Z}_{k}$ to $d_{k}$ up to weight $22$ : it was also verified up to $24$ by using modular arithmetic, and up to $26$ and more with an additional conjecture and limited depths. The duality relations, which are obtained by the integral definition of MZVs and a change of variables, are very useful to compute because they can bring down the size of relations by about half. It has not been proved yet that the EDS relations include the duality relations, although the inclusion is expected to be true conjecturally: in other words, we have not succeeded in understanding the duality of MZVs algebraically. The experimental approaches of [6] and ours differ in the use of the duality relations.

The organization of this paper is as follows. In Section 2, we state exact definitions of the binary MZVs $\zeta^{\mathfrak{b}}({\bf k})$ , the formal multiple zeta spaces $\mathcal{Z}_{k}^{\mathfrak{b}}$ and the quotient spaces $\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}$ . We report our computational results in Section 3, and explain how our computer programs produce the results in Section 4. The programs are available at the open-source site GitHub.⁴⁴4https://github.com/machide-tomoyan/BMZS-calculator Section 5 is devoted to problems about formal multiple zeta spaces which arise from the computational results. In Appendix, we describe an essential algorithm in our experiments, which employs a conflict based search and speeds up the Gaussian forward elimination under certain conditions.

The computer only assists us in showing Proposition 4.1 by Gaussian elimination. The dimension conjecture (1.2) is true if we can theoretically show Proposition 4.1 for all weights $k$ .

2 Formal multiple zeta space over $\mathbb{F}_{2}$

The formal multiple zeta space $\mathcal{Z}_{k}^{\mathfrak{b}}$ of weight $k$ is briefly defined by

\displaystyle\mathcal{Z}_{k}^{\mathfrak{b}}

\displaystyle=

\displaystyle\frac{\langle\eta^{\mathfrak{b}}({\bf k}){\,|\,}{\bf k}\in{\bf I}_{k}\rangle_{\mathbb{F}_{2}}}{\text{ \{binary EDS relations\} }},

(2.1)

where $\eta^{\mathfrak{b}}({\bf k})$ are indeterminates. That is, $\mathcal{Z}_{k}^{\mathfrak{b}}$ is an $\mathbb{F}_{2}$ -vector space generated by formal symbols $\zeta^{\mathfrak{b}}({\bf k})\equiv\eta^{\mathfrak{b}}({\bf k})$ that satisfy binary variations of the EDS relations. Eight equivalent statements are given in [19, Theorem 2] for the EDS relations. In this paper, we choice the statement (v) in the theorem because the relations are all $\mathbb{Z}$ -linear and fewer in number.

To define (2.1) exactly, we require the algebraic setup by Hoffman [18] which allows us the steady handling of two products, the shuffle $\mathcyr{sh}$ and stuffle $*$ : the latter is also called harmonic or quasi-shuffle. Let $\mathfrak{H}$ be the polynomial ring $\mathbb{Q}\langle x,y\rangle$ in the two non-commutative variables $x$ and $y$ . We call each variable a letter, and a monomial in the variables a word. The shuffle product $\mathcyr{sh}$ is a $\mathbb{Q}$ -bilinear product on $\mathfrak{H}$ , which satisfies $w=w\;\mathcyr{sh}\;1=1\;\mathcyr{sh}\;w$ and

\displaystyle au\;\mathcyr{sh}\;bv

\displaystyle=

\displaystyle a(u\;\mathcyr{sh}\;bv)+b(au\;\mathcyr{sh}\;v)

(2.2)

for any words $u,v,w\in\mathfrak{H}$ and letters $a,b\in\{x,y\}$ . Let $z_{k}$ denote a word $x^{k-1}y$ for any $k\geq 1$ , and let $\mathfrak{H}^{1}$ be the polynomial ring $\mathbb{Q}\langle z_{1},z_{2},\ldots\rangle$ , or equivalently, the subring $\mathbb{Q}+\mathfrak{H}y$ in $\mathfrak{H}$ . The stuffle product $*$ is a $\mathbb{Q}$ -bilinear product on $\mathfrak{H}^{1}$ , which satisfies $w=w\,*\,1=1\,*\,w$ and

\displaystyle z_{i}u\,*\,z_{j}v

\displaystyle=

\displaystyle z_{i}(u\,*\,z_{j}v)+z_{j}(z_{i}u\,*\,v)+z_{i+j}(u\,*\,v)

(2.3)

for any words $u,v,w\in\mathfrak{H}$ and integers $i,j\geq 1$ . By induction on the lengths of words, both products are commutative and associative, and both $\mathfrak{H}^{1}_{\mathcyr{sh}}=(\mathfrak{H}^{1},\mathcyr{sh})$ and $\mathfrak{H}^{1}_{*}=(\mathfrak{H}^{1},*)$ are commutative $\mathbb{Q}$ -algebras. We notice $\mathfrak{H}_{\mathcyr{sh}}=(\mathfrak{H},\mathcyr{sh})$ is a parent space of $\mathfrak{H}^{1}_{\mathcyr{sh}}$ . Let $\mathfrak{H}^{0}=\mathbb{Q}+x\mathfrak{H}y=\langle z_{{\bf k}}{\,|\,}{\bf k}\in{\bf I}\rangle_{\mathbb{Q}}$ , where $z_{{\bf k}}=z_{k_{1}}\cdots z_{k_{r}}$ and $z_{\varnothing}=1$ . Both $\mathfrak{H}^{0}_{\mathcyr{sh}}=(\mathfrak{H}^{0},\mathcyr{sh})$ and $\mathfrak{H}^{0}_{*}=(\mathfrak{H}^{0},*)$ are subalgebras since $\mathfrak{H}^{0}$ is closed under $\mathcyr{sh}$ and $*$ . The pair $(\mathfrak{H}^{1},\mathfrak{H}^{0})$ of spaces satisfies the polynomial ring property in one variable: the former is freely generated by $y$ over the latter on each of $\mathcyr{sh}$ and $*$ . We thus have

\displaystyle\mathfrak{H}^{1}_{\mathcyr{sh}}\,\simeq\,\mathfrak{H}^{0}_{\mathcyr{sh}}[y],\qquad\mathfrak{H}^{1}_{*}\,\simeq\,\mathfrak{H}^{0}_{*}[y].

(2.4)

See [30] and [18] for proofs of (2.4), respectively.

We introduce the EDS relations stated in [19, Theorem 2(v)]. Let $\mathrm{reg}_{\mathcyr{sh}}$ denote a homomorphism from $\mathfrak{H}^{1}_{\mathcyr{sh}}$ to $\mathfrak{H}^{0}_{\mathcyr{sh}}$ , which is defined by taking the constant term with respect to $y$ in the first isomorphism of (2.4):⁵⁵5 The homomorphism $\mathrm{reg}_{*}$ of stuffle type exists as well, but it is intractable because EDS relations of that type are not always $\mathbb{Z}$ -linear: see [19] (or [2, 20]) for details.

\displaystyle\mathrm{reg}_{\mathcyr{sh}}:\mathfrak{H}^{1}_{\mathcyr{sh}}\,\ni\,w\,=\,\sum_{i=0}^{m}w_{i}\;\mathcyr{sh}\;y^{\mathcyr{sh}i}\quad\mapsto\quad w_{0}\,\in\,\mathfrak{H}^{0}.

(2.5)

Let ${\bf\widehat{I}}_{k}={\bf I}_{k}\cup\{(\underset{k}{\underbrace{1,\ldots,1}})\}$ , and let

\displaystyle\widehat{{\bf PI}}_{k}

\displaystyle=

\displaystyle{\textstyle\bigcup\limits_{i,j\geq 0\atop(i+j=k)}}{\bf\widehat{I}}_{i}\times{\bf I}_{j}.

For any pair $({\bf k},{\bf l})$ of mult-indices in $\widehat{{\bf PI}}_{k}$ , we define

\displaystyle\mathsf{ds}({\bf k},{\bf l})

\displaystyle:=

\displaystyle\mathrm{reg}_{\mathcyr{sh}}(z_{{\bf k}}\,*\,z_{{\bf l}})-\mathrm{reg}_{\mathcyr{sh}}(z_{{\bf k}}\;\mathcyr{sh}\;z_{{\bf l}})\,\in\,\mathfrak{H}^{0}.

(2.6)

The objective EDS relations of weight $k$ are stated as

\displaystyle Z(\mathsf{ds}({\bf k},{\bf l}))

\displaystyle=

\displaystyle 0\qquad(({\bf k},{\bf l})\in\widehat{{\bf PI}}_{k}),

(2.7)

where $Z:\mathfrak{H}^{0}\to\mathbb{R}$ is the $\mathbb{Q}$ -linear map (or evaluation map) defined by $Z(z_{{\bf k}})=\zeta({\bf k})$ $({\bf k}\in{\bf I})$ . We have by (2.5)

	$\displaystyle\mathrm{reg}_{\mathcyr{sh}}(w)$	$\displaystyle=$	$\displaystyle w\qquad(w\in\mathfrak{H}^{0}),$
	$\displaystyle\rule{0.0pt}{15.0pt}\mathrm{reg}_{\mathcyr{sh}}(y^{m}\;\mathcyr{sh}\;z_{{\bf m}})$	$\displaystyle=$	$\displaystyle 0\qquad(m>0,{\bf m}\in{\bf I}).$

We can thus divide (2.7) into two parts:

	$\displaystyle Z(z_{{\bf k}}\,*\,z_{{\bf l}})-Z(z_{{\bf k}}\;\mathcyr{sh}\;z_{{\bf l}})$	$\displaystyle=$	$\displaystyle 0\qquad(({\bf k},{\bf l})\in{\bf PI}_{k}),$		(2.8)
	$\displaystyle\rule{0.0pt}{15.0pt}Z(\mathrm{reg}_{\mathcyr{sh}}(y^{m}\,*\,z_{{\bf m}}))$	$\displaystyle=$	$\displaystyle 0\qquad(0<m<k-1,{\bf m}\in{\bf I}_{k-m}),$		(2.9)

where ${\bf PI}_{k}={\textstyle\bigcup_{i,j\geq 0\atop(i+j=k)}}{\bf I}_{i}\times{\bf I}_{j}$ . The relations in (2.8) are called the finite double shuffle (FDS) relations, because MZVs are defined by $\zeta(k_{1},\ldots,k_{r})=\sum_{m_{1}>\cdots>m_{r}>0}1/m_{1}^{k_{1}}\cdots m_{r}^{k_{r}}$ and finite (or convergent) at ${\bf k}\in{\bf I}$ . The FDS relations do not suffice to give all relations of MZVs. For instance, we can not obtain any relation in weight $3$ , in particular, the simplest formula $\zeta(2,1)=\zeta(3)$ . Therefore the relations in (2.9) are essential to the EDS conjecture.

A little more notions are required for (2.1), which are analogs of the notions mentioned above in $\mathbb{Z}$ -module and $\mathbb{F}_{2}$ -vector. Let $\mathfrak{H}^{\mathbb{Z}}$ denote the subring $\mathbb{Z}\langle x,y\rangle$ in $\mathbb{Q}\langle x,y\rangle$ . We set

\displaystyle\mathfrak{H}^{\mathbb{Z},0}\,=\,\langle z_{{\bf k}}{\,|\,}{\bf k}\in{\bf I}\rangle_{\mathbb{Z}},\qquad\mathcal{H}^{\mathfrak{b}}\,=\,\langle\eta^{\mathfrak{b}}({\bf k}){\,|\,}{\bf k}\in{\bf I}\rangle_{\mathbb{F}_{2}},

to define a canonical map from $\mathfrak{H}^{\mathbb{Z},0}$ to $\mathcal{H}^{\mathfrak{b}}$ which is induced by modulo $2$ arithmetic:

\displaystyle\mathrm{can}^{\mathfrak{b}}:\mathfrak{H}^{\mathbb{Z},0}\,\ni\,w\,=\,\sum_{{\bf k}\in{\bf I}}c_{{\bf k}}z_{{\bf k}}\quad\mapsto\quad\sum_{{\bf k}\in{\bf I}}(c_{{\bf k}}\,\mathrm{mod}\ 2)\eta^{\mathfrak{b}}({\bf k})\,\in\,\mathcal{H}^{\mathfrak{b}}.

(2.10)

For any pair $({\bf k},{\bf l})\in{\bf PI}_{k}$ , the elements $z_{{\bf k}}\,*\,z_{{\bf l}}$ and $z_{{\bf k}}\;\mathcyr{sh}\;z_{{\bf l}}$ belong to $\mathfrak{H}^{\mathbb{Z},0}$ , and the element $\mathrm{can}^{\mathfrak{b}}(\mathsf{ds}({\bf k},{\bf l}))$ is well-defined. For $0<m<k-1$ and ${\bf m}\in{\bf I}_{k-m}$ , $y^{m}\,*\,z_{{\bf m}}$ belongs to $\langle y^{n}z_{{\bf n}}{\,|\,}n\geq 0,{\bf n}\in{\bf I}\rangle_{\mathbb{Z}}$ , and $\mathrm{can}^{\mathfrak{b}}(\mathrm{reg}_{\mathcyr{sh}}(y^{m}\,*\,z_{{\bf m}}))$ is well-defined if

\displaystyle\mathrm{reg}_{\mathcyr{sh}}(y^{n}z_{{\bf n}})

\displaystyle\in

\displaystyle\mathfrak{H}^{\mathbb{Z},0}\qquad(n>0,{\bf n}\in{\bf I}),

which holds by [19, Proposition 8] (see (4.7) below). Consequently,

\displaystyle\mathcal{E}^{\mathfrak{b}}_{k}

\displaystyle:=

\displaystyle\langle\mathrm{can}^{\mathfrak{b}}(\mathsf{ds}({\bf k},{\bf l})){\,|\,}({\bf k},{\bf l})\in\widehat{{\bf PI}}_{k}\rangle_{\mathbb{F}_{2}}\,\subset\,\mathcal{H}^{\mathfrak{b}}_{k}

is well-defined.

We are in a position to define (2.1).

Definition 2.1.

For a weight $k$ , we define the formal multiple zeta space by

\displaystyle\mathcal{Z}_{k}^{\mathfrak{b}}

\displaystyle:=

\displaystyle\mathcal{H}^{\mathfrak{b}}_{k}/\mathcal{E}^{\mathfrak{b}}_{k}.

(2.11)

For a mult-index ${\bf k}\in{\bf I}_{k}$ , we denote by $\zeta^{\mathfrak{b}}({\bf k})$ the element in $\mathcal{Z}_{k}^{\mathfrak{b}}$ which is congruent to $\eta^{\mathfrak{b}}({\bf k})$ modulo $\mathcal{E}^{\mathfrak{b}}_{k}$ . We call $\zeta^{\mathfrak{b}}({\bf k})$ a binary multiple zeta symbol or a binary MZS.

Let $H^{\mathfrak{b}}$ denote the natural homomorphism from $\operatorname*{\bigoplus}_{k\geq 0}\mathcal{H}^{\mathfrak{b}}_{k}$ to $\operatorname*{\bigoplus}_{k\geq 0}\mathcal{Z}_{k}^{\mathfrak{b}}$ : each component is the canonical map of (2.11). We define the binary evaluation map by $Z^{\mathfrak{b}}=H^{\mathfrak{b}}\circ\mathrm{can}^{\mathfrak{b}}$ . The binary EDS relations of weight $k$ are then stated as

\displaystyle Z^{\mathfrak{b}}(\mathsf{ds}({\bf k},{\bf l}))

\displaystyle=

\displaystyle 0\qquad(({\bf k},{\bf l})\in\widehat{{\bf PI}}_{k}).

(2.12)

We list some examples of the original and binary EDS relations for weights $k\leq 4$ in Table 1.

Let $\mathcal{Z}_{k,r}^{\mathfrak{b}}$ denote the vector subspace $\langle\zeta^{\mathfrak{b}}({\bf k}){\,|\,}{\bf k}\in{\bf I}_{k},{\mathrm{d}}({\bf k})\leq r\rangle_{\mathbb{F}_{2}}$ as introduced in the first section. We end this section with the definition of the graded pieces satisfying the direct sum decomposition (1.5).

Definition 2.2.

For a weight $k$ , we define the depth graded formal multiple zeta spaces by

\displaystyle\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}

\displaystyle:=

\displaystyle\mathcal{Z}_{k,r}^{\mathfrak{b}}/\mathcal{Z}_{k,r-1}^{\mathfrak{b}}\qquad(k>r\geq 0),

(2.13)

where $\mathcal{Z}_{k,-1}^{\mathfrak{b}}=\{0\}$ .

Table 1: EDS relations in

\mathcal{Z}_{k}

and

\mathcal{Z}_{k}^{\mathfrak{b}}

for weights

k\leq 4

${\bf k},{\bf l}$	Original EDS relation (over $\mathbb{Z}$ )	Binary EDS relation (over $\mathbb{F}_{2}$ )
$(1),(2)$	$-\zeta(2,1)+\zeta(3)=0$	$\zeta^{\mathfrak{b}}(2,1)+\zeta^{\mathfrak{b}}(3)=0$
$(1),(3)$	$-\zeta(2,2)-\zeta(3,1)+\zeta(4)=0$	$\zeta^{\mathfrak{b}}(2,2)+\zeta^{\mathfrak{b}}(3,1)+\zeta^{\mathfrak{b}}(4)=0$
$(1),(2,1)$	$-\zeta(2,1,1)+\zeta(2,2)+\zeta(3,1)=0$	$\zeta^{\mathfrak{b}}(2,1,1)+\zeta^{\mathfrak{b}}(2,2)+\zeta^{\mathfrak{b}}(3,1)=0$
$(1,1),(2)$	$\zeta(2,1,1)-\zeta(2,2)-\zeta(3,1)=0$	$\zeta^{\mathfrak{b}}(2,1,1)+\zeta^{\mathfrak{b}}(2,2)+\zeta^{\mathfrak{b}}(3,1)=0$
$(2),(2)$	$-4\zeta(3,1)+\zeta(4)=0$	$\zeta^{\mathfrak{b}}(4)=0$

3 Computational result

We report our computational results. How we obtain them will be explained in the next section.

We begin with a typical result related to (1.2).

Experiment 3.1.

For any weight $k$ with $2\leq k\leq 22$ , we verify $\zeta^{\mathfrak{b}}({\bf I}^{H}_{k})$ is a basis of $\mathcal{Z}_{k}^{\mathfrak{b}}$ , and

\displaystyle\dim_{\mathbb{F}_{2}}\mathcal{Z}_{k}^{\mathfrak{b}}

\displaystyle=

\displaystyle 2^{k-2}-\dim_{\mathbb{F}_{2}}\mathcal{E}^{\mathfrak{b}}_{k}\,=\,d_{k}.

(3.1)

The EDS conjecture states that, for every weight $k$ , the relations in (2.7) suffice to reduce the number of generators of $\mathcal{Z}_{k}$ to $d_{k}$ :

\displaystyle\dim_{\mathbb{Q}}\mathcal{E}_{k}

\displaystyle\geq

\displaystyle 2^{k-2}-d_{k},

(3.2)

where $\mathcal{E}_{k}=\langle\mathsf{ds}({\bf k},{\bf l}){\,|\,}({\bf k},{\bf l})\in\widehat{{\bf PI}}_{k}\rangle_{\mathbb{Q}}$ . This can be confirmed by Experiment 3.1, as follows. We denote by $\mathcal{E}^{\mathbb{Z}}_{k}=\langle\mathsf{ds}({\bf k},{\bf l}){\,|\,}({\bf k},{\bf l})\in\widehat{{\bf PI}}_{k}\rangle_{\mathbb{Z}}$ the $\mathbb{Z}$ -module counterpart of $\mathcal{E}_{k}$ . Since $\mathbb{Q}$ is the field of fractions of $\mathbb{Z}$ and $\mathrm{can}^{\mathfrak{b}}$ is a surjective homomorphism from $\mathcal{E}^{\mathbb{Z}}_{k}$ to $\mathcal{E}^{\mathfrak{b}}_{k}$ ,

\displaystyle\dim_{\mathbb{Q}}\mathcal{E}_{k}

\displaystyle=

\displaystyle\mathrm{rank}_{\mathbb{Z}}\,\mathcal{E}^{\mathbb{Z}}_{k}\,\geq\,\dim_{\mathbb{F}_{2}}\mathcal{E}^{\mathfrak{b}}_{k},

which, together with (3.1), proves (3.2) for $k\leq 22$ .

We recall $d^{\mathfrak{b}}_{k,r}=\binom{r}{k-2r}$ that is the number of the Hoffman mult-indices of weight $k$ and depth $r$ . We define $\mathcal{H}^{\mathfrak{b}}_{k,r}=\langle\eta^{\mathfrak{b}}({\bf k}){\,|\,}{\bf k}\in{\bf I}_{k},{\mathrm{d}}({\bf k})\leq r\rangle_{\mathbb{F}_{2}}\subset\mathcal{H}^{\mathfrak{b}}_{k}$ , and

\displaystyle\overline{\mathcal{E}}^{\mathfrak{b}}_{k,r}

\displaystyle=

\displaystyle(\mathcal{H}^{\mathfrak{b}}_{k,r}\cap\mathcal{E}^{\mathfrak{b}}_{k})/\mathcal{H}^{\mathfrak{b}}_{k,r-1}.

The main result is a refinement of Experiment 3.1. Taking the sum for $r=1,\ldots,k-1$ in (3.3) induces (3.1) because of (1.5): note that $\overline{\mathcal{Z}}_{k,0}^{\mathfrak{b}}=\{0\}$ unless $k=0$ .

Experiment 3.2.

For any weight $k$ and depth $r$ with $1\leq r<k\leq 22$ , we verify $\overline{\zeta}^{\mathfrak{b}}({\bf I}^{H}_{k,r})$ is a basis of $\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}$ , and

\displaystyle\dim_{\mathbb{F}_{2}}\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}

\displaystyle=

\displaystyle\binom{k-2}{r-1}-\dim_{\mathbb{F}_{2}}\overline{\mathcal{E}}^{\mathfrak{b}}_{k,r}\,=\,d^{\mathfrak{b}}_{k,r}.

(3.3)

The first equality in (3.3) is by the isomorphism theorems. In fact, we have

$\displaystyle\mathcal{Z}_{k,r}^{\mathfrak{b}}/\mathcal{Z}_{k,r-1}^{\mathfrak{b}}$	$\displaystyle\simeq$	$\displaystyle\raisebox{4.0pt}[0.0pt][0.0pt]{$\mathcal{H}^{\mathfrak{b}}_{k,r}/(\mathcal{H}^{\mathfrak{b}}_{k,r}\cap\mathcal{E}^{\mathfrak{b}}_{k})$}\Big{/}\raisebox{-4.0pt}[0.0pt][0.0pt]{$\mathcal{H}^{\mathfrak{b}}_{k,r-1}/(\mathcal{H}^{\mathfrak{b}}_{k,r-1}\cap\mathcal{E}^{\mathfrak{b}}_{k})$}$	(3.4)
	$\displaystyle\simeq$	$\displaystyle\rule{0.0pt}{15.0pt}\raisebox{4.0pt}[0.0pt][0.0pt]{$\mathcal{H}^{\mathfrak{b}}_{k,r}$}\Big{/}\raisebox{-4.0pt}[0.0pt][0.0pt]{$(\mathcal{H}^{\mathfrak{b}}_{k,r-1}+\mathcal{H}^{\mathfrak{b}}_{k,r}\cap\mathcal{E}^{\mathfrak{b}}_{k})$}$
	$\displaystyle\simeq$	$\displaystyle\rule{0.0pt}{15.0pt}\raisebox{4.0pt}[0.0pt][0.0pt]{$\mathcal{H}^{\mathfrak{b}}_{k,r}/\mathcal{H}^{\mathfrak{b}}_{k,r-1}$}\Big{/}\raisebox{-4.0pt}[0.0pt][0.0pt]{$(\mathcal{H}^{\mathfrak{b}}_{k,r}\cap\mathcal{E}^{\mathfrak{b}}_{k})/\mathcal{H}^{\mathfrak{b}}_{k,r-1}$},$

and

\displaystyle\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}

\displaystyle\simeq

\displaystyle\raisebox{4.0pt}[0.0pt][0.0pt]{$\mathcal{H}^{\mathfrak{b}}_{k,r}/\mathcal{H}^{\mathfrak{b}}_{k,r-1}$}\Big{/}\raisebox{-4.0pt}[0.0pt][0.0pt]{$\overline{\mathcal{E}}^{\mathfrak{b}}_{k,r}$}.

Since $\binom{k-2}{r-1}=|\mathcal{H}^{\mathfrak{b}}_{k,r}/\mathcal{H}^{\mathfrak{b}}_{k,r-1}|$ by counting the number of the mult-indices of weight $k$ and depth $r$ , we obtain the desired equality.

We demonstrate the numbers $d^{\mathfrak{b}}_{k,r}$ for $k\leq 22$ in Table 2. They are expressed in terms of binomial coefficients, and we can observe a (shifted) Pascal triangle pattern: the column $r=0$ has the sequence $(1)$ from the row $k=0$ , the column $r=1$ has $(1,1)$ from $k=2$ , the column $r=2$ has $(1,2,1)$ from $k=4$ , the column $r=3$ has $(1,3,3,1)$ from $k=6$ , and so on. For comparison, the dimensions of ${\overline{\mathcal{Z}}_{k,r}}$ conjectured in (1.8) are listed in Table 3.

Table 2: The numbers

d^{\mathfrak{b}}_{k,r}

for

0\leq r<k\leq 22

: the unlisted numbers

d^{\mathfrak{b}}_{k,r}

(r>11)

are

0

. The total number of each row is

d_{k}

and that of each column is

2^{r}

(for

r\leq 7

${\left.\begin{array}[]{|c|cccccccccccc|c|}\hline\cr k/r&0&1&2&3&4&5&6&7&8&9&10&11&\text{Total}\\ \hline\cr 0&1&0&0&0&0&0&0&0&0&0&0&0&1\\ 1&0&0&0&0&0&0&0&0&0&0&0&0&0\\ 2&0&1&0&0&0&0&0&0&0&0&0&0&1\\ 3&0&1&0&0&0&0&0&0&0&0&0&0&1\\ 4&0&0&1&0&0&0&0&0&0&0&0&0&1\\ 5&0&0&2&0&0&0&0&0&0&0&0&0&2\\ 6&0&0&1&1&0&0&0&0&0&0&0&0&2\\ 7&0&0&0&3&0&0&0&0&0&0&0&0&3\\ 8&0&0&0&3&1&0&0&0&0&0&0&0&4\\ 9&0&0&0&1&4&0&0&0&0&0&0&0&5\\ 10&0&0&0&0&6&1&0&0&0&0&0&0&7\\ 11&0&0&0&0&4&5&0&0&0&0&0&0&9\\ 12&0&0&0&0&1&10&1&0&0&0&0&0&12\\ 13&0&0&0&0&0&10&6&0&0&0&0&0&16\\ 14&0&0&0&0&0&5&15&1&0&0&0&0&21\\ 15&0&0&0&0&0&1&20&7&0&0&0&0&28\\ 16&0&0&0&0&0&0&15&21&1&0&0&0&37\\ 17&0&0&0&0&0&0&6&35&8&0&0&0&49\\ 18&0&0&0&0&0&0&1&35&28&1&0&0&65\\ 19&0&0&0&0&0&0&0&21&56&9&0&0&86\\ 20&0&0&0&0&0&0&0&7&70&36&1&0&114\\ 21&0&0&0&0&0&0&0&1&56&84&10&0&151\\ 22&0&0&0&0&0&0&0&0&28&126&45&1&200\\ \hline\cr\text{Total}&1&2&4&8&16&32&64&128&-&-&-&-&\lx@intercol\hfil\hfil\lx@intercol\\ \cline{1-13}\cr\end{array}\right.}$

Table 3: The conjectural numbers

\dim\overline{\mathcal{Z}}_{k,r}

for

0\leq r<k\leq 22

: the unlisted numbers

\dim\overline{\mathcal{Z}}_{k,r}

(r>11)

are

0

${\left.\begin{array}[]{|c|cccccccccccc|c|}\hline\cr k/r&0&1&2&3&4&5&6&7&8&9&10&11&\text{Total}\\ \hline\cr 0&1&0&0&0&0&0&0&0&0&0&0&0&1\\ 1&0&0&0&0&0&0&0&0&0&0&0&0&0\\ 2&0&1&0&0&0&0&0&0&0&0&0&0&1\\ 3&0&1&0&0&0&0&0&0&0&0&0&0&1\\ 4&0&1&0&0&0&0&0&0&0&0&0&0&1\\ 5&0&1&1&0&0&0&0&0&0&0&0&0&2\\ 6&0&1&1&0&0&0&0&0&0&0&0&0&2\\ 7&0&1&2&0&0&0&0&0&0&0&0&0&3\\ 8&0&1&2&1&0&0&0&0&0&0&0&0&4\\ 9&0&1&3&1&0&0&0&0&0&0&0&0&5\\ 10&0&1&3&3&0&0&0&0&0&0&0&0&7\\ 11&0&1&4&3&1&0&0&0&0&0&0&0&9\\ 12&0&1&3&6&2&0&0&0&0&0&0&0&12\\ 13&0&1&5&6&4&0&0&0&0&0&0&0&16\\ 14&0&1&5&9&4&2&0&0&0&0&0&0&21\\ 15&0&1&6&8&10&3&0&0&0&0&0&0&28\\ 16&0&1&5&14&11&6&0&0&0&0&0&0&37\\ 17&0&1&7&13&18&7&3&0&0&0&0&0&49\\ 18&0&1&6&19&18&17&4&0&0&0&0&0&65\\ 19&0&1&8&17&31&19&10&0&0&0&0&0&86\\ 20&0&1&7&25&30&35&12&4&0&0&0&0&114\\ 21&0&1&9&22&48&37&29&5&0&0&0&0&151\\ 22&0&1&8&32&45&65&33&16&0&0&0&0&200\\ \hline\cr\end{array}\right.}$

Refinements of the EDS conjecture have been proposed. Minh et al. [27] conjectured that a part of the EDS relations obtained from

\displaystyle\widehat{{\bf PI}}_{k}^{\mathrm{MJPO}}

\displaystyle=

\displaystyle{\bf PI}_{k}\cup({\bf\widehat{I}}_{1}\times{\bf I}_{k-1})

(3.5)

is a right candidate, and verified it up to $k=16$ . The relations

\displaystyle Z(\mathsf{ds}({\bf k},{\bf l}))

\displaystyle=

\displaystyle 0\qquad(({\bf k},{\bf l})\in{\bf\widehat{I}}_{1}\times{\bf I}_{k-1})

are known as Hoffman’s relations ([18]), and their conjecture says that FDS relations and Hoffman’s relations suffice to give all relations among MZVs. Kaneko et al. [21] conjectured the above relations are too much, i.e., a smaller part obtained from

\displaystyle\widehat{{\bf PI}}_{k}^{\mathrm{KNT}}

\displaystyle=

\displaystyle(\{(3),(2,1)\}\times{\bf I}_{k-3})\cup(\{(2)\}\times{\bf I}_{k-2})\cup({\bf\widehat{I}}_{1}\times{\bf I}_{k-1})

(3.6)

is a right candidate. They verified it up to $k=20$ .

In the space $\mathcal{Z}_{k}^{\mathfrak{b}}$ , neither the relations obtained from (3.5) nor those obtained from (3.6) suffice to give all relations among binary MZSs.

Experiment 3.3.

Let $\bullet\in\{\mathrm{KNT},\mathrm{MJPO}\}$ and let $\mathcal{E}^{\mathfrak{b},\bullet}_{k}=\langle\mathrm{can}^{\mathfrak{b}}(\mathsf{ds}({\bf k},{\bf l})){\,|\,}({\bf k},{\bf l})\in\widehat{{\bf PI}}_{k}^{\bullet}\rangle_{\mathbb{F}_{2}}$ . There exist weights $k\leq 22$ such that

\displaystyle\dim_{\mathbb{F}_{2}}\mathcal{E}^{\mathfrak{b},\bullet}_{k}

\displaystyle<

\displaystyle 2^{k-2}-d_{k}.

(3.7)

Table 4: The numbers

d^{\mathfrak{b},\bullet}_{k}

(\bullet\in\{\mathrm{KNT},\mathrm{MJPO}\})

with

d_{k}

: they are same when

k\leq 6

$\left.\begin{array}[]{|c|ccc|}\hline\cr k\rule[-6.0pt]{0.0pt}{20.0pt}&d^{\mathfrak{b},\mathrm{KNT}}_{k}&d^{\mathfrak{b},\mathrm{MJPO}}_{k}&d_{k}\\ \hline\cr 7&4&4&3\\ 8&6&4&4\\ 9&8&6&5\\ 10&12&8&7\\ 11&21&10&9\\ 12&30&14&12\\ 13&44&18&16\\ 14&66&24&21\\ 15&100&33&28\\ 16&140&42&37\\ 17&208&57&49\\ 18&300&75&65\\ 19&441&99&86\\ 20&644&132&114\\ 21&-&174&151\\ 22&-&231&200\\ \hline\cr\end{array}\right.$

Computational results of $d^{\mathfrak{b},\bullet}_{k}=2^{k-2}-\dim\mathcal{E}^{\mathfrak{b},\bullet}_{k}$ are shown in Table 4. In general,

\displaystyle d^{\mathfrak{b},\mathrm{KNT}}_{k}

\displaystyle>

\displaystyle d^{\mathfrak{b},\mathrm{MJPO}}_{k}\,>\,d_{k}.

We can find that the sequence $(d^{\mathfrak{b},\mathrm{MJPO}}_{k})_{0\leq k\leq 22}$ has a quasi Fibonacci-like rule,

\displaystyle d^{\mathfrak{b},\mathrm{MJPO}}_{k}

\displaystyle=

\displaystyle d^{\mathfrak{b},\mathrm{MJPO}}_{k-2}+d^{\mathfrak{b},\mathrm{MJPO}}_{k-3}+\delta_{M,k},

(3.8)

where $M=\{7,15\}$ and $\delta_{M,k}$ is the Kronecker delta function defined by $\delta_{M,k}=1$ if $k\in M$ and $\delta_{M,k}=0$ otherwise. It appears that $(d^{\mathfrak{b},\mathrm{KNT}}_{k})_{0\leq k\leq 22}$ does not have an obvious law.

4 Computer program

Our computer programs, that perform the Gaussian forward elimination on the linear combinations in $\mathcal{E}^{\mathfrak{b}}_{k}$ , show the following proposition.

Proposition 4.1.

Let $k$ and $r$ be a weight and depth, respectively, with $r<k\leq 22$ . For a mult-index ${\bf k}$ in ${\bf I}_{k,r}$ , the following statements hold.
(i) If ${\bf k}\notin{\bf I}^{H}_{k,r}$ , there exists a combination $c\in\mathcal{H}^{\mathfrak{b}}_{k,r}\cap\mathcal{E}^{\mathfrak{b}}_{k}$ such that

\displaystyle\eta^{\mathfrak{b}}({\bf k})

\displaystyle\in

\displaystyle c+\langle\eta^{\mathfrak{b}}({\bf h}){\,|\,}{\bf h}\in{\bf I}^{H}_{k,r}\cup{\bf I}^{H}_{k,r-1}\cup\cdots\cup{\bf I}^{H}_{k,\lfloor k/3\rfloor}\rangle_{\mathbb{F}_{2}}.

(4.1)

(ii) If ${\bf k}\in{\bf I}^{H}_{k,r}$ , there exists no combination $c$ such as $\mathrm{(\ref{4_PRP1_IncBHS})}$ .

Here $\lfloor\cdot\rfloor$ is the floor function defined by $\lfloor t\rfloor=\max\left\{a\in\mathbb{Z}{\,|\,}a\leq t\right\}$ for a real number $t$ .

Proposition 4.1 verifies Experiment 3.2. Suppose ${\bf k}\in{\bf I}_{k,r}\setminus{\bf I}^{H}_{k,r}$ . By the statement (i),

\displaystyle\zeta^{\mathfrak{b}}({\bf k})

\displaystyle\in

\displaystyle\langle\zeta^{\mathfrak{b}}({\bf h}){\,|\,}{\bf h}\in{\bf I}^{H}_{k,r}\cup{\bf I}^{H}_{k,r-1}\cup\cdots\cup{\bf I}^{H}_{k,\lfloor k/3\rfloor}\rangle_{\mathbb{F}_{2}},

(4.2)

\displaystyle\overline{\zeta}^{\mathfrak{b}}({\bf k})

\displaystyle\in

\displaystyle\langle\overline{\zeta}^{\mathfrak{b}}({\bf h}){\,|\,}{\bf h}\in{\bf I}^{H}_{k,r}\rangle_{\mathbb{F}_{2}},

(4.3)

which, together with the statement (ii), implies $\overline{\zeta}^{\mathfrak{b}}({\bf I}^{H}_{k,r})$ is a basis of $\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}$ for $r<k\leq 22$ .

Imaginarily, the Gaussian elimination can elucidate any vector space whose corresponding matrix (or set of defining linear combinations) is clearly given: but practically, it is limited to a space that are not too big. The bound $k=22$ in Proposition 4.1 indicates a performance threshold of our computing environments. Below we will describe the environments and prove Proposition 4.1.

The programs are written almost by Python language and partly by Cython language. The machine is as follows: a Linux-based PC having two CPUs with $12$ -core at 2.70GHz (Intel Xeon Gold 6226) and a $3$ TB RAM. The package of the programs is available at https://github.com/machide-tomoyan/BMZS-calculator.

The executable files are in the directories named as $\mathtt{Main\_make}$ and $\mathtt{Main\_cal}$ . The former contains five files that produce datas of binary systems (or binary matrices) obtained from the binary EDS relations, and the latter contains one file that calculates dimensions of $\mathcal{Z}_{k}^{\mathfrak{b}}$ and $\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}$ (or row echelon forms of the corresponding binary matrices). The produced datas are stocked in $\mathtt{Data}$ , almost of which are saved in Python pickle format to reduce data size. Class files in which essential precesses are performed are stored in $\mathtt{Work}$ . Files of config, license and readme are also placed in the root directory of the package. (See Figure 1 for a layout of the package).

Refer to caption — Figure 1: Layout of our package for the executable files.

We have a convenient expression for a linear combination in $\mathcal{Z}_{k}^{\mathfrak{b}}$ since $\mathbb{F}_{2}$ consists of only two elements. A subset ${\bf J}$ in ${\bf I}_{k}$ is identified with a combination such as

\displaystyle{\bf J}\quad\,\longleftrightarrow\,\quad\sum_{{\bf k}\in{\bf J}}\zeta^{\mathfrak{b}}({\bf k}).

(4.4)

For instance, ${\bf J}_{1}=\{(2,1,1),(2,2),(3,1)\}$ corresponds to $\zeta^{\mathfrak{b}}(2,1,1)+\zeta^{\mathfrak{b}}(2,2)+\zeta^{\mathfrak{b}}(3,1)$ and ${\bf J}_{2}=\{(2,2),(3,1),(4)\}$ corresponds to $\zeta^{\mathfrak{b}}(2,2)+\zeta^{\mathfrak{b}}(3,1)+\zeta^{\mathfrak{b}}(4)$ . By (4.4), the symmetric difference $\bigtriangleup$ of two sets is equivalent to the plus of two combinations: ${\bf J}_{1}\bigtriangleup{\bf J}_{2}=({\bf J}_{1}\setminus{\bf J}_{2})\cup({\bf J}_{2}\setminus{\bf J}_{1})=\{(2,1,1),(4)\}$ corresponds to $(\zeta^{\mathfrak{b}}(2,2)+\zeta^{\mathfrak{b}}(3,1)+\zeta^{\mathfrak{b}}(4))+(\zeta^{\mathfrak{b}}(2,1,1)+\zeta^{\mathfrak{b}}(2,2)+\zeta^{\mathfrak{b}}(3,1))=\zeta^{\mathfrak{b}}(2,1,1)+\zeta^{\mathfrak{b}}(4)$ . The expression (4.4) is also applied to a linear relation $\sum_{{\bf k}\in{\bf J}}\zeta^{\mathfrak{b}}({\bf k})=0$ in the same way. We compute binary EDS relations (or defining combinations in $\mathcal{E}^{\mathfrak{b}}_{k}$ ) through (4.4) with the set datatype in Python. This set based expression can be realized by built-in objects.⁶⁶6 We use frozenset and s.symmetric_difference(t) (or the operator notation ‘s^t’), where frozenset is an immutable datatype for set datas and s,t are its instances.

We will explain the executable files and report their statics. We do not mention actual command lines to use the files in a linux OS, but we can find them in the beginning of each file.

4.1 Executable file in $\mathtt{Main\_make}$

We will require many maps to save midway datas for the binary linear systems induced from the binary EDS relations. The prime reason is that, by (2.6), each EDS relation is composed of a combination of $\mathcyr{sh}$ , $*$ and $\mathrm{reg}_{\mathcyr{sh}}$ . For the maps or the midway datas, we will use dictionary datatype, which is a built-in object in Python and consists of a collection of tuples of two objects called ‘key’ and ‘value’: a key-object is mapped to its associated value-object.

The file $\mathtt{0\_preparation.py}$ prepares two dictionary datas for each weight $k\leq 22$ . Let $[n]$ denote the set $\{1,\ldots,n\}$ for a positive integer $n$ . One data gives a one-to-one mapping from the integers in $[2^{k}]$ to the words of degree $k$ , and another data gives a one-to-one mapping from the integers in $[2^{k-1}]$ to the mult-indices in ${\textstyle\bigcup_{r=1}^{k}}\mathbb{N}^{r}$ of weight $k$ : if $n\in[2^{k-1}]$ and the associated mult-index is ${\bf k}$ , the associated word is $z_{{\bf k}}$ . The objects of the set which our programs select for the set based expression in the left of (4.4) are the integers (for which the integer datatype is necessary) instead of the mult-indices and words (for which the tuple and string datatypes are necessary), because the integer datatype is reasonable in data size and running time.

The file $\mathtt{1\_product.py}$ creates dictionary datas for shuffle and stuffle products. The defining equations (2.2) and (2.3) suggest that creating datas of shuffle will take more time since shuffle products can contain more terms. For a speed-up, we improve (2.2):

			$\displaystyle a_{1}\cdots a_{m}\;\mathcyr{sh}\;b_{1}\cdots b_{n}$		(4.5)
		$\displaystyle=$	$\displaystyle\sum_{{i+j=l}\atop\left(0\leq i\leq\min\left\{l,m\right\}\atop 0\leq j\leq\min\left\{l,n\right\}\right)}\left(a_{1}\cdots a_{i}\;\mathcyr{sh}\;b_{1}\cdots b_{j}\right)\left(a_{i+1}\cdots a_{m}\;\mathcyr{sh}\;b_{j+1}\cdots b_{n}\right),$		(4.5)

where $a_{1},\ldots,a_{m},b_{1},\ldots,b_{n}\in\{x,y\}$ and $0<l\leq m+n$ . This is a spacial case of (2.2) if $l=1$ and can be proved by induction on $l$ . Let $k=m+n$ . Using (4.5) with $l=\lfloor k/2\rfloor$ , we can reduce shuffle products of weight $k$ to combinations of those of about half weight. We denote by $\mathtt{Sh}$ and $\mathtt{St}$ created datas of shuffle and stuffle, respectively. They map pairs of mult-indices to combinations including temporal indeterminates ${\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,\ldots,1,{\bf n})$ $({\bf n}\in{\bf I}\setminus\{\varnothing\})$ that are binary versions of regularized MZVs. For instance,

\begin{split}\mathtt{Sh}((1),(2))&\,=\,{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,2),\\ \mathtt{St}((1),(2))&\,=\,{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,2)+\zeta^{\mathfrak{b}}(2,1)+\zeta^{\mathfrak{b}}(3),\\ \mathtt{Sh}((1,1),(2))&\,=\,{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,1,2)+\zeta^{\mathfrak{b}}(2,1,1),\\ \mathtt{St}((1,1),(2))&\,=\,{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,1,2)+{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,2,1)+\zeta^{\mathfrak{b}}(2,1,1)+{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,3)+\zeta^{\mathfrak{b}}(3,1).\end{split}

(4.6)

In the case of shuffle, for using (4.5) with $l=\lfloor k/2\rfloor$ , we also create maps from pairs of words to combinations of words up to weight $22/2=11$ . In those additional maps, we allow the words that can not be written in terms of mult-indices (e.g., $x$ and $yx=z_{1}x$ ).

Let ${\bf k}$ be a mult-index that is expressed as $z_{{\bf k}}=y^{n}z_{{\bf n}}$ , where $n\geq 0$ and ${\bf n}\in{\bf I}\setminus\{\varnothing\}$ . Let ${\bf n}^{\prime}$ denote a mult-index such that $z_{{\bf n}}=xz_{{\bf n}^{\prime}}$ . By [19, Proposition 8],

\displaystyle\mathrm{reg}_{\mathcyr{sh}}(y^{n}z_{{\bf n}})

\displaystyle=

\displaystyle(-1)^{n}x(y^{n}\;\mathcyr{sh}\;z_{{\bf n}^{\prime}}).

(4.7)

Since the regularized MZV of $z_{{\bf k}}$ is $Z\circ\mathrm{reg}_{\mathcyr{sh}}(y^{n}z_{{\bf n}})$ , its binary version should be

\displaystyle{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}({\bf k})

\displaystyle=

\displaystyle Z^{\mathfrak{b}}\circ\mathrm{reg}_{\mathcyr{sh}}(y^{n}z_{{\bf n}})\,=\,H^{\mathfrak{b}}\circ\mathrm{can}^{\mathfrak{b}}(x(y^{n}\;\mathcyr{sh}\;z_{{\bf n}^{\prime}})).

(4.8)

The dictionary datas that the file $\mathtt{2\_regularized\_product.py}$ creates are obtained by applying (4.8) to ones that the previous file creates. For instance, we have by (4.8)

$\displaystyle{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,2)$	$\displaystyle=$	$\displaystyle H^{\mathfrak{b}}\circ\mathrm{can}^{\mathfrak{b}}(x(y\;\mathcyr{sh}\;y))\,=\,H^{\mathfrak{b}}(\mathrm{can}^{\mathfrak{b}}(2xyy))\,=\,0,$
$\displaystyle\rule{0.0pt}{15.0pt}{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,1,2)$	$\displaystyle=$	$\displaystyle H^{\mathfrak{b}}\circ\mathrm{can}^{\mathfrak{b}}(x(y^{2}\;\mathcyr{sh}\;y))\,=\,H^{\mathfrak{b}}(\mathrm{can}^{\mathfrak{b}}(3xyyy))\,=\,\zeta^{\mathfrak{b}}(2,1,1),$
$\displaystyle\rule{0.0pt}{15.0pt}{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,2,1)$	$\displaystyle=$	$\displaystyle H^{\mathfrak{b}}\circ\mathrm{can}^{\mathfrak{b}}(x(y\;\mathcyr{sh}\;yy))\,=\,H^{\mathfrak{b}}(\mathrm{can}^{\mathfrak{b}}(3xyyy))\,=\,\zeta^{\mathfrak{b}}(2,1,1),$
$\displaystyle\rule{0.0pt}{15.0pt}{\zeta}_{\mathcyr{sh}}^{\mathfrak{b}}(1,3)$	$\displaystyle=$	$\displaystyle H^{\mathfrak{b}}\circ\mathrm{can}^{\mathfrak{b}}(x(y\;\mathcyr{sh}\;xy))\,=\,H^{\mathfrak{b}}(\mathrm{can}^{\mathfrak{b}}(xyxy+2xxyy))\,=\,\zeta^{\mathfrak{b}}(2,2),$

and so the previous datas in (4.6) are converted to

\begin{split}\mathtt{Sh}_{\mathcyr{sh}}((1),(2))&\,=\,0,\\ \mathtt{St}_{\mathcyr{sh}}((1),(2))&\,=\,\zeta^{\mathfrak{b}}(2,1)+\zeta^{\mathfrak{b}}(3),\\ \mathtt{Sh}_{\mathcyr{sh}}((1,1),(2))&\,=\,0,\\ \mathtt{St}_{\mathcyr{sh}}((1,1),(2))&\,=\,\zeta^{\mathfrak{b}}(2,1,1)+\zeta^{\mathfrak{b}}(2,2)+\zeta^{\mathfrak{b}}(3,1),\end{split}

(4.9)

where $\mathtt{Sh}_{\mathcyr{sh}}$ and $\mathtt{St}_{\mathcyr{sh}}$ stand for the maps of regularized shuffle and stuffle, respectively.

The file $\mathtt{3\_extended\_relation.py}$ makes binary EDS relations,

\displaystyle\mathtt{St}_{\mathcyr{sh}}({\bf k},{\bf l})+\mathtt{Sh}_{\mathcyr{sh}}({\bf k},{\bf l})

\displaystyle=

\displaystyle 0\qquad(({\bf k},{\bf l})\in\widehat{{\bf PI}}_{k}),

by combining the previous datas. For instance, the datas in (4.9) create the two relations,

\displaystyle\zeta^{\mathfrak{b}}(2,1)+\zeta^{\mathfrak{b}}(3)

\displaystyle=

\displaystyle 0,

\displaystyle\zeta^{\mathfrak{b}}(2,1,1)+\zeta^{\mathfrak{b}}(2,2)+\zeta^{\mathfrak{b}}(3,1)

\displaystyle=

\displaystyle 0.

The file $\mathtt{4\_binary\_system.py}$ converts the binary EDS relations of a weight $k$ to a binary linear system (which we call a binary EDS linear system) in both of text and pickle formats. The text format is organized as follows:

1.

A line with the first character ‘#’ is a comment line. Comment lines typically occur at the beginning of the file, but are allowed to appear throughout the file.
2.

The remainder of the file contains lines defining the binary linear relations, one by one.
3.

A relation is defined by positive integers numbering binary MZSs. A number ‘0’ is typically placed at the last of the line, but it is optional.

For example, the line “2 4 0” is corresponding to $\zeta^{\mathfrak{b}}(2,1)+\zeta^{\mathfrak{b}}(3)=0$ , if $\zeta^{\mathfrak{b}}(2,1)$ and $\zeta^{\mathfrak{b}}(3)$ are numbered as $2$ and $4$ , respectively. The pickle files are not necessary but useful: e.g., when loading the large size system. For Experiment 3.3, we also make binary $\mathrm{KNT}$ and $\mathrm{MJPO}$ linear systems by restricting binary EDS relations.

The above programs run under the parallel process since the datas can be created independently if $\widehat{{\bf PI}}_{k}$ is divided into a plurality of blocks. The filenames of the datas by the parallel process have strings ‘ $\_\mathtt{Bn}$ ’ $(\mathtt{n}\in\mathbb{N})$ at their tails. Editing the file $\mathtt{config.txt}$ we can control the max number of parallel threads.

In Table 5, we present computation times (or elapsed real times) to execute all files for $k\geq 18$ , where $\mathtt{i\_M.py}$ stands for the $i$ -th file mentioned above from $0$ to $4$ . We find that calculating the regularizations in $\mathtt{2\_M.py}$ is the dominant process. Table 6 lists the file sizes of the binary linear systems in pickle format for $\widehat{{\bf PI}}_{k}^{\mathrm{KNT}}$ , $\widehat{{\bf PI}}_{k}^{\mathrm{MJPO}}$ and $\widehat{{\bf PI}}_{k}=\widehat{{\bf PI}}_{k}^{\mathrm{EDS}}$ . As expected from $\widehat{{\bf PI}}_{k}^{\mathrm{KNT}}\subset\widehat{{\bf PI}}_{k}^{\mathrm{MJPO}}\subset\widehat{{\bf PI}}_{k}$ , the file of $\mathrm{KNT}$ is smallest and that of EDS is largest for each weight. The size of text format file is about $1.5$ times the size of pickle one. For each weight $k$ , the maximum memory size (or maximum resident set size) to execute the files $\mathtt{i\_M.py}$ is the size required by $\mathtt{4\_M.py}$ , which is about half size used in Gaussian forward elimination (see Table 7). Computationally, making linear systems is not harder than calculating their coranks (or dimensions of cokernel) as we will see below.

Table 5: Elapsed real times [sec] to make binary systems.

$k$	$\mathtt{0\_M.py}$	$\mathtt{1\_M.py}$	$\mathtt{2\_M.py}$	$\mathtt{3\_M.py}$	$\mathtt{4\_M.py}$	Total
18	0	75	246	53	54	428 ( $\fallingdotseq$ 7min)
19	1	188	805	133	186	1313 ( $\fallingdotseq$ 22min)
20	3	469	3137	510	543	4662 ( $\fallingdotseq$ 1.3hour)
21	7	1529	15607	1384	2362	20889 ( $\fallingdotseq$ 5.8hour)
22	15	3018	61898	3675	6578	75184 ( $\fallingdotseq$ 21hour)

Table 6: File sizes of binary linear systems in pickle format.

$k$	$\mathrm{KNT}$	$\mathrm{MJPO}$	EDS
18	8.3M	150M	274M
19	21M	509M	922M
20	47M	1.6G	2.8G
21	105M	5G	8.6G
22	233M	16G	26G

4.2 Executable file in $\mathtt{Main\_cal}$

The file $\mathtt{dimensions.py}$ ( $\mathtt{d\_C.py}$ for short) executes the Gaussian forward elimination on a given binary linear system of a weight $k$ by using Algorithm A.6. In the process, an order of mult-indices (or binary MZSs) have to be determined to convert the inputted binary linear system into the corresponding binary matrix. We employ a sequence $({{\bf k}_{1}},\ldots,{{\bf k}_{2^{k-2}}})$ satisfying the following: if $i<j$ ,

(a)

${\mathrm{d}}({\bf k}_{i})>{\mathrm{d}}({\bf k}_{j})$ ; or
(b)

${\mathrm{d}}({\bf k}_{i})={\mathrm{d}}({\bf k}_{j})$ and $({\bf k}_{i},{\bf k}_{j})\notin{\bf I}^{H}_{k}\times({\bf I}_{k}\setminus{\bf I}^{H}_{k})$ .

The condition (a) means that the mult-indices (or columns in the corresponding matrix) are sectioned into $k-1$ blocks by depth: the mult-indices in a left block have a greater depth than those in a right block. The condition (b) means that the Hoffman mult-indices of depth $r$ are at the rightmost place in the $(k-r)$ th block. For example, the order of weight $4$ determined by ${\bf k}_{1}=(2,1,1)$ , ${\bf k}_{2}=(3,1)$ , ${\bf k}_{3}=(2,2)$ and ${\bf k}_{4}=(4)$ satisfies (a) and (b): they are sectioned as $({\bf k}_{1})|({\bf k}_{2},{\bf k}_{3})|({\bf k}_{4})$ and the only Hoffman mult-index ${\bf k}_{3}$ is located rightmost in the $2$ th block.

Proposition 4.1 is shown as follows.

Proof of Proposition 4.1. We consider the situation where we run $\mathtt{d\_C.py}$ by inputing the binary EDS linear system of weight $k$ . We then obtain a row echelon matrix satisfies the following.

(E1)

There exists a non-zero pivot at any column ${\bf k}$ in ${\bf I}_{k}\setminus{\bf I}^{H}_{k}$ .
(E2)

There exists no non-zero pivot at any column ${\bf k}$ in ${\bf I}^{H}_{k}$ .

For a non-zero combination $c=\eta^{\mathfrak{b}}({\bf k}_{i_{1}})+\cdots+\eta^{\mathfrak{b}}({\bf k}_{i_{j}})$ in $\mathcal{H}^{\mathfrak{b}}_{k}$ with $i_{1}<\cdots<i_{j}$ , we define the leading term of $c$ by

\displaystyle L(c)

\displaystyle=

\displaystyle\eta^{\mathfrak{b}}({\bf k}_{i_{1}}).

By (a) and (b), the statements (E1) and (E2) are equivalent to (e1) and (e2), respectively:

(e1)

There exists a combination $c\in\mathcal{E}^{\mathfrak{b}}_{k}$ such that $L(c)=\eta^{\mathfrak{b}}({\bf k})$ for any ${\bf k}$ in ${\bf I}_{k}\setminus{\bf I}^{H}_{k}$ .
(e2)

There exists no combination $c\in\mathcal{E}^{\mathfrak{b}}_{k}$ such that $L(c)=\eta^{\mathfrak{b}}({\bf k})$ for any ${\bf k}$ in ${\bf I}^{H}_{k}$ .

Under (e1) and (e2), the back substitution (performed imaginarily) implies Proposition 4.1, where the fact that ${\bf I}^{H}_{k,r}=\phi$ for any depth $r<\lfloor k/3\rfloor$ is used for (4.1). $\Box$

We give examples of (4.3) for $k\leq 7$ excluding the case that $\overline{\zeta}^{\mathfrak{b}}({\bf k})=0$ . Note that $\overline{\zeta}^{\mathfrak{b}}({\bf k})$ is always zero if ${\bf k}\in{\bf I}_{k,r}$ and ${\bf I}^{H}_{k,r}=\phi$ .

$\displaystyle\overline{\zeta}^{\mathfrak{b}}(3,1)$	$\displaystyle=$	$\displaystyle\overline{\zeta}^{\mathfrak{b}}(2,2),$
$\displaystyle\rule{0.0pt}{15.0pt}\rule{0.0pt}{25.0pt}\overline{\zeta}^{\mathfrak{b}}(4,1)$	$\displaystyle=$	$\displaystyle\overline{\zeta}^{\mathfrak{b}}(2,3)+\overline{\zeta}^{\mathfrak{b}}(3,2),$
$\displaystyle\rule{0.0pt}{15.0pt}\rule{0.0pt}{25.0pt}\overline{\zeta}^{\mathfrak{b}}(2,1,3)\,=\,\overline{\zeta}^{\mathfrak{b}}(3,2,1)\,=\,\overline{\zeta}^{\mathfrak{b}}(4,1,1)$	$\displaystyle=$	$\displaystyle\overline{\zeta}^{\mathfrak{b}}(2,2,2),$
$\displaystyle\rule{0.0pt}{15.0pt}\overline{\zeta}^{\mathfrak{b}}(5,1)$	$\displaystyle=$	$\displaystyle\overline{\zeta}^{\mathfrak{b}}(3,3),$
$\displaystyle\rule{0.0pt}{15.0pt}\rule{0.0pt}{25.0pt}\overline{\zeta}^{\mathfrak{b}}(5,1,1)\,=\,\overline{\zeta}^{\mathfrak{b}}(3,1,3)$	$\displaystyle=$	$\displaystyle\overline{\zeta}^{\mathfrak{b}}(2,2,3)+\overline{\zeta}^{\mathfrak{b}}(2,3,2)+\overline{\zeta}^{\mathfrak{b}}(3,2,2),$
$\displaystyle\rule{0.0pt}{15.0pt}\overline{\zeta}^{\mathfrak{b}}(3,3,1)$	$\displaystyle=$	$\displaystyle\overline{\zeta}^{\mathfrak{b}}(2,3,2),$
$\displaystyle\rule{0.0pt}{15.0pt}\overline{\zeta}^{\mathfrak{b}}(4,2,1)$	$\displaystyle=$	$\displaystyle\overline{\zeta}^{\mathfrak{b}}(2,2,3)+\overline{\zeta}^{\mathfrak{b}}(2,3,2),$
$\displaystyle\rule{0.0pt}{15.0pt}\overline{\zeta}^{\mathfrak{b}}(4,1,2)\,=\,\overline{\zeta}^{\mathfrak{b}}(2,1,4)$	$\displaystyle=$	$\displaystyle\overline{\zeta}^{\mathfrak{b}}(2,2,3)+\overline{\zeta}^{\mathfrak{b}}(3,2,2),$
$\displaystyle\rule{0.0pt}{15.0pt}\overline{\zeta}^{\mathfrak{b}}(2,4,1)$	$\displaystyle=$	$\displaystyle\overline{\zeta}^{\mathfrak{b}}(2,3,2)+\overline{\zeta}^{\mathfrak{b}}(3,2,2).$

Examining the Gaussian forward elimination performed by $\mathtt{d\_C.py}$ in detail, we can find a part of the inputted binary EDS relations which forms a basis of $\mathcal{E}^{\mathfrak{b}}_{k}$ . We give examples of bases for $k\leq 6$ , where only the pairs of mult-indices are written (see Table 1 that lists associated relations for $k\leq 4$ ).

$k=3$

$((1),(2))$ .
$k=4$

$((1),(2,1))$ , $((1),(3))$ , $((2),(2))$ .
$k=5$

$((1),(2,1,1))$ , $((1),(2,2))$ , $((1),(3,1))$ , $((1),(4))$ , $((2),(2,1))$ , $((2),(3))$ .
$k=6$

$((1),(2,1,1,1))$ , $((1),(2,1,2))$ , $((1),(2,2,1))$ , $((1),(2,3))$ , $((1),(3,1,1))$ , $((1),(3,2))$ , $((1),(4,1))$ , $((1),(5))$ , $((2),(2,1,1))$ , $((2),(2,2))$ , $((2),(3,1))$ , $((2,1),(2,1))$ , $((2,1),(3))$ , $((3),(3))$ .

We can verify Experiment 3.3 similarly to Experiment 3.2. We input $\mathrm{KNT}$ and $\mathrm{MJPO}$ linear systems into $\mathtt{d\_C.py}$ . By Table 4, in most cases, row echelon matrices that do not satisfy (E1) are outputted. The fails of (E1) induce (3.7), and ensure Experiment 3.3.

The program in $\mathtt{d\_C.py}$ applies the parallel process to determine an order of mult-indices since mult-indices can be divided by depth. For instance, $(k-1)$ parallel threads occur as preprocessing if a binary EDS linear system of weight $k$ is inputted. Algorithm A.6, the main process for computing a row echelon matrix, is executed in single. It appears that the parallelization of Algorithm A.6 is not easy because a non-simple search procedure is incorporated.

In Table 7, we present the statics of the executions by $\mathtt{d\_C.py}$ whose inputs are the binary $\mathrm{KNT}$ , $\mathrm{MJPO}$ and EDS relations. We observe that the computation for $\mathrm{KNT}$ requires much more time than $\mathrm{MJPO}$ and EDS, although the number of relations of $\mathrm{KNT}$ is quite small such that the corresponding matrix is square for any $k\geq 7$ . This phenomenon expresses a characteristic of Algorithm A.6. It employs a conflict based search procedure inspired by the conflict-driven clause learning (CDCL), a modern method with many successes to practical applications in solving the Boolean satisfiability (SAT) problem. Roughly speaking, relations with good structures for finding conflict combinations can accelerate searching a pivot relation (see Remark A.7 for more information). The memory cost is bad in comparison with the statics in [21], but the runtime is about 10 times more faster. Therefore we can improve the record of calculating (3.2) from $k=20$ to $22$ by the use of a machine with large memory capacity.

Table 7: Statistics of the computations of Experiments 3.2 and 3.3. ‘Rels’ is the number of relations. ‘MeanNum’ is the average number of terms per relation. ‘Memory’ and ‘Time’ are the resident set size and elapsed real time, respectively. In each block with respect to the weight

k

, top row indicates information on

\mathrm{KNT}

, middle row indicates that on

\mathrm{MJPO}

and bottom row indicates that on EDS.

$k$	$2^{k-2}$	Rels	MeanNum	Memory	Time
18	65536	65536	30.1	4.6G	8.6hour	$\mathrm{KNT}$
\cdashline3-7		155711	230.4	7.3G	8.8min	$\mathrm{MJPO}$
\cdashline3-7		188470	364.4	11.4G	9.8min	EDS
19	131072	131072	33.7	16.5G	68hour
\cdashline3-6		327679	339.5	22.9G	42.4min
\cdashline3-6		393206	523.1	34.3G	43.7min
20	262144	262144	37.6	61G	22day
\cdashline3-6		688254	500.5	82G	5.3hour
\cdashline3-6		819316	751.7	110G	4.7hour
21	524288	-	-	-	-
\cdashline3-6		1441791	739.8	256G	30hour
\cdashline3-6		1703925	1083.3	329G	25hour
22	1048576	-	-	-	-
\cdashline3-6		3014911	1094.4	789G	8day
\cdashline3-6		3539188	1564.1	982G	7day

5 Problem

Some problems arise in connection with the experiments in Section 3.

Experiments 3.1 and 3.2 indicate typical problems on the dimensions of $\mathcal{Z}_{k}^{\mathfrak{b}}$ and $\overline{\mathcal{Z}}_{k,r}^{\mathfrak{b}}$ : obviously, Problem 5.2 includes Problem 5.1.

Problem 5.1.

Does (3.1) hold for any weight $k$ ?

Problem 5.2.

Does (3.3) (or Proposition 4.1) hold for any weight $k$ and depth $r$ ?

Experiment 3.3 yields the following:

Problem 5.3.

(i) Is there a subset $M\subset\mathbb{N}$ such that $M\cap[22]=\{7,15\}$ and the sequence $(d^{\mathfrak{b},\mathrm{MJPO}}_{k})_{k\geq 0}$ satisfies (3.8)?
(ii) Can we find a law in the sequence $(d^{\mathfrak{b},\mathrm{KNT}}_{k})_{k\geq 0}$ ?

We have adopted the binary field $\mathbb{F}_{2}$ for the scalar field of the formal multiple zeta space and for the computation of corank. (It is worth noting that the experiments of [21] employ $\mathbb{F}_{16381}$ and $\mathbb{F}_{31991}$ .) There are no particular reasons for choosing $\mathbb{F}_{2}$ except computational science techniques are easy to apply. A discovery of a regularity of $d^{\mathfrak{b}}_{k,r}$ in Table 2 is a product of good luck.

Problem 5.4.

(i) Can we find a theoretical reason why the dimensions $d^{\mathfrak{b}}_{k,r}$ $(r<k\leq 22)$ have a Pascal triangle pattern?
(ii) What will the dimensions be if we adopt other finite fields $\mathbb{F}_{p}$ ?

Like MZVs, we can make an assumption that binary MZSs satisfy a multiplication compatible with the shuffle and stuffle products. Under the assumption, we have $\zeta^{\mathfrak{b}}(2)^{2}=0$ since $Z^{\mathfrak{b}}(z_{2})Z^{\mathfrak{b}}(z_{2})=Z^{\mathfrak{b}}(z_{2}\,\mathcyr{sh}\,z_{2})=H^{\mathfrak{b}}\circ\mathrm{can}^{\mathfrak{b}}(2z_{2,2}+4z_{3,1})=0$ . This means that the algebras of MZV and binary MZS are different. In particular, $\mathbb{F}_{2}[\zeta^{\mathfrak{b}}(2)]=\langle 1,\zeta^{\mathfrak{b}}(2)\rangle_{\mathbb{F}_{2}}$ is not isomorphic to the polynomial ring in one variable, and statements and conjectures involving $\mathcal{Z}/\zeta(2)\mathcal{Z}$ (e.g., those involving finite and symmetric multiple zeta values introduced in [20]) can not be varied to $\mathcal{Z}^{\mathfrak{b}}/\zeta^{\mathfrak{b}}(2)\mathcal{Z}^{\mathfrak{b}}$ directly. It seems a mysterious problem that whether the algebra of binary MZS has a good property and a connection to the algebra of MZV.

Acknowledgements

The author would like to thank Tomohiro Sonobe for help with computing environments, and Junichi Teruyama for a recommendation to use (4.5) which made it possible to reduce computation costs. This work was supported by Japan Society for the Promotion of Science, Grant-in-Aid for Scientific Research (C) 20K03727.

Appendix

We will introduce a technique to speed up the Gaussian forward elimination over any field $K$ for a system of linear combinations that have some structure. An essential part of the technique appears in [24] to decide the full rankness of a binary matrix.

Let $x_{1},\ldots,x_{n}$ be variables, and we order the variables according to their subscripts. For a non-zero linear combination $p=p(x_{1},\ldots,x_{n})=\sum{}c_{i}x_{i}$ over $K$ , we denote by $s_{\mathrm{min}}(l)$ and $c_{\mathrm{min}}(l)$ the subscript and coefficient of the minimum variable, respectively. That is, $s_{\mathrm{min}}(p)=\min\left\{i{\,|\,}c_{i}\neq 0\right\}$ and $c_{\mathrm{min}}(p)=c_{s_{\mathrm{min}}(p)}$ . We define $s_{\mathrm{min}}(p)=n+1$ and $c_{\mathrm{min}}(p)=0$ when $p=0$ .

In what follows we will handle mainly linear combinations over $K$ , and we just call them combinations. Let $\mathcal{K}_{p_{1},\ldots,p_{m}}$ denote the $K$ -vector space spanned by combinations $p_{1},\ldots,p_{m}$ , and let $\mathcal{K}^{*}_{p_{1},\ldots,p_{m}}=\mathcal{K}_{p_{1},\ldots,p_{m}}\setminus\{0\}$ . We say that $p_{i}$ is a pivot combination if $s_{\mathrm{min}}(p_{i})=i$ , and

\displaystyle(p_{i_{g}})_{1\leq g\leq h}

\displaystyle=

\displaystyle(p_{i_{1}},\ldots,p_{i_{h}})

is a pivot sequence if $1\leq i_{1}<\cdots<i_{h}\leq n$ and every $p_{i_{g}}$ is a pivot combination.

There are two key processes for the speed-up technique. One is a conflict search procedure.

Process A.1.

Input: Combinations $L=\{l_{1},\ldots,l_{m}\}$ and a pivot sequence $(p_{1},\ldots,p_{j-1})$ .

Output: Either $(0,\varnothing)$ or $(q_{i},{\bf k}_{i})$ such that

(a)

$q_{i}\in L$ with $s_{\mathrm{min}}(q_{i})=i\leq j$ ;
(b)

${\bf k}_{i}=(k_{i},\ldots,k_{j-1},k_{j},\ldots,k_{n})\in K^{n-i+1}$ with $k_{j}=1$ and $k_{j+1}=\cdots=k_{n}=0$ ;
(c)

$q_{i}{\,|\,}_{(x_{i},\ldots,x_{n})={\bf k}_{i}}\in K^{*}$ ; and
(d)

$p_{i}{\,|\,}_{(x_{i},\ldots,x_{n})={\bf k}_{i}}=\cdots=p_{j-1}{\,|\,}_{(x_{i},\ldots,x_{n})={\bf k}_{i}}=0$ .

1.

Set ${\bf k}_{j}=(k_{j},\ldots,k_{n})=(1,0,\ldots,0)\in K^{n-j+1}$ and $i=j$ .
2.

Search $q_{i}$ from $\{l\in L{\,|\,}s_{\mathrm{min}}(l)=i\}$ such that $q_{i}{\,|\,}_{(x_{i},\ldots,x_{n})={\bf k}_{i}}\in K^{*}$ .
3.

Return $(q_{i},{\bf k}_{i})$ if such $q_{i}$ exists.
4.

Return $(0,\varnothing)$ if $i=1$ .
5.

Evaluate $k_{i-1}=-\dfrac{p_{i-1}-c_{\mathrm{min}}(p_{i-1})x_{i-1}}{c_{\mathrm{min}}(p_{i-1})}{\,\bigg{|}\,}_{(x_{i},\ldots,x_{n})={\bf k}_{i}}\in K$ .⁷⁷7 This evaluation is well-defined since $s_{\mathrm{min}}(p_{i-1})=i-1$ and $c_{\mathrm{min}}(p_{i-1})\neq 0$ . The condition (d) follows from $\displaystyle p_{i-1}{\,|\,}_{(x_{i-1},\ldots,x_{n})=(k_{i-1},\ldots,k_{n})}$ $\displaystyle=$ $\displaystyle c_{\mathrm{min}}(p_{i-1})k_{i-1}+(p_{i-1}-c_{\mathrm{min}}(p_{i-1})x_{i-1}){\,|\,}_{(x_{i},\ldots,x_{n})=(k_{i},\ldots,k_{n})}\,=\,0.$
6.

Set ${\bf k}_{i-1}=(k_{i-1},{\bf k}_{i})$ .
7.

Update $i\leftarrow i-1$ , and go back to step $2$ .

Another is the classical elimination procedure with an evidence of conflict.

Process A.2.

: Input: A pivot sequence $(p_{i},\ldots,p_{j-1})$ and a pair $(q_{i},{\bf k}_{i})\neq(0,\varnothing)$ which satisfies the output conditions in Process A.1.
: Output: A combination $q_{j}\in\mathcal{K}^{*}_{q_{i},p_{i},\ldots,p_{j}}$ such that $s_{\mathrm{min}}(q_{j})=j$ .⁸⁸8 The theory of Gaussian elimination only ensures $q_{j}\in\mathcal{K}_{q_{i},p_{i},\ldots,p_{j}}$ and $s_{\mathrm{min}}(q_{j})\geq j$ . However, updating method of $q$ in step 2, together with the output conditions (c) and (d) in Process A.1, implies $q_{j}{\,|\,}_{(x_{i},\ldots,x_{n})={\bf k}_{i}}\in K^{*}$ . It also implies $s_{\mathrm{min}}(q_{j})=j$ . In fact, if $s_{\mathrm{min}}(q_{j})>j$ , $\displaystyle q_{j}{\,|\,}_{(x_{i},\ldots,x_{n})={\bf k}_{i}}$ $\displaystyle=$ $\displaystyle q_{j}{\,|\,}_{(x_{j+1},\ldots,x_{n})=(k_{j+1},\ldots,k_{n})}\,=\,q_{j}{\,|\,}_{x_{j+1}=\cdots=x_{n}=0}\,=\,0,$ which is a contradiction. Therefore the output condition in Process A.2 holds.

1.

Set $q=q_{i}$ .
2.

For $h$ from $i$ to $j-1$ , update $q\leftarrow q-\dfrac{c_{\mathrm{min}}(q)}{c_{\mathrm{min}}(p_{h})}p_{h}$ if $h=s_{\mathrm{min}}(q)$ .
3.

Return $q_{j}=q$ .

We can construct a process to find a new pivot combination by combining Processes A.1 and A.2.

Process A.3.

: Input: Combinations $l_{1},\ldots,l_{m}$ and a pivot sequence $(p_{1},\ldots,p_{j-1})$ .
: Output: Either $0$ or a combination $p_{j}\in\mathcal{K}^{*}_{l_{1},\ldots,l_{m},p_{1},\ldots,p_{j}}$ such that $s_{\mathrm{min}}(p_{j})=j$ .

1.

Receive $(q_{i},{\bf k}_{i})$ from Process A.1 for the inputs $L=\{l_{1},\ldots,l_{m}\}$ and $(p_{1},\ldots,p_{j-1})$ .
2.

Return $0$ if $q_{i}=0$ .
3.

Receive $q_{j}$ from Process A.2 for the inputs $(p_{i},\ldots,p_{j-1})$ and $(q_{i},{\bf k}_{i})$ .
4.

Return $p_{j}=q_{j}$ .

Process A.3 is essential for finding a pivot combination whose minimum variable is $x_{j}$ , because we can find out it by Process A.3 if and only if it exists.

Proposition A.4.

For combinations $l_{1},\ldots,l_{m}$ and a pivot sequence $(p_{1},\ldots,p_{j-1})$ , the following statements are equivalent.
(i) Process A.3 outputs $p_{j}\in\mathcal{K}^{*}_{l_{1},\ldots,l_{m},p_{1},\ldots,p_{j-1}}$ such that $s_{\mathrm{min}}(p_{j})=j$ .
(ii) There exists a combination $p_{j}\in\mathcal{K}^{*}_{l_{1},\ldots,l_{m},p_{1},\ldots,p_{j-1}}$ such that $s_{\mathrm{min}}(p_{j})=j$ .

Proof. Obviously, (i) implies (ii). Suppose (ii) is true to prove the converse. Then there exist elements $c_{1},\ldots,c_{m},d_{1},\ldots,d_{j-1}$ in $K$ such that

\displaystyle p_{j}

\displaystyle=

\displaystyle\sum_{h}c_{h}l_{h}+\sum_{i}d_{i}p_{i}.

We have $p_{j}{\,|\,}_{(x_{j},x_{j+1},\ldots,x_{n})=(1,0,\ldots,0)}\in K^{*}$ since $x_{j}$ is the minimum variable in $p_{j}$ .

We first consider the situation where we run Process A.1 for the inputs $L=\{l_{1},\ldots,l_{m}\}$ and $(p_{1},\ldots,p_{j-1})$ : however, we temporally assume that step 3 is skipped and the process ends with the output $(0,\varnothing)$ at step 4 of $i=1$ . Let $k_{1},\ldots,k_{j-1}$ be the elements in $K$ which are recursively determined as at step $5$ , and let ${\bf k}=(k_{1},\ldots,k_{j-1},1,0,\ldots,0)\in K^{n}$ . Then $p_{1}({\bf k})=\cdots=p_{j-1}({\bf k})=0$ , and

\displaystyle p_{j}({\bf k})

\displaystyle=

\displaystyle\sum_{h}c_{h}l_{h}({\bf k})+\sum_{i}d_{i}p_{i}({\bf k})\,=\,\sum_{h}c_{h}l_{h}({\bf k}).

Since $p_{j}({\bf k})=p_{j}{\,|\,}_{(x_{j},x_{j+1},\ldots,x_{n})=(1,0,\ldots,0)}\in K^{*}$ , this implies $l_{h}({\bf k})\in K^{*}$ for some $h$ , which means that Process A.1 can find out $q_{i}$ in step 2 such that $q_{i}{\,|\,}_{(x_{i},\ldots,x_{n})={\bf k}_{i}}\in K^{*}$ , at least when $i=s_{\mathrm{min}}(l_{h})$ . Therefore, Process A.1 without the temporal assumption always outputs $(q_{i},{\bf k}_{i})\neq(0,\varnothing)$ .

We input $L=\{l_{1},\ldots,l_{m}\}$ and $(p_{1},\ldots,p_{j-1})$ into Process A.3. At step $1$ , we receive $(q_{i},{\bf k}_{i})\neq(0,\varnothing)$ from Process A.1. Thus step 2 is skipped, and $q_{j}$ is received from Process A.2 at step 3, which satisfies the condition required in (i). Since $p_{j}=q_{j}$ is returned at step 4, we conclude (i) holds. $\Box$

For a subscript $j$ and a pivot sequence $(p_{i_{g}})=(p_{i_{g}})_{1\leq g\leq h}$ with $i_{h}<j$ , we define

\displaystyle D_{(p_{i_{g}}),j}

\displaystyle:=

\displaystyle[j-1]\setminus\{i_{1},\ldots,i_{h}\}.

We call an integer in $D_{(p_{i_{g}}),j}$ a deficient subscript, and a variable $x_{i}$ with $i\in D_{(p_{i_{g}}),j}$ a deficient variable.

We need to modify Process A.3 for practical use.

Process A.5.

: Input: Combinations $l_{1},\ldots,l_{m}$ , a subscript $j$ , and a pivot sequence $(p_{i_{g}})_{1\leq g\leq h}$ with $i_{h}<j$ .
: Output: Either $0$ or a combination $p_{j}\in\mathcal{K}^{*}_{l_{1},\ldots,l_{m},p_{i_{1}},\ldots,p_{i_{h}}}$ with $s_{\mathrm{min}}(p_{j})\in D_{(p_{i_{g}}),j}\cup\{j\}$ .

1.

Change the variable order by moving the deficient variables backward.
2.

Prepare the pivot sequence $(p^{\prime}_{1},\ldots,p^{\prime}_{j-1-|D_{(p_{i_{g}}),j}|})$ for the new variable order.
3.

Receive $p^{\prime}_{j-|D_{(p_{i_{g}}),j}|}$ from Process A.3 for the inputs $l_{1},\ldots,l_{m}$ and $(p^{\prime}_{1},\ldots,p^{\prime}_{j-1-|D_{(p_{i_{g}}),j}|})$ .
4.

Undo the variable order by putting the deficient variables back to their original places.
5.

Return $p_{j}=p^{\prime}_{j-|D_{(p_{i_{g}}),j}|}$ .

We are in a position to state Algorithm A.6 for a fast Gaussian forward elimination.

Algorithm A.6.

: Input: Combinations $L=\{l_{1},\ldots,l_{m}\}$ .
: Output: A pivot sequence $(p_{i_{g}})$ .

1.

Create subsets $L_{i}=\{l\in L{\,|\,}s_{\mathrm{min}}(l)=i\}$ $(i=1,\ldots,n)$ .
2.

Set $j=0$ and $(p_{i_{g}})=\phi$ .
3.
Execute the following loop process to make a pivot sequence $(p_{i_{g}})$ :
1. (i)
  
  Update $j\leftarrow j+1$ if $j<n$ ; otherwise break.
2. (ii)
  
  If $L_{j}\neq\phi$ , append a combination in $L_{i}$ to $(p_{i_{g}})$ and go back to (i).
3. (iii)
  
  Receive $p_{j}$ from Process A.5 for the inputs $L_{1}\cup\cdots\cup L_{j-1}$ and $(p_{i_{g}})$ .
4. (iv)
  
  If $p_{j}=0$ , go back to (i).
5. (v)
  
  Append $p_{j}$ to $(p_{i_{g}})$ ,⁹⁹9 The loop process ensures $s_{\mathrm{min}}(p_{j})=j$ . To show this, we may prove $s_{\mathrm{min}}(p_{j})\notin D_{(p_{i_{g}}),j}$ by the output condition in Process A.5. Suppose $s_{\mathrm{min}}(p_{j})\in D_{(p_{i_{g}}),j}$ and set $j^{\prime}=s_{\mathrm{min}}(p_{j})<j$ . Then, on the $j^{\prime}$ -round in the loop process, Process A.5 at (iii) must return a non-zero combination by Proposition A.4 and the existence of $p_{j}$ , where note that Process A.5 is essentially Process A.3. This means a combination $p_{j^{\prime}}$ satisfying $s_{\mathrm{min}}(p_{j^{\prime}})=j^{\prime}$ must be appended to $(p_{i_{g}})$ at (v) on the $j^{\prime}$ -round, which contradicts $j^{\prime}=s_{\mathrm{min}}(p_{j})\in D_{(p_{i_{g}}),j}$ . and back to (i).
4.

Return $(p_{i_{g}})$ .

The pivot sequence $(p_{i_{g}})$ outputted by Algorithm A.6 is a row echelon matrix under the order $x_{1}<\cdots<x_{n}$ thanks to Proposition A.4 (see the footnote in (v) of step 3 for details).

Remark A.7.

Process A.1 is influenced by the unit propagation (UP) in the algorithm to solve the Boolean satisfiability (SAT) problem (see, e.g., [4, Chapter 1]). SAT is the first problem that was proved to be NP-complete, which means that all NP-problems are at most as difficult as SAT. UP is a technique to determine an assignment value for the variable we watch while searching a conflict combination (or a conflict clause in SAT terminology).

Process A.2 is inspired by the conflict-driven clause learning (CDCL) proposed in [3, 25, 26] (see also [4, Chapter 5]). CDCL enable us to find (or learn) a new pivot combination from the conflict evidence found by UP.

The performance of UP tends to increase when combinations have good structures for finding conflict combinations under a good variable order: i.e., not too few number of combinations, high frequency of small size combinations, bias of occurrences of variables, and so on. We have seen in Table 7 that the runtimes of $\mathrm{MJPO}$ and EDS are much better than those of $\mathrm{KNT}$ , which seems to be due to the difference in numbers of relations (or combinations).

References

[1]
[2] H. Bachmann, Multiple zeta values and modular forms, Lecture notes (under construction, but available at https://www.henrikbachmann.com/mzv2020.html) in Nagoya University, Spring 2020.
[3] R. J. Bayardo Jr. and R. C. Schrag, Using CSP look-back techniques to solve real-world SAT instances, Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence, 203–208, 1997.
[4] A. Biere, M. Heule, H. V. Maaren, and T. Walsh (eds.), Handbook of satisfiability, Frontiers in Artificial Intelligence and Applications, Volume 185, IOS Press, Amsterdam, The Netherlands, 2009.
[5] M. Bigotte, G. Jacob, N. E. Oussous and M. Petitot, Lyndon words and shuffle algebras for generating the coloured multiple zeta values relations tables, Theor. Comput. Sci. 273 (2002), 271–282.
[6] J. Blümlein, D. J. Broadhurst and J. A. M. Vermaseren, The multiple zeta value data mine, Comput. Phys. Commun. 181 (2010), 582–625.
[7] D. Broadhurst and D. Kreimer, Association of multiple zeta values with positive knots via Feynman diagrams up to $9$ loops, Phys. Lett. B 393 (1997), 403–412.
[8] F. Brown, Mixed Tate motives over $\mathbb{Z}$ , Ann. Math. 175 (2012), 949–976.
[9] F. Brown, Depth-graded motivic multiple zeta values, Compos. Math. 157 (2021), 529–572.
[10] P. Deligne and A. B. Goncharov, Groupes fondamentaux motiviques de Tate mixte, Ann. Sci. École Norm. Sup. 38 (2005), 1–56.
[11] V. G. Drinfeld, On quasitriangular quasi-Hopf algebras and a group closely connected with $\mathrm{Gal}(\overline{\mathbb{Q}}/\mathbb{Q})$ , Leningrad Math. J. 2 (1991), 829–860.
[12] M. Espie, J-C. Novelli and G. Racinet, Formal computations about multiple zeta values, in “From Combinatorics to Dynamical Systems” (Strasbourg, 2002), IRMA Lect. Math. Theor. Phys. 3, F. Fauvet and C. Mitschi (eds.), de Gruyter, Berlin, (2003), 1–16.
[13] L. Euler, Meditationes circa singulare serierum genus, Novi Comm. Acad. Sci. Petropol. 20 (1776), 140–186 ; reprinted in Opera Omnia Ser. I, vol. 15, 217–267.
[14] H. Furusho, The multiple zeta value algebra and the stable derivation algebra, Publ. Res. Inst. Math. Sci. 39 (2003), 695–720.
[15] H. Gangl, M. Kaneko and D. Zagier, Double zeta values and modular forms, in “Automorphic forms and zeta functions”, Proceedings of the Conference in Memory of Tsuneo Arakawa, World Sci. Publ., Hackensack, NJ, (2006), 71–106.
[16] A. B. Goncharov, Periods and mixed motives, preprint (arXiv:math/0202154), 2002.
[17] M. Hirose and N. Sato, Iterated integrals on $\mathbb{P}\setminus\{0,1,\infty,z\}$ and a class of relations among multiple zeta values, Adv. Math. 348 (2019), 163–182.
[18] M. E. Hoffman, The algebra of multiple harmonic series, J. Algebra 194 (1997), 477–495.
[19] K. Ihara, M. Kaneko, and D. Zagier, Derivation and double shuffle relations for multiple zeta values, Compos. Math. 142 (2006), 307–338.
[20] M. Kaneko, An introduction to classical and finite multiple zeta values, Publ. math. Besançon. Algèb. Théor. Nr. 1 (2019), 103–129.
[21] M. Kaneko, M. Noro and K. Tsurumaki, On a conjecture for the dimension of the space of the multiple zeta values, Software for algebraic geometry 148 (2008), 47–58.
[22] M. Kaneko and S. Yamamoto, A new integral-series identity of multiple zeta values and regularizations, Sel. Math. New Ser. 24 (2018), 2499–2521.
[23] G. Kawashima, A class of relations among multiple zeta values, J. Number Theory 129 (2009), 755–788.
[24] T. Machide and T. Sonobe, Determination method influenced by SAT solver for the full rankness of a matrix (in Japanese) , The 32nd Annual Conference of the Japanese Society for Artificial Intelligence, 2018.
[25] J. P. Marques-Silva and K. A. Sakallah, GRASP – a new search algorithm for satisfiability, Proceedings of the 1996 IEEE/ACM international conference on Computer-aided design, 220–227, 1996.
[26] J. P. Marques-Silva and K. A. Sakallah, GRASP: a search algorithm for propositional satisfiability, IEEE Transactions on Computers 48 (1999), 506–521.
[27] H. N. Minh, G. Jacob, M. Petitot and N. E. Oussous, Aspects combinatoires des polylogarithmes et des sommes d’Euler-Zagier, J. Électr. Sém. Lothar. Combin. 43 (2000), Art. B43e, 29 pp.
[28] H. N. Minh and M. Petitot, A Lyndon words, polylogarithms and the Riemann $\zeta$ function, Discrete Math. 217 (2000), 273–292.
[29] G. Racinet, Doubles mélanges des polylogarithmes multiples aux racines de l’unité, Publ. Math. IHÉS. 95 (2002), 185–231.
[30] C. Reutenauer, Free Lie algebras, London Mathematical Society Monographs. New Series, 7. Oxford Science Publications. The Clarendon Press, Oxford University Press, New York, 1993.
[31] K. Tasaka, On linear relations among totally odd multiple zeta values related to period polynomials, Kyushu J. Math. 70 (2016), 1–28.
[32] T. Terasoma, Mixed Tate motives and multiple zeta values, Invent. Math. 149 (2002), 339–369.
[33] R. Umezawa, Evaluation of iterated log-sine integrals in terms of multiple polylogarithms, preprint (arXiv:1912.07201 [math.NT]), 2019.
[34] D. Zagier, Values of zeta functions and their applications, First European Congress of Mathematics, Vol. II (Paris, 1992), 497–512, Progr. Math., 120, Birkh $\ddot{\mathrm{a}}$ user, Basel. 1994.