This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Tangent complexes and the Diamond Lemma

Vladimir Dotsenko Institut de Recherche Mathématique Avancée, UMR 7501, Université de Strasbourg et CNRS, 7 rue René-Descartes, 67000 Strasbourg, France vdotsenko@unistra.fr  and  Pedro Tamaroff School of Mathematics, Trinity College, Dublin 2, Ireland pedro@maths.tcd.ie
Abstract.

The celebrated Diamond Lemma of Bergman gives an effectively verifiable criterion of uniqueness of normal forms for term rewriting in associative algebras. We present a new way to interpret and prove this result from the viewpoint of homotopical algebra. Our main result states that every multiplicative free resolution of an algebra with monomial relations gives rise to its own Diamond Lemma, so that Bergman’s condition of “resolvable ambiguities” becomes the first non-trivial component of the Maurer–Cartan equation in the corresponding tangent complex. The same approach works for many other algebraic structures, emphasizing the relevance of computing multiplicative free resolutions of algebras with monomial relations.

Key words and phrases:
deformation theory, Diamond Lemma, Gröbner basis, free resolution, rewriting system, tangent complex

Introduction

Context of our work

When studying algebras presented by generators and relations, the central general result is the statement known as the Diamond Lemma. Historically, one would say that it was already implicit in Shirshov’s work [86] who proved an analogous but technically more involved result, the Composition Lemma, in the case of Lie algebras. About a decade after that, this result was proved in the case of associative algebras by Bokut in [17], and independently by Bergman [11], who also proposed the name “Diamond Lemma” in order to emphasize the analogy with the classical result of Newman [68]. Paraphrasing the opening phrase of [11], the main results of our paper are doubly trivial: Bergman’s Diamond Lemma is, in his own words, trivial, and our main goal is to explain a new trivial proof of this trivial statement, as well as some other trivial statements, from the point of view of homotopical algebra. However, we believe that readers of this paper can benefit from it in a number of ways. For a reader whose intuition comes from homotopical algebra, our proof will hopefully feel like a conceptual explanation of useful but seemingly technical criteria of “resolvable ambiguities” for uniqueness of normal forms. For a reader with background in Gröbner bases or term rewriting, our proof will offer intuition behind both the Diamond Lemma and its optimisation, known as the Chain Criterion in the commutative case [21, 27, 48] and as the Triangle Lemma in the case of noncommutative associative algebras [20, 54, 85], as well as precise guidance on how to generalise those for other algebraic structures. Specifically, our work means that computing models of algebras with monomial relations explicitly helps both to state the relevant Diamond Lemmas and to optimise them.

From Gröbner bases to deformation theory and back

The idea of using Gröbner bases for computing homological invariants of associative algebras is well known. The seemingly earliest instance appears in the work of Priddy [70] who constructed resolutions for the ground field, viewed as the trivial module, for algebras presented by quadratic–linear relations; this was later generalised by Anick [3] to arbitrary presentations. Existence of such resolutions is not at all surprising for the following reason. Defining relations of an associative algebra AA form a Gröbner basis if the same monomials give a vector space basis for both algebras AA and AmonA_{\mathrm{mon}}; the latter is the algebra with monomial relations given by the leading monomials of the relations of AA with respect to a suitable ordering. The trivial module for AmonA_{\mathrm{mon}} admits a combinatorially defined free resolution, implicit in the work [4] of Backelin, and explicitly determined by Green, Happel and Zacharia in [32]. Such resolution can be obtained as a contraction of the bar resolution; incorporating the lower terms of relations from a Gröbner basis can obtained from such a contraction by homological perturbation techniques [10, 34, 35, 53]. In fact, the one-sided module resolution of Anick can be generalised to a bimodule resolution, as established by Bardzell [5, 6], leading to a computational method for determining the Hochschild (co)homology of an algebra, and thus for studying deformations of a given algebra. This construction works under a weaker assumption of a convergent rewriting system instead of a Gröbner basis, as shown by Chouhy and Solotar in [23].

Our main wish, motivated by interest in generalising these methods to algebraic structures other than associative algebras, is to go in the opposite direction and re-discover effective criteria for reduction systems with unique normal forms from the deformation theory viewpoint. Since in those situations the multiplication table of an algebra AA is obtained from the simple combinatorial multiplication table of AmonA_{\mathrm{mon}} by incorporating appropriate lower terms, we naturally find ourselves in the framework of deformation theory. In fact, very recently the bimodule resolutions mentioned above have been used to study deformations of algebras with monomial relations by Barmeier and Wang in their work on deformation theory of quiver algebras [7] and by Redondo and Rossi Bertone in [75], using the following idea. A free bimodule resolution

AmonCAmonAmonA_{\mathrm{mon}}\otimes C_{\bullet}\otimes A_{\mathrm{mon}}\simeq A_{\mathrm{mon}}

leads to the representative Hom(C,Amon)\operatorname{Hom}(C_{\bullet},A_{\mathrm{mon}}) of the deformation complex, and one can use a strong deformation retract relating it to Hochschild complex to compute explicitly its LL_{\infty}-algebra structure using the homotopy transfer theorem. Studying deformations amounts to studying solutions to the Maurer–Cartan equation in that LL_{\infty}-algebra, and one may obtain various results this way. However, if one adopts this viewpoint, the Diamond Lemma criterion of “resolvable ambiguities” cannot be recovered instantly: it involves a calculation in the free associative algebra which cannot be reproduced directly since the target space of the deformation complex is the monomial algebra AmonA_{\mathrm{mon}}. The two deformation complexes are homotopy equivalent, so the necessary result can be proved in principle. However, recovering the classical Diamond Lemma will require some translation, in the spirit of Anick’s slogan “the perturbative construction of the second differential of the resolution is precisely the procedure of resolving ambiguities”, see [3, Sec. 2]. We also refer the reader to a discussion of this phenomenon in the survey of Ufnarovski [85, Sec. 3.8] where one also finds the Triangle Lemma as a way to optimise the algorithm.

Deformations via tangent complexes

It turns out that a natural way to remedy the situation is to move from homology to homotopy, and study deformations of algebras in terms of homotopical algebra. This means working with multiplicative free resolutions, that is resolutions which are free as algebras rather than as bimodules. In this case, the homotopy class of the deformation complex of an algebra can be represented by its tangent complex, that is, the differential graded Lie algebra of derivations of its free resolution. Historically, this approach to deformations first emerged in the case of commutative associative algebras. In this context, it makes sense to note that some of the pioneering works both in the theory of Gröbner bases and in the deformation theory appeared in the algebro-geometric context: the former in the solution of the resolution of singularities problem in characteristic zero by Hironaka [39, 40], and the latter in Palamodov’s work on deformation of analytic spaces [69], where the tangent complex approach is attributed to unpublished work of Tyurina (who tragically died in a kayaking accident at the age of 32). However, the standard bases of Hironaka were just one of many tools in a paper of more than two hundred pages, and the power of this method in algebraic geometry and commutative algebra became apparent only after work of Buchberger [22] who highlighted their algorithmic nature. The tangent complex approach to deformation theory of algebras became widely known from the famous manuscript [78] by Schlessinger and Stasheff. We would like to also remark that a seemingly completely independent path to multiplicative resolutions which however stays away from deformation theory questions emerges in higher-categorical rewriting theory. Original work of Squier [52, 80] on homological finiteness conditions for monoids admitting a convergent presentation first received a higher-categorical flavour in works of Lafont [50] and Citterio [24]; corresponding multiplicative resolutions appear in work of Lafont and Métayer [51] and Guetta [33] relying on theory of resolutions of categories by polygraphs developed by Métayer [67]. This formalism was recently extended to the 𝕜\mathbbold{k}-linear context by Guiraud, Hoffbeck, and Malbos [37], who in particular explained how to construct a polygraphic resolution of an associative algebra by techniques similar to the perturbative construction of AA_{\infty}-structures [36] and free resolutions of algebras [43].

Structure of the paper

For us, it is crucial that working with free resolutions means that recovering Bergman’s criterion of resolvable ambiguities stands a chance, at least in principle, since our deformation complexes of algebras with monomial relations have free associative algebras as target spaces. Our approach blends ideas and methods from several different areas, and we tried to offer enough detail when recalling the necessary background information. Our first homotopical re-formulation of uniqueness of normal forms is a general criterion in terms of perturbations of free resolutions (Theorem 4.1). Using that result, we prove another general criterion in terms of approximate solutions to the Maurer–Cartan equation in the tangent complex (Theorem 5.1). Using this latter result, we establish the main result of this paper (Theorem 5.2): every model extending the Shafarevich complex of the given relations gives rise to its own Diamond Lemma, so that the resolvable ambiguities condition of the Diamond Lemma becomes the first non-trivial component of the Maurer–Cartan equation in the tangent complex of the model. Applying that latter result to some specific combinatorial models, we recover the classical Diamond Lemma in the case of the inclusion–exclusion model of the first author and Khoroshkin [25] and the Triangle Lemma in the case of the minimal model of the second author [82]. We discuss some analogues and possible generalisations of our results for other types of algebras towards the end of the article. Already in the case of operads with monomial relations, explicit formulas for minimal models are not known in general, and only a much weaker version of the Triangle Lemma is available [20]; we hope that our work will attract due attention to this question.

Precursors: Evgeny Solomonovich Golod and Victor Nikolaevich Latyshev

To conclude the introduction, we would like to mention two crucial sources of inspiration for our work. The first of them is the work of Golod who discovered a proof of the Diamond Lemma [29] using the non-commutative analogue of the Koszul complex, the Shafarevich complex associated to a system of elements in the free algebra originally introduced in [31]. That complex is a differential graded algebra whose homotopy type depends on the presentation of the original algebra. Our homotopy invariant free resolutions are obtained from the Shafarevich complex at the cost of adding extra generators of higher degrees; we believe that the benefit of the resulting clarity outweighs the cost. Our second inspiration comes from the work of Latyshev who used normal forms to resolve some particular cases [58, 59] of the celebrated Specht problem on identities of associative algebras [79], and advocated general importance of normal forms and standard bases [55, 56, 57]. Both Golod and Latyshev passed away relatively recently (in July 2018 and April 2020, respectively). We would like to dedicate this work to their memory.

Acknowledgements.

The authors thank Leonid Arkadievich Bokut, Eric Hoffbeck, Anton Khoroshkin and Muriel Livernet for useful discussions. The first author also thanks Dmitri Piontkovski who introduced him to Shafarevich complexes many years ago, and Emil Sköldberg for sharing some computations of minimal models of monomial operads in low homological degrees. The idea of this article emerged during the workshop “Homotopy meets homology” at Hamilton Mathematics Institute (Trinity College Dublin) in May 2019, and was primarily guided by our wish to unveil homotopical algebra behind the work [75] presented at that workshop by María Julia Redondo; we are grateful to the Science Foundation Ireland and the Simons Foundation for allocating funding that made that workshop possible. Part of this work was done during the second author’s visits to Strasbourg funded by IRMA (Institut de Recherche Mathématique Avancée), he is thankful for both the financial support of those visits and the excellent working conditions.

1. Conventions

Unless otherwise indicated, all objects in this paper are defined over an arbitrary ground field 𝕜\mathbbold{k}. By a graded vector space we mean a vector space VV of the form

V=nVn,V=\bigoplus_{n\in\mathbb{Z}}V_{n},

where we write |v|=n|v|=n for vVnv\in V_{n}, and refer to nn as the homological degree of an element vv. The adjective ‘homological’ means that these degrees create “Koszul signs” arising from exchanging factors in tensor products. Recall that one defines the tensor product of two graded vector spaces by the formula

(VW)n:=i+j=kViWj(V\otimes W)_{n}:=\bigoplus_{i+j=k}V_{i}\otimes W_{j}

summarised in words by “degrees add under tensor products”, and homological degrees enter the formula for the symmetry isomorphism

τV,W:VWWV\tau_{V,W}\colon V\otimes W\to W\otimes V

given by τ(vw)=(1)|v||w|wv\tau(v\otimes w)=(-1)^{|v||w|}w\otimes v. (This, for a trained eye, creates signs in a lot of places; for example, if one applies the tensor product of linear maps to a tensor product of two vectors, the formula (ϕψ)(vw)=(1)|ψ||v|ϕ(v)ψ(w)(\phi\otimes\psi)(v\otimes w)=(-1)^{|\psi||v|}\phi(v)\otimes\psi(w) has to be used.) Occasionally, our graded vector spaces will have extra gradings which do not create any extra signs in formulas; in such cases, we shall use the word “grade” for such degrees.

2. Term rewriting

In this section, we give a short introduction to term rewriting in the linear context. Mathematically, all of this goes back to [11]; terminologically, we choose to follow the recent literature on rewriting systems, see, for example, [37] and references therein.

2.1. Rewriting systems

Let us fix once and for all a finite set XX; we shall denote by X\langle X\rangle the free monoid generated by XX, so that the linear span 𝕜𝕏\mathbbold{k}\langle X\rangle is the free associative 𝕜\mathbbold{k}-algebra generated by XX. For an element g𝕜𝕏g\in\mathbbold{k}\langle X\rangle we shall denote by supp(g)\operatorname{supp}(g) the set of all elements of X\langle X\rangle that appear in gg with a non-zero coefficient.

Definition (Rewriting system).

A rewriting system on 𝕜𝕏\mathbbold{k}\langle X\rangle is a triple (X,W,f)(X,W,f), where WXW\subset\langle X\rangle, and ff is a function on WW with values in 𝕜𝕏\mathbbold{k}\langle X\rangle.

The right way to think of a rewriting system is as of a collection of rules that allow to rewrite each monomial wWw\in W into the element f(w)𝕜𝕏f(w)\in\mathbbold{k}\langle X\rangle. In this way, each rewriting system Σ\Sigma naturally gives rise to an associative algebra AΣA_{\Sigma} with relations R={wf(w):wW}R=\{w-f(w)\colon w\in W\}:

AΣ=𝕜𝕏.A_{\Sigma}=\mathbbold{k}\langle X\mid R\rangle.

Each rewriting rule of Σ\Sigma becomes a way to replace elements of 𝕜𝕏\mathbbold{k}\langle X\rangle by other representatives of the same coset in AΣA_{\Sigma}, as follows.

Definition (Reduction).

Suppose that wWw\in W, and a,bXa,b\in\langle X\rangle. A basic reduction associated to the triple (a,w,b)(a,w,b) is the 𝕜\mathbbold{k}-linear endomorphism ρ\rho of 𝕜𝕏\mathbbold{k}\langle X\rangle such that for each mXm\in\langle X\rangle we have

ρ(m)={af(w)b,if m=awb,motherwise. \rho(m)=\begin{cases}af(w)b,\ \ \text{if }m=awb,\\ \quad\ \ m\qquad\text{otherwise. }\end{cases}

A finite sequence of basic reductions is called a reduction; it defines a 𝕜\mathbbold{k}-linear endomorphism of 𝕜𝕏\mathbbold{k}\langle X\rangle by composing, in the given order, the basic reductions appearing in it.

Using reductions is intended to “simplify” representatives of cosets. This is formalised by the notion of a normal form.

Definition (Normal form).

An element g𝕜𝕏g\in\mathbbold{k}\langle X\rangle is called irreducible if ρ(g)=g\rho(g)=g for every basic reduction ρ\rho. A normal form of an element g𝕜𝕏g\in\mathbbold{k}\langle X\rangle is an irreducible element g¯\bar{g} such that the cosets of gg and g¯\bar{g} in AΣA_{\Sigma} are equal.

In principle, it is possible that one can perform basic reductions infinitely many times. We shall only work with rewriting systems for which it does not happen. To formalise that, we give the following definition.

Definition (Terminating rewriting system).

A pseudo-reduction is an infinite sequence

ρ=(ρ1,ρ2,ρ3,)\rho=(\rho_{1},\rho_{2},\rho_{3},\ldots)

of basic reductions. To every pseudo-reduction ρ\rho and every g𝕜𝕏g\in\mathbbold{k}\langle X\rangle we can associate the sequence

ρ(g)=(ρ1(g),ρ2ρ1(g),ρ3ρ2ρ1(g),)\rho(g)=(\rho_{1}(g),\rho_{2}\rho_{1}(g),\rho_{3}\rho_{2}\rho_{1}(g),\ldots)

of elements in 𝕜𝕏\mathbbold{k}\langle X\rangle. We say that an element g𝕜𝕏g\in\mathbbold{k}\langle X\rangle is reduction finite if for every pseudo-reduction ρ\rho, the associated sequence ρ(g)\rho(g) is eventually constant. A rewriting system Σ\Sigma is terminating if all elements g𝕜𝕏g\in\mathbbold{k}\langle X\rangle are reduction finite.

Note that if an element gg is reduction finite, there exists a reduction that sends this element to an irreducible one, for otherwise we could easily find a pseudo-reduction ρ\rho for which ρ(g)\rho(g) is not eventually constant. Thus, for a terminating rewriting system, each element has at least one normal form. All rewriting systems considered in this paper are assumed to be terminating. We shall now recall how to recast termination in terms of partial orderings.

Definition (Compatible rewriting ordering).

For a given rewriting system Σ\Sigma, we say that a partial order \leqslant on X\langle X\rangle is a rewriting ordering compatible with Σ\Sigma if the following conditions hold:

  1. (1)

    multiplicativity: if m<mm<m^{\prime} for two elements m,mm,m^{\prime}, then amb<ambamb<am^{\prime}b for any a,bXa,b\in\langle X\rangle;

  2. (2)

    well-ordering: every non-empty set of monomials has a minimal element;

  3. (3)

    compatibility with Σ\Sigma: for all wWw\in W and all wsupp(f(w))w^{\prime}\in\operatorname{supp}(f(w)), we have w<ww^{\prime}<w.

Among rewriting orderings compatible with Σ\Sigma (if they exist at all), there exists the weakest possible one obtained as follows. We first define <Σ<_{\Sigma} as the smallest transitive binary relation that is multiplicative and compatible with Σ\Sigma. If <Σ<_{\Sigma} is a well-ordering, the reflexive closure Σ\leqslant_{\Sigma} is a rewriting ordering compatible with Σ\Sigma, and each other rewriting ordering compatible with Σ\Sigma obviously refines the ordering Σ\leqslant_{\Sigma}. According to [11, 23], the rewriting system Σ\Sigma is terminating if and only if <Σ<_{\Sigma} is a well-ordering.

For a terminating rewriting system, the associative algebra

AΣ0=𝕜𝕏𝕎A_{\Sigma_{0}}=\mathbbold{k}\langle X\mid W\rangle

obtained from the rewriting system Σ0=(W,0)\Sigma_{0}=(W,0) and called the monomial algebra associated to Σ\Sigma is a relevant object of study. Indeed, an element g𝕜𝕏g\in\mathbbold{k}\langle X\rangle is irreducible if and only if no monomial msupp(g)m\in\operatorname{supp}(g) is divisible by elements of WW, so irreducible monomials form a basis of the algebra AΣ0A_{\Sigma_{0}}. However, since the same element may have several different normal forms, the cosets of irreducible monomials might not be linearly independent in the algebra AΣA_{\Sigma}.

Definition (Convergent rewriting system).

A (terminating) rewriting system is convergent if each element g𝕜𝕏g\in\mathbbold{k}\langle X\rangle has exactly one normal form.

Among the rewriting systems, the best known class is given by Gröbner bases: those are rewriting systems where there is a total order \leqslant on X\langle X\rangle satisfying the above three conditions. Let us recall an example [20, 37] showing that sometimes using a partial order is really advantageous. For that, we consider the algebra A=𝕜𝕩,𝕪,𝕫𝕩𝟛+𝕪𝟛+𝕫𝟛𝕩𝕪𝕫A=\mathbbold{k}\langle x,y,z\mid x^{3}+y^{3}+z^{3}-xyz\rangle. Note that for a total multiplicative order, we always have xyz<max(x3,y3,z3)xyz<\max(x^{3},y^{3},z^{3}), and one can check that for such an order the corresponding rewriting system is not convergent, so one needs to extend it by further rewriting rules. However, one can consider the rewriting system Σ=(X,W,f)\Sigma=(X,W,f) with X={x,y,z}X=\{x,y,z\}, W={xyz}W=\{xyz\} and f(xyz)=x3+y3+z3f(xyz)=x^{3}+y^{3}+z^{3} which does not arise from a total multiplicative order. Let us explain why this rewriting system is terminating. For that, we define, for each monomial mm, Φ(m)=3A(m)+B(m)\Phi(m)=3A(m)+B(m), where A(m)A(m) is the number of divisors of mm equal to xyzxyz and B(m)B(m) is the number of divisors of mm equal to yy. Then one can check that any application of our rewriting rule replaces a monomial by a linear combination of terms for which the parameter Φ\Phi is strictly smaller, which guarantees the termination. Later we shall see that this rewriting system is convergent, showing that in this case there is an advantage over the theory of Gröbner bases.

Bergman’s Diamond Lemma [11] establishes a criterion of convergence in terms of the so called “resolvable ambiguities”. If this were a paper on rewriting systems, at this point we would formulate and prove that result. However, our goal is to explain how such criterion can be found, so that one can determine its analogues for other algebraic structures. It is easy to see that a rewriting system Σ\Sigma is convergent if and only if the corresponding irreducible monomials are linearly independent. In other words, Σ\Sigma is convergent if and only if the linear map AΣ0AΣA_{{\Sigma_{0}}}\to A_{\Sigma} sending each element of the standard monomial basis of AΣ0A_{{\Sigma_{0}}} to its coset in AΣA_{\Sigma} is an isomorphism of vector spaces. This suggests that convergent rewriting systems may be studied from the viewpoint of deformation theory of associative algebras. In general, the notion of a deformation implies that we deform the structure by adding lower terms with respect to a suitable filtration. For example, for the classical deformation theory set-up [65] defined over an Artinian local ring containing 𝕜\mathbbold{k}, the filtration may be defined by powers of the maximal ideal 𝔪\mathfrak{m} of that ring. In our case, the filtration is defined out of the partial ordering <Σ<_{\Sigma}, or, better to say, out of any order that extends it to a total well-ordering. The multiplication table in the algebra AΣ0A_{{\Sigma_{0}}} is very simple: the product of two basis elements is either a basis element or zero. For a convergent rewriting system, the product of two basis elements in the algebra AΣA_{{\Sigma}} is either a basis element or the linear combination of smaller basis elements obtained by term rewriting. Thus, saying that Σ\Sigma is convergent is equivalent to saying that the algebra AΣA_{\Sigma} is a deformation of the algebra AΣ0A_{\Sigma_{0}} with respect to the corresponding filtration. This suggests that using deformation theory of algebras might shed light on convergence of rewriting systems. The main part of this paper uses this viewpoint extensively.

3. Homotopy theory of associative algebras

Deformation theory of algebras borrows a lot of intuition from algebraic topology: one may say that algebraic tools for studying continuous deformations of spaces are very much amenable to the case of deformations of algebras. In algebraic topology, an important invariant of a space is its cohomology algebra. However, that algebra itself does not capture many important homotopy invariants. It turns out that to remedy that one should either consider cohomology together with certain higher structures, or work with a bigger algebra of the same homotopy type, for example the differential graded algebra of singular cochains. This led Quillen to the general philosophy of homotopical algebra [73] suggesting that one should extend categories of algebras to better behaved “model categories” with a notion of weak equivalence, an abstraction of homotopy equivalence, and consider the homotopy category formed by equivalence classes. Alternatively, one may say that instead of studying an algebra it is often beneficial to study its model, which is a better behaved algebra of the same homotopy type in the model category. This section gives a recollection of the corresponding definitions to make the rest of the paper more readable to experts in Gröbner bases and rewriting.

3.1. Differential graded associative algebras

A sufficiently general recipe to define model categories is to consider simplicial algebras of the given type. However, in the case of associative algebras, it turns out to be possible to do homotopy theory in a slightly more hands-on way, using differential graded algebras.

Definition (Differential graded associative algebra).

A differential graded associative algebra, is a pair (A,d)(A,d) where AA is a graded associative algebra, that is, a graded vector space equipped with the associative degree zero map AAAA\otimes A\to A, and d:AAd\colon A\to A is a degree 1-1 map satisfying the condition d2=0d^{2}=0 as well as the graded derivation property d(a1a2)=d(a1)a2+(1)|a1|a1d(a2)d(a_{1}a_{2})=d(a_{1})a_{2}+(-1)^{|a_{1}|}a_{1}d(a_{2}).

We denote by 𝖣𝖦𝖠\operatorname{\mathsf{DGA}} the category whose objects are differential graded associative algebras and whose morphisms from (A,dA)(A,d_{A}) to (B,dB)(B,d_{B}) are (degree zero) algebra homomorphisms f:ABf\colon A\to B that are chain maps, that is satisfy fdA=dBff\circ d_{A}=d_{B}\circ f. Such a morphism ff induces a morphism f:H(A,dA)H(B,dB)f_{\bullet}\colon H_{\bullet}(A,d_{A})\to H_{\bullet}(B,d_{B}) of graded associative algebras obtained by computing homology. We say that ff is a quasi-isomorphism if the induced homology morphism ff_{\bullet} is an isomorphism.

3.2. Models of algebras

As indicated above, our general plan is to replace an associative algebra with a better behaved differential graded algebra of the same homotopy type. A mathematically precise meaning of the words “better behaved” is given by the notion of a cofibrant object in a closed model category. A motivated reader is invited to consult [41, 73] for foundations of the general theory of model categories. According to Hinich [38, Th. 4.1.1], the category 𝖣𝖦𝖠\operatorname{\mathsf{DGA}} admits a closed model category structure for which weak equivalences are quasi-isomorphisms, and the fibrations are surjections. In fact, Hinich works in the general framework of algebras over operads, and in the only case relevant for this paper, that of differential graded algebras concentrated in non-negative homological degrees, the same result is established by Jardine [45]. To describe cofibrant objects, we need one more definition.

Definition (Triangulated quasi-free algebra).

A quasi-free algebra is a differential graded algebra (B,dB)(B,d_{B}) for which the underlying graded associative algebra is free, that is BT(V)B\cong T(V) for some graded vector space VV. Such algebra is said to be triangulated if its space of generators admits an extra decomposition into a direct sum (of graded vector spaces)

Vj1V(j),V\cong\bigoplus_{j\geq 1}V^{(j)},

for which the differential of an element vV(j+1)v\in V^{(j+1)} belongs to the subalgebra generated by 1kjV(k)\bigoplus_{1\leq k\leq j}V^{(k)} for all j0j\geq 0.

The cofibrant objects for Hinich’s model category structure on 𝖣𝖦𝖠\operatorname{\mathsf{DGA}} are exactly the retracts of triangulated quasi-free algebras. We remark that it is useful to think that triangulated algebras are an abstraction of the construction of “killing cycles” of Tate [46, 83]. The relevance of triangulated differential graded commutative algebras in homotopy theory became apparent after the seminal paper of Sullivan [81]; for this reason, the are frequently called Sullivan algebras.

Definition (Model of an algebra).

A model of a differential graded algebra (A,dA)(A,d_{A}) is a triangulated quasi-free algebra (T(V),d)(T(V),d) equipped with a surjective quasi-isomorphism ff to (A,dA)(A,d_{A}). Such a model is said to be minimal if its differential is decomposable, that is for each vVv\in V, the element d(v)d(v) is a combination of products of length at least two in the tensor algebra.

While minimal models of algebras are unique up to isomorphism, their existence is a much more subtle matter; see, for example, the unpublished note of Keller [47]. However, the target algebras (A,dA)(A,d_{A}) of interest for us are in fact non-graded and non-differential, in other words, An=0A_{n}=0 for n0n\neq 0 and dA=0d_{A}=0. Since we work over a field, minimal models for such algebras exist under very mild assumptions (see, for example, [8, 43]) which hold in all cases that are of interest to us.

3.3. Shafarevich complexes

We shall now recall the notion of a Shafarevich complex of a system of elements in an algebra. Those complexes are very rarely models themselves, but they offer a good starting point for constructing a model, which will be one of the guiding principles for our main results.

Definition (Shafarevich complex).

Let AA be an associative (non-differential non-graded) algebra. The Shafarevich complex of a subset SAS\subset A is the differential graded algebra

𝕏(S,A)=(AT(U),d).\Sha(S,A)=(A*T(U),d).

Here AT(U)A*T(U) is the free product of AA and the tensor algebra, UU is the graded vector space whose only non-zero component is U1𝕜𝕊U_{1}\cong\mathbbold{k}S, and dd is the unique graded derivation satisfying d(a)=0d(a)=0 for aAa\in A and d(es)=sAd(e_{s})=s\in A for each basis element sU1s\in U_{1}.

It is an immediate consequence of the definition that the degree zero homology of 𝕏(S,A)\Sha(S,A) is isomorphic to the quotient A/(S)\cong A/(S). In this paper, we shall only consider the Shafarevich complexes 𝕏(S,𝕜𝕏)\Sha(S,\mathbbold{k}\langle X\rangle) for the free algebra A=𝕜𝕏A=\mathbbold{k}\langle X\rangle; it is precisely those that were first defined in work of Golod and Shafarevich [31] and later called the Shafarevich complexes by Golod in [29]. In this particular case, the degree zero homology is the algebra 𝕜𝕏𝕊\mathbbold{k}\langle X\mid S\rangle. Of course, 𝕏(S,𝕜𝕏)=(𝕋(𝕍),𝕕)\Sha(S,\mathbbold{k}\langle X\rangle)=(T(V),d) where VV is the graded vector space whose only non-zero components are V0=𝕜𝕏V_{0}=\mathbbold{k}X and V1𝕜𝕊V_{1}\cong\mathbbold{k}S, and dd is the unique graded derivation satisfying d(x)=0d(x)=0 for xXx\in X and d(es)=s𝕜𝕏𝕋(𝕍𝟘)d(e_{s})=s\in\mathbbold{k}\langle X\rangle\cong T(V_{0}) for each basis element sV1s\in V_{1}. Thus, 𝕏(S,𝕜𝕏)\Sha(S,\mathbbold{k}\langle X\rangle) is a triangulated quasi-free algebra whose space of generators is concentrated in homological degrees zero and one corresponding to killing certain cycles in T(V0)T(V_{0}). In that capacity, it was rediscovered by Lemaire [60], and used by Anick in his work on Hilbert series of associative algebras [1, 2].

In general, the positive degree homology of 𝕏(S,𝕜𝕏)\Sha(S,\mathbbold{k}\langle X\rangle) does not vanish, so that differential graded algebra is not a model of 𝕜𝕏𝕊\mathbbold{k}\langle X\mid S\rangle. In fact, it is known that the homology of 𝕏(S,𝕜𝕏)\Sha(S,\mathbbold{k}\langle X\rangle) is generated by elements of homological degrees zero and one [30], but this remarkable result will not be immediately useful for us. We however will be mainly interested in models that extend Shafarevich complexes.

Definition (Acyclic extension of the Shafarevich complex).

Let A𝕜𝕏𝕊A\cong\mathbbold{k}\langle X\mid S\rangle be an associative algebra presented by generators and relations. We say that a model of the algebra AA is an acyclic extension of the Shafarevich complex 𝕏(S,𝕜𝕏)\Sha(S,\mathbbold{k}\langle X\rangle) if it is concentrated in non-negative homological degrees, and in homological degrees zero and one is isomorphic to the Shafarevich complex 𝕏(S,𝕜𝕏)\Sha(S,\mathbbold{k}\langle X\rangle).

4. Rewriting systems and perturbation of models

Since any model of an associative algebra is quasi-free, it is not particularly interesting as an algebra: all the complexity is hidden in the differential. Thus, if we replace an algebra by a model, it is reasonable to expect that studying deformation theory is done via studying perturbations of the differential. The main result of this section confirms this expectation: essentially, it says that once we fix a filtration, deforming a monomial algebra with respect to that filtration is equivalent to perturbing the differential of its model.

Definition (Word-homogeneous algebra).

Let us call a differential associative graded algebra (B,d)(B,d) word-homogeneous if it has an extra grading by the free monoid X\langle X\rangle, that is we have a decomposition into a direct sum of chain subcomplexes

(B,d)=uX(Bu,d),(B,d)=\bigoplus_{u\in\langle X\rangle}(B_{u},d),

such that Bu1Bu2Bu1u2B_{u_{1}}B_{u_{2}}\subseteq B_{u_{1}u_{2}} for all u1,u2Xu_{1},u_{2}\in\langle X\rangle. For an element bBub\in B_{u}, we say the element bb is word-homogeneous of grade uu.

As above, let us consider a rewriting system Σ=(X,W,f)\Sigma=(X,W,f). Clearly, the non-differential non-graded algebra (AΣ0,0)(A_{\Sigma_{0}},0) is word-homogeneous. It is easy to establish that its minimal model is word-homogeneous; while we shall consider models that are not necessarily non-minimal, we shall restrict ourselves to word-homogeneous models when considering monomial algebras. Moreover, since the algebra AΣ0A_{\Sigma_{0}} is concentrated in homological degree zero, it is enough to work with word-homogeneous models concentrated in non-negative homological degrees. For such a model (T(V),d)(T(V),d), the differential graded subalgebra (T(V0),0)(T(V_{0}),0) surjects onto AΣ0A_{\Sigma_{0}}; in other words, the image of the vector space V0V_{0} generates the algebra AΣ0A_{\Sigma_{0}}. In the context of studying the rewriting system Σ\Sigma, we are working with a distinguished set of generators XX, so it makes sense to focus on models with V0𝕜𝕏V_{0}\cong\mathbbold{k}X. Moreover, since H(T(V),d)AΣ0H_{\bullet}(T(V),d)\cong A_{\Sigma_{0}}, the image of V1V_{1} under dd generates the two-sided ideal of T(V0)𝕜𝕏T(V_{0})\cong\mathbbold{k}\langle X\rangle generated by WW; it makes sense to focus on models with V1𝕜𝕎V_{1}\cong\mathbbold{k}W. Using the terminology we recalled earlier, this means that we work with word-homogeneous acyclic extensions of the Shafarevich complex 𝕏(W,𝕜𝕏)\Sha(W,\mathbbold{k}\langle X\rangle). Finally, we make one slightly less trivial assumption: similar to many situations in rational homotopy theory, we shall consider models of finite type, that is require that dimVn<\dim V_{n}<\infty for all n0n\geq 0. This is true for the minimal model of a monomial algebra with finitely many relations, as established by the second author in [82], and for some other models of interest.

The following result is a mild generalisation of [82, Th. 5.1], which mimics [25, Th. 4.1], and is an adaptation to our case of the argument of Anick [3, Th. 1.4]. We say that a linear endomorphism of a word-homogeneous algebra is Σ\Sigma-filtered if it sends every element of grade uXu\in\langle X\rangle to a combination of elements of grades strictly less than uu with respect to the ordering <Σ<_{\Sigma}.

Theorem 4.1.

Suppose that (T(V),d)(T(V),d) is a word-homogeneous acyclic extension of finite type of the Shafarevich complex 𝕏(W,𝕜𝕏)\Sha(W,\mathbbold{k}\langle X\rangle). The rewriting system Σ\Sigma is convergent if and only if there exists an acyclic extension (T(V),d+d)(T(V),d+d^{\prime}) of the Shafarevich complex 𝕏(R,𝕜𝕏)\Sha(R,\mathbbold{k}\langle X\rangle) with a Σ\Sigma-filtered perturbation dd^{\prime}.

Note that being an acyclic extension of the Shafarevich complex 𝕏(R,𝕜𝕏)\Sha(R,\mathbbold{k}\langle X\rangle) means in particular that d|V0=0d^{\prime}|_{V_{0}}=0 and that for each basis element ewV1e_{w}\subset V_{1}, we have

d(ew)=f(w),d^{\prime}(e_{w})=-f(w),

so dd^{\prime} is indeed Σ\Sigma-filtered on elements of degree one.

Proof.

Suppose first that such a model exists. Extend the partial ordering <Σ<_{\Sigma} to a total well ordering arbitrarily, and consider the filtration of the graded vector space T(V)T(V) associated to that ordering. Since we are dealing with a countable well-order that is not necessarily isomorphic to \mathbb{N}, one has to be careful, and either consider a sequence of spectral sequences, or consider generalised spectral sequences of countably filtered modules, such as the transfinite spectral sequences of Hu [42] or spectral sequences of transfinitely filtered modules of Rahmati [74]; either strategy works in our case, and allows one to construct a spectral sequence converging to H(T(V),d+d)H_{\bullet}(T(V),d+d^{\prime}) with the zeroth term E0E^{0} given by (T(V),d)(T(V),d). This forces our spectral sequence to collapse on the following page, since the homology H(T(V),d)H_{\bullet}(T(V),d) is concentrated in homological degree zero. Examining the individual word-homogeneous components, we conclude that cosets of the monomial basis of AΣ0A_{\Sigma_{0}} form a basis of the vector space AΣA_{\Sigma}, and therefore Σ\Sigma is convergent.

Suppose now that Σ\Sigma is convergent. If we ignore the differential, there exist surjective homomorphisms from the algebra T(V)T(V) to both the monomial algebra AΣ0A_{\Sigma_{0}} and the algebra AΣA_{\Sigma}: one may project it onto its part T(V0)𝕜𝕏T(V_{0})\cong\mathbbold{k}\langle X\rangle of homological degree 0, and the latter admits obvious projection maps to its respective quotients. Let us choose splittings for those projections; this amounts to exhibiting two idempotent endomorphisms π¯\bar{\pi} and π\pi of T(V)T(V) such that both of them annihilate all elements of positive homological degree, the former annihilates the ideal of T(V0)𝕜𝕏T(V_{0})\cong\mathbbold{k}\langle X\rangle generated by WW, and the latter annihilates the ideal of T(V0)𝕜𝕏T(V_{0})\cong\mathbbold{k}\langle X\rangle generated by RR. Since the model T(V)T(V) is of finite type, there exists a word-homogeneous map h:T(V)T(V)h\colon T(V)\longrightarrow T(V) such that [d,h]=1π¯[d,h]=1-\bar{\pi}.

We are going to define a derivation d:T(V)T(V)d^{\prime}\colon T(V)\longrightarrow T(V) of homological degree 1-1 satisfying the requested conditions; since dd^{\prime} is a derivation, it is enough to define it on the generators. We shall also define a map

h:ker(d+d)T(V)h^{\prime}\colon\ker(d+d^{\prime})\to T(V)

of homological degree 11 such that

(d+d)(h+h)|ker(d+d)=1π.\left.(d+d^{\prime})(h+h^{\prime})\right|_{\ker(d+d^{\prime})}=1-\pi.

This latter condition instantly implies that (T(V),d+d)(T(V),d+d^{\prime}) is a model of AΣA_{\Sigma}.

We shall construct the maps and prove their properties inductively. More specifically, we shall prove by induction on k0k\geq 0 that one can define the values of the perturbation dd^{\prime} on generators of homological degree k+1k+1 and the values of hh^{\prime} on elements of ker(d+d)\ker(d+d^{\prime}) of homological degree kk so that the following conditions hold:

perturbation: both maps dd^{\prime} (on generators of homological degree k+1k+1) and hh^{\prime} (on elements of ker(d+d)\ker(d+d^{\prime}) of homological degree kk) are Σ\Sigma-filtered,

square-zero: we have (d+d)2=0(d+d^{\prime})^{2}=0 on generators of homological degree k+1k+1,

homotopy: we have (d+d)(h+h)=1π(d+d^{\prime})(h+h^{\prime})=1-\pi on elements of ker(d+d)\ker(d+d^{\prime}) of homological degree kk; of course, according to the definition of the map π\pi, this condition reads (d+d)(h+h)=1(d+d^{\prime})(h+h^{\prime})=1 on elements of positive homological degree.

As a basis of induction, we recall that the formula d(ew)=f(w)d^{\prime}(e_{w})=-f(w) indicated in the statement of the theorem is Σ\Sigma-filtered, and the square-zero condition is satisfied for degree reasons, as there are no elements of negative homological degree. Suppose that xx is a degree zero element of grade uu. We define

h(x)={0, if x(W),(h+h)((1π)(x)(d+d)h(x)), if x(W).h^{\prime}(x)=\begin{cases}\qquad\qquad\qquad\qquad 0,\ \qquad\quad\qquad\qquad\text{ if }x\notin(W),\\ -(h+h^{\prime})((1-\pi)(x)-(d+d^{\prime})h(x)),\text{ if }x\in(W).\end{cases}

Since the map hh is word-homogeneous and since we already know that dd^{\prime} is Σ\Sigma-filtered on elements of homological degree 11, the map dhd^{\prime}h is Σ\Sigma-filtered. Also, the map

(1π)dh=(1π)(1π¯)=(π¯π)(1-\pi)-dh=(1-\pi)-(1-\bar{\pi})=(\bar{\pi}-\pi)

is Σ\Sigma-filtered simply by definition of <Σ<_{\Sigma}. By induction with respect to the well-ordering <Σ<_{\Sigma}, we may assume the Σ\Sigma-filtered condition for the map hh^{\prime} evaluated on sum of those elements, and the perturbation condition for hh^{\prime} evaluated on xx follows. Finally, to show the homotopy property evaluated on elements of homological degree zero, we note that on elements x(W)x\notin(W) this becomes 0=00=0, and otherwise we have

(d+d)(h+h)(x)=(d+d)h(x)(d+d)(h+h)((1π)(x)(d+d)h(x))(d+d^{\prime})(h+h^{\prime})(x)=(d+d^{\prime})h(x)-(d+d^{\prime})(h+h^{\prime})((1-\pi)(x)-(d+d^{\prime})h(x))

which, by induction on the well-ordering <Σ<_{\Sigma}, is equal to

(d+d)h(x)(1π)((1π)(x)(d+d)h(x))=(1π)2(x)(1π)(d+d)h(x)=(1π)(x),(d+d^{\prime})h(x)-(1-\pi)((1-\pi)(x)-(d+d^{\prime})h(x))=\\ (1-\pi)^{2}(x)-(1-\pi)(d+d^{\prime})h(x)=(1-\pi)(x),

since π\pi vanishes on the image of d+d=(R)d+d^{\prime}=(R) and 1π1-\pi is a projector.

To make the step of induction, we proceed in a similar way. To define d(x)d^{\prime}(x) for a word-homogeneous generator xx of homological degree k+1>1k+1>1, we put

d(x)=(h+h)(d+d)d(x).d^{\prime}(x)=-(h+h^{\prime})(d+d^{\prime})d(x).

The Σ\Sigma-filtered property for dd^{\prime} now easily follows by induction. For the square-zero property, we note that

(d+d)2(x)=(d+d)(d(x)(h+h)(d+d)d(x))=(d+d)d(x)(d+d)(h+h)((d+d)d(x))=(d+d)d(x)(1π)(d+d)d(x)=π((d+d)d(x))=0,(d+d^{\prime})^{2}(x)=(d+d^{\prime})(d(x)-(h+h^{\prime})(d+d^{\prime})d(x))=\\ (d+d^{\prime})d(x)-(d+d^{\prime})(h+h^{\prime})((d+d^{\prime})d(x))=\\ (d+d^{\prime})d(x)-(1-\pi)(d+d^{\prime})d(x)=\pi((d+d^{\prime})d(x))=0,

since (d+d)d(x)ker(d+d)(d+d^{\prime})d(x)\in\ker(d+d^{\prime}), and since π\pi vanishes on the image of d+dd+d^{\prime}. Suppose that xx is word-homogeneous of grade uu. Note that whenever xker(d+d)x\in\ker(d+d^{\prime}), we have x(d+d)h(x)ker(d+d)x-(d+d^{\prime})h(x)\in\ker(d+d^{\prime}) as well. Since π¯\bar{\pi} vanishes on elements of positive homological degree, we have dh(x)=(1π¯)(x)=xdh(x)=(1-\bar{\pi})(x)=x, and so, using the perturbation condition for dd^{\prime}, we see that x(d+d)h(x)x-(d+d^{\prime})h(x) is a combination of elements of grades strictly less than uu with respect to the ordering <Σ<_{\Sigma}. Consequently, we may define hh^{\prime} on elements of ker(d+d)\ker(d+d^{\prime}) of homological degree k>0k>0 by the same inductive argument: we put

h(x)=(h+h)(x(d+d)h(x)).h^{\prime}(x)=(h+h^{\prime})(x-(d+d^{\prime})h(x)).

Once again, a simple inductive argument shows that the Σ\Sigma-filtered property for the map hh^{\prime} and the homotopy condition are satisfied.

We conclude that (T(V),d+d)(T(V),d+d^{\prime}) a differential graded algebra, with the homology H(T(V),d+d)H_{\bullet}(T(V),d+d^{\prime}) isomorphic to AΣA_{\Sigma}, and a Σ\Sigma-filtered map dd^{\prime}, as required. It remains to check that it is triangulated. For that, we first decompose the space of generators VV according to the homological degree, and then for the fixed homological degree nn, decompose VnV_{n} into word-homogeneous components. Since we work with a model of finite type, there are finitely many word-homogeneous components in each homological degree, and so once we linearly order them extending the ordering <Σ<_{\Sigma}, the total order on thus obtained components is either finite or isomorphic to \mathbb{N}. This total order gives a triangulation: by construction, the differential of each component is made of products of elements from the previous components. This completes the proof of the theorem. ∎

5. Differential graded Lie algebras and the Diamond Lemma

In this section, we present a deformation-theoretic version of the Diamond Lemma. This corresponds to the heuristics of deformation theory going back to Deligne, Drinfeld, and Feigin (and later formalised by Lurie [63, 64] and Pridham [71]) that, infinitesimally, any moduli space of deformations in characteristic zero is controlled by a Maurer–Cartan equation in a suitable differential graded Lie algebra. We remark that while using this heuristics requires to restrict ourselves to a ground field 𝕜\mathbbold{k} of characteristic zero, one can easily see that replacing, in Theorem 5.1, the Maurer–Cartan equation [d+F,d+F]=0[d+F,d+F]=0 by the square-zero condition (d+F)2=0(d+F)^{2}=0 leads to a result that is valid without any assumption on the ground field.

5.1. Differential graded Lie algebras

We begin with a brief recollection of differential graded Lie algebras and their Maurer–Cartan elements.

Definition (Differential graded Lie algebra).

A differential graded Lie algebra is a pair (L,d)(L,d), where LL is a graded Lie algebra, that is a graded vector space equipped with a degree zero operation LLLL\otimes L\to L, ab[a,b]a\otimes b\mapsto[a,b], called the Lie bracket, which satisfies the graded skew-symmetry property [a,b]=(1)|a||b|[b,a][a,b]=-(-1)^{|a||b|}[b,a] for homogeneous elements a,bLa,b\in L and the graded Jacobi identity

[a,[b,c]]=[[a,b],c]+(1)|a||b|[b,[a,c]][a,[b,c]]=[[a,b],c]+(-1)^{|a||b|}[b,[a,c]]

for homogeneous elements a,b,cLa,b,c\in L, and d:LLd\colon L\to L is a degree 1-1 map satisfying the condition d2=0d^{2}=0 as well as the graded derivation property

d[a,b]=[d(a),b]+(1)|a|[a,d(b)].d[a,b]=[d(a),b]+(-1)^{|a|}[a,d(b)].

From the homotopical algebra point of view, one of the protagonist in the homotopy theory for differential graded Lie algebras is the set of Maurer–Cartan elements of such algebra.

Definition.

Let (L,d)(L,d) be a differential graded Lie algebra over a field 𝕜\mathbbold{k} of characteristic different from two. The condition

d(x)+12[x,x]=0d(x)+\frac{1}{2}[x,x]=0

is called the Maurer–Cartan equation of LL, and any solution xL1x\in L_{-1} is called a Maurer–Cartan element.

Let LL be a graded Lie algebra. Note that for an element xLx\in L of degree 1-1 and any aLa\in L, the Jacobi identity implies

[x,[x,a]]=[[x,x],a]+(1)|x||x|[x,[x,a]]=[[x,x],a][x,[x,a]],[x,[x,a]]=[[x,x],a]+(-1)^{|x||x|}[x,[x,a]]=[[x,x],a]-[x,[x,a]],

and therefore

[x,[x,a]]=12[[x,x],a].[x,[x,a]]=\frac{1}{2}[[x,x],a].

Thus, if we suppose that xx satisfies the condition [x,x]=0[x,x]=0 (note that for elements of odd degree the graded antisymmetry property is actually symmetry, so this condition is non-empty), the degree 1-1 endomorphism [x,][x,-] squares to zero; the Jacobi identity implies that it also satisfies the derivation property. Thus, (L,[x,])(L,[x,-]) is a differential graded Lie algebra.

5.2. Tangent complexes and deformations

Our Lie algebras of interest arise as Lie algebras of derivations of models of associative algebras. Suppose that AA is an associative algebra, and that (B,d)(B,d) is a model of AA. The graded vector space Der(B)\operatorname{Der}(B) of all derivations of the graded associative algebra BB has a natural structure of a graded Lie algebra: the graded bracket

[D1,D2]=D1D2(1)|D1||D2|D2D1[D_{1},D_{2}]=D_{1}\circ D_{2}-(-1)^{|D_{1}||D_{2}|}D_{2}\circ D_{1}

of two derivations is again a derivation. Clearly, dDer(B)d\in\operatorname{Der}(B) and

[d,d]=dd(1)|d||d|dd=2d2=0;[d,d]=d\circ d-(-1)^{|d||d|}d\circ d=2d^{2}=0;

as we saw above, the map [d,]:Der(B)Der(B)[d,-]\colon\operatorname{Der}(B)\to\operatorname{Der}(B) makes Der(B)\operatorname{Der}(B) a differential graded Lie algebra. We note that in the previous section, dd denoted the differential of the Lie algebra, and now our context forces us to consider differential graded Lie algebras whose differentials are of the form [d,][d,-] with dd being an element of the algebra, the differential of the model; we hope that this does not lead to a confusion. This algebra is called the tangent complex associated to the model BB. Using the work [77] of Schlessinger and Stasheff as an inspiration, Hinich showed in [38] that the homotopy type of this Lie algebra does not depend on the choice of a model.

In our context, the Maurer–Cartan heuristics of deformation theory acquires very precise meaning. Since the differential of the tangent complex of a model is of the form [d,][d,-], the Maurer–Cartan equation in that differential graded Lie algebra is equivalent to the equation [d+x,d+x]=0[d+x,d+x]=0, which in turn is equivalent to the square-zero condition for the derivation d+xd+x. Thus, the Maurer–Cartan equation for the differential graded Lie algebra (Der(B),[d,])(\operatorname{Der}(B),[d,-]) describes perturbations of the differential of BB.

5.3. Perturbing solutions to the Maurer–Cartan equation

In this section, we shall prove what is essentially the Diamond Lemma in disguise. It claims that having the Maurer–Cartan equation satisfied on generators of homological degree at most two of a model is sufficient to ensure existence of a perturbation. As above, we consider a rewriting system Σ=(X,W,f)\Sigma=(X,W,f).

Theorem 5.1.

Suppose that (T(V),d)(T(V),d) is a word-homogeneous acyclic extension of finite type of the Shafarevich complex 𝕏(W,𝕜𝕏)\Sha(W,\mathbbold{k}\langle X\rangle). The rewriting system Σ\Sigma is convergent if and only if there exists an degree 1-1 element FDer(T(V))F\in\operatorname{Der}(T(V)) such that

  • for each ewV1e_{w}\in V_{1}, we have F(ew)=f(w)F(e_{w})=-f(w),

  • F|V2F|_{V_{2}} is Σ\Sigma-filtered,

  • the Maurer–Cartan equation for the element FF holds when evaluated on generators of homological degree at most two.

Proof.

In view of Theorem 4.1, we have to prove that finding a perturbation that only satisfies the Maurer–Cartan equation on generators of homological degree at most two is enough to ensure that there exists a perturbation that is a Maurer–Cartan element. We shall show by induction on kk that we can find a Σ\Sigma-filtered derivation FF for which the Maurer–Cartan equation holds on generators of homological degree at most kk, the basis of induction being k=2k=2.

Note that [d+F,d+F][d+F,d+F] is the Lie bracket of derivations, thus itself a derivation, so since it vanishes on generators of homological degree at most kk, it vanishes on all elements of T(V)T(V) of homological degree at most kk. Let us consider a word-homogeneous element cVk+1c\in V_{k+1}. We note that the element

(c)=((F)+12[F,F])(c)=d(F(c))+F(d(c))+F(F(c))T(V)\nabla(c)=\left(\partial(F)+\frac{1}{2}[F,F]\right)(c)=d(F(c))+F(d(c))+F(F(c))\in T(V)

is in the kernel of d+Fd+F. Indeed, we have

d((c))=d(d(F(c))+F(d(c))+F(F(c)))=(dF)((d+F)(c)),d(\nabla(c))=d(d(F(c))+F(d(c))+F(F(c)))=(dF)((d+F)(c)),

and since the element (d+F)(c)(d+F)(c) is of homological degree at most kk, the Maurer–Cartan equation for FF holds when evaluated on that element. This means that

d((c))=(Fd+F2)((d+F)(c))=F(d(F(c)))F(F(d(c)))F(F(F(c)))=F((c)),d(\nabla(c))=-(Fd+F^{2})((d+F)(c))\\ =-F(d(F(c)))-F(F(d(c)))-F(F(F(c)))=-F(\nabla(c)),

as required. The element (c)\nabla(c) is of homological degree k+12=k1k+1-2=k-1, and we know that the Maurer–Cartan equation means that the derivation d+Fd+F squares to zero on elements of degree at most kk. Thus, it is not unreasonable to ask whether (c)\nabla(c) is in the image of d+Fd+F. Let us show that it is indeed the case. We write (c)=x+y\nabla(c)=x+y, where xx is the word-homogeneous component of maximal grade uu. Since the differential dd is word-homogeneous, and the derivation FF is Σ\Sigma-filtered on elements of degree at most kk, it follows that

{d(x)=0,d(y)+F(x+y)=0.\begin{cases}d(x)=0,\\ d(y)+F(x+y)=0.\end{cases}

Since |x|=|(c)|=k11|x|=|\nabla(c)|=k-1\geq 1 and the model (T(V),d)(T(V),d) is acyclic in positive degrees, it follows that we can write x=dax=da for some aT(V)a\in T(V). The element

yF(a)=(c)(d+F)(a)y-F(a)=\nabla(c)-(d+F)(a)

represents the same homology class for d+Fd+F, but its component of grade uu has been replaced by a combination of elements of grades strictly less than uu with respect to the ordering <Σ<_{\Sigma}. By induction with respect to the well-ordering <Σ<_{\Sigma}, we see that (c)\nabla(c) is a boundary of d+Fd+F, so we can write (c)=(d+F)(b)\nabla(c)=(d+F)(b). Let us now consider a new derivation FF^{\prime} which coincides with FF on elements of homological degree at most kk, but satisfies

F(c)=F(c)b.F^{\prime}(c)=F(c)-b.

We note that since \nabla is Σ\Sigma-filtered, each element aa above (and hence the resulting element bb) is a linear combination of elements of grades strictly less than the grade of cc with respect to the ordering <Σ<_{\Sigma}, so FF^{\prime} is still Σ\Sigma-filtered. Moreover, since the elements d(c)d(c) and F(c)F^{\prime}(c) are of homological degree kk, we have

d(F(c))+F(d(c))+F(F(c))=d(F(c)b)+F(d(c))+F(F(c)b)=(c)d(b)F(b)=0.d(F^{\prime}(c))+F^{\prime}(d(c))+F^{\prime}(F^{\prime}(c))\\ =d(F(c)-b)+F(d(c))+F(F(c)-b)=\nabla(c)-d(b)-F(b)=0.

Making such adjustments for all elements of homological degree k+1k+1, we obtain a perturbation that satisfies the Maurer–Cartan equation up to degree k+1k+1, which allows us to proceed by induction. Since the correction does not change values of FF on generators of low homological degrees, we may consider the limiting value of FF; it is a bona fide Maurer–Cartan element extending the original perturbation of the differential. ∎

5.4. Maurer–Cartan equation and the Diamond Lemma

In this section, we are finally rewarded for going through all the previous arguments involving models and the Maurer–Cartan equation; the reward is a general result stating that each word-homogeneous acyclic extension of 𝕏(W,𝕜𝕏)\Sha(W,\mathbbold{k}\langle X\rangle) leads to its own Diamond Lemma criterion for convergence of Σ\Sigma. At the core of such result is the following definition of an S-polynomial associated to a generator of homological degree two of such model.

Definition (Obstruction).

Let (T(V),d)(T(V),d) be a word-homogeneous acyclic extension of the Shafarevich complex 𝕏(W,𝕜𝕏)\Sha(W,\mathbbold{k}\langle X\rangle). For each element cV2c\in V_{2}, the element d(c)d(c) is of degree 11, and therefore is of the form

d(c)=iaiewibiT(V0)V1T(V0).d(c)=\sum_{i}a_{i}e_{w_{i}}b_{i}\in T(V_{0})\otimes V_{1}\otimes T(V_{0}).

We define the obstruction Sc𝕜𝕏S_{c}\in\mathbbold{k}\langle X\rangle associated to the element cc by the formula

Sc=iaif(wi)bi.S_{c}=\sum_{i}a_{i}f(w_{i})b_{i}.

We remark that our usage of the term “obstruction” is different from that of Anick in [3]: for us, as it will become apparent below, ScS_{c} is an actual obstruction to convergence of Σ\Sigma.

We are now ready to state and prove our model-specific Diamond Lemma. As above, we consider a rewriting system Σ=(X,W,f)\Sigma=(X,W,f).

Theorem 5.2.

Suppose that (T(V),d)(T(V),d) is a word-homogeneous acyclic extension of finite type of the Shafarevich complex 𝕏(W,𝕜𝕏)\Sha(W,\mathbbold{k}\langle X\rangle). The rewriting system Σ\Sigma is convergent if and only if for each word-homogeneous element cV2c\in V_{2}, the corresponding obstruction ScS_{c} is mapped to zero by some reduction with respect to Σ\Sigma.

Proof.

According to Theorem 5.1, Σ\Sigma is convergent if and only if there exists a degree 1-1 element FDer(T(V))F\in\operatorname{Der}(T(V)) such that

  • for each ewV1e_{w}\subset V_{1}, we have (d+F)(ew)=wf(w)(d+F)(e_{w})=w-f(w),

  • F|V2F|_{V_{2}} is Σ\Sigma-filtered,

  • the Maurer–Cartan condition [d+F,d+F]=0[d+F,d+F]=0 for the element FF holds when evaluated on generators of homological degree at most two.

Since the Maurer–Cartan equation evaluated on an element of homological degree kk is an element of homological degree k2k-2, we may replace “at most two” by “exactly two” without changing the statement.

Suppose cV2c\in V_{2} is word-homogeneous of grade uu. We note that the formula

Sc=iaif(wi)biS_{c}=\sum_{i}a_{i}f(w_{i})b_{i}

for the obstruction ScS_{c} means that ScS_{c} is a combination of terms of grade less than uu, so existence of its reduction equal to zero is clearly equivalent to existence of a representation of ScS_{c} as a two-sided linear combination

Sc=iai(wif(wi))bi,S_{c}=\sum_{i}a_{i}^{\prime}(w_{i}^{\prime}-f(w_{i}^{\prime}))b_{i}^{\prime},

where all the terms in that sum are of grade less than uu. Indeed, a sequence of basic reductions produces such a combination, and vice versa, existence each such combination immediately suggests a sequence of basic reductions.

Recall that the Maurer–Cartan equation for FF is exactly the square-zero condition for d+Fd+F. Let us evaluate it on an element cV2c\in V_{2} of grade uu. We obtain

d(d(c))+F(d(c))+d(F(c))+F(F(c))=0.d(d(c))+F(d(c))+d(F(c))+F(F(c))=0.

We already remarked that the element d(c)T(V)d(c)\in T(V) is of homological degree one, and therefore can be written as

d(c)=iaiewibiT(V0)V1T(V0).d(c)=\sum_{i}a_{i}e_{w_{i}}b_{i}\in T(V_{0})\otimes V_{1}\otimes T(V_{0}).

The first term in the square-zero condition vanishes, while the second is

F(d(c))=F(iaiewibi)=iaiF(ewi)bi=iaif(wi)bi=Sc.F(d(c))=F\left(\sum_{i}a_{i}e_{w_{i}}b_{i}\right)=\sum_{i}a_{i}F(e_{w_{i}})b_{i}=-\sum_{i}a_{i}f(w_{i})b_{i}=-S_{c}.

The remaining terms in the square-zero condition are

d(F(c))+F(F(c))=(d+F)(F(c)).d(F(c))+F(F(c))=(d+F)(F(c)).

Since F|V2F|_{V_{2}} is Σ\Sigma-filtered, the element F(c)F(c) is of the form iaiewibi\sum_{i}a_{i}^{\prime}e_{w_{i}^{\prime}}b_{i}^{\prime}, where all the terms in the sum are of grades less than uu. Finally, we obtain

d(F(c))+F(F(c))=(d+F)(iaiewibi)=iai(wif(wi))bi.d(F(c))+F(F(c))=(d+F)\left(\sum_{i}a_{i}^{\prime}e_{w_{i}^{\prime}}b_{i}^{\prime}\right)=\sum_{i}a_{i}^{\prime}(w_{i}^{\prime}-f(w_{i}^{\prime}))b_{i}^{\prime}.

We conclude that the validity of the Maurer–Cartan equation on cV2c\in V_{2} of grade uu is equivalent to the equation

Sc=iai(wif(wi))bi,S_{c}=\sum_{i}a_{i}^{\prime}(w_{i}^{\prime}-f(w_{i}^{\prime}))b_{i}^{\prime},

where all the terms in that sum are of grade less than uu, which, according to our observation above, completes the proof. ∎

6. Examples: specific cases of models and their Diamond Lemmas

In this section, we discuss two particular examples of models of monomial algebras that allow us to recover the Diamond Lemma and the Triangle Lemma for classical term rewriting.

6.1. The inclusion-exclusion model

In [25], the first author and Khoroshkin constructed a combinatorial (usually non-minimal) model for a shuffle operad with monomial relations; specialising to shuffle operads generated by elements of arity 11, one obtains a model for an associative algebra with monomial relations. For the benefit of the reader who does not wish to go through the operad construction, we describe that model here. As above, we denote AΣ0=𝕜𝕏𝕎A_{\Sigma_{0}}=\mathbbold{k}\langle X\mid W\rangle.

Let uXu\in\langle X\rangle. Suppose that there are exactly mm different divisors of the word uu that are among the monomial relations WW. We introduce formal symbols Du,1,,Du,mD_{u,1},\ldots,D_{u,m} in one-to-one correspondence with those divisors, and denote by AuA_{u} the Grassmann algebra Λ(Du,1,,Du,m)\Lambda(D_{u,1},\ldots,D_{u,m}). The direct sum

AW:=uXAuA_{W}:=\bigoplus_{u\in\langle X\rangle}A_{u}

has an associative algebra structure defined as follows. For an element uXu^{\prime}\in\langle X\rangle which is a divisor of uu, there is a natural inclusion AuAuA_{u^{\prime}}\hookrightarrow A_{u} sending each generator Du,iD_{u^{\prime},i} to the corresponding Du,jD_{u,j}, where jj is the position of the ii-th divisor of uu^{\prime} (viewed now as a divisor of uu) in the list of divisors of uu. The product of two elements a1Au1a_{1}\in A_{u_{1}} and a2Au2a_{2}\in A_{u_{2}} is the product in Au1u2A_{u_{1}u_{2}} of their images with respect to the inclusions Au1,Au2Au1u2A_{u_{1}},A_{u_{2}}\hookrightarrow A_{u_{1}u_{2}}. If we set the homological degree of each Du,iD_{u,i} to be equal to one, the algebra AWA_{W} is graded. Moreover, it has a differential graded algebra structure, where the differential dd, when restricted to each Grassmann algebra AuA_{u}, is the unique derivation of that algebra sending all generators Du,iD_{u,i} to 11.

Let us call a Grassmann monomial Du,i1Du,ikAuD_{u,i_{1}}\wedge\cdots\wedge D_{u,i_{k}}\in A_{u} indecomposable if there does not exist a factorisation u=u1u2u=u_{1}u_{2} into a product of two non-empty monomials u1u_{1} and u2u_{2} for which each of the divisors of uu corresponding to Su,ipS_{u,i_{p}} is either a divisor of u1u_{1} or a divisor of u2u_{2}. It is easy to see that as an associative algebra, the algebra AWA_{W} is freely generated by its indecomposable elements. Moreover, according to [25, Th. 2.2], the differential graded algebra (AW,d)(A_{W},d) is a model of the algebra AΣ0A_{\Sigma_{0}}. We call this model the inclusion-exclusion model. It is immediately seen to be a word-homogeneous acyclic extension of finite type of the Shafarevich complex 𝕏(W,𝕜𝕏)\Sha(W,\mathbbold{k}\langle X\rangle). Moreover, all homological degree two word-homogeneous generators cc of that model of grade uXu\in\langle X\rangle are of two possible forms:

  • inclusion: c=Su,iSu,jc=S_{u,i}\wedge S_{u,j} where the ii-th divisor of uu is its proper divisor uu^{\prime} and the jj-th divisor of uu is equal to uu,

  • overlap: c=Su,iSu,jc=S_{u,i}\wedge S_{u,j} where the ii-th divisor of uu is a left divisor uu^{\prime} of uu, the jj-th divisor of uu is a right divisor u′′u^{\prime\prime} of uu, and these two subwords have a non-trivial common divisor.

For each generator of the first type, we have u=aubu=au^{\prime}b for some a,bXa,b\in\langle X\rangle, therefore such generators are in one-to-one correspondence with the inclusion ambiguities of [11]. We note that for such generator, we have

d(c)=euaeub,d(c)=e_{u}-ae_{u^{\prime}}b,

and the obstruction associated to this generator is the basic reduction associated to the triple (a,u,b)(a,u^{\prime},b) applied to the relation uf(u)u-f(u). Similarly, for each generator of the second type, we have u=ua=bu′′u=u^{\prime}a=bu^{\prime\prime} for some a,bXa,b\in\langle X\rangle, therefore such generators are in one-to-one correspondence with the overlap ambiguities of op. cit.. Moreover, for such generator, we have

d(c)=beu′′eua,d(c)=be_{u^{\prime\prime}}-e_{u^{\prime}}a,

and the obstruction associated to this generators is the element usually called the S-polynomial corresponding to the overlap of uu^{\prime} and u′′u^{\prime\prime} [20]. This proves the following result.

Proposition 6.1 (Bergman’s Diamond Lemma [11, Th. 1.2]).

The rewriting system Σ\Sigma is convergent if and only if each basic reduction of each relation from RR and each S-polynomial between two relations from RR is mapped to zero by some reduction with respect to Σ\Sigma.

As an immediate consequence, we see that the rewriting system

xyzx3+y3+z3xyz\mapsto x^{3}+y^{3}+z^{3}

on 𝕜𝕩,𝕪,𝕫\mathbbold{k}\langle x,y,z\rangle discussed in Section 2 is convergent, since we already established its termination, and the word xyzxyz does not have self-overlaps.

6.2. The minimal model

In [82], the second author constructed, for each monomial algebra AΣ0=𝕜𝕏𝕎A_{\Sigma_{0}}=\mathbbold{k}\langle X\mid W\rangle, the minimal model which we shall now recall and use. In this section, the rewriting system Σ\Sigma is assumed minimal: WW contains no words of length one and no words from WW divide one another.

We begin with recalling the definition of (right) Anick chains, a notion that was to some extent present implicitly in work of Backelin [4], then used by Green, Happel and Zacharia [32], and became known to a wider community from the work of Anick [3].

Definition (Anick chain).

We say that an Anick 0-chain is an element xXx\in X, and define the tail of such element to be equal to it. For n>0n>0, we say that an Anick nn-chain is a monomial cXc\in\langle X\rangle such that

  1. (1)

    we can write c=ctc=c^{\prime}t where cc^{\prime} is an Anick (n1)(n-1)-chain,

  2. (2)

    if tt^{\prime} is the tail of cc^{\prime}, then ttt^{\prime}t has a right divisor which is a relation from WW,

  3. (3)

    no proper left divisor of cc satisfies the first two conditions.

The tail of cc is the element tc:=tt_{c}:=t.

In particular, the Anick 11-chains are the defining relations WW, with the tail of each monomial relation given by the monomial obtained by deleting its first letter, and the Anick 22-chains are precisely the monomials cXc\in\langle X\rangle which can be written as c=ua=bu′′c=u^{\prime}a=bu^{\prime\prime} for two relations u,u′′Wu^{\prime},u^{\prime\prime}\in W, and which do not have other divisors from WW.

It is shown in [3] that a monomial cc admits at most one structure of a chain: if cc is an nn-chain with tail tct_{c}, then both nn and tct_{c} are uniquely determined. We define the homological degree of an Anick nn-chain cc to be equal to nn, and consider the graded associative algebra BB freely generated by Anick chains. This algebra has a unique derivation dd whose value d(c)d(c) on an Anick nn-chain cc is the sum over all ways to represent cc, viewed as a monomial, as a concatenation c1ckc_{1}\cdots c_{k} of an Anick n1n_{1}-chains c1c_{1}, …, an Anick nkn_{k}-chain ckc_{k} with n1++nk=n1n_{1}+\cdots+n_{k}=n-1, of the terms

(1)(n+12)+n1c1ck.-(-1)^{\binom{n+1}{2}+n_{1}}c_{1}\otimes\cdots\otimes c_{k}.

The main theorem of [82] states that for a minimal rewriting system Σ\Sigma, the quasi-free algebra (B,d)(B,d) is the minimal model of AΣ0A_{\Sigma_{0}}. It is a word-homogeneous acyclic extension of finite type of the Shafarevich complex 𝕏(W,𝕜𝕏)\Sha(W,\mathbbold{k}\langle X\rangle).

The description of Anick 22-chains mentioned above indicates that they are among the overlaps from the previous section; in fact, they are minimal overlaps that have only two divisors from WW. For the differential of the model, we have d(c)=beu′′euad(c)=be_{u^{\prime\prime}}-e_{u^{\prime}}a, as before. This proves the following result.

Proposition 6.2 (Triangle Lemma [20, Sec. 2.4.3]).

A minimal rewriting system Σ\Sigma is convergent if and only if each S-polynomial corresponding to a minimal overlap is mapped to zero by some reduction with respect to Σ\Sigma.

For example, for the rewriting system x3xyzy3z3x^{3}\mapsto xyz-y^{3}-z^{3} arising in the example discussed in Section 2 for a total multiplicative ordering, the monomial x3x^{3} has two self-overlaps, x4=x3x=xx3x^{4}=x^{3}\cdot x=x\cdot x^{3} and x5=x3x2=x2x3x^{5}=x^{3}\cdot x^{2}=x^{2}\cdot x^{3}. Only the first of them is minimal. This means that there is no need to consider the S-polynomial corresponding to the overlap x5x^{5} when checking the Diamond Lemma criterion. That said, in this case already the S-polynomial corresponding to the overlap x4x^{4} cannot be reduced to zero, so the rewriting system is not convergent, as we indicated when discussing that example.

7. Analogues and generalisations

In words of Kontsevich and Soibelman [49], “the deformation theory of associative algebras is a guide for developing the deformation theory of many algebraic structures; conversely, all the concepts of what should be the “deformation theory of everything” must be tested in the case of associative algebras”. We believe that our work constitutes a successful test of putting the Diamond Lemma in the homotopical context, and it is natural to conclude this paper with a discussion of analogues and possible generalisations of our results.

7.1. Multiplicative algebraic structures

We begin with outlining a context in which an immediate generalisation of our approach is available.

Definition (Multiplicative algebraic structure).

We say that an algebraic structure 𝖯\mathsf{P} is multiplicative if it is described by a coloured operad obtained as a 𝕜\mathbbold{k}-linear span of a set-theoretic operad. In classical terms, we require that the free 𝕜\mathbbold{k}-linear 𝖯\mathsf{P}-algebra generated by a vector space VV is equal to the 𝕜\mathbbold{k}-linear span of the free 𝖯\mathsf{P}-algebra in the category of sets generated by a basis of VV.

The key consequence of this definition is that free 𝕜\mathbbold{k}-linear 𝖯\mathsf{P}-algebras are naturally equipped with monomial bases, so one can talk about rewriting systems and orderings of monomials, and most of our results apply mutatis mutandis if one can talk about models of algebras in the same way as above. Work of Hinich [38] mentioned earlier allows one to verify that condition easily enough.

Some remarks are in order. First, talking about coloured operads means that this formalism includes, for instance, quotients of path algebras, since the latter are obtained as 𝕜\mathbbold{k}-linear spans of small categories and are not just algebras in a very classical sense. Second, there are situations where the deviation from the set-theoretic property is “moderate”, i.e. the value of any structure operation on basis elements is a basis element up to a non-zero constant; it is easy to extend our work to such algebras. Finally, it is also possible to include algebras that are complete in a suitable sense; we already indicated in the introduction that complete commutative associative algebras were one of the central examples at the moment of inception of both topics discussed in this paper, through the theory of standard bases of Hironaka [40, Sec. III.1] and the deformation theory of analytic spaces of Palamodov [69].

7.1.1. Commutative associative algebras

The free commutative associative algebra coincides with the linear span of the free commutative monoid, and the homotopy theory for commutative associative algebras suggests that our results work over a field of characteristic zero. An analogue of the inclusion-exclusion model exists, and generators of homological degree at most two of the minimal model of a given monomial algebra can be determined directly (for further information on models of monomial algebras, we refer the reader to [9]). If the partial order <Σ<_{\Sigma} comes from a total multiplicative order of monomials, the result corresponding to the inclusion-exclusion model is the Buchberger criterion [22], and the result corresponding to the minimal model is known as the Chain Criterion [21, 48].

7.1.2. Non-associative algebras

If one considers free non-associative algebras, better known as magmatic algebras for one binary operation and as absolutely free algebras in the more general case, the free algebra is the linear span of the free set-theoretic magma (consisting of appropriate decorated planar trees), and models are surprisingly manageable. In particular, the inclusion-exclusion model of a given monomial algebra is readily available, and the minimal model for a monomial algebra whose set of generators and relations are already chosen minimal is simply the non-associative Shafarevich complex, since there are no non-associative overlaps. As a consequence, for a general rewriting system convergence is equivalent to the fact that all basic reductions between the relations RR are mapped to zero by some reduction with respect to Σ\Sigma, and, in particular, a rewriting system where pairwise reductions are impossible is convergent. This result goes back to [87], see also [19, Appendix 3] and a more recent paper [28].

7.1.3. Shuffle algebras

Shuffle algebras [26, 76], known also as permutads [62], are associative algebras for the monoidal category of non-symmetric sequences with respect to the monoidal structure given by Cauchy tensor product. The corresponding definition is also available for sets; elements of the free monoid are decorated permutations and their suitable generalisations. All results of this paper, including a generalisation of the inclusion-exclusion model and the second author’s minimal model, can be adapted; the corresponding results are the Diamond Lemma for shuffle algebras [20, Th. 4.5.1.4] and the Triangle Lemma for shuffle algebras [20, Prop. 4.5.3.2].

7.1.4. Non-symmetric operads and shuffle operads

When one considers operads, whether non-symmetric or shuffle ones, each free operad is the linear span of the corresponding free set operad consisting of appropriate decorated planar trees. It is important to note that the algebraic structure on the space of decorated trees in this case is much richer than the one considered in the case of magmatic algebras above, so the corresponding Diamond Lemmas (and especially their applications) are sufficiently non-trivial. Models are once again available under relatively mild assumptions [66]. In particular, the inclusion-exclusion model can be defined for every shuffle operad with monomial relations [25, Sec. 2]. This way one obtains the Diamond Lemma for non-symmetric operads and for shuffle operads [20].

Finding an explicit description of the minimal model of the given operad with monomial relations is an open problem. Partial results in the spirit of the Triangle Lemma are available for operads [20], but even in homological degree two, the computation of the spaces of generators of minimal models of monomial operads still has to be completed in full generality. One calculation hinting at the noticeable complexity of this question was made by Sköldberg several years ago (unpublished), and we shall now describe the result of that calculation; another discussion of that phenomenon can be found in the recent preprint of Iyudu and Vlassopoulos [44].

The minimal model of a monomial operad is homogeneous: one can separate generators according to their underlying tree monomials. In the case of associative algebras, minimal models of monomial algebras satisfy the homological purity condition: for each monomial, the corresponding component of the space of generators is concentrated in one homological degree. Somewhat surprisingly, it turns out that in the case of operads, this homological purity condition does not hold for some underlying tree monomials. The simplest possible example is obtained as follows. Consider the free non-symmetric operad generated by one binary operation, and consider the set of monomial relations WW consisting of all tree monomials of arity four in that operad. Then for the tree monomial

of arity seven, as well as the monomials obtained from it by reflections at internal vertices, the corresponding component of the space generators of the minimal model is two-dimensional, with one generator in degree three and one generator in degree four. This example is a particular case of the remarkable construction from the very recent preprint of Qi, Xu, Zhang and Zhao [72, Sec. 5]: it is the “operadization” of the algebra 𝕜𝕩,𝕪/𝔪𝟛\mathbbold{k}\langle x,y\rangle/\mathfrak{m}^{3}. We feel that explicitly computing minimal models for monomial operads obtained by that construction is likely to exhibit some of the most interesting phenomena behind minimal models of general monomial operads.

7.2. Non-multiplicative algebraic structures

For an algebraic structure that is not multiplicative, our approach does not adapt immediately. Upon examining the existing literature on Gröbner–Shirshov bases, see, for example, [12, 13, 14] and references therein, we feel that the most obvious way to deal with a non-multiplicative algebraic structure is to embed it into a bigger multiplicative one. For example, Lie algebras can be embedded in their universal envelopes, and so one can work with a rewriting system within a multiplicative algebraic structure. Equivalence of the associative algebra methods applied to the universal envelopes and the Lie-algebraic methods of Shirshov was established by Bokut and Malcolmson [16]. Alternatively, a Lie algebra can be regarded as an anti-commutative magmatic algebra; the corresponding study of Gröbner–Shirshov bases was undertaken in [15]. Recent papers on Gröbner–Shirshov bases for pre-Lie algebras [18], Novikov algebras [84], and Sabinin algebras [61] follow the same logic. This raises a question of computing not only models of monomial algebras for multiplicative algebraic structures, but also models of meaningful classes of algebras obtained when extending non-multiplicative structures to multiplicative ones. We hope to address this in more detail elsewhere.

References

  • [1] David J. Anick. A counterexample to a conjecture of Serre. Ann. of Math. (2), 115(1):1–33, 1982.
  • [2] David J. Anick. Noncommutative graded algebras and their Hilbert series. J. Algebra, 78(1):120–140, 1982.
  • [3] David J. Anick. On the homology of associative algebras. Trans. Amer. Math. Soc., 296(2):641–659, 1986.
  • [4] Jörgen Backelin. La série de Poincaré-Betti d’une algèbre graduée de type fini à une relation est rationnelle. C. R. Acad. Sci. Paris Sér. A-B, 287(13):A843–A846, 1978.
  • [5] Michael J. Bardzell. Noncommutative Gröbner bases and Hochschild cohomology. In Symbolic computation: solving equations in algebra, geometry, and engineering (South Hadley, MA, 2000), volume 286 of Contemp. Math., pages 227–240. Amer. Math. Soc., Providence, RI, 2001.
  • [6] Michael Joseph Bardzell. Resolutions and cohomology of finite dimensional algebras. Thesis (Ph.D.)–Virginia Polytechnic Institute and State University. ProQuest LLC, Ann Arbor, MI, 1996.
  • [7] Severin Barmeier and Zhengfang Wang. Deformations of path algebras of quivers with relations. arXiv e-print arXiv:2002.10001, February 2020.
  • [8] H. J. Baues and J.-M. Lemaire. Minimal models in homotopy theory. Math. Ann., 225(3):219–242, 1977.
  • [9] Alexander Berglund. Poincaré series of monomial rings. J. Algebra, 295(1):211–230, 2006.
  • [10] Alexander Berglund. Homological perturbation theory for algebras over operads. Algebr. Geom. Topol., 14(5):2511–2548, 2014.
  • [11] George M. Bergman. The diamond lemma for ring theory. Adv. in Math., 29(2):178–218, 1978.
  • [12] L. A. Bokut and Yuqun Chen. Gröbner-Shirshov bases for Lie algebras: after A. I. Shirshov. Southeast Asian Bull. Math., 31(6):1057–1076, 2007.
  • [13] L. A. Bokut and Yuqun Chen. Gröbner-Shirshov bases and their calculation. Bull. Math. Sci., 4(3):325–395, 2014.
  • [14] L. A. Bokut and Yuqun Chen. Gröbner-Shirshov bases for universal algebras. J. South China Normal Univ. Natur. Sci. Ed., 46(6):1–9, 2014.
  • [15] L. A. Bokut, YuQun Chen, and Yu Li. Anti-commutative Gröbner-Shirshov basis of a free Lie algebra. Sci. China Ser. A, 52(2):244–253, 2009.
  • [16] L. A. Bokut and P. Malcolmson. Gröbner-Shirshov bases for relations of a Lie algebra and its enveloping algebra. In Algebras and combinatorics (Hong Kong, 1997), pages 47–54. Springer, Singapore, 1999.
  • [17] L. A. Bokut. Imbeddings into simple associative algebras. Algebra i Logika, 15(2):117–142, 245, 1976.
  • [18] L. A. Bokut, YuĭtsyunChèn, and Yuĭ Li. Gröbner-Shirshov bases Vinberg-Koszul-Gerstenhaber for right-symmetric algebras. Fundam. Prikl. Mat., 14(8):55–67, 2008.
  • [19] L. A. Bokutand G. P. Kukin. Algorithmic and combinatorial algebra, volume 255 of Mathematics and its Applications. Kluwer Academic Publishers Group, Dordrecht, 1994.
  • [20] Murray R. Bremner and Vladimir Dotsenko. Algebraic operads: an algorithmic companion. CRC Press, Boca Raton, FL, 2016.
  • [21] B. Buchberger. A criterion for detecting unnecessary reductions in the construction of Gröbner-bases. In Symbolic and algebraic computation (EUROSAM ’79, Internat. Sympos., Marseille, 1979), volume 72 of Lecture Notes in Comput. Sci., pages 3–21. Springer, Berlin-New York, 1979.
  • [22] Bruno Buchberger. An algorithm for finding the basis elements of the residue class ring of a zero dimensional polynomial ideal. J. Symbolic Comput., 41(3-4):475–511, 2006. Translated from the 1965 German original by Michael P. Abramson.
  • [23] Sergio Chouhy and Andrea Solotar. Projective resolutions of associative algebras and ambiguities. J. Algebra, 432:22–61, 2015.
  • [24] Maurizio G. Citterio. Classifying spaces of categories and term rewriting. Theory Appl. Categ., 9:92–105, 2001/02.
  • [25] Vladimir Dotsenko and Anton Khoroshkin. Quillen homology for operads via Gröbner bases. Doc. Math., 18:707–747, 2013.
  • [26] Vladimir Dotsenko and Anton Khoroshkin. Shuffle algebras, homology, and consecutive pattern avoidance. Algebra Number Theory, 7(3):673–700, 2013.
  • [27] Christian Eder. Predicting zero reductions in gröbner basis computations. In Proceedings of the 2014 Symposium on Symbolic-Numeric Computation, page 109–110. Association for Computing Machinery, New York, NY, 2014.
  • [28] Lothar Gerritzen. Tree polynomials and non-associative Gröbner bases. J. Symbolic Comput., 41(3-4):297–316, 2006.
  • [29] E. S. Golod. Standard bases and homology. In Algebra—some current trends (Varna, 1986), volume 1352 of Lecture Notes in Math., pages 88–95. Springer, Berlin, 1988.
  • [30] E. S. Golod. On the homology algebra of the Shafarevich complex of a free algebra. Fundam. Prikl. Mat., 5(1):97–100, 1999.
  • [31] E. S. Golod and I. R. Šafarevič. On the class field tower. Izv. Akad. Nauk SSSR Ser. Mat., 28:261–272, 1964.
  • [32] E. L. Green, D. Happel, and D. Zacharia. Projective resolutions over Artin algebras with zero relations. Illinois J. Math., 29(1):180–190, 1985.
  • [33] Léonard Guetta. Homology of categories via polygraphic resolutions. arXiv e-print arXiv:2003.10734, Mar 2020.
  • [34] V. K. A. M. Gugenheim and L. A. Lambe. Perturbation theory in differential homological algebra. I. Illinois J. Math., 33(4):566–582, 1989.
  • [35] V. K. A. M. Gugenheim, L. A. Lambe, and J. D. Stasheff. Perturbation theory in differential homological algebra. II. Illinois J. Math., 35(3):357–373, 1991.
  • [36] V. K. A. M. Gugenheim and J. D. Stasheff. On perturbations and AA_{\infty}-structures. Bull. Soc. Math. Belg. Sér. A, 38:237–246 (1987), 1986.
  • [37] Yves Guiraud, Eric Hoffbeck, and Philippe Malbos. Convergent presentations and polygraphic resolutions of associative algebras. Math. Z., 293(1-2):113–179, 2019.
  • [38] Vladimir Hinich. Homological algebra of homotopy algebras. Comm. Algebra, 25(10):3291–3323, 1997.
  • [39] Heisuke Hironaka. On resolution of singularities (characteristic zero). In Proc. Internat. Congr. Mathematicians (Stockholm, 1962), pages 507–521. Inst. Mittag-Leffler, Djursholm, 1963.
  • [40] Heisuke Hironaka. Resolution of singularities of an algebraic variety over a field of characteristic zero. I, II. Ann. of Math. (2) 79 (1964), 109–203; ibid. (2), 79:205–326, 1964.
  • [41] Mark Hovey. Model categories, volume 63 of Mathematical Surveys and Monographs. American Mathematical Society, Providence, RI, 1999.
  • [42] Po Hu. Transfinite spectral sequences. In Homotopy invariant algebraic structures (Baltimore, MD, 1998), volume 239 of Contemp. Math., pages 197–216. Amer. Math. Soc., Providence, RI, 1999.
  • [43] Johannes Huebschmann and Tornike Kadeishvili. Small models for chain algebras. Math. Z., 207(2):245–280, 1991.
  • [44] Natalia Iyudu and Ioannis Vlassopoulos. Homologies of monomial operads and algebras. arXiv e-print arXiv:2008.00985, August 2020.
  • [45] J. F. Jardine. A closed model structure for differential graded algebras. In Cyclic cohomology and noncommutative geometry (Waterloo, ON, 1995), volume 17 of Fields Inst. Commun., pages 55–58. Amer. Math. Soc., Providence, RI, 1997.
  • [46] T. Józefiak. Tate resolutions for commutative graded algebras. Bull. Acad. Polon. Sci. Sér. Sci. Math. Astronom. Phys., 17:617–621, 1969.
  • [47] Bernhard Keller. Notes on minimal models. Preprint available on the author’s webpage https://webusers.imj-prg.fr/~bernhard.keller/publ/index.html, 2003.
  • [48] C. Kollreider and B. Buchberger. An improved algorithmic construction of gröbner-bases for polynomial ideals. SIGSAM Bull., 12(2):27–36, May 1978.
  • [49] Maxim Kontsevich and Yan Soibelman. Deformations of algebras over operads and the Deligne conjecture. In Conférence Moshé Flato 1999, Vol. I (Dijon), volume 21 of Math. Phys. Stud., pages 255–307. Kluwer Acad. Publ., Dordrecht, 2000.
  • [50] Yves Lafont. A new finiteness condition for monoids presented by complete rewriting systems (after Craig C. Squier). J. Pure Appl. Algebra, 98(3):229–244, 1995.
  • [51] Yves Lafont and François Métayer. Polygraphic resolutions and homology of monoids. J. Pure Appl. Algebra, 213(6):947–968, 2009.
  • [52] Yves Lafont and Alain Prouté. Church-Rosser property and homology of monoids. Math. Structures Comput. Sci., 1(3):297–326, 1991.
  • [53] Larry A. Lambe. Homological perturbation theory, Hochschild homology, and formal groups. In Deformation theory and quantum groups with applications to mathematical physics (Amherst, MA, 1990), volume 134 of Contemp. Math., pages 183–218. Amer. Math. Soc., Providence, RI, 1992.
  • [54] V. N. Latyshev. Combinatorial ring theory. Standard bases. Izdatelstvo MGU, 1988.
  • [55] V. N. Latyshev. General version of standard bases in linear structures. In Algebra (Moscow, 1998), pages 215–226. de Gruyter, Berlin, 2000.
  • [56] V. N. Latyshev. A general version of standard basis and its application to TT-ideals. Acta Appl. Math., 85(1-3):219–223, 2005.
  • [57] V. N. Latyshev. A general version of a standard basis in associative algebras and their derivative constructions. Fundam. Prikl. Mat., 15(3):183–203, 2009.
  • [58] V. N. Latyšev. Algebras with identical relations. Dokl. Akad. Nauk SSSR, 146:1003–1006, 1962.
  • [59] V. N. Latyšev. On the choice of basis in a TT-ideal. Sibirsk. Mat. Ž., 4:1122–1127, 1963.
  • [60] J.-M. Lemaire. “Autopsie d’un meurtre” dans l’homologie d’une algèbre de chaînes. Ann. Sci. École Norm. Sup. (4), 11(1):93–100, 1978.
  • [61] Yu Li, Qiuhui Mo, and L. A. Bokut. Generalized anti-commutative Gröbner-Shirshov basis theory and free Sabinin algebras. Comm. Algebra, 48(12):5086–5109, 2020.
  • [62] Jean-Louis Loday and María Ronco. Permutads. J. Combin. Theory Ser. A, 120(2):340–365, 2013.
  • [63] Jacob Lurie. Moduli problems for ring spectra. In Proceedings of the International Congress of Mathematicians. Volume II, pages 1099–1125. Hindustan Book Agency, New Delhi, 2010.
  • [64] Jacob Lurie. Derived algebraic geometry X: Formal moduli problems. Preprint available on the author’s webpage https://www.math.ias.edu/~lurie/, 2011.
  • [65] Marco Manetti. Differential graded Lie algebras and formal deformation theory. In Algebraic geometry—Seattle 2005. Part 2, volume 80 of Proc. Sympos. Pure Math., pages 785–810. Amer. Math. Soc., Providence, RI, 2009.
  • [66] Martin Markl. Models for operads. Comm. Algebra, 24(4):1471–1500, 1996.
  • [67] François Métayer. Resolutions by polygraphs. Theory Appl. Categ., 11:No. 7, 148–184, 2003.
  • [68] M. H. A. Newman. On theories with a combinatorial definition of “equivalence.”. Ann. of Math. (2), 43:223–243, 1942.
  • [69] V. P. Palamodov. Deformations of complex spaces. Uspehi Mat. Nauk, 31(3):129–194, 1976.
  • [70] Stewart B. Priddy. Koszul resolutions. Trans. Amer. Math. Soc., 152:39–60, 1970.
  • [71] J. P. Pridham. Unifying derived deformation theories. Adv. Math., 224(3):772–826, 2010.
  • [72] Zihao Qi, Yongjun Xu, James J. Zhang, and Xiangui Zhao. Growth of nonsymmetric operads. arXiv e-print arXiv:2010.12172, Oct 2020.
  • [73] Daniel G. Quillen. Homotopical algebra. Lecture Notes in Mathematics, No. 43. Springer-Verlag, Berlin-New York, 1967.
  • [74] Saeed Rahmati. Theory of Spectral Sequences of Exact Couples: Applications To Countably And Transfinitely Filtered Modules. ProQuest LLC, Ann Arbor, MI, 2013. Thesis (Ph.D.)–University of Alberta (Canada).
  • [75] María Julia Redondo and Fiorella Rossi Bertone. L{L}_{\infty}-structure on Bardzell’s complex for monomial algebras. arXiv e-print arXiv:2008.08122, August 2020.
  • [76] María Ronco. Shuffle bialgebras. Ann. Inst. Fourier (Grenoble), 61(3):799–850, 2011.
  • [77] Michael Schlessinger and James Stasheff. The Lie algebra structure of tangent cohomology and deformation theory. J. Pure Appl. Algebra, 38(2-3):313–322, 1985.
  • [78] Mike Schlessinger and Jim Stasheff. Deformation theory and rational homotopy type. arXiv e-print arXiv:1211.1647, November 2012.
  • [79] Wilhelm Specht. Gesetze in Ringen. I. Math. Z., 52:557–589, 1950.
  • [80] Craig C. Squier. Word problems and a homological finiteness condition for monoids. J. Pure Appl. Algebra, 49(1-2):201–217, 1987.
  • [81] Dennis Sullivan. Infinitesimal computations in topology. Inst. Hautes Études Sci. Publ. Math., (47):269–331 (1978), 1977.
  • [82] Pedro Tamaroff. Minimal models for monomial algebras. Homology Homotopy Appl., to appear, 2021.
  • [83] John Tate. Homology of Noetherian rings and local rings. Illinois J. Math., 1:14–27, 1957.
  • [84] Rabigul Tuniyaz, L. A. Bokut, Marhaba Xiryazidin, and Abdukadir Obul. Gröbner-Shirshov bases for free Gelfand-Dorfman-Novokov algebras and for right ideals of free right Leibniz algebras. Comm. Algebra, 46(10):4392–4402, 2018.
  • [85] V. A. Ufnarovskij. Combinatorial and asymptotic methods in algebra. In Algebra, VI, volume 57 of Encyclopaedia Math. Sci., pages 1–196. Springer, Berlin, 1995.
  • [86] A. I. Širšov. Some algorithm problems for Lie algebras. Sibirsk. Mat. Ž., 3:292–296, 1962.
  • [87] A. I. Žukov. Reduced systems of defining relations in non-associative algebras. Mat. Sbornik N.S., 27(69):267–280, 1950.