Free structures and limiting density

Johanna N. Y. Franklin, Meng-Che “Turbo” Ho , and Julia Knight The first author was supported in part by Simons Foundation Collaboration Grant #420806.The second author acknowledges support from the National Science Foundation under Grant No. DMS-2054558.The first and third authors acknowledge support from the National Science Foundation under Grant #DMS-1800692.This material is based upon work supported by the National Science Foundation under Grant No. DMS-1928930 while the second and third authors participated in a program hosted by the Mathematical Sciences Research Institute in Berkeley, California, during Fall 2020 and Summer 2022 programs.

Abstract

Gromov asked what a typical (finitely presented) group looks like, and he suggested a way to make the question precise in terms of limiting density. The typical finitely generated group is known to share some important properties with the non-abelian free groups. We ask Gromov’s question more generally, for structures in an arbitrary algebraic variety (in the sense of universal algebra), with presentations of a specific form. We focus on elementary properties. We give examples illustrating different behaviors of the limiting density. Based on the examples, we identify sufficient conditions for the elementary first-order theory of the free structure to match that of the typical structure; i.e., a sentence is true in the free structure iff it has limiting density $1$ .

1 Introduction

In the paper where he introduced the notion of hyperbolic group, Gromov [8] asked what a typical group looks like. He was thinking of finitely presented groups. He described, in terms of limiting density, what it might mean for a typical group to have some property $Q$ , and he stated that the typical group is hyperbolic. Gromov’s notion has been made precise in different ways; see, for instance, the survey [17]. Ol’shanskii [18] cleaned up the statement and the proof that the typical group is hyperbolic. The third author conjectured that for the typical group obtained from a presentation consisting of $m\geq 2$ generators and a single relator, the elementary first order theory matches that of the non-abelian free groups. In this paper, we generalize Gromov’s question to arbitrary equational classes, or algebraic varieties (in the sense of universal algebra). Here, as for groups, the notions of finite presentation and free structure make sense. We find examples exhibiting different behavior. Our main results are for varieties with only unary functions. With some rather strong conditions on the variety, and on the presentations, we obtain the analogue of the conjecture for groups: the sentences true in the free structure are exactly those with limiting density $1$ .

1.1 Organization

We begin with Gromov’s original question, which concerned finitely presented groups. We then generalize this question to finitely presented members of a variety $V$ . Section 2 has some background on algebraic varieties. Section 3 has four examples illustrating different possible behaviors and also introduces key ideas that will appear in later proofs. The first example shows that in a bijective structure with a single identity, the sentences true in the free structures have limiting density $1$ . In the second example, the set of sentences with limiting density 1 is the theory of a structure in the variety, but this structure is not finitely generated, nor is it free. In the third example, we look at bijective structures as in the first example but with two identities, and we give a sentence for which the limiting density does not exist. In the fourth example, we look at abelian groups and give sentences where the limiting density is strictly between $0$ and $1$ .

In Section 4, we consider more general bijective varieties. We show that for these varieties and presentations with a single generator and a single identity, a sentence has limiting density $1$ if and only if it is true in the free structure. For a language with unary function symbols $f_{1},\ldots,f_{n}$ , injective and commuting, we say how to find elementary invariants.¹¹1We are grateful to Sergei Starchenko, who, having seen our result for bijective structures, suggested that we look further at varieties for languages with unary functions. We use a version of Gaifman’s Locality Theorem, which we prove using saturation.²²2We are grateful to Phokion Kolaitis for alerting us to Gaifman’s Theorem and providing references. In Section 5, we consider sentences with constants. For an arbitrary variety and presentations with a fixed tuple $\bar{a}$ of generators, we give conditions guaranteeing that, for sentences $\varphi$ in the language with added constants naming the generators, $\varphi$ has limiting density $1$ if and only if it is true in the free structure on $\bar{a}$ . In Section 6, we give further examples illustrating the results from Sections 4 and 5.

1.2 Gromov’s question about groups

Here, we recall Gromov’s original question and mention some prior work on typical or random groups. The usual language for groups has a binary operation symbol (for the group operation), a unary operation symbol (for the inverse), and a constant (for the identity). Let $T$ be the theory of groups. Recall that a group presentation consists of a tuple $\bar{a}$ of generators and a set $R$ of words $w_{i}(\bar{a})$ on these generators, called relators. In the group $G$ with presentation $\langle\bar{a}|R\rangle$ , $G\models t(\bar{a})=e$ if and only if $T\cup\{w(\bar{a})=e:w(\bar{a})\in R\}\models t(\bar{a})=e$ . Suppose $F$ is the free group on $\bar{a}$ and $N$ is the subgroup of $F$ consisting of the elements $t(\bar{a})$ such that $T\cup\{w(\bar{a})=e\}\models t(\bar{a})=e$ . Then $G\cong F/N$ .

The notion of limiting density depends not just on the variety, but also on the allowable group presentations, Ol’shanskii [18] considered presentations with $m$ generators and $k$ relators, all reduced. Kapovich and Schupp [11] considered the case where $k=1$ . In the Gromov “density” model, the number of relators may vary but is bounded in terms of the length of the relators and a parameter $d$ . It is important to bound the number of relators in some way; otherwise, the typical group will almost surely be trivial [18].

Definition 1.1.

Let $Q$ be a property of interest. Let $P_{s}$ be the number of presentations in which the relators have length at most $s$ , and let $P_{s}(Q)$ be the number of these presentations for which the resulting group has property $Q$ . The limiting density for $Q$ is $\lim_{s\rightarrow\infty}\frac{P_{s}(Q)}{P_{s}}$ , if this limit exists.

We consider the typical group to have property $Q$ if $Q$ has density $1$ . We are particularly interested in elementary properties. For a sentence $\varphi$ , the density of the property of satisfying $\varphi$ will be called simply the density of $\varphi$ . The typical group, in the sense of limiting density, is also called the random group.³³3Harrison-Trainor, Khoussainov, and Turetsky [9] took a different approach and considered random structures more along the lines of the Rado graph. The typical group has some properties of free groups. Gromov introduced the property of hyperbolicity and stated that the typical group is hyperbolic. Ol’shanskii [18] showed that for presentations with $m$ generators and $k$ reduced relators, the property of being hyperbolic has limiting density $1$ . Kapovich and Schupp [11] showed that for presentations with $m$ generators and $1$ reduced relator, the property that all minimal generating tuples are Nielsen equivalent has limiting density $1$ . Nielsen equivalence means that one tuple can be transformed into the other by a finite sequence of simple, obviously reversible, kinds of steps.

Benjamin Fine, in conversation with the third author at the JMM in January of 2013, made an off-hand comment to the effect that in the limiting density sense, all groups look free. Fine’s comment gave rise to the conjecture below, saying that for presentations with $m\geq 2$ generators and $1$ relator, the typical group has the same elementary first order theory as the the free group. By a result of Sela [20] (see also Kharlampovich-Miasnikov [12]), the elementary first-order theory of all non-abelian free groups is the same. The conjecture is given in [10, Conjecture 2.2].

Conjecture 1.2 (Knight).

Take groups given by presentations with a fixed $m$ -tuple of generators, for $m\geq 2$ , and $1$ relator (the Kapovich-Schupp model). For all elementary first order sentences $\varphi$ ,

1.

the limiting density exists, and
2.

the density has value $1$ if $\varphi$ is true in the non-abelian free groups, and $0$ otherwise.

There is some evidence for the conjecture. By a result of Arzhantseva and Ol’shanskii [2, 1], a random group obtained from a presentation with $m$ generators and $k$ relators has many free subgroups. Thus, an existential sentence true in the free group is also true in a random group. Kharlampovich and Sklinos [13] used Gromov’s density model, with parameter $d$ . In this setting, they showed the following.

Theorem 1.3 (Kharlampovich and Sklinos [13]).

A random group, in Gromov’s density model with $d\leq 1/16$ , satisfies a universal sentence if and only if the sentence is true in the non-abelian free groups.

The Kharlampovich-Sklinos result implies the conjecture (for universal sentences), but we will not give a proof here.

2 Generalizing Gromov’s question

The question that Gromov asked about groups makes sense for other algebraic varieties as well. We begin by presenting our definition of an algebraic variety; then we discuss the types of presentations we will allow and give some basic lemmas.

2.1 Algebraic varieties

Definition 2.1.

A language is algebraic if it consists only of function symbols and constants.

The term “algebraic variety” is used to mean different things in algebraic geometry and in universal algebra. The definition that we give below is the one from universal algebra.

Definition 2.2 (Algebraic variety).

For a fixed algebraic language $L$ , a class $V$ of $L$ -structures is an algebraic variety, or simply variety, if it is closed under substructures, homomorphic images, and direct products.

For our purposes, it is convenient to use the following equivalent definition, of “equational class.”

Definition 2.3 (Equational class).

For a fixed algebraic language $L$ , a class $V$ of $L$ -structures is an equational class if it is axiomatized by sentences of the form $(\forall\bar{x})t(\bar{x})=t^{\prime}(\bar{x})$ —universal quantifiers in front of an equation.

Birkhoff showed that these two definitions are equivalent. Mal’tsev defined a broader class of theories whose models have well-defined presentations. See [3] for a general overview of universal algebra, where the result below appears as Theorem 11.9.

Theorem 2.4.

For a fixed algebraic language $L$ , a class of $L$ -structures is an equational class if and only if it is a variety.

In the usual language for groups, namely $\{\cdot,^{-1},e\}$ , the group axioms have the required form. Thus, groups form a variety.

Now we consider an arbitrary algebraic variety $V$ . For a fixed generating tuple $\bar{a}$ , there is a well-defined free structure $F$ generated by $\bar{a}$ . If $\mathcal{A}$ is a structure in $V$ generated by $\bar{a}$ , then $\mathcal{A}$ is a quotient of $F$ under an appropriate equivalence relation on terms $t(\bar{a})$ . This equivalence relation becomes equality in $\mathcal{A}$ .

Definition 2.5.

For a variety $V$ , a presentation has the form $\bar{a}|R$ , where $\bar{a}$ is a generating tuple and $R$ is a set of identities on $\bar{a}$ . We write $\langle\bar{a}|R\rangle$ for the structure $\mathcal{A}$ such that the identities $t(\bar{a})=t^{\prime}(\bar{a})$ true in $\mathcal{A}$ are just the ones that follow logically from $R$ and the axioms for $V$ .

We ask what the typical behavior is for members of a variety given by presentations of a specific form.

2.2 Allowable presentations

In this paper, almost all of the languages of our varieties will be either the group language or a language with just unary function symbols. We consider presentations with a fixed generating tuple $\bar{a}$ , say of length $m$ . For the analogue of the Ol’shanskii setting, we consider presentations with $k$ identities for some fixed $k$ . For the analogue of the Kapovich-Schupp setting, we set $k=1$ . This is the primary case we will consider. Where we do consider $k>1$ , our presentations have the form $\bar{a}|R$ , where $R$ is an unordered set of identities.

We may restrict the identities in certain natural ways. For groups, we do what the group theorists do; that is, we suppose that the identities have the form $w(\bar{a})=e$ , where $w(\bar{a})$ is a word representing a product of various $a_{i}$ and $a_{i}^{-1}$ . For the variety in the language consisting of two unary function symbols $S,S^{-1}$ with axioms saying that the two functions are inverses, we may restrict in a similar way, allowing only identities of the form $t(a_{i})=a_{j}$ ; that is, with function symbols only on the left. For the language with finitely many unary function symbols $f_{1},\ldots,f_{n}$ and varieties that do not have axioms explicitly saying that one $f_{j}$ is the inverse of another $f_{i}$ , our identities have the form $t(a_{i})=t^{\prime}(a_{j})$ , where $t(x)$ and $t^{\prime}(x)$ are terms built up from the function symbols.

2.2.1 Length

We will need to measure length of identities in our presentations. We will use the following conventions, based on the restrictions described above.

Definition 2.6.

•

In the setting of groups, the length of an identity of the form $w(\bar{a})=e$ is the number of occurrences of the various $a_{i}$ and $a_{i}^{-1}$ in the word $w(\bar{a})$ . This is the usual length of the relator.
•

For varieties in the language $L$ with just the unary function symbols $f_{1},\ldots,f_{n}$ , the length of an identity of the form $t(a_{i})=t^{\prime}(a_{j})$ is the total number of occurrences of the function symbols in the terms $t$ and $t^{\prime}$ .

2.2.2 Limiting density

As for groups, we consider limiting density. Here is the formal definition of limiting density.

Definition 2.7.

Fix a language, a variety, and a set of allowable presentations with an $m$ -tuple $\overline{a}$ of generators and $k$ identities. We write $P_{s}$ for the number of presentations in which all of the identities have length at most $s$ . For a property $Q$ , let $P_{s}(Q)$ be the number of these presentations for which the resulting structure has property $Q$ . Then the limiting density of $Q$ is $\lim_{s\rightarrow\infty}\frac{P_{s}(Q)}{P_{s}}$ , provided that this limit exists.

We are particularly interested in the case where $Q$ is the property of satisfying an elementary first-order sentence $\varphi$ in the language of the variety, possibly with added constants for the generators. We write $P_{s}(\varphi)$ for the number of presentations in which the identities have length at most $s$ , and the resulting structure satisfies $\varphi$ . We say that $\varphi$ has limiting density $d$ if $\lim\limits_{s\to\infty}\frac{P_{s}(\varphi)}{P_{s}}=d$ .

Definition 2.8.

We say the variety $V$ , with a specified set of allowable presentations, satisfies the zero–one law if for every elementary first-order sentence $\varphi$ in $L$ , $\varphi$ has limiting density $1$ or $0$ .

2.2.3 Sets versus tuples of identities

We have said that our presentations consist of a tuple of generators and an unordered set of distinct identities. Other possibilities would be to consider ordered tuples of identities, with or without repetition. In practice, most of the time, we will consider a single identity. When we do consider more than one identity, we show that the results would be the same for ordered tuples of identities allowing repetition, ordered tuples not allowing repetition, and unordered sets of identities.

As above, we write $P_{s}$ for the number of unordered sets of $k$ identities of length at most $s$ . In the result below, we write $P^{*}_{s}$ for the number of ordered $k$ -tuples allowing repetition, and $P^{**}_{s}$ for the number of ordered $k$ -tuples not allowing repetition.

Proposition 2.9.

Let $N(s)$ be the number of identities in $L$ of length at most $s$ and suppose that $\lim\limits_{s\to\infty}N(s)=\infty$ . Then for any sentence $\varphi$ ,

\lim\limits_{s\to\infty}\frac{P_{s}(\varphi)}{P_{s}}=\lim\limits_{s\to\infty}\frac{P^{*}_{s}(\varphi)}{P^{*}_{s}}=\lim\limits_{s\to\infty}\frac{P^{**}_{s}(\varphi)}{P^{**}_{s}}\ .

Proof.

By definition, $P_{s}={N(s)\choose k}$ , $P^{*}_{s}=N(s)^{k}$ , and $P^{**}_{s}=k!\cdot P_{s}$ . Each unordered set of $k$ identities yields $k!$ ordered $k$ -tuples of identities. Thus, it is clear that

\lim\limits_{s\to\infty}\frac{P_{s}(\varphi)}{P_{s}}=\lim\limits_{s\to\infty}\frac{P^{**}_{s}(\varphi)}{P^{**}_{s}}\ .

To compare $\frac{P_{s}(\varphi)}{P_{s}}$ and $\frac{P^{*}_{s}(\varphi)}{P^{*}_{s}}$ , we need the following:

Claim:

1.

$\frac{k!P_{s}}{P^{*}_{s}}\rightarrow 1$ ,
2.

$\frac{N(s)^{k}-k!P_{s}}{P^{*}_{s}}\rightarrow 0$ .

Proof of Claim.

For (1), the denominator is $N(s)^{k}$ and the numerator is a polynomial in $N(s)$ with leading term $N(s)^{k}$ . For (2), the numerator is a polynomial in $N(s)$ of degree less than $k$ , and the denominator is $N(s)^{k}$ . ∎

Now, we note that

k!P_{s}(\varphi)\leq P^{*}_{s}(\varphi)\leq k!P_{s}(\varphi)+N(s)^{k}-k!{N(s)\choose k}.

Dividing by $P^{*}_{s}=N(s)^{k}$ and letting $s\to\infty$ , we get

\lim\limits_{s\to\infty}\frac{k!P_{s}(\varphi)}{P^{*}_{s}}\leq\lim\limits_{s\to\infty}\frac{P^{*}_{s}(\varphi)}{P^{*}_{s}}\leq\lim\limits_{s\to\infty}\frac{k!P_{s}(\varphi)+N(s)^{k}-k!{N(s)\choose k}}{P^{*}_{s}}.

Using the claim, we see that the right-hand side is

\lim\limits_{s\to\infty}\frac{k!P^{*}_{s}(\varphi)}{k!P^{*}_{s}}=\lim\limits_{s\to\infty}\frac{P^{*}_{s}(\varphi)}{P^{*}_{s}}.

∎

We can now phrase the questions we are interested in more formally.

Question 2.10.

1.

Which varieties (with allowable presentations involving a fixed $m$ -tuple $\bar{a}$ of generators) satisfy the zero–one law?
2.

Given that the zero–one law holds, when do the sentences with limiting density $1$ match those true in the free structure?

2.3 Basic lemmas

Before we begin, we state three lemmas that hold very generally.

Lemma 2.11.

$(\varphi\vee\psi)$ has limiting density $0$ if and only if $\varphi$ and $\psi$ both have limiting density $0$ ; in fact, this holds for any finite disjunction.

Proof.

We have $\frac{P_{s}(\varphi)}{P_{s}},\frac{P_{s}(\psi)}{P_{s}}\leq\frac{P_{s}(\varphi\vee\psi)}{P_{s}}\leq\frac{P_{s}(\varphi)}{P_{s}}+\frac{P_{s}(\psi)}{P_{s}}$ . From this, the lemma is clear. ∎

Lemma 2.12.

$\varphi$ has limiting density $0$ just in case $\neg{\varphi}$ has limiting density $1$ .

Proof.

We have $1=\frac{P_{s}(\varphi)}{P_{s}}+\frac{P_{s}(\neg{\varphi})}{P_{s}}$ . Again, the lemma is clear. ∎

Lemma 2.13.

Let $S$ be the set of $L$ -sentences with limiting density 1. Then $S$ is consistent and is closed under logical implication—if $\varphi_{1},\ldots,\varphi_{n}\in S$ and $\varphi_{1},\ldots,\varphi_{n}\vdash\psi$ , then $\psi\in S$ .

Proof.

Suppose $S$ is not consistent. By the Compactness Theorem, some finite subset is inconsistent. As every sentence in this set has density 1, there is a model of $T$ that realizes all these (finitely many) sentences, a contradiction. By the previous two lemmas, we have that each $\neg\varphi_{i}$ has limiting density 0, so $\bigvee\neg\varphi_{i}$ also has limiting density 0, and so $\bigwedge\varphi_{i}$ has limiting density 1. But, $\bigwedge\varphi_{i}\vdash\psi$ , so $\frac{P_{s}({\psi})}{P_{s}}\geq\frac{P_{s}(\bigwedge\varphi_{i})}{P_{s}}=1$ , and the lemma follows. ∎

3 Illustrative examples

In this section, we consider some of varieties and classes of presentations that illustrate different possibilities. First, we consider the variety of bijective structures, and presentations with a single generator and a single identity in which the function symbols occur only on the left. We show that the sentences true in the free structure are exactly those with limiting density $1$ . Second, we consider a variety of a single unary function and presentations with a single generator and a single identity. Here, we show that a specific sentence that is true in the free structure has limiting density $0$ . Next, we again consider bijective structures, and presentations with a single generator but with two identities. Here, we give sample sentences for which the limiting density does not exist. Finally, we consider the variety of abelian groups, and presentations with a single generator and a single relator. We give sentences for which the limiting density does not exist, and sentences for which the limiting density exists but is neither $0$ nor $1$ .

3.1 Bijective structures

We start with the variety of bijective structures. Recall that the language consists of two unary function symbols $S,S^{-1}$ . The axioms are

(\forall x)SS^{-1}(x)=x\mbox{\ \ and\ \ }(\forall x)S^{-1}S(x)=x\ .

These guarantee that the function $S$ is $1-1$ and onto and that $S^{-1}$ is the inverse of $S$ . Let $T$ be the theory generated by these axioms. The models consist of infinite $\mathbb{Z}$ -chains and finite cycles $\mathbb{Z}_{m}$ . While these structures lack the mathematical interest and importance of groups, it is instructive to consider them because there are relatively simple elementary invariants, and for presentations with a single generator and a single identity, we can calculate the limiting densities for these sentences. It turns out that the analogue of Conjecture 1.2 holds.

Lemma 3.1.

Over the theory $T$ , every sentence is equivalent to a Boolean combination of sentences of the following basic types:

1.

$\alpha(n,k)$ , saying that there are at least $k$ cycles of size $n$ ,
2.

$\beta(n)$ , saying that there is a chain of length at least $n$ .

Proof.

For any model $\mathcal{A}$ of $T$ , we have a natural equivalence relation $\sim$ on the universe, where $a\sim b$ if $S^{m}(a)=b$ for some integer $m$ . Each $\sim$ -class is a copy of $\mathbb{Z}$ or a finite cycle. The isomorphism type of $\mathcal{A}$ is determined by the number of $\sim$ -classes of different types. Each model $\mathcal{A}$ of $T$ is elementarily equivalent to a saturated model $\mathcal{A}^{*}$ , where $\mathcal{A}^{*}$ has infinitely many copies of $\mathbb{Z}$ if there is no finite bound on the sizes of the $\sim$ -classes. From this, we see that the isomorphism type of $\mathcal{A}^{*}$ and the elementary first order theory of $\mathcal{A}$ are determined by the sentences $\alpha(n,k)$ and $\beta(n)$ . ∎

We consider bijective structures with a single generator $a$ . There is a single $\sim$ -class, which has the form $\mathbb{Z}$ , an infinite chain, or $\mathbb{Z}_{m}$ , a cycle of size $m$ . We note that in either $\mathbb{Z}$ or $\mathbb{Z}_{m}$ , all elements are automorphic. The following lemma is clear from the meanings of the sentences $\alpha(n,k)$ and $\beta(n)$ .

Lemma 3.2.

1.

For $k>1$ , $\alpha(n,k)$ is false in both $\mathbb{Z}$ and $\mathbb{Z}_{m}$ ,
2.

$\alpha(n,1)$ is true only in $\mathbb{Z}_{n}$ ,
3.

$\beta(n)$ is true in $\mathbb{Z}$ ; it is true in $\mathbb{Z}_{m}$ if and only if $m>n$ .

For models with a single generator, $\alpha(n,k)$ is false for $k>1$ , and $\beta(n)$ is equivalent to $\bigwedge_{m\leq n}\neg{\alpha(n,1)}$ . Thus, it is enough to consider the elementary invariants of the form $\alpha(n,1)$ .

Here our presentations have a single identity, of the form $t(a)=a$ (function symbols occur only on the left). We may refer to the term $t(a)$ as a relator. For a single relator $t(a)$ , we get $\mathbb{Z}$ if for some $k$ , $t(a)$ has $k$ occurrences of $S$ and $k$ occurrences of $S^{-1}$ . We get $\mathbb{Z}_{m}$ if for some $k$ , $t(a)$ has either $m+k$ occurrences of $S$ and $k$ of $S^{-1}$ or $m+k$ occurrences of $S^{-1}$ and $k$ of $S$ .

We will show that for all $n\geq 1$ , $\alpha(n,1)$ has limiting density $0$ . This implies that $\neg\alpha(n,1)$ , which is true in the free structure, has limiting density $1$ . We will use two combinatorial lemmas. The first is an approximation for $\left(\begin{array}[]{c}2k\\ k\end{array}\right)$ , good for large $k$ . The proof requires the use of Stirling’s formula on all three factorials (see the website of Das [5]).

Lemma 3.3.

$\left(\begin{array}[]{c}2k\\ k\end{array}\right)=(1+o(1))\frac{2^{2k}}{\sqrt{\pi k}}$ .

The second combinatorial lemma is an inequality.

Lemma 3.4.

For all $n\geq 1$ and all $k$ , $2\left(\begin{array}[]{c}n+2k\\ k\end{array}\right)<\left(\begin{array}[]{c}n+2(k+1)\\ k+1\end{array}\right)$ .

Proof.

Recall Pascal’s Identity

\left(\begin{array}[]{c}n\\ k\end{array}\right)+\left(\begin{array}[]{c}n\\ k+1\end{array}\right)=\left(\begin{array}[]{c}n+1\\ k+1\end{array}\right).

We prove the inequality by induction on $k$ . First, for $k=0$ , the inequality just says that $2<n+2$ . Now, suppose $k>0$ . Applying Pascal’s Identity to the right side of the inequality, we get

\left(\begin{array}[]{c}n+2(k+1)\\ k+1\end{array}\right)=\left(\begin{array}[]{c}n+2k+1\\ k\end{array}\right)+\left(\begin{array}[]{c}n+2k+1\\ k+1\end{array}\right)

and then

\left(\begin{array}[]{c}n+2k\\ k-1\end{array}\right)+\left(\begin{array}[]{c}n+2k\\ k\end{array}\right)+\left(\begin{array}[]{c}n+2k\\ k\end{array}\right)+\left(\begin{array}[]{c}n+2k\\ k+1\end{array}\right)\ .

This is clearly greater than $2\left(\begin{array}[]{c}n+2k\\ k\end{array}\right)$ . ∎

To show that $\alpha(n,1)$ has limiting density $0$ , we use several further lemmas.

Lemma 3.5.

$P_{s}=2^{s+1}-1$ .

Proof.

The number of terms of length $m$ is $2^{m}$ , so the number of terms of length at most $s$ is $1+2+\ldots+2^{s}=2^{s+1}-1$ . ∎

Lemma 3.6.

$P_{s}(\alpha(m,1))=\sum_{m+2k\leq s}2\left(\begin{array}[]{c}m+2k\\ k\end{array}\right)$ .

Proof.

For each $m\geq 1$ , and each $k$ , we have $\left(\begin{array}[]{c}m+2k\\ k\end{array}\right)$ terms with $m+k$ occurrences of $S$ and $k$ of $S^{-1}$ , and the same number with the symbols switched. ∎

The next lemma bounds the sum $P_{n+2k}(\alpha(n,1))$ by a single term.

Lemma 3.7.

For all $n\geq 1$ and all $k\geq 0$ , $P_{n+2k}(\alpha(n,1))<\left(\begin{array}[]{c}n+2(k+1)\\ k+1\end{array}\right)$ .

Proof.

We fix $n$ and proceed by induction on $k$ . For $k=0$ , the left side is $P_{n}(\alpha(n,1))=\left(\begin{array}[]{c}n\\ 0\end{array}\right)=1$ , and the right side is $\left(\begin{array}[]{c}n+2\\ 1\end{array}\right)=n+2>1$ . Supposing that the statement holds for $k$ , we prove it for $k+1$ . By Lemma 3.4,

\left(\begin{array}[]{c}n+2(k+2)\\ k+2\end{array}\right)>2\left(\begin{array}[]{c}n+2(k+1)\\ k+1\end{array}\right)=\left(\begin{array}[]{c}n+2(k+1)\\ k+1\end{array}\right)+\left(\begin{array}[]{c}n+2(k+1)\\ k+1\end{array}\right)\ .

By the Induction Hypothesis, this is greater than

\left(\begin{array}[]{c}n+2(k+1)\\ k+1\end{array}\right)+P_{n+2k}(\varphi_{n})=P_{n+2(k+1)}(\varphi_{n})\ .

∎

Now we can show that the limiting density of $\alpha(n,1)$ is $0$ for any $n\geq 1$ . To do so, we must make an odd/even case distinction because the only way to get presentations of different lengths of the same structure is for these lengths to differ by a multiple of two, so $P_{n+2k+1}(\alpha(n,1))$ will equal $P_{n+2k}(\alpha(n,1))$ . However, if $s=n+2k$ for some $k$ , then $P_{s}(\alpha(n,1))$ has a new last term $\left(\begin{array}[]{c}n+2k\\ k\end{array}\right)$ , and $P_{s+1}(\alpha(n,1))=P_{s}(\alpha(n,1))$ . Therefore, it is enough to show that

\frac{P_{n+2k}(\alpha(n,1))}{P_{n+2k}}\rightarrow 0\ .

By Lemma 3.4, the last term of $P_{n+2k}(\alpha(n,1))$ is greater than the sum of the earlier terms. Thus, $P_{n+2k}(\alpha(n,1))<2\left(\begin{array}[]{c}n+2k\\ k\end{array}\right)$ . Recall that $P_{n+2k}=2^{n+2k+1}-1$ , which is strictly greater than $2^{n+2k}$ , so $\frac{P_{n+2k}(\alpha(n,1))}{P_{n+2k}}<\frac{2}{2^{n+2k}}\left(\begin{array}[]{c}n+2k\\ k\end{array}\right)$ . To prove that the limiting density of $\alpha(n,1)$ is $0$ , it is enough to prove that

\frac{2}{2^{n+2k}}\left(\begin{array}[]{c}n+2k\\ k\end{array}\right)\rightarrow 0.

We can express $\frac{2}{2^{n+2k}}\left(\begin{array}[]{c}n+2k\\ k\end{array}\right)$ as a product of two factors, one involving the fixed $n$ and the other not. The first factor is

\frac{2}{2^{n}}\left(\frac{2k+n}{k+n}\right)\left(\frac{2k+(n-1)}{k+(n-1)}\right)\ldots\left(\frac{2k+1}{k+1}\right)\ .

This is an $(n+1)$ -fold product with limit $2$ as $k\rightarrow\infty$ . The second factor is $\frac{1}{2^{2k}}\left(\begin{array}[]{c}2k\\ k\end{array}\right)$ . By Lemma 3.3 above, this is $(1+o(1))\frac{1}{\sqrt{\pi k}}$ , which has limit $0$ . All together, we have shown the following.

Proposition 3.8.

For $n\geq 1$ , $\lim_{s\rightarrow\infty}\frac{P_{s}(\alpha(n,1))}{P_{s}}=0$ .

From this, we get the following.

Theorem 3.9.

For bijective structures with a single generator and a single relator, each sentence $\varphi$ has limiting density equal to $1$ if $\ \mathbb{Z}\models\varphi$ and $\ 0$ otherwise.

For later use, we state below another immediate consequence of Proposition 3.8. For a term $t(a)$ , let $X$ be the difference between the number of occurrences of $S$ and the number of occurrences of $S^{-1}$ in $t(a)$ . The relators that make $\alpha(n,1)$ true are exactly those for which $|X|=n$ .

Lemma 3.10.

For each $k\in\mathbb{Z}$ , the set of presentations $a,t(a)=a$ for which $X=k$ has density $0$ .

3.2 Structures with a single unary function

Next, take the language with a single unary function symbol $f$ , and the variety with no axioms. We consider presentations with a single generator $a$ and a single identity. At first, we focus on the sentence $\varphi$ saying that $f$ is not $1-1$ . We will see that this sentence is false in the free structure, but it has limiting density $1$ . After that, we will consider arbitrary sentences, and prove a zero-one law.

For an identity $f^{(r)}(a)=f^{(r^{\prime})}(a)$ , where $r,r^{\prime}\geq 0$ , the length is $r+r^{\prime}$ . If $r=r^{\prime}$ , then we get an $\omega$ -chain. If $0<r<r^{\prime}$ , then we get a chain of length $r$ leading to a cycle of length $r^{\prime}-r$ . If $0=r<r^{\prime}$ , then we get a cycle of length $r^{\prime}$ . Similarly, if $0<r^{\prime}<r$ , we get a chain of length $r^{\prime}$ leading to a cycle of length $r-r^{\prime}$ , and if $0=r^{\prime}<r$ , we get a cycle of length $r$ .

We write $m+\mathbb{Z}_{n}$ for a chain of length $m$ leading to a cycle of length $n$ , allowing the possibility that $m=0$ . Any structure in our variety obtained from a single generator and a single identity has one of the following forms:

1.

an $\omega$ -chain—this is the free structure,
2.

a finite chain leading to a finite cycle, or
3.

a finite cycle.

Lemma 3.11.

$P_{s}=\frac{1}{2}(s^{2}+3s+2)$ .

Proof.

The number of identities of length $m$ is $m+1$ . Then
$P_{s}=1+2+\ldots+(s+1)=\frac{1}{2}(s+2)(s+1)=\frac{1}{2}(s^{2}+3s+2)$ . ∎

Recall that $\varphi$ holds if the function is not $1-1$ . We can see that $\varphi$ is false in $\omega$ and the finite cycles $\mathbb{Z}_{n}$ , and true in the structures $m+\mathbb{Z}_{n}$ for $m\geq 1$ , so an identity $f^{n}(a)=f^{n^{\prime}}(a)$ yields a structure in which $\varphi$ is false if $n=n^{\prime}$ or if one of $n,n^{\prime}$ is $0$ , and true otherwise. We can easily count the identities of length $m$ that make $\varphi$ false. For $m=0$ , there is just one identity, and it makes $\varphi$ false. For $m=1$ , there are two identities, and both make $\varphi$ false. For $m\geq 2$ , if $m$ is odd, there are just two identities that make $\varphi$ false, and if $m$ is even, there are three identities that make $\varphi$ false. Thus, $P_{s}(\neg{\varphi})=1+2+3+2+\ldots+(\frac{5}{2}+\frac{1}{2}(-1)^{s})$ .

Proposition 3.12.

The limiting density for $\varphi$ is $1$ .

Proof.

For $s\geq 1$ , $P_{s}(\neg{\varphi})\leq 3s$ , and $\lim_{s\rightarrow\infty}\frac{3s}{P_{s}}=0$ . Therefore, $\neg{\varphi}$ has limiting density $0$ , and $\varphi$ has limiting density $1$ . ∎

Figure 1:

m+\mathbb{Z}_{n}

where

m=3

and

n=4

In fact, we will see that each sentence has limiting density $1$ or $0$ . The set of sentences with limiting density $1$ is not that of the free structure. It is the theory of a structure $\mathcal{A}$ that we may think of as a limit of the structures $m+\mathbb{Z}_{n}$ (see Figure 1). The limit structure $\mathcal{A}$ consists of an $\omega$ -chain together with two $\omega^{*}$ -chains and a single $\omega$ -chain that come together at a special point—this point is the end of the two $\omega^{*}$ -chains and the beginning of the $\omega$ -chain (see Figure 2). The chain of length $m$ is replaced, in the limit, by an $\omega$ -chain plus one of the $\omega^{*}$ -chains, and the $n$ -cycle is replaced, in the limit, by the other $\omega$ -chain and $\omega^{*}$ -chain. We note that $\mathcal{A}$ is not finitely generated.

Figure 2: The limit structure

Lemma 3.13.

The theory of $\mathcal{A}$ is generated by the following sentences; note that the elements $a$ and $c$ are defined and not named by constants.

1.

$\psi_{a}$ , saying that there is a unique element $a$ with no $f$ -pre-image,
2.

$\psi_{c}$ , saying that there is a unique element $c$ with two $f$ -pre-images,
3.

$\psi$ , saying that there is no element with more than two $f$ -pre-images,
4.

for every $n\in\omega$ , $\alpha_{n}$ saying that there is no cycle of length $n$ , and
5.

for every $n\in\omega$ , $\beta_{n}$ saying that $a$ and $c$ are not connected by a chain of length $n$ .

Proof.

In any structure in our variety, there is an equivalence relation $\sim$ , where $x\sim y$ if there is a finite sequence $x_{0},\ldots,x_{n}$ such that $x=x_{0}$ , $y=x_{n}$ , and for each $i<n$ , either $f(x_{i})=x_{i+1}$ or $f(x_{i+1})=x_{i}$ . If $\mathcal{B}$ and $\mathcal{C}$ are two saturated models of the sentences above, of the same cardinality, then $\mathcal{B}\cong\mathcal{C}$ . We map the special elements $a$ and $c$ in $\mathcal{B}$ to the corresponding elements of $\mathcal{C}$ . The $\sim$ -class of $a$ is an $\omega$ -chain that does not include $c$ . The $\sim$ -class of $c$ has two $\omega^{*}$ -chains ending in $c$ and one $\omega$ -chain starting with $c$ . The other $\sim$ -classes, not containing $a$ or $c$ , are $\mathbb{Z}$ -chains. There are the same number of these in $\mathcal{B}$ and $\mathcal{C}$ . ∎

Lemma 3.14.

The sentences above that generate the theory of $\mathcal{A}$ all have limiting density $1$ .

Proof.

The sentence $\psi$ , saying that there is no element with more than two $f$ -pre-images, is true in all of the models that we get from a single generator $a$ and a single identity, so this has limiting density $1$ . We have seen that the set of presentations that give models in which $f$ is not $1-1$ has limiting density $1$ . The models have the form $m+\mathbb{Z}_{n}$ for $m,n>0$ . The sentences $\psi_{a}$ , saying that there is a unique element $a$ with no $f$ -pre-image, $\psi_{c}$ , saying that there is a unique element $c$ with two $f$ -pre-images, and $\psi$ are true in all of these models, so these sentences have limiting density $1$ .

Consider $\alpha_{n}$ , saying that there is no cycle of length $n$ . Let $B$ be the set of presentations that make $\alpha_{n}$ false—the resulting model has a cycle of length $n$ . These identities have the form $f^{n+k}(a)=f^{k}(a)$ or $f^{k}(a)=f^{n+k}(a)$ . The number of such identities of length $m$ is $0$ if $m-n$ is odd and $2$ if $m-n$ is even. Then $P_{s}(\alpha_{n})\leq 2s$ . Since $P_{s}=O(s^{2})$ , the limiting density of $B$ is $0$ , so the density of $\alpha_{n}$ is $1$ . Finally, consider $\beta_{n}$ , saying that $a$ and $c$ are not connected by a chain of length $n$ . Let $C$ be the set of presentations that make this false. These identities have the form $f^{n}(a)=f^{n+k}(a)$ or $f^{n+k}(a)=f^{n}(a)$ . The number of such identities of length $m$ is $2$ for all $m\geq 2n$ . Then $P_{s}(C)\leq 2s$ . Again the limiting density of $C$ is $0$ , so the density of $\beta_{n}=1$ . ∎

We conclude the following.

Proposition 3.15.

For all sentences $\varphi$ true in $\mathcal{A}$ , the limiting density is $1$ .

3.3 Bijective structures with two identities

In the next example, we return to the variety of bijective structures as in Section 3.1. As before, our presentations have a single generator $a$ , but there are two identities instead of just one. We will show that the limiting density need not exist. Recall that the language consists of unary function symbols $S,S^{-1}$ and that the axioms say that $S$ and $S^{-1}$ are inverses. The identities have the form $t(a)=a$ , with function symbols only on the left. Each identity is thus equivalent to one of the form $S^{m}(a)=a$ , where $m\in\mathbb{Z}$ .

Proposition 3.16.

For bijective structures, and presentations with a single generator $a$ and two identities, the sentence $\varphi$ saying that the structure is a $1$ -cycle does not have a limiting density.

The proof is somewhat involved. We begin with some elementary lemmas, but eventually we will consider a random walk on a group and appeal to results from random group theory that depend on the Central Limit Theorem. The lemma below tells us when the sentence $\varphi$ is true.

Lemma 3.17.

Let $\mathcal{A}$ be the structure given by an unordered set consisting of two identities, equivalent to $S^{m}(a)=a$ and $S^{m^{\prime}}(a)=a$ . Then $\mathcal{A}$ is a $1$ -cycle if and only if $GCD(m,m^{\prime})=1$ .

Proof.

Note that $S^{k}(a)=a$ if and only if $a=S^{-k}(a)$ . Thus, we may suppose that both $m,m^{\prime}\geq 0$ . First, suppose that $GCD(m,m^{\prime})=1$ . In this case, there are $r,s\in\mathbb{Z}$ such that $mr+m^{\prime}s=1$ . Then we have

S(a)=S^{mr+m^{\prime}s}(a)=S^{mr}(S^{m^{\prime}s}(a))=S^{m}\circ\cdots\circ S^{m}\circ S^{m^{\prime}}\circ\cdots\circ S^{m^{\prime}}(a)=a,

so $\mathcal{A}$ is a $1$ -cycle. Now, suppose that $\mathcal{A}$ is a $1$ -cycle, and let $GCD(m,m^{\prime})=d$ . The axioms of $T$ and the identities $S^{m}(a)=a$ and $S^{m^{\prime}}(a)=a$ are both satisfied in a $d$ -cycle, and $\mathcal{A}$ can only be a $1$ -cycle if $d=1$ . ∎

Our presentations have two identities, but we also need some facts about single identities. We indicate with ^′ that we are considering single identities, writing $P^{\prime}_{=m}$ for the number of identities of length $m$ and $P^{\prime}_{s}$ for the number of length at most $s$ , and writing $P^{\prime}_{=m}(B)$ , $P^{\prime}_{s}(B)$ for the number of these identities in a set $B$ . We reserve $P_{s}$ for the number of unordered pairs of identities of length at most $s$ , and we write $P_{s}(B^{2})$ for the number such that both identities are in $B$ .

For a single identity of the form $t(a)=a$ , let $X$ be the difference between the number of occurrences of $S$ and the number of occurrences of $S^{-1}$ in $t$ . Intuition may suggest that the statement $n|X$ should have limiting density $\frac{1}{n}$ . This turns out to be true for odd $n$ . However, for $n=2$ , we find that the limiting density for the statement $2|X$ does not exist. Essentially, the reason is that the last term of $P^{\prime}_{s}(2|X)=\sum_{m\leq s}P^{\prime}_{=m}(2|X)$ may be greater than the sum of all earlier terms, and this term depends on the parity of $s$ . The lemma below says what happens to $P_{=s}(2|X)$ as the parity of $s$ changes.

Lemma 3.18.

•

For even $m$ , all identities of length $m$ satisfy $2\mid X$ ; none satisfies $2\nmid X$ .
•

For odd $m$ , all identities of length $m$ satisfy $2\nmid X$ and none satisfies $2\mid X$ .

Proof.

For $m=0$ , there is just one identity, and for this identity, $X=0$ . Supposing that the statements hold for $m$ , if $t$ has length $m$ , then $t$ has two extensions of length $m+1$ , and the parity of $X$ changes. ∎

The next lemma gives the proportion of single identities of length at most $s$ for which $2|X$ holds. The value depends on the parity of $s$ .

Lemma 3.19.

1.

$\displaystyle\lim\limits_{s\rightarrow\infty}\frac{P^{\prime}_{2s}(2\mid X)}{P^{\prime}_{2s}}=\frac{2}{3}$ and $\displaystyle\lim\limits_{s\rightarrow\infty}\frac{P^{\prime}_{2s}(2\nmid X)}{P^{\prime}_{2s}}=\frac{1}{3}$ .
2.

$\displaystyle\lim_{s\rightarrow\infty}\frac{P^{\prime}_{2s+1}(2\mid X)}{P^{\prime}_{2s+1}}=\frac{1}{3}$ and $\displaystyle\lim_{s\rightarrow\infty}\frac{P^{\prime}_{2s+1}(2\nmid X)}{P^{\prime}_{2s+1}}=\frac{2}{3}$ .

Proof.

The calculation is based on Lemma 3.18. For (1), the even case, we have $P^{\prime}_{2s}=2^{2s+1}-1=4^{s}\cdot 2-1$ , and

P^{\prime}_{2s}(2\mid X)=\sum_{m\leq s}2^{2m}=\sum_{m\leq s}4^{m}=\frac{4^{s+1}-1}{3}.

Then $\displaystyle\frac{P^{\prime}_{2s}(2\mid X)}{P^{\prime}_{2s}}=\frac{4^{s+1}-1}{3(4^{s}\cdot 2-1)}\rightarrow\frac{2}{3}$ .

For (2), the odd case, we have $P^{\prime}_{2s+1}=2^{2s+2}-1=4^{s+1}-1$ and

P^{\prime}_{2s+1}(2\mid X)=P^{\prime}_{2s}(2\mid X)=\frac{4^{s+1}-1}{3}.

Then $\displaystyle\frac{P^{\prime}_{2s+1}(2\mid X)}{P^{\prime}_{2s+1}}=\frac{4^{s+1}-1}{3\cdot(4^{s+1}-1)}=\frac{1}{3}$ . ∎

So far, the lemmas have involved only elementary calculations. The next result is from random group theory [4, 19], concerning a random walk on a group. The elements of the group $G=\mathbb{Z}_{n}$ represent the possible remainders after division of an integer $z$ by $n$ . In general, for a random walk, there are finitely many states, and given just the current state $s$ , with no more prior history, we have fixed probabilities of passing next to state $s^{\prime}$ . We allow $s^{\prime}=s$ .

Our states are group elements. We write $\mu(g)$ for the probability of going in one step from the identity to $g$ , and we write $\mu^{(k)}(g)$ for the probability of going in $k$ steps from the identity to $g$ . For the result below, the probability measure $\mu$ , defined on $G$ , is supported on a special generating set $\Sigma$ . For $\mu$ to be supported on $\Sigma$ means that $\mu$ assigns non-zero probability to the elements of $\Sigma$ , i.e., the set $\Sigma$ will consist of the group elements reachable from the identity in one step. For any $n$ , $\Sigma$ is also the set of differences $g^{\prime}-g$ , where $g^{\prime}$ is a successor of $g$ reachable in one step. The values of $\mu^{k}(g)$ , for $k>0$ are obtained by considering the tree of $1$ -step extensions of length $k$ starting from the identity. We multiply probabilities along the paths, and then sum over the paths leading to $g$ .

The result below tells us that the probability of each remainder $g$ in $\mathbb{Z}_{n}$ is approximately $\frac{1}{n}$ , and that the convergence (as $k\rightarrow\infty$ ) has a great deal of uniformity.

Theorem 3.20 ([19, Theorem 7.3]).

There exist $\alpha,\beta>0$ such that for any group $G$ of the form $\mathbb{Z}_{n}$ , any generating set $\Sigma$ containing the group identity element, and a probability measure $\mu$ supported on $\Sigma$ , we have that for all $g\in G$ and all $k\in\omega$ ,

\left|\mu^{(k)}(g)-\frac{1}{n}\right|<\alpha e^{-(\frac{\beta k}{n^{2}})}

To adapt this theorem to our setting, we will consider an odd $n$ and identities of even length $m$ . We break the identity into pieces of length 2, so each piece has $1/4$ chance of being each of $SS$ , $S^{-1}S^{-1}$ , $SS^{-1}$ , or $S^{-1}S$ . These correspond to $2,-2$ , and $0$ in the random walk on $\mathbb{Z}_{n}$ , and when $n$ is odd, $\Sigma=\{-2,0,2\}$ generates $\mathbb{Z}_{n}$ . Just as the identities of even length approach a uniform distribution, so do the identities of odd length, and, consequently, so do the identities of length at most $s$ . We have the following:

Corollary 3.21.

For any odd number $n$ and any $s$ ,

\displaystyle\lim_{s\to\infty}\frac{P^{\prime}_{s}(n\mid X)}{P^{\prime}_{s}}=\frac{1}{n}\text{ and }\displaystyle\lim_{s\to\infty}\frac{P^{\prime}_{s}(n\nmid X)}{P^{\prime}_{s}}=\frac{n-1}{n}.

Our presentations have an unordered set of two identities. However, it is easier to count ordered pairs, allowing repetition. By Proposition 2.9, we get the same limiting densities, so we will count ordered pairs allowing repetition of elements instead. Let $C$ be the set of presentations in which the difference functions $X_{1},X_{2}$ are both divisible by some prime $p$ . It follows from Lemma 3.10 that $X_{1}=0$ , and $X_{2}=0$ both have limiting density $0$ . The important part of $C$ consists of the presentations such that $X_{1}\not=0$ and $X_{2}\not=0$ , and in what follows, we write $C$ for this important part. For the presentations in $C$ , there is some prime that divides both $X_{1}$ and $X_{2}$ , and both $X_{1},X_{2}$ are non-zero.

Definition 3.22.

For each $s$ , let $p_{s}$ be the greatest prime $p$ such that
$2\cdot 3\cdots p\leq ln(s)$ .

For every $s$ , we split $C$ into two parts, $C_{1}=C_{s,1}$ and $C_{2}=C_{s,2}$ . Note that this splitting depends on $s$ . For a presentation in $C$ , let $d$ be the least prime that divides both $X_{1},X_{2}$ . Then the presentation is in $C_{s,1}$ if $d\leq p_{s}$ and in $C_{s,2}$ if $d>p_{s}$ . We will show that $C_{s,2}$ has limiting density $\lim\limits_{s\to\infty}\frac{P_{s}(C_{s,2})}{P_{s}}=0$ and that the limiting density of $C_{1}$ does not exist—it toggles between two values, one for even $s$ and the other for odd $s$ . Among the primes, $2$ behaves differently from the odd primes. We have shown that, for single identities with difference $X$ , the limiting density of $2|X$ does not exist. We will see later that this explains why the limiting density of $C_{1}$ does not exist.

Our first goal is to show that $C_{2}$ has limiting density $0$ . Toward this, we consider a single identity with difference $X$ .

Lemma 3.23.

For each odd prime $p$ and all $s$ , $\displaystyle\frac{P^{\prime}_{s}(p|X\ \&\ X\not=0)}{P^{\prime}_{s}}\leq\frac{2}{p+1}$ .

Proof.

We will first prove that $\displaystyle\frac{P^{\prime}_{=m}(p|X\ \&\ X\not=0)}{P^{\prime}_{=m}}\leq\frac{2}{p+1}$ for all $m$ . Note that the numbers $P^{\prime}_{=m}(X=n)$ , for $-m\leq n\leq m$ , form a Pascal triangle. At the top, for $m=0$ , we have $1$ , corresponding to $X=0$ . For $m=1$ , we have $1$ ’s corresponding to $X=\pm 1$ . In general, for even $m$ , $X$ takes the even values $n$ in the interval $[-m,m]$ , and for odd $m$ , $X$ takes the odd values in the interval $[-m,m]$ . In both cases, $P^{\prime}_{=m+1}(X=n)=P^{\prime}_{=m}(X=n-1)+P^{\prime}_{=m}(X=n+1)$ . We can see that $P^{\prime}_{=m}(X=n)$ decreases as $|n|$ increases, and that $P^{\prime}_{=m}(X=n)=P^{\prime}_{=m}(X=-n)$ .

For odd $m$ (so that $X$ is odd), we have

P^{\prime}_{=m}(X=\pm 1)\geq P^{\prime}_{=m}(X=\pm 3)\geq\cdots\geq P^{\prime}_{=m}(X=\pm p)\ \,

P^{\prime}_{=m}(X=\pm(p+2))\geq P^{\prime}_{=m}(X=\pm(p+4))\geq\cdots\geq P^{\prime}_{=m}(X=\pm 3p)

\cdots

Note that there are $p$ terms in each of the lines, except the first line, which has only $\frac{p+1}{2}$ terms. Therefore, we have

	$\displaystyle 1=\sum\limits_{n:\text{odd}}\frac{P^{\prime}_{=m}(X=n)}{P^{\prime}_{=m}}$	$\displaystyle=\frac{P^{\prime}_{=m}(X=\pm 1)}{P^{\prime}_{=m}}+\frac{P^{\prime}_{=m}(X=\pm 3)}{P^{\prime}_{=m}}+\cdots$
		$\displaystyle\geq\left(\frac{p+1}{2}\right)\frac{P^{\prime}_{=m}(X=\pm p)}{P^{\prime}_{=m}}+p\cdot\frac{P^{\prime}_{=m}(X=\pm 3p)}{P^{\prime}_{=m}}+\cdots$
		$\displaystyle\geq\left(\frac{p+1}{2}\right)\left(\frac{P^{\prime}_{=m}(X=\pm p)}{P^{\prime}_{=m}}+\frac{P^{\prime}_{=m}(X=\pm 3p)}{P^{\prime}_{=m}}+\cdots\right)$
		$\displaystyle=\left(\frac{p+1}{2}\right)\frac{P^{\prime}_{=m}(p\mid X\ \&\ X\neq 0)}{P^{\prime}_{=m}}.$

If $m$ is even (so that $X$ is even), then we have

P^{\prime}_{=m}(X=\pm 2)\geq P^{\prime}_{=m}(X=\pm 4)\geq\cdots\geq P^{\prime}_{=m}(X=\pm 2p)

P^{\prime}_{=m}(X=\pm(2p+2))\geq P^{\prime}_{=m}(X=\pm(2p+4))\geq\cdots\geq P^{\prime}_{=m}(X=\pm 4p)

\cdots

In this case, each line has $p$ terms, and we get the following slightly stronger inequality:

	$\displaystyle 1=\sum\limits_{n:\text{even}}\frac{P^{\prime}_{=m}(X=n)}{P^{\prime}_{=m}}$	$\displaystyle=\frac{P^{\prime}_{=m}(X=0)}{P^{\prime}_{=m}}+\frac{P^{\prime}_{=m}(X=\pm 2)}{P^{\prime}_{=m}}+\frac{P^{\prime}_{=m}(X=\pm 4)}{P^{\prime}_{=m}}+\cdots$
		$\displaystyle\geq\frac{P^{\prime}_{=m}(X=\pm 2)}{P^{\prime}_{=m}}+\frac{P^{\prime}_{=m}(X=\pm 4)}{P^{\prime}_{=m}}+\cdots$
		$\displaystyle\geq p\cdot\frac{P^{\prime}_{=m}(X=\pm 2p)}{P^{\prime}_{=m}}+p\cdot\frac{P^{\prime}_{=m}(X=\pm 4p)}{P^{\prime}_{=m}}+\cdots$
		$\displaystyle=p\cdot\frac{P^{\prime}_{=m}(p\mid X\ \&\ X\neq 0)}{P^{\prime}_{=m}}.$

Combining the even and odd case, we get the desired $\frac{P^{\prime}_{=m}(p|X\ \&\ X\not=0)}{P^{\prime}_{=m}}\leq\frac{2}{p+1}$ .

Now, we turn our attention back to the inequality in the lemma, which concerns identities up to a certain length. The quotient $\frac{P^{\prime}_{s}(p\mid X\ \&\ X\neq 0)}{P^{\prime}_{s}}$ is a weighted average (weighted by the proportion of identities of each length) of the probabilities $\frac{P^{\prime}_{=m}(p\mid X\ \&\ X\neq 0)}{P^{\prime}_{=m}}$ , where $m\leq s$ . Thus, the lemma follows from the inequality on identities of a fixed length $\frac{P^{\prime}_{=m}(p|X\ \&\ X\not=0)}{P^{\prime}_{=m}}\leq\frac{2}{p+1}$ . ∎

We are now ready to consider both identities.

Lemma 3.24.

$\displaystyle\lim\limits_{s\to\infty}\frac{P_{s}(C_{2})}{P_{s}}=0$ .

Proof.

Below, we will appeal to Proposition 2.9 and consider, for each $s$ , the probability space consisting of the ordered pairs of identities, each of length at most $s$ . Then the random variables $X_{1},X_{2}$ are independent. Counting ordered pairs of identities and allowing repetition, we see that for each $s$ ,

	$\displaystyle P_{s}(C_{2})$	$\displaystyle\leq\sum\limits_{p>p_{s}}P_{s}(X_{1},X_{2}\not=0\ \&\ p\|X_{1}\ \&\ p\|X_{2})$
		$\displaystyle=\sum\limits_{p>p_{s}}P^{\prime}_{s}(X_{1}\neq 0\ \&\ p\|X_{1})P^{\prime}_{s}(X_{2}\neq 0\ \&\ p\|X_{2}).$

So, it follows from the previous lemma that

	$\displaystyle\frac{P_{s}(C_{2})}{P_{s}}$	$\displaystyle\leq\sum\limits_{p>p_{s}}\frac{P^{\prime}_{s}(X_{1}\neq 0\ \&\ p\|X_{1})}{P^{\prime}_{s}}\frac{P^{\prime}_{s}(X_{2}\neq 0\ \&\ p\|X_{2})}{P^{\prime}_{s}}$
		$\displaystyle\leq\sum_{p>p_{s}}\left(\frac{2}{p+1}\right)^{2}$

By a well-known fact from number theory, the sum of the squares of the reciprocals of primes (or of all natural numbers) converges. Since $\lim\limits_{s\to\infty}p_{s}=\infty$ , we have that $\lim\limits_{s\to\infty}\sum\limits_{p>p_{s}}(\frac{2}{p+1})^{2}=0$ . Thus, $C_{2}$ has limiting density $0$ . ∎

We turn to $C_{1}$ . Again, we consider first a single identity.

Lemma 3.25.

We write $D_{s}$ for the set of identities of length at most $s$ but greater than $\sqrt{s}$ . Then $\frac{P^{\prime}_{s}(D_{s})}{P^{\prime}_{s}}\rightarrow 1$ .

Proof.

We have $P^{\prime}_{s}(D_{s})=P^{\prime}_{s}-P^{\prime}_{\sqrt{s}}$ , and $\frac{P^{\prime}_{s}(D_{s})}{P^{\prime}_{s}}=1-\frac{2^{\sqrt{s}+1}-1}{2^{s+1}-1}\rightarrow 1$ . ∎

Lemma 3.25 may be interpreted as saying that most identities of length at most $s$ have length at least $\sqrt{s}$ . We write $P_{s}(D_{s}^{2})$ for the number of pairs of identities of length at most $s$ such that both have length at least $\sqrt{s}$ . The next lemma says that for most pairs of identities of length at most $s$ , the length of both is at least $\sqrt{s}$ .

Lemma 3.26.

$\lim_{s\rightarrow\infty}\frac{P_{s}(D_{s}^{2})}{P_{s}}\rightarrow 1$ .

Now, $\frac{P_{s}(C_{1})}{P_{s}}$ is the probability that, among pairs of identities of length at most $s$ , with difference functions $X_{1}$ and $X_{2}$ , there is some prime $p\leq p_{s}$ such that $p|X_{1}\ \&\ p|X_{2}$ . We may suppose that both identities have length greater than $\sqrt{s}$ . We have seen that the limiting probability that $2$ divides both $X_{1},X_{2}$ does not exist—for even $s$ , it approaches $\frac{4}{9}$ , while for odd $s$ , it approaches $\frac{1}{9}$ 3.21. Here, we consider odd primes. We have justified thinking of the random variables $X_{1},X_{2}$ (for identities of length at most $s$ ) as independent.

We would like to assume that for $i=1,2$ , the events $p|X_{i}$ for different primes $p$ are independent. This turns out to be “approximately” true. The probability that $X_{1},X_{2}$ are not both divisible by $3$ is approximately $1-\frac{1}{3^{2}}$ . The probability that $X_{1},X_{2}$ are not both divisible by $3$ and not both divisible by $5$ is approximately $(1-\frac{1}{3^{2}})(1-\frac{1}{5^{2}})$ . The probability that $X_{1},X_{2}$ are not both divisible by any odd prime $p\leq p_{s}$ is approximately $\prod\limits_{3\leq p\leq p_{s}}(1-\frac{1}{p^{2}})$ . This formula matches what we would get by laborious inclusion-exclusion counting.

In fact, the divisibilities of $X_{i}$ by different primes may not be independent. However, we can apply the Chinese Remainder Theorem and consider the residue of $X_{i}$ modulo $N_{s}=\prod\limits_{p\leq p_{s}}p$ , which is $2\prod\limits_{3\leq p\leq p_{s}}p$ . It follows from the Definition of $p_{s}$ (Definition 3.22) that $N_{s}\leq\ln(s)$ . This is where the random walk on the group comes in. We will use Theorem 3.20.

Theorem 3.27.

Below, we let $p$ range over all primes:

1.

$\displaystyle\frac{P_{2s}(C_{s,1}^{c})}{P_{2s}}\rightarrow\frac{5}{9}\cdot\prod_{3\leq p}\left(1-\frac{1}{p^{2}}\right)$ .
2.

$\displaystyle\frac{P_{2s+1}(C_{s,1}^{c})}{P_{2s+1}}\rightarrow\frac{8}{9}\cdot\prod_{3\leq p}\left(1-\frac{1}{p^{2}}\right)$ .

Proof.

We prove (1). Take $\alpha,\beta>0$ as in Theorem 3.20. Recall that $X_{1}$ and $X_{2}$ are the difference functions associated with the first and second identities, where in each identity, the function symbols are all on the left. Fixing $s$ , we consider the residue of $X_{1}$ and $X_{2}$ modulo $N_{s}=2\prod\limits_{3\leq p\leq p_{s}}p$ , where $N_{s}$ is at most $\ln(s)$ . For a single identity, we consider a string $t$ of length $k$ . For the fixed $s$ , let $G_{s}$ be the group of possible remainders after division of $X_{1}$ by $N_{s}$ . Theorem 3.20 tells us that for every $0\leq a<N_{s}$ and every $k$ ,

\left|\frac{P^{\prime}_{=k}(X_{1}=a\pmod{N_{s}})}{P^{\prime}_{=k}}-\frac{2}{N_{s}}\right|<\alpha e^{\frac{-\beta k}{N_{s}^{2}}}\text{ if $a$ and $k$ have the same parity, and}

\frac{P^{\prime}_{=k}(X_{1}=a\pmod{N_{s}})}{P^{\prime}_{=k}}=0\text{ if $a$ and $k$ have different parities}.

When we sum up the identities of length at most $s$ , the previous lemma says that most of them will have length some $k\geq\sqrt{s}$ . Thus, we may assume that $k\geq\sqrt{s}$ . The previous inequality yields

\left|\frac{P^{\prime}_{=k}(X_{1}=a\pmod{N_{s}})}{P^{\prime}_{=k}}-\frac{2}{N_{s}}\right|<\alpha e^{\frac{-\beta k}{N_{s}^{2}}}<\alpha e^{\frac{-\beta\sqrt{s}}{\ln(s)^{2}}}.

By Lemma 3.19, among the identities of length up to $2s$ , $2/3$ of them are even, and $1/3$ of them are odd. The probability $\frac{P^{\prime}_{2s}(X_{1}=a\pmod{N_{s}})}{P^{\prime}_{2s}}$ is a weighted sum of $\frac{P^{\prime}_{=k}(X_{1}=a\pmod{N_{s}})}{P^{\prime}_{=k}}$ . Noticing that the rest of the previous inequality does not depend on $k$ , doing a weighted sum gives

\left|\frac{P^{\prime}_{2s}(X_{1}=a\pmod{N_{s}})}{P^{\prime}_{2s}}-\frac{2}{3}\cdot\frac{2}{N_{s}}\right|<\alpha e^{\frac{-\beta\sqrt{s}}{\ln(s)^{2}}}\text{ if }a\text{ is even, and}

\left|\frac{P^{\prime}_{2s}(X_{1}=a\pmod{N_{s}})}{P^{\prime}_{2s}}-\frac{1}{3}\cdot\frac{2}{N_{s}}\right|<\alpha e^{\frac{-\beta\sqrt{s}}{\ln(s)^{2}}}\text{ if }a\text{ is odd.}

By the independence of $X_{1},X_{2}$ , we see that for all sufficiently large $s$ ,

\left|\frac{P_{2s}(X_{1}=a\pmod{N_{s}}\ \&\ X_{2}=b\pmod{N_{s}})}{P_{2s}}-c\frac{4}{N_{s}^{2}}\right|<\alpha e^{\frac{-\beta\sqrt{s}}{\ln(s)^{2}}}

where $c=\begin{cases}\frac{1}{9}&\mbox{if $a,b$ are both odd,}\\ \frac{2}{9}&\mbox{if one of $a,b$ is odd and the other is even,}\\ \frac{4}{9}&\mbox{if $a,b$ are both even.}\\ \end{cases}$

Now, we consider pairs $(a,b)$ modulo $N_{s}$ such that no $p\leq p_{s}$ divides both $a$ and $b$ . Note that $(a,b)$ cannot both be even. As we have seen, up to the parities of $a$ and $b$ , for large $s$ , the distribution of $X_{i}$ is approximately uniform, and the distribution of ordered pairs $(X_{1},X_{2})$ is also approximately uniform. For each odd prime $p$ , the fraction of the pairs $(a,b)$ such that $a,b$ are both divisible by $p$ is approximately $\frac{1}{p^{2}}$ . Thus, considering all primes, approximately $\prod\limits_{3\leq p\leq p_{s}}(1-\frac{1}{p^{2}})$ of the possible pairs do not have a common odd prime factor $\leq p_{s}$ . More precisely,

\left|\frac{P_{2s}(\text{no odd prime }p\leq p_{s}\text{ divides both }X_{1},X_{2})}{P_{2s}}-\prod\limits_{3\leq p\leq p_{s}}\left(1-\frac{1}{p^{2}}\right)\right|<N_{s}^{2}\alpha e^{\frac{-\beta\sqrt{s}}{\ln(s)^{2}}}.

Finally, considering $p=2$ , the probability that both $a,b$ are even is approximately $\frac{4}{9}$ . Thus, we have that $\frac{P_{2s}(C_{s,1}^{c})}{P_{2s}}$ , the probability that no prime $p\leq p_{s}$ divides both $X_{1}$ and $X_{2}$ , satisfies

\left|\frac{P_{2s}(C_{s,1}^{c})}{P_{2s}}-\frac{5}{9}\cdot\prod\limits_{3\leq p\leq p_{s}}\left(1-\frac{1}{p^{2}}\right)\right|<N_{s}^{2}\alpha e^{\frac{-\beta\sqrt{s}}{\ln(s)^{2}}}\leq(\ln s)^{2}\alpha e^{\frac{-\beta\sqrt{s}}{\ln(s)^{2}}}.

Note that the right hand side of the inequality goes to 0 as $s\to\infty$ . Thus,

\lim\limits_{s\to\infty}\frac{P_{2s}(C_{s,1}^{c})}{P_{2s}}=\frac{5}{9}\cdot\prod_{3\leq p}\left(1-\frac{1}{p^{2}}\right).

The proof of (2) is similar. ∎

3.4 Abelian groups

Let $V$ be the variety of abelian groups. To axiomatize $V$ , we add to the group axioms the sentence $(\forall x)(\forall y)x+y=y+x$ .

3.4.1 Elementary invariants

Szmielew [21] carried out an elimination of quantifiers for abelian groups, and she gave elementary invariants. Later, Eklof and Fisher [6] used saturation to give elementary invariants for modules. Their methods also yield the Szmielew invariants for abelian groups. We give invariants for abelian groups below. For a prime $p$ , we write $G[p]$ for the set $\{x\in G:px=0\}$ , which consists of the identity and the elements of order $p$ .

1.

$\alpha(p,n,k)$ , saying $|p^{n}G|\geq k$ ,
2.

$\beta(p,n,k)$ , saying $dim(p^{n}G/_{p^{n+1}G})\geq k$ ,
3.

$\gamma(p,n,k)$ , saying $dim(p^{n}G[p])\geq k$ ,
4.

$\delta(p,n,k)$ , saying $dim(p^{n}G[p]/_{p^{n+1}G[p]})\geq k$ .

We consider presentations with a single generator $a$ and a single relator. The free abelian group on one generator is $\mathbb{Z}$ , and the other abelian groups on one generator are the finite cyclic groups $C_{m}$ . We focus on the sentences $\beta(p,n,1)$ , which say that there is an element divisible by $p^{n}$ and not by $p^{n+1}$ . We will see that these sentences are true in $\mathbb{Z}$ and do not have limiting density $0$ or $1$ . For $p=2$ , the limiting density does not exist, while for odd primes $p$ , the limiting density exists and has a value strictly between $0$ and $1$ .

Lemma 3.28.

1.

$\beta(p,n,1)$ is true in $\mathbb{Z}$ .
2.

$\beta(p,n,1)$ is true in $C_{m}$ if and only if $p^{n+1}|m$ .

Proof.

For (1), we note that in $\mathbb{Z}$ , the element $p^{n}$ witnesses the truth of $\beta(p,n,1)$ . For (2), consider $C_{m}$ . For some $r$ (possibly $0$ ) and some $m^{\prime}$ relatively prime to $p$ , we have $m=p^{r}\cdot m^{\prime}$ , and then $C_{m}\cong C_{p^{r}}\oplus C_{m^{\prime}}$ . If $r>n$ , then $C_{p^{r}}$ has an element divisible by $p^{n}$ and not by $p^{n+1}$ , and otherwise, there is no such element. Furthermore, all elements of $C_{m^{\prime}}$ are divisible by all powers of $p$ . So, $C_{m}$ has elements divisible by $p^{n}$ if and only if $r>n$ . ∎

A relator of length $m$ has the form $w(a)=\sum_{i\leq m}d_{i}a$ , where $d_{i}=\pm 1$ . We consider the relator of length $0$ , representing the empty sum, to be $0$ .

Lemma 3.29.

$P_{s}=1+2+\ldots+2^{s}=2^{s+1}-1$ .

Proof.

There is one relator of length $0$ . For $m\geq 1$ , there are $2^{m}$ possible relators $t(a)=d_{1}a+\ldots+d_{m}a$ , $d_{i}=\pm 1$ . Then we have
$P_{s}=1+2+2^{2}+\ldots+2^{s}=\frac{1-2^{s+1}}{1-2}=2^{s+1}-1$ . ∎

We consider the limiting density for $\beta(p,n,1)$ for various combinations of $p$ and $n$ . Recall that the sentence $\beta(p,n,1)$ is true in $\mathbb{Z}$ , and it is true in $C_{m}$ provided that $p^{n+1}|m$ . Let $A$ be the set of relators that give $\mathbb{Z}$ , and for $r<p^{n+1}$ , let $B_{r}$ be the set of relators that give $\mathbb{Z}_{m}$ for the various $m\equiv_{p^{n+1}}r$ . We write $P_{s}(A)$ and $P_{s}(B_{r})$ for the number of relators of length at most $s$ in the sets $A$ and $B_{r}$ .

Lemma 3.30.

1.

$P_{s}(A)=1+\sum\limits_{0<2m\leq s}\left(\begin{array}[]{cc}2m\\ m\end{array}\right)$ ,
2.

$A$ has limiting density $0$ .

Proof.

For (1), note that the relator $t(a)$ gives $\mathbb{Z}$ just in case $t(a)$ has even length $2m$ for some $m$ , and $a$ and $-a$ each occur $m$ times. For (2), look back at Section 3.1 where we encountered the same quantities as the current $P_{s}$ and $P_{s}(A)$ . There, we saw that $\lim_{s\rightarrow\infty}\frac{P_{s}(A)}{P_{s}}=0$ . ∎

Recall that for a relator $t(a)$ , $X$ is the difference between the number of occurrences of $a$ and the number of occurrences of $a^{-1}$ in the term $t(a)$ . For a relator of even length $2m$ , $X$ takes only even values $0,\pm 2,\ldots,\pm 2m$ . For an identity of odd length $2m+1$ , $X$ takes only odd values $\pm 1$ , $\pm 3$ , $\ldots$ , $\pm(2m+1)$ . Using arguments similar to those in Section 3.3, we will show that the limiting density of $2^{n+1}|X$ does not exist, and for odd primes $p$ , the limiting density of $p^{n+1}|X$ is $\frac{1}{p^{n+1}}$ .

Lemma 3.31.

For $p=2$ and $n\geq 0$ , the limiting density of $2^{n+1}|X$ does not exist. In particular, $\frac{P_{2s}(2^{n+1}|X)}{P_{2s}}\rightarrow(\frac{2}{3})(\frac{1}{2^{n}})$ , and $\frac{P_{2s+1}(2^{n+1}|X)}{p_{2s+1}}\rightarrow(\frac{1}{3})(\frac{1}{2^{n}})$ .

Proof.

We begin with the case where $n=0$ . Here the calculations are straightforward. We have $P_{=2m}(2|X)=2^{2m}$ , and $P_{=2m+1}(2|X)=0$ . Then $P_{2s}(2|X)=1+2^{2}+2^{4}+\ldots+2^{2s}=1+4+4^{2}+\ldots+4^{s}=\frac{4^{s+1}-1}{3}$ . Therefore, $\frac{P_{2s}(2|X)}{P_{2s+1}}=\frac{(\frac{4^{s+1}-1}{3})}{2^{2s+2}-1}\rightarrow\frac{2}{3}$ . Since $P_{2s+1}(2|X)=P_{2s}$ , we have $\frac{P_{2s+1}(2|X)}{P_{2s+1}}=\frac{(\frac{4^{s+1}-1}{3})}{2^{2s+2}-1}\rightarrow\frac{1}{3}$ . What we have shown is that if $E$ is the set of relators of even length, then $\frac{P_{2s}(E)}{P_{2s}}\rightarrow\frac{2}{3}$ and $\frac{P_{2s+1}(E)}{P_{2s+1}}\rightarrow\frac{1}{3}$ .

For $n\geq 1$ , we again use Theorem 3.20. For every presentation in $E^{c}$ , we have $2^{n+1}\nmid X$ , so we may condition on $E$ when we consider even length $2m$ . In the relation $w(a)=\sum_{i\leq 2m}d_{i}a$ , we can consider the sums $d_{1}+d_{2},d_{3}+d_{4},\dots,d_{2m-1}+d_{2m}$ . This gives us an $m$ -step random walk on $\mathbb{Z}_{2^{n+1}}$ with each step being $2$ with probability $\frac{1}{4}$ , $-2$ with probability $\frac{1}{4}$ , and $0$ with probability $\frac{1}{2}$ . Dividing everything by 2, we get a random walk with support $\{1,0,-1\}$ on $\mathbb{Z}_{2^{n}}$ . Thus, Theorem 3.20 applies. We have $2^{n+1}\mid X$ exactly when the random walk ends at $0\in\mathbb{Z}_{2^{n}}$ . The probability of this is $\frac{P_{=2m}(2^{n+1}|X)}{P_{=2m}}\to 1/2^{n}$ .

Now, as in the proof of Theorem 3.27, we have that most identities of length at most $s$ will have length at least $\geq\sqrt{s}$ . Since the rate of convergence in Theorem 3.20 is exponential and all identities in $E$ have even length, we can pass from the probability for identities of a fixed length to the probability for identities with length $\leq s$ , and we get $\frac{P_{2s}(2^{n+1}|X\mid E)}{P_{2s}(E)}\to 1/2^{n}$ .

Since $2^{n+1}\mid X$ only when $X$ is even, i.e., the identity is in $E$ , and the above probability $1/2^{n}$ was conditioned to $E$ , we have the desired

\frac{P_{2s}(2^{n+1}|X)}{P_{2s}}=\frac{P_{2s}(2^{n+1}|X\mid E)}{P_{2s}}=\frac{P_{2s}(2^{n+1}|X\mid E)}{P_{2s}(E)}\cdot\frac{P_{2s}(E)}{P_{2s}}\to(\frac{1}{2^{n}})\cdot(\frac{2}{3}).

The odd case can be proved similarly. ∎

Lemma 3.32.

For odd primes $p$ , $p^{n+1}|X$ has limiting density $\frac{1}{p^{n+1}}$ .

Proof.

For a fixed even length $2m$ , we get a random walk on $\mathbb{Z}_{p^{n+1}}$ supported on $\{2,0,-2\}$ —a single step increases the length by $2$ . By Theorem 3.20, we have $\frac{P_{2m}(p^{n+1}\mid X)}{P_{2n}}\to\frac{1}{p^{n+1}}$ . This random walk converges to the uniform distribution for even lengths, and the same is true for the odd lengths.

As before, we split the set of relators of length at most $s$ into two parts, those of length less than $\sqrt{s}$ , and those of length at least $\sqrt{s}$ . Let $S_{s}$ be the number of relators of length at most $s$ for which the length is less than $\sqrt{s}$ , and let $L_{s}$ be the number for which the length is at least $\sqrt{s}$ . Then $\frac{S_{s}}{P_{s}}\rightarrow 0$ , so $\frac{L_{s}}{P_{s}}\rightarrow 1$ . Then the exponential rate of convergence of Theorem 3.20 gives

\frac{P_{s}(p^{n+1}|X)}{P_{s}}\rightarrow\frac{1}{p^{n+1}}.

∎

4 Generalizing

In this section, we give general conditions that imply some of the behaviors that we saw in Section 3. Our languages will have finitely many unary function symbols, and we may also allow finitely many constants.

4.1 Generalized bijective varieties

In Section 3.1, we considered the variety with axioms saying of a pair of unary function symbols $S,S^{-1}$ that they are inverses, and we showed that for presentations with a single generator $a$ and a single identity of the form $t(a)=a$ , the sentences true in the free structure are exactly those with limiting density $1$ . In this subsection and the next, we turn our attention to varieties of structures with multiple bijective unary functions, possibly with additional axioms. We might suppose that the language of has unary function symbols $g_{1},g_{1}^{-1},\cdots,g_{n},g_{n}^{-1}$ , and that our varieties have axioms saying that for each $i$ , $g_{i}$ and $g_{i}^{-1}$ are inverses. However, the assumption that the functions have inverses named by function symbols turns out to be unnecessary once we know that the functions are $1-1$ and onto.

Definition 4.1.

Let $L$ be a language with unary function symbols $f_{1},\ldots,f_{n}$ , and let $V$ be an algebraic variety with theory $T$ . The variety is generalized bijective if for all $i$ , $T\vdash(\forall x,y)(f_{i}(x)=f_{i}(y)\rightarrow x=y)$ and
$T\vdash(\forall y)(\exists x)\ f_{i}(x)=y$ .

The result below says that for a generalized bijective variety, the basic functions have inverses named by terms.

Proposition 4.2.

Let $T$ be the theory of a generalized bijective variety in the language $\{f_{1},\ldots,f_{n}\}$ . Then for each $f_{i}$ , there is some word $u_{i}$ such that
$T\vdash(\forall x)f_{i}\circ u_{i}(x)=u_{i}\circ f_{i}(x)=x$ .

Proof.

Fix $i$ , and let $F$ be the free structure on one generator $a$ . There is some $b\in F$ with $f_{i}(b)=a$ . We can express $b$ as $u_{i}(a)$ for some word $u_{i}$ . Then $F\models f_{i}\circ u_{i}(a)=a$ . Recall that in a variety, if an atomic formula is true of the generating tuplea in the free structure, then it holds on all tuples in all structures [3, Theorem 11.4]. Thus, $T\vdash(\forall x)\ f_{i}\circ u_{i}(x)=x$ . In $F$ , let $x=f_{i}(a)$ . We have $f_{i}\circ u_{i}\circ f_{i}(a)=f_{i}(a)$ . Since $f_{i}$ is injective, this means that $F\models u_{i}\circ f_{i}(a)=a$ . Hence, $T\models(\forall x)\ u_{i}\circ f_{i}(x)=x$ . This completes the proof. ∎

Definition 4.3.

Let $V$ be a variety in the language $\{f_{1},\ldots,f_{n}\}$ , where each $f_{i}$ is unary. The variety is commutative if the axioms imply $(\forall x)f_{i}(f_{j}(x))=f_{j}(f_{i}(x))$ for all $i,j$ .

Our main general result, Theorem 4.26, says that for a commutative generalized bijective variety $V$ and presentations with a single generator and a single identity, the zero–one law holds. Moreover, the sentences with density $1$ are those true in the free structure. To prove Theorem 4.26, we will use a version of Gaifman’s Locality Theorem, which we discuss below.

4.2 Gaifman’s Locality Theorem

We state a special version of Gaifman’s Locality Theorem for generalized bijective varieties, and we sketch a proof using saturation. Fix a language $L$ consisting of unary function symbols $f_{1},\ldots,f_{m}$ . Below, we define the Gaifman graph of an $L$ -structure. Gaifman defined the graph for structures in a finite relational language. When convenient, we treat the unary functions as binary relations.

Definition 4.4.

Let $\mathcal{A}$ be an $L$ -structure. The Gaifman graph of $\mathcal{A}$ is the undirected graph with universe equal to that of $\mathcal{A}$ , and with an edge between $x$ and $y$ if and only if $f_{i}(x)=y$ or $f_{i}(y)=x$ for some $i$ .

We define an equivalence relation $\sim$ on $\mathcal{A}$ such that $x\sim y$ if $x$ and $y$ belong to the same connected component in the Gaifman graph; i.e., there is a finite path leading from $x$ to $y$ .

Definition 4.5 (distance, $d(x,y)$ ).

For $x,y\in\mathcal{A}$ , the distance between $x$ and $y$ is the least $r$ such that there is a path of length $r$ from $x$ to $y$ . We write $d(x,y)\geq r$ , $d(x,y)>r$ to indicate that the distance is, respectively, at least $r$ , or greater than $r$ .

Remark: Elements $x,y$ lie in different connected components just in case $d(x,y)>r$ for all $r$ .

We consider substructures of $\mathcal{A}$ . Note that two connected components, thought of as substructures, are isomorphic if there is a map from one onto the other that preserves the unary functions $f_{i}$ , which we think of as binary relations. The structure $\mathcal{A}$ is determined, up to isomorphism, by the number of connected components of different isomorphism types.

Definition 4.6 ( $r$ -ball, $B_{r}(a)$ , $B_{r}(\bar{a})$ ).

Let $\mathcal{A}$ be a structure and let $r\in\omega$ .

1.

For $a\in\mathcal{A}$ , the $r$ -ball around $a$ is $B_{r}(a)=\{x\in\mathcal{A}:d(a,x)\leq r\}$ .
2.

For $\bar{a}\in\mathcal{A}^{n}$ , we write $B_{r}(\bar{a})$ for the set $\cup_{i<n}B_{r}(a_{i})$ .
3.

We write $B_{\infty}(a)$ for the connected component of $a$ , or $\cup_{r}B_{r}(a)$ .
4.

We write $B_{\infty}(\bar{a})$ for the union of the connected components of elements of $\bar{a}$ , or $\cup_{i}B_{\infty}(a_{i})$ .

Let $V$ be a generalized bijective variety for the language $L$ . For $\mathcal{A}\in V$ , each element has a unique image and a unique pre-image under each $f_{i}$ . We show that for each $r$ and $n$ , there is a finite set of formulas $\alpha(\bar{x})$ that describe, for all $\mathcal{A}\in V$ , the possible substructures $B_{r}(\bar{a})$ for $n$ -tuples $\bar{a}$ .

Lemma 4.7.

Let $V$ be a generalized bijective variety for the language $L$ . For each $r$ and $n$ , there is a finite set $C_{r,n}$ of formulas $\alpha(\bar{x})$ , such that

1.

for each $\mathcal{A}\in V$ , each $n$ -tuple $\bar{a}$ in $\mathcal{A}$ satisfies a unique formula $\alpha(\bar{x})\in C_{r,n}$ ,
2.

for all $\mathcal{A},\mathcal{A}^{\prime}\in V$ , if $n$ -tuples $\bar{a}$ in $\mathcal{A}$ and $\bar{a}^{\prime}$ in $\mathcal{A}^{\prime}$ satisfy the same formula $\alpha(\bar{x})\in C_{r,n}$ , then there is an isomorphism from $B_{r}(\bar{a})$ onto $B_{r}(\bar{a}^{\prime})$ that takes $\bar{a}$ to $\bar{a}^{\prime}$ .

Moreover, we may take the formulas $\alpha(\bar{x})$ in $C_{r,n}$ to be existential. We may equally well take them to be universal.

Proof.

We describe the possible elements of $B_{r}(\bar{x})$ inductively as follows. The set $B_{0}(\bar{x})$ has just the members of the $n$ -tuple $\bar{x}$ as possible elements. Now, suppose we have the possible elements of $B_{r}(\bar{x})$ for some $r\geq 0$ . We will set the possible elements of $B_{r+1}(\bar{x})$ to be the elements of $B_{r}(\bar{x})$ together with additional possible elements $z$ obtained as follows: Take some $y\in B_{r}(\bar{x})$ corresponding to a node at a distance $r$ from some $x\in\bar{x}$ and follow an arrow labeled $f_{i}$ or $f_{i}^{-1}$ from $y$ to $z$ ; please note that $f_{i}^{-1}$ is shorthand for the term that acts as an inverse to $f_{i}$ from Proposition 4.2.

We may think of the possible elements of $B_{r}(\bar{x})$ as terms $u(x)$ , where $u$ is a string of $f_{i}$ , $f_{i}^{-1}$ of length at most $r$ . For an actual structure in our generalized bijective variety, with an actual tuple $\bar{a}$ corresponding to $\bar{x}$ , we may have equalities—different paths may lead to the same point. For $\mathcal{A}\in V$ generated by $\bar{a}$ , the elements of $B_{r}(\bar{a})$ are equivalence classes of terms $u(a_{i})$ , where $u$ is a string of $f_{i},f_{i}^{-1}$ of length at most $r$ . We have an existential formula saying that there exist $y$ ’s corresponding to the possible elements of $B_{r}(\bar{x})$ such that the structure has a specific atomic diagram. We also have a universal formula saying that for all $y$ ’s corresponding to the possible elements of $B_{r}(\bar{x})$ , the structure has a specific atomic diagram. ∎

We fix sets of formulas $C_{r,n}$ as in the lemma. Gaifman’s Locality Theorem says that any formula $\varphi(\bar{x})$ (in a relational language) can be expressed as a finite Boolean combination of “local” formulas and “local” sentences (see the references [15], [7], [14]). For our setting, we take the local formulas and local sentences to be as follows.

Definition 4.8.

1.

The $r$ -local formulas $\bar{x}$ are those in $C_{r,n}$ for various $n$ .
2.

The $r$ -local sentences have one of the following forms:
1. (a)
  
  $(\exists v_{1},\cdots,v_{s})\left(\bigwedge\limits_{i}\alpha_{i}(v_{i})\ \&\bigwedge\limits_{i<j}d^{>2r}(v_{i},v_{j})\right)$ ,
  for some $s$ and $\alpha_{i}(x)\in C_{r,1}$ ,
2. (b)
  
  $(\exists v)\alpha(v)$ , for some $\alpha\in C_{r,1}$ .

Remark.

This definition is similar to Gaifman’s, except that we allow only special formulas in $C_{r,n}$ . Note that the formulas in $C_{r,n}$ already give information on whether the distance between $x_{i}$ and $x_{j}$ is greater than $2r$ . Indeed, if $d(x_{i},x_{j})\leq 2r$ , the formula will contain a conjunct that says (in the rational language) $t(x_{i})=t^{\prime}(x_{j})$ for some $t,t^{\prime}$ of length at most $r$ . Thus, we may equivalently replace 2(a) by $(\exists v_{1},\cdots,v_{s})\alpha(v_{1},\cdots,v_{s})$ for some $\alpha\in C_{r,s}$ . We chose the form above to stay closer to Gaifman’s definition.

Definition 4.9.

A formula or sentence is local if it is $r$ -local for some $r$ .

Here is our special version of Gaifman’s Locality Theorem, where the local formulas and sentences are as defined above.

Theorem 4.10.

Let $V$ be a generalized bijective variety with theory $T$ .

1.

Any elementary first order sentence $\varphi$ is equivalent over $T$ to a sentence $\varphi^{*}$ that is a finite Boolean combination of local sentences.
2.

Any elementary first order formula $\varphi(\bar{x})$ with free variables $\bar{x}$ is equivalent over $T$ to a formula $\varphi^{*}(\bar{x})$ that is finite Boolean combination of local sentences and local formulas. In fact, we may take $\varphi^{*}(\bar{x})$ to be a finite disjunction of formulas $\alpha_{i}(\bar{x})\ \&\ \beta$ , where for each $i$ , $\alpha_{i}(\bar{x})$ is a single local formula, and $\beta$ is a finite conjunction of local sentences and negations of local sentences.

We sketch a proof using saturation. We begin with some definitions and lemmas.

Definition 4.11.

•

For $\mathcal{A}\in V$ , the local theory of $\mathcal{A}$ is the set of all local sentences and negations of local sentences that are true in $\mathcal{A}$ .
•

For $\bar{a}$ in $\mathcal{A}$ , the local type of $\bar{a}$ is the set of formulas generated by the local theory and the set of local formulas true of $\bar{a}$ in $\mathcal{A}$ .

Note that for $\bar{a}$ in $\mathcal{A}$ and $\bar{a}^{\prime}$ in $\mathcal{A}^{\prime}$ of the same length, if the local type of $\bar{a}$ in $\mathcal{A}$ is contained in the local type of $\bar{a}^{\prime}$ in $\mathcal{A}^{\prime}$ , then the local types are the same.

Lemma 4.12.

Let $\mathcal{A},\mathcal{A}^{\prime}\in V$ . If $n$ -tuples $\bar{a}$ in $\mathcal{A}$ and $\bar{a^{\prime}}$ in $\mathcal{A}^{\prime}$ satisfy the same local type, then there is a partial isomorphism $f$ from $B_{\infty}(\bar{a})$ onto $B_{\infty}(\bar{a}^{\prime})$ such that $f(\bar{a})=\bar{a}^{\prime}$ .

Proof.

The fact that the tuples $\bar{a}$ and $\bar{a}^{\prime}$ satisfy the same local type means that the structures $\mathcal{A}$ and $\mathcal{A}^{\prime}$ satisfy the same local theory, and for each $r$ , the tuples $\bar{a},\bar{a}^{\prime}$ satisfy the same unique formula $\alpha(\bar{x})\in C_{r,n}$ . By Lemma 4.7, for each $r$ , there is an isomorphism $p$ from $B_{r}(\bar{a})$ onto $B_{r}(\bar{a}^{\prime})$ taking $\bar{a}$ to $\bar{a}^{\prime}$ . We have a tree of these finite partial isomorphisms $p$ between $B_{\infty}(\bar{a})\subseteq\mathcal{A}$ and $B_{\infty}(\bar{a}^{\prime})\subseteq\mathcal{A}^{\prime}$ , where at level $r$ , we put the isomorphisms from $B_{r}(\bar{a})$ onto $B_{r}(\bar{a}^{\prime})$ that take $\bar{a}$ to $\bar{a}^{\prime}$ , and at level $r+1$ , the successors of a given partial isomorphism $p$ from level $n$ are the extensions of $p$ taking $B_{r+1}(\bar{a})$ isomorphically onto $B_{r+1}(\bar{a}^{\prime})$ . If $B_{\infty}(\bar{a})$ is infinite, then the tree is infinite, and it is finitely branching, so by König’s Lemma, there is a path $(p_{r})_{r\in\omega}$ . The desired isomorphism is $\cup_{r}p_{r}$ . If the substructure $B_{\infty}(\bar{a})$ is finite, then it is contained in $B_{r}(\bar{a})$ for some $r$ , and $p_{r}$ is the desired isomorphism. ∎

For any $\mathcal{A}\in V$ , the isomorphism type of $\mathcal{A}$ is determined by the number of connected components of each isomorphism type. Suppose $\mathcal{A}$ is saturated, of infinite cardinality $\kappa$ . In $\mathcal{A}$ , a local type $\Gamma(\bar{x})$ is satisfied if it is finitely satisfied. For a local type $\Gamma(x)=\{\alpha_{r}(x):r\in\omega\}$ , there are at least $n$ realizations of $\Gamma(x)$ on different connected components if and only if for all $r$ , $\mathcal{A}$ satisfies the $r$ -local sentence saying that there are at least $n$ elements satisfying $\alpha_{r}(x)$ and at a distance greater than $2r$ . The number of connected components with an element satisfying $\Gamma(x)$ is either finite or $\kappa$ . This yields the following.

Lemma 4.13.

Suppose $\mathcal{A},\mathcal{A}^{\prime}\in V$ are saturated and of the same cardinality $\kappa$ . If $\mathcal{A},\mathcal{A}^{\prime}$ satisfy the same local sentences, then $\mathcal{A}\cong\mathcal{A}^{\prime}$ .

Proof.

Since $\mathcal{A},\mathcal{A}^{\prime}$ are saturated, of the same cardinality, and satisfy the same local sentences, they realize the same local types, and they have the same number of connected components of each isomorphism type. Hence, they are isomorphic. ∎

Knowing what the saturated structures in the variety $V$ look like, we see that for any countable $\mathcal{A}\in V$ , there exists a saturated structure $\mathcal{A}^{*}$ of cardinality $2^{\aleph_{0}}$ such that $\mathcal{A},\mathcal{A}^{*}$ satisfy the same local sentences.

Lemma 4.14.

If $\mathcal{A},\mathcal{A}^{\prime}\in V$ have the same local theory, then they are elementarily equivalent.

Proof.

Let $\mathcal{A}^{*}$ and $(\mathcal{A}^{\prime})^{*}$ be saturated models of the common local theory of $\mathcal{A},\mathcal{A}^{\prime}$ such that $\mathcal{A}^{*},(\mathcal{A}^{\prime})^{*}$ both have cardinality $2^{\aleph_{0}}$ . Applying Lemma 4.13, we see that $\mathcal{A}^{*}\cong(\mathcal{A}^{\prime})^{*}$ . Hence, $\mathcal{A},\mathcal{A}^{\prime}$ are elementarily equivalent. ∎

Lemma 4.15.

Take $n$ -tuples $\bar{a},\bar{a}^{\prime}$ in $\mathcal{A}$ . If $\bar{a},\bar{a}^{\prime}$ satisfy the same local type, then there is an automorphism of $\mathcal{A}$ that takes $\bar{a}$ to $\bar{a}^{\prime}$ .

Proof.

We have a partial isomorphism $f$ from $B_{\infty}(\bar{a})$ onto $B_{\infty}(\bar{a}^{\prime})$ such that $f(\bar{a})=\bar{a}^{\prime}$ . This extends to an automorphism that agrees with $f$ on $B_{\infty}(\bar{a})$ , with $f^{-1}$ on $B_{\infty}(\bar{a}^{\prime})$ , and with the identity on the rest of $\mathcal{A}$ . ∎

Lemma 4.16.

If $\mathcal{A}\models\varphi$ , then there is a sentence $\psi$ true in $\mathcal{A}$ such that $\psi$ is a finite conjunction of local sentences and negations of local sentences and $T\vdash(\psi\rightarrow\varphi)$ .

Proof.

If $S$ is the local theory of $\mathcal{A}$ , then $T\cup S\vdash\varphi$ . Then there is some $\psi$ , the conjunction of a finite subset of $S$ , such that $T\vdash(\psi\rightarrow\varphi)$ . ∎

For a formula $\varphi(\bar{x})$ with an $n$ -tuple $\bar{x}$ of variables, we have the following.

Lemma 4.17.

If $\mathcal{A}\models\varphi(\bar{a})$ , then there is a formula $\psi(\bar{x})=(\alpha(\bar{x})\ \&\ \beta)$ such that $\beta$ is a finite conjunction of sentences in the local theory of $\mathcal{A}$ , $\alpha(\bar{x})$ is a local formula satisfied by $\bar{a}$ in $\mathcal{A}$ , and $T\vdash(\forall\bar{x})(\psi(\bar{x})\rightarrow\varphi(\bar{x}))$ .

Proof.

We have a saturated model $\mathcal{B}$ of cardinality $2^{\aleph_{0}}$ with a tuple $\bar{b}$ satisfying the type of $\bar{a}$ . If $\mathcal{B}^{\prime}$ is saturated and satisfies the local theory of $\mathcal{A}$ and $\mathcal{B}$ , there is an isomorphism $f$ from $\mathcal{B}$ onto $\mathcal{B}^{\prime}$ . If $\bar{b}^{\prime}$ is an $n$ -tuple in $\mathcal{B}^{\prime}$ satisfying the local type of $\bar{a}$ and $\bar{b}$ , we may suppose that $f(\bar{b})=\bar{b}^{\prime}$ . Hence, $\bar{b}^{\prime}$ realizes the complete type of $\bar{a}$ . This shows that the local theory of $\mathcal{A}$ and the local type of $\bar{a}$ generate the full theory and type. If $\chi(\bar{x})$ is a finite conjunction of local formulas and negations of local formulas in the local type of $\bar{a}$ , then there is a single formula $\alpha(\bar{x})$ in the local type of $\bar{a}$ that implies $\chi(\bar{x})$ —take $\alpha(\bar{x})\in C_{r,n}$ for sufficiently large $r$ . ∎

A standard model-theoretic argument gives the following.

Proposition 4.18.

Any elementary first order sentence $\varphi$ is equivalent over $T$ to a finite disjunction of local sentences and negations of such sentences.

Proof.

For each $\mathcal{A}\in V$ satisfying $\varphi$ , choose $\psi$ as in Lemma 4.16, a finite conjunction of local sentences and negations, true in $\mathcal{A}$ , such that $T\vdash(\psi\rightarrow\varphi)$ . Let $S$ be the set of chosen sentences. Now, $T\cup\{\neg{\psi}:\psi\in S\}\cup\{\varphi\}$ is inconsistent, so there is a finite set $S^{\prime}\subseteq S$ such that $T\vdash(\varphi\rightarrow\bigvee_{\psi\in S^{\prime}}\psi)$ . Then $\varphi$ is equivalent over $T$ to the disjunction of the sentences in $S^{\prime}$ . ∎

Here is the companion result for formulas with free variables.

Proposition 4.19.

For any formula $\varphi(\bar{x})$ with free variables among $\bar{x}$ , there is a formula $\varphi^{*}(\bar{x})$ equivalent over $T$ to $\varphi(\bar{x})$ such that $\varphi^{*}(\bar{x})$ is a finite disjunction of formulas $(\alpha(\bar{x})\ \&\ \beta)$ , where $\beta$ is a conjunction of local sentences and negations and $\alpha(\bar{x})$ is a local formula.

Proof.

We replace $\bar{x}$ with a tuple of constants $\bar{c}$ . For each model $\mathcal{A}$ of $T$ and each tuple $\bar{a}$ satisfying $\varphi(\bar{x})$ , choose a formula $\psi(\bar{x})$ in the local type of $\bar{a}$ such that $T\vdash(\psi(\bar{c})\rightarrow\varphi(\bar{c}))$ . Let $S$ be the set of chosen formulas. Now, $T\cup\{\neg{\psi(\bar{c})}:\psi(\bar{c})\in S\}\cup\{\varphi(\bar{c})\}$ is inconsistent, so for some finite $S^{\prime}\subseteq S$ , $T\vdash(\varphi(\bar{c})\rightarrow\bigvee_{\psi(\bar{c})\in S^{\prime}}\psi(\bar{c}))$ . We may take $\psi$ of the form $\alpha(\bar{x})\ \&\ \beta$ , where $\beta$ is the conjunction of the local sentences in $S^{\prime}$ and $\alpha(\bar{x})$ is the local formula in $C_{r,n}$ that is true of $\bar{a}$ , where $r$ is greatest such that $S^{\prime}$ contains a formula in $C_{r,n}$ . ∎

Remark.

For our special version of Gaifman’s Locality Theorem, the local formulas may be taken to be either existential or universal. Thus, over a completion of $T$ (or over the set of local sentences in the complete theory), each formula is equivalent to an existential formula, and to a universal formula.

4.3 The group associated to a generalized bijective variety

Let $V$ be a generalized bijective variety with theory $T$ . There is an equivalence relation on strings of function symbols such that strings $t,t^{\prime}$ are equivalent if $T\vdash(\forall x)t(x)=t^{\prime}(x)$ . For a string of symbols $t$ , we may write $\operatorname{len}(t)$ for the length of $t$ . We will associate to the variety $V$ a group $G(V)$ , whose elements are the equivalence classes of strings.

Definition 4.20 (Gaifman group, $G(V)$ ).

For a generalized bijective variety $V$ , the Gaifman group is the group $G(V)$ consisting of equivalence classes of strings of symbols under the operation induced by concatenation of strings.

The identity in $G(V)$ is the equivalence class of the empty string. For each function symbol $f_{i}$ , we fix a term $u_{i}$ that names the inverse, as in Proposition 4.2. We may write $f_{i}^{-1}$ for $u_{i}$ . The inverse function extends in a natural way to any word $v$ in $f_{1},\cdots,f_{n}$ . Let $F$ be the element of $V$ obtained as the free structure generated by the finite tuple $\bar{a}$ . The group $G(V)$ has a natural action on $F$ , taking $t\in G(V)$ and $b\in F$ to $t(b)$ . Since $b=t^{\prime}(a)$ for some $t^{\prime}$ , the action takes $t^{\prime}(a)$ to $t\circ t^{\prime}(a)$ .

Definition 4.21 (orbit under action of $G(V)$ ).

For $\mathcal{A}\in V$ and $b\in\mathcal{A}$ , the orbit of $b$ under the action of $G(V)$ is the set of all $x$ such that for some $t\in G(V)$ , $t(b)=x$ .

Note.

For $\mathcal{A}\in V$ and $b\in\mathcal{A}$ , the orbit of $b$ under the action of $G(V)$ is just the set of elements of $\mathcal{A}$ generated by $b$ . The automorphism orbit of $b$ results from the action of the group of automorphisms.

Lemma 4.22.

Let $V$ be a generalized bijective variety, and let $F$ be the free structure in $V$ generated by the tuple $\bar{a}$ . The action of $G(V)$ on $F$ is well defined and simply transitive on the orbits.

Proof.

We first prove that the action is well defined. Suppose $t_{1}=t_{2}$ in $G(V)$ . Without loss of generality, we assume that $t_{1}$ is obtained from $t_{2}$ by applying an identity of $G(V)$ , say $w=w^{\prime}$ . This means that $t_{1}=uw(w^{\prime})^{-1}v$ and $t_{2}=uv$ for some words $u,v$ . Then $(t_{1},t^{\prime}(a))\mapsto t_{1}\circ t^{\prime}(a)=uw(w^{\prime})^{-1}vt^{\prime}(a)$ . Since $w=w^{\prime}$ is an identity in $G(V)$ , we have $T\vdash(\forall x)w(x)=(w^{\prime})(x)$ and for an element $a$ of $F$ , $F\models uw(w^{\prime})^{-1}vt^{\prime}(a)=uw^{\prime}(w^{\prime})^{-1}vt^{\prime}(a)=uvt^{\prime}(a)=t_{2}t^{\prime}(a)$ . Thus, the action is well defined.

Recall that every element $x$ of $F$ has the form $t(a_{i})$ for some generator $a_{i}$ , and every such $x$ is in the orbit of $a_{i}$ . Thus, every orbit in $F$ has the form $\{t(a_{i}):t\in G(V)\}$ for some generator $a_{i}$ . Now, take $x=t(a_{i})$ in $F$ and suppose that $F\models u\circ t(a_{i})=v\circ t(a_{i})$ . Since $F$ is free, we have that $T\models(\forall x)ut(x)=vt(x)$ . Therefore, $ut=vt$ holds in the group $G(V)$ , so by cancellation, we have $u=v$ . Thus, the action is simply transitive on its orbits. ∎

For our commutative generalized bijective variety with theory $T$ , we have the following.

Lemma 4.23.

1.

For $u,v,w\in G(V)$ , $T\vdash(\forall x)(u(w(x))=v(w(x))\leftrightarrow u(x)=v(x))$ .
2.

For $\alpha\in C_{r,1}$ , $T\vdash(\forall x)(\alpha(w(x))\leftrightarrow\alpha(x))$ .

For structures $\mathcal{A}\in V$ with a single generator $a$ , all elements have the same local type. In fact, they are in the same automorphism orbit as well as the same orbit under the action of $G(V)$ .

Lemma 4.24.

Suppose $\mathcal{A}\in V$ is generated by $a$ . For $\alpha\in C_{r,1}$ ,
$\mathcal{A}\models(\exists x)\alpha(x)\leftrightarrow(\forall x)\alpha(x)$ .

Consider a local sentence $\rho$ saying that there exists $\bar{x}$ with $x_{i}$ satisfying $\alpha_{i}\in C_{r,1}$ and with $d(x_{i},x_{j})>2r$ for $i<j$ . For $\mathcal{A}$ generated by a single element $a$ , $\rho$ cannot be true unless the $\alpha_{i}$ ’s are all the same and $\mathcal{A}$ has a tuple of elements $\bar{x}$ such that $d(x_{i},x_{j})>2r$ for $i<j$ . Thus, the important local invariants are the sentences $(\exists x)\alpha(x)$ for $\alpha\in C_{r,1}$ and the sentences saying that there are at least $n$ elements at a distance at least $2r$ . We will show that for these important sentences, the ones true in $F$ have density $1$ .

For a string $t$ of function symbols, we write $t^{n}$ for the $n$ -fold concatenation of $t$ . We write $\langle t\rangle$ for the subgroup of $G(V)$ generated by the equivalence class of $t$ —the elements are the equivalence classes of the strings $t^{n}$ , $t^{-n}=(t^{-1})^{n}$ . We need to understand truth in the structure $\mathcal{A}$ with presentation $a|R$ , where $R$ is a single identity. Any identity is equivalent over $T$ to a canonically chosen identity of the form $t^{*}(a)=a$ , where the length of $t^{*}$ is bounded by a constant multiple of the length of $R$ . The next lemma will tell us a great deal about truth in $\mathcal{A}$ .

Lemma 4.25.

Let $V$ be a generalized bijective variety, and consider presentations $a|R$ , where $R$ is an identity equivalent to one of the form $t^{*}(a)=a$ . Then for $u,v\in G(V)$ , $\langle a|R\rangle\models u(a)=v(a)$ iff $u,v$ are in the same left coset of $\langle t^{*}\rangle$ .

Proof.

Let $\mathcal{A}=\langle a|R\rangle$ .

$\Leftarrow$ : Without loss of generality, suppose $v=u(t^{*})^{n}$ . In $\mathcal{A}$ , we have

v(a)=u(t^{*})^{n}(a)=u(t^{*})^{n-1}(a)=\cdots=u(a)\ .

$\Rightarrow$ : Now, suppose $u(a)=v(a)$ in $\mathcal{A}$ . Then $T\cup\{t^{*}(a)=a\}$ must prove

u(a)=x_{0}(a)=x_{1}(a)=\cdots=x_{\ell}(a)=v(a)\ ,

where for each $i<\ell$ , we have one of the following:

(i)

$x_{i+1}=x_{i}t^{*}$ ,
(ii)

$x_{i}=x_{i+1}t^{*}$ , or
(iii)

$x_{i}(a)=x_{i+1}(a)$ .

In the first two cases, $x_{i}$ and $x_{i+1}$ are clearly in the same left coset of $\langle t^{*}\rangle$ . In the third case, $x_{i}=x_{i+1}$ in $G(V)$ , so again $x_{i}$ and $x_{i+1}$ are in the same left coset of $\langle t^{*}\rangle$ . ∎

For a given identity $u(a)=v(a)$ , we are interested in the identities $R$ such that $\langle a|R\rangle\models u(a)=v(a)$ . The lemma above lets us recognize these identities. We come to the theorem that gives conditions under which the sentences true in the free structure have limiting density $1$ .

Theorem 4.26.

Let $V$ be a commutative generalized bijective variety in the language $\{f_{1},\cdots,f_{n}\}$ , and consider presentations with a single generator $a$ and a single identity. Let $F$ be the free structure on $a$ . If $F$ is infinite, then the sentences true in $F$ have limiting density $1$ .

Proof.

We show that for the important sentences $\alpha$ , if $\alpha$ is true in $F$ , then it has density $1$ , and if $\alpha$ is false in $F$ , then it has density $0$ . For structures in $V$ with generator $a$ , the important sentences say one of the following:

1.

$(\exists x)\alpha(x)$ for $\alpha\in C_{r,1}$ —this is equivalent to a finite conjunction of formulas of the form $u(a)=v(a)$ or $u(a)\not=v(a)$ .
2.

$(\exists x_{1},\ldots x_{n})\bigwedge_{i<j}d(x_{i},x_{j})>2r$ .

If $F$ is infinite, then we can show that any sentence of the second form true in $F$ is implied over $T$ by a sentence of the first form true in $F$ . A saturated model of the theory of $F$ has infinitely many connected components, and the sentence $(\exists x_{1},\ldots x_{n})\bigwedge_{i<j}d(x_{i},x_{j})>2r$ is clearly true in this model. Therefore, it is true in $F$ . Take witnesses $x_{1},\ldots,x_{n}$ , where $x_{i}=w_{i}(a)$ . Choose $k$ such that all $x_{i}$ are in $B_{k}(a)$ , and take $\alpha\in C_{1,k}$ true of $a$ in $F$ . Then over $T$ , $(\exists x)\alpha(x)$ implies $(\exists x_{1},\ldots x_{n})\bigwedge_{i<j}d(x_{i},x_{j})>2r$ .

The group $G(V)$ is abelian and finitely generated, so it is a finite direct product of cyclic groups generated by some elements $b_{1},\cdots,b_{k}$ . We write $\Pi_{i}(x)$ for the projection of an element $x$ on the subgroup generated by $b_{i}$ . Since $G(V)$ is infinite, some $b_{i}$ must have infinite order. Without loss of generality, we suppose $b_{1}$ has infinite order and generates a copy of $\mathbb{Z}$ . We focus on $\Pi_{1}(x)$ , and we suppose that the values are integers.

Each identity $R$ has the form $t(a)=t^{\prime}(a)$ , but this is equivalent to an identity of the form $t^{*}(a)=a$ . Let $e_{0}=\max_{i}|\Pi_{1}(f_{i})|$ . If $\operatorname{len}(t)\leq r$ , then the projection $\Pi_{1}(t)$ is an integer bounded by $r\cdot e_{0}$ . If $\operatorname{len}(t),\operatorname{len}(t^{\prime})\leq r$ , then $d(t,t^{\prime})\leq 2r$ . Then $|\Pi_{1}(t)-\Pi_{1}(t^{\prime})|\leq 2r\cdot e_{0}$ . To prove Theorem 4.26, it is enough to show that all statements of the form $t(a)=t^{\prime}(a)$ or $t(a)\not=t^{\prime}(a)$ true in $F$ have limiting density $1$ . The proof consists of two steps.

1.

The first step is to show that for a fixed $k$ , the set of presentations
$a|t(a)=t^{\prime}(a)$ such that $|\Pi_{1}(t)-\Pi_{1}(t^{\prime})|<k$ has limiting density $0$ .
2.

The second step is to show that for a fixed $k$ and a fixed identity $R$ of the form $t(a)=t^{\prime}(a)$ , if $|\Pi_{1}(t)-\Pi_{1}(t^{\prime}))|>e_{0}k$ , then for any $u$ , $v$ such that $d(u,v)\leq k$ in the Gaifman graph $G(F)$ , we have $F\models u(a)=v(a)$ if and only if $\langle a\mid R\rangle\models u(a)=v(a)$ .

Toward the first step, we prove some lemmas.

Lemma 4.27.

1.

The number of identities of length $m$ is $n^{m}(m+1)$ . Furthermore, for every $0\leq k\leq m+1$ , there are exactly $n^{m}$ identities of length $m$ in which $t$ (the string of function symbols on the left side) has length $k$ .
2.

$P_{s}=\frac{n^{s+1}(s+2)(n-1)+1}{(n-1)^{2}}$ .

Proof.

For (1), the number of strings of function symbols of length $m$ is $n^{m}$ . To determine an identity $t(a)=t^{\prime}(a)$ , we choose one of the $m+1$ initial segments to serve as the left-hand side. For (2), we simply note that

	$\displaystyle P_{s}$	$\displaystyle=$	$\displaystyle\sum_{0\leq m\leq s}(m+1)n^{m}=(1+2n+\ldots+(s+1)n^{s})$
		$\displaystyle=$	$\displaystyle\displaystyle\frac{(s+2)n^{s+2}-(s+2)n^{s+1}+1}{(n-1)^{2}}=\frac{n^{s+1}(s+2)(n-1)+1}{(n-1)^{2}}.$

∎

The next lemma may by interpreted as saying that a random identity of length $\leq s$ has length $>\sqrt{s}$ .

Lemma 4.28.

$\lim_{s\rightarrow\infty}\frac{P_{s^{2}}-P_{s}}{P_{s^{2}}}=1$ .

Proof.

Using Lemma 4.27, we get $\frac{P_{s}}{P_{s^{2}}}=\frac{n^{s+1}(s+2)(n-1)+1}{n^{s^{2}+1}(s^{2}(n-1)+2)+1}$ . This clearly has limit $0$ , so $\frac{P_{s^{2}}-P_{s}}{P_{s^{2}}}=1-\frac{P_{s}}{P_{s^{2}}}$ has limit $1$ . ∎

Let $P_{=m}$ be the number of identities of length exactly $m$ , and let $P_{=m}(A)$ be the number of identities in $A$ of length equal to $m$ . Calculating the limit of $\frac{P_{=s}(A)}{P_{=s}}$ is often easier than calculating the limit of $\frac{P_{s}(A)}{P_{s}}$ . The lemma below gives us permission to do that.

Lemma 4.29.

For any set $A$ of identities of arbitrary length, if $\frac{P_{=s}(A)}{P_{=s}}$ has limit $0$ , then so does $\frac{P_{s}(A)}{P_{s}}$ .

Proof.

We show that for $\epsilon>0$ , there is some $m$ such that for $s\geq m$ , $\frac{P_{s}(A)}{P_{s}}<\epsilon$ . Take $m_{1}$ such that for all $s\geq m_{1}$ , we have $\frac{P_{=s}(A)}{P_{=s}}<\frac{\epsilon}{2}$ , and take $m_{2}$ such that for all $s$ such that $\sqrt{s}\geq m_{2}$ , we have $\frac{P_{\sqrt{s}}}{P_{s}}<\frac{\epsilon}{2}$ . Let $s\geq m_{1},m_{2}$ . Then

P_{s}(A)-P_{\sqrt{s}}(A)=\sum_{\sqrt{s}<m\leq s}P_{=m}(A)<\frac{\epsilon}{2}\sum_{\sqrt{s}<m\leq s}P_{=m}=\frac{\epsilon}{2}(P_{s}-P_{\sqrt{s}}).

This gives us

\frac{P_{s}(A)}{P_{s}}=\frac{P_{\sqrt{s}}}{P_{s}}+\frac{P_{s}(A)-P_{\sqrt{s}}(A)}{P_{s}}<\frac{\epsilon}{2}+\frac{\epsilon}{2}\cdot\frac{P_{s}-P_{\sqrt{s}}}{P_{s}}<\frac{\epsilon}{2}+\frac{\epsilon}{2}=\epsilon.\

∎

The next lemma will complete the first step of the proof of Theorem 4.26. We write $t$ and $t^{\prime}$ for both strings of function symbols and elements of $G(V)$ .

Lemma 4.30.

For every $k\in\mathbb{N}$ , we have

\lim\limits_{s\to\infty}\frac{P_{s}(|\Pi_{1}(t)-\Pi_{1}(t^{\prime})|<k)}{P_{s}}=0\ .

Proof.

By Lemma 4.29, it suffices to prove that

\lim\limits_{s\to\infty}\frac{P_{=s}(|\Pi_{1}(t)-\Pi_{1}(t^{\prime})|<k)}{P_{=s}}=0.

Furthermore, since $k$ is fixed, it is enough to prove that for every $k\in\mathbb{Z}$ ,

\lim\limits_{s\to\infty}\frac{P_{=s}(\Pi_{1}(t)-\Pi_{1}(t^{\prime})=k)}{P_{=s}}=0.

Fix $s$ . The identities of length $s$ form a finite probability space, and the random variables $\Pi_{1}(t)$ and $\Pi_{1}(t^{\prime})$ are not independent. By Lemma 4.27, we may consider $\Pi_{1}(t)-\Pi_{1}(t^{\prime})$ conditioned on $t$ having length $\ell$ . Then $\operatorname{len}(t^{\prime})=s-\ell$ . For each $\ell\leq s$ , the number of identities with $\operatorname{len}(t)=\ell$ and $\operatorname{len}(t^{\prime})=s-\ell$ is equal to the number of strings of length $s$ , so the probability that $\operatorname{len}(t)=\ell$ is $\frac{1}{s+1}$ . The probability that $\Pi_{1}(t)-\Pi_{1}(t^{\prime})=k$ is the sum over $\ell\leq s$ of the probability that $\operatorname{len}(t)=\ell$ times the conditional probability that $\Pi_{1}(t)-\Pi_{1}(t^{\prime})=k$ given $\operatorname{len}(t)=\ell$ . We have

\frac{P_{=s}(\Pi_{1}(t)-\Pi_{1}(t^{\prime})=k)}{P_{=s}}

=\frac{1}{s+1}\sum\limits_{\ell=0}^{s}\frac{P_{=s}(\Pi_{1}(t)-\Pi_{1}(t^{\prime})=k\ \&\ \operatorname{len}(t)=\ell\ \&\ \operatorname{len}(t^{\prime})=s-\ell)}{P_{=s}(\operatorname{len}(t)=\ell\ \&\ \operatorname{len}(t^{\prime})=s-\ell)}.

We write $X_{\ell}$ for $\Pi_{1}(t)$ conditioned on $t$ having length $\ell$ . Then, as a random variable, $X_{\ell}$ is a sum of $\ell$ i.i.d. random variables $Y_{\ell_{k}}$ whose value is equal to the projection of the $k^{th}$ symbol. All function symbols are equally likely. Thus, with probability $\frac{1}{n}$ , $Y$ will be $\Pi_{1}(f_{i})$ for $1\leq i\leq n$ . As $s\to\infty$ , we have $\ell\to\infty$ . By the Central Limit Theorem, we have that $X_{\ell}/\ell$ converges to a normal distribution. This means that, in particular, for every $\epsilon$ , there is some $\ell_{\epsilon}$ such that for every $\ell>\ell_{\epsilon}$ , the probability that $X_{\ell}=i$ is less than $\epsilon$ for all $i$ ; i.e.,

\frac{P_{=s}(\Pi_{1}(t)=i\ \&\ \operatorname{len}(t)=\ell)}{P_{=s}(\operatorname{len}(t)=\ell)}<\epsilon.

Without loss of generality, we will assume that $\operatorname{len}(t)\geq\operatorname{len}(t^{\prime})$ , so $\ell\geq s/2$ . Thus, $\ell>\ell_{\epsilon}$ whenever $s>2\ell_{\epsilon}$ .

Now, we have that

		$\displaystyle\frac{P_{=s}(\Pi_{1}(t)-\Pi_{1}(t^{\prime})=k\ \&\ \operatorname{len}(t)=\ell\ \&\ \operatorname{len}(t^{\prime})=s-\ell)}{P_{=s}(\operatorname{len}(t)=\ell\ \&\ \operatorname{len}(t^{\prime})=s-\ell)}$
	$\displaystyle=$	$\displaystyle\sum\limits_{i}\frac{P_{=s}(\Pi_{1}(t)=i\ \&\ \operatorname{len}(t)=\ell)}{P_{=s}(\operatorname{len}(t)=\ell)}\cdot\frac{P_{=s}(\Pi_{1}(t^{\prime})=i-k\ \&\ \operatorname{len}(t^{\prime})=s-\ell)}{P_{=s}(\operatorname{len}(t^{\prime})=s-\ell)}$
	$\displaystyle<$	$\displaystyle\sum\limits_{i}\epsilon\cdot\frac{P_{=s}(\Pi_{1}(t^{\prime})=i-k\ \&\ \operatorname{len}(t^{\prime})=s-\ell)}{P_{=s}(\operatorname{len}(t^{\prime})=s-\ell)}$
	$\displaystyle<$	$\displaystyle\ \ \epsilon.$

Combining these, we get

		$\displaystyle\lim\limits_{s\to\infty}\frac{P_{=s}(\Pi_{1}(t)-\Pi_{1}(t^{\prime})=k)}{P_{=s}}$
	$\displaystyle=$	$\displaystyle\lim\limits_{s\to\infty}\frac{1}{s+1}\sum\limits_{\ell=0}^{s}\frac{P_{=s}(\Pi_{1}(t)-\Pi_{1}(t^{\prime})=k\ \&\ \operatorname{len}(t)=\ell\ \&\ \operatorname{len}(t^{\prime})=s-\ell)}{P_{=s}(\operatorname{len}(t)=\ell\ \&\ \operatorname{len}(t^{\prime})=s-\ell)}$
	$\displaystyle=$	$\displaystyle\ \ 0.$

∎

We proceed to the second step of the proof. Recall that $e_{0}=\max_{i}|\Pi_{1}(f_{i})|$ .

Lemma 4.31.

Fix $R$ of the form $t(a)=t^{\prime}(a)$ , and fix $k$ such that
$|\Pi_{1}(t)-\Pi_{1}(t^{\prime}))|>e_{0}k$ . For any $u$ , $v$ at a distance $\leq k$ in the Gaifman graph of $F$ , we have $F\models u(a)=v(a)$ if and only if $\langle a\mid R\rangle\models u(a)=v(a)$ , where $\langle a\mid R\rangle$ is the structure given by the presentation $a|R$ .

Proof.

Based on the discussion before Lemma 4.27, we can see that if $u(a)$ and $v(a)$ are adjacent in the Gaifman graph, then $|\Pi_{1}(u)-\Pi_{1}(v)|=|\Pi_{1}(f_{i})|\leq e_{0}$ for some $f_{i}$ . Thus, if $d(u,v)\leq k$ in the Gaifman graph, then $|\Pi_{1}(u)-\Pi_{1}(v)|\leq e_{0}k$ . We will also write $t^{*}=t^{-1}\circ t^{\prime}$ , where $t^{-1}$ is the term that is the inverse of $t$ in the theory of the commutative generalized bijective variety. Note that $t^{-1}$ may be longer than $t$ , but this does not affect the argument below.

It is easy to see that if $u(a)=v(a)$ holds in $F$ , then it holds in the structure $\langle a|R\rangle$ , where $R$ is $t(a)=t^{\prime}(a)$ , which is equivalent to $t^{*}(a)=a$ . Suppose $\langle a\mid R\rangle\models u(a)=v(a)$ . By Lemma 4.25, this implies that $u,v$ are in the same left coset of $\langle t^{*}\rangle$ ; i.e., $u^{-1}v\in\langle t^{*}\rangle$ . Taking the projection $\Pi_{1}$ , we see that $\Pi_{1}(u^{-1}v)\in\Pi_{1}(\langle t^{*}\rangle)$ . For some integer $k$ , we have $u^{-1}v=(t^{*})^{k}\in\langle t^{*}\rangle$ , and $\Pi_{1}((t^{*})^{k})=k\cdot\Pi_{1}(t^{*})$ . However, by assumption, $|\Pi_{1}(t^{*})|=|\Pi_{1}(t^{-1}t^{\prime})|>e_{0}|k|$ , and we have $|\Pi_{1}(u^{-1}v)|\leq e_{0}|k|$ . Therefore, we must have $k=0$ . It follows that $\Pi_{1}(u^{-1}v)=0\cdot\Pi_{1}(t^{*})=0$ . Moreover, $u^{-1}v=(t^{*})^{0}$ . It follows that $u=v$ in $G(V)$ , and $F\models u(a)=v(a)$ . ∎

We are ready to complete the proof of the theorem. We just need to show that the sentences of the form $u(a)=v(a)$ or $u(a)\not=v(a)$ true in $F$ have limiting density $1$ . By Lemma 4.30, for any integer $k$ , the set of identities $t(a)=t^{\prime}(a)$ such that $|\Pi_{1}(t)-\Pi_{1}(t^{\prime})|>\epsilon_{0}|k|$ has density $1$ . For a fixed sentence $u(a)=v(a)$ , take $k$ such that $u,v$ both have length at most $\frac{k}{2}$ , so that $u(a),v(a)$ are at distance at most $k$ . Then by Lemma 4.31, the sentence $u(a)=v(a)$ holds in $F$ iff it holds in the structures given by identities $t(a)=t^{\prime}(a)$ such that $|\Pi_{1}(t)=\Pi_{1}(t^{\prime})|>\epsilon_{0}k$ , where this set has density $1$ . ∎

This theorem can be generalized to presentations with multiple generators.

Proposition 4.32.

Let $V$ be a commutative generalized bijective variety in the language $\{f_{1},\cdots,f_{n}\}$ and suppose that the free structure on $a$ is infinite. Then for the structures in $V$ with an $m$ -tuple $\overline{a}$ of generators and a single identity, the sentences true in the free structure on $\overline{a}$ have limiting density $1$ .

To do so, we need the following lemma.

Lemma 4.33.

Let $V$ be a commutative generalized bijective variety, with theory $T$ . Let $F_{m}$ be the free structure on $m$ generators. Suppose that $F_{1}$ is infinite. Then for all $m\geq 1$ , $F_{m}$ and $F_{1}$ satisfy the same theory.

Proof.

All elements of $F_{1}$ have the same local type. Now, $F_{1}$ has a saturated elementary extension $F^{*}$ whose Gaifman graph has infinitely many connected components. Let $A$ be the substructure of $F^{*}$ extending $F_{1}$ and generated by an $m$ -tuple $a_{1},\ldots,a_{m}$ from different connected components. Clearly, $F_{1}$ and $F^{*}$ satisfy the same special local sentences. Since the sentences are existential, any special local sentence true in $F_{1}$ is true in $A$ , and any special local sentence true in $A$ is true in $F^{*}$ .

We may suppose that $F_{m}$ has generators $a_{1},\ldots,a_{n}$ . The connected component of $a_{i}$ in $F_{m}$ and in $A$ is generated by $a_{i}$ —the elements are named by terms $t(a_{i})$ . The special $r$ -local formula $\alpha(x)\in C_{r,1}$ true of the elements of $F_{1}$ is true of each $a_{i}$ in $F_{m}$ and in $A$ . We have an isomorphism from $F_{m}$ onto $A$ that takes $a_{i}$ to $a_{i}$ and takes $B_{r}(a_{i})$ in $F_{m}$ to $B_{r}(a_{i})$ in $A$ . Then $F_{1}$ and $F_{m}$ have the same theory. ∎

Proof of Proposition 4.32.

For presentations with $m$ generators and a single identity, we consider separately the set $M$ of presentations in which the identity involves a single generator and the complementary set $\neg{M}$ in which the identity involves two distinct generators. For a presentation $\bar{a}|t_{1}(a_{i})=t_{2}(a_{i})$ in $M$ , the resulting structure is the disjoint union of the structure $\langle a_{i}|t_{1}(a_{i})=t_{2}(a_{i})\rangle$ (with generator $a_{i}$ ) and $(m-1)$ copies of $F_{1}$ (one for each of the other $a_{j}$ ’s). The identities in $\neg{M}$ have the form $t_{1}(a_{i})=t_{2}(a_{j})$ for $i\not=j$ . In the structure $\langle\overline{a}|t_{1}(a_{i})=t_{2}(a_{j})\rangle$ , the connected component of $a_{i}$ and the connected component of $a_{j}$ are collapsed via the relation $t(a_{i})=t_{1}^{-1}t_{2}t(a_{j})$ . Thus, the structure is a disjoint union of $(m-1)$ copies of the free structure on one generator.

For fixed $s$ , we have a finite probability space. For a sentence $\varphi$ , the probability that $\varphi$ is true is $\frac{P_{s}(\varphi)}{P_{s}}=\frac{P_{s}(M\ \&\ \varphi)}{P_{s}}+\frac{P_{s}(\neg{M}\ \&\ \varphi)}{P_{s}}$ . For presentations with a single generator, we write $P^{\prime}_{s}$ and $P_{s}^{\prime}(\varphi)$ . By Theorem 4.26,

\frac{P^{\prime}_{s}(\varphi)}{P^{\prime}_{s}}\rightarrow\left\{\begin{array}[]{cc}1&\mbox{if $F_{1}\models\varphi$,}\\ 0&\mbox{otherwise}.\end{array}\right.

Now, $\frac{P_{s}(M\ \&\ \varphi)}{P_{s}}$ is the probability of $(M\ \&\ \varphi)$ . This is equal to the probability of $M$ times the conditional probability of $\varphi$ given $M$ . The probability of $M$ is $\frac{1}{n}$ . The conditional probability of $\varphi$ given $M$ is the same as the probability of $\varphi$ for presentations with a single generator; namely, $\frac{P^{\prime}_{s}(\varphi)}{P^{\prime}_{s}}$ . Thus, $\frac{P_{s}(M\ \&\ \varphi)}{P_{s}}=(\frac{1}{n})(\frac{P^{\prime}_{s}(\varphi)}{P^{\prime}_{s}})$ . As $s\rightarrow\infty$ , this approaches $\frac{1}{n}$ if $\varphi$ is true in the free structures and $0$ otherwise.

Similarly, the probability of $(\neg{M}\ \&\ \varphi)$ is the probability of $\neg{M}$ times the conditional probability of $\varphi$ given $\neg{M}$ . The probability of $\neg{M}$ is $\frac{(n-1)}{n}$ . The conditional probability of $\varphi$ given $\neg{M}$ is $1$ if $\varphi$ is true in $F_{m-1}$ and $0$ otherwise. Thus,

\frac{P_{s}(\neg{M}\ \&\ \varphi)}{P_{s}}=\left\{\begin{array}[]{cc}\frac{(n-1)}{n}&\mbox{if $F_{m-1}\models\varphi$}\\ 0&\mbox{otherwise.}\end{array}\right.

In total, $\frac{P_{s}(\varphi)}{P_{s}}$ has limit $\frac{1}{n}+\frac{n-1}{n}=1$ if $\varphi$ is true in the free structures and $0$ otherwise. ∎

Remark.

Using the multidimensional Central Limit Theorem [22], we can generalize the theorem and corollary above to any commutative generalized bijective variety $V$ where $\mathbb{Z}^{k}$ embeds into $G(V)$ . In this case, the random structures in $V$ with a single generator and $k$ identities satisfy the zero–one conjecture, and the limiting theory agrees with the theory of the free structure. However, without the condition that $\mathbb{Z}^{k}$ embeds in $G(V)$ , the statement is false, as witnessed by the bijective structures with two identities considered in Section 3.3.

4.3.1 Superstability

We make a brief comment on the superstability of completions of the theory of generalized bijective varieties. Recall that for an infinite cardinal $\kappa$ , a (complete) theory $T$ is $\kappa$ -stable if for every set $A$ in a model of $T$ , if $A$ has cardinality $\kappa$ , then the set of complete types over $A$ has cardinality $\kappa$ as well. A theory is stable if it is $\kappa$ -stable for some $\kappa$ , and it is superstable if it is $\kappa$ -stable for all sufficiently large $\kappa$ . If the language of $T$ is countable, then $\kappa\geq 2^{\aleph_{0}}$ will suffice. For more on stable theories, see Chapter 4 of [16].

Proposition 4.34.

All completions of the theory of a generalized bijective variety are superstable.

Proof.

Let $T$ be a completion of this theory, and let $X$ be a subset of some model of $T$ of cardinality $\kappa\geq 2^{\aleph_{0}}$ . We show that the number of $1$ -types over $X$ is at most $\kappa$ .

A type in a variable $x$ over $X$ will say one of the following:

1.

$x=t(a)$ for some $a\in X$ .
2.

For every term $t$ and every $a\in X$ , $x\not=t(a)$ , and $x$ satisfies a certain quantifier-free $1$ -type $p(x)$ .

From this, it follows that if $\kappa$ is the cardinality of $X$ , then the number of $1$ -types over $X$ is at most $\kappa+2^{\aleph_{0}}$ . Thus, for $\kappa\geq 2^{\aleph_{0}}$ , $T$ is $\kappa$ -stable. ∎

Remark.

If we drop the condition that $T$ is a completion of the theory of a generalized bijective variety, then there are theories in a language with finitely many unary function symbols that are unstable. We will not give an example here, although one is easily obtained taking the theory of a structure in the variety which we will study in Section 6.3.

4.4 Failure of the zero–one law

The next result gives conditions under which the zero–one law fails.

Theorem 4.35.

Let $L$ be a language with $n$ unary functions, including $f$ , where $n\geq 2$ . Let $V$ be a variety such that for some term $t$ involving a symbol apart from $f$ , the theory $T$ of $V$ contains the sentence $(\forall x)(\forall y)t(x)=t(y)$ . Consider presentations with an $m$ -tuple $\bar{a}$ of generators and a single identity, and suppose that in the free structure, $f(t(a))\not=t(a)$ . Then there is a sentence with limiting density neither $0$ nor $1$ .

Remark.

The sentence $(\forall x)(\forall y)t(x)=t(y)$ says that $t$ has a constant value. If $t$ involved just the symbol $f$ , then the free structure would satisfy the sentence $f(t(a))=t(f(a))=t(a)$ .

Proof of Theorem 4.35.

Let $\varphi$ be a sentence saying that $f$ fixes the constant given by $t$ . For instance, we may take $\varphi=(\forall x)f(t(x))=t(x)$ . We show that $\varphi$ does not have limiting density 0 or 1. We consider presentations with a tuple $\bar{a}$ of $m$ generators and a single identity. Let $A$ be the set of identities of the form $u(a_{i})=v(a_{j})$ , where $u(a_{i})=t(u^{\prime}(a_{i}))$ and $v(a_{j})=f(t(v^{\prime}(a_{j})))$ . In the resulting structures, $f$ fixes the constant, so $\varphi$ is true. Let $B$ be the set of identities of the form $u(a_{i})=v(a_{j})$ , where $u(a_{i})=t(u^{\prime}(a_{i}))$ and $v(a_{j})=t(v^{\prime}(a_{j}))$ . The resulting structure is free and $f$ does not fix the constant, so $\varphi$ is false. We show that neither $A$ nor $B$ has density $0$ . It follows that neither $\varphi$ nor $\neg{\varphi}$ has density $0$ .

The number of identities of length $\ell$ is $m^{2}n^{\ell}(\ell+1)$ . Then

	$\displaystyle P_{s}$	$\displaystyle=m^{2}\sum_{0\leq\ell\leq s}(\ell+1)n^{\ell}=m^{2}(1+2n+\ldots+(s+1)n^{s})$
		$\displaystyle=m^{2}\left(\frac{(s+2)n^{s+2}-(s+2)n^{s+1}+1}{(n-1)^{2}}\right)=m^{2}\left(\frac{n^{s+1}(s+2)(n-1)+1}{(n-1)^{2}}\right).$

Say that $t$ has length $r$ . Then the identities in $A$ have length at least $2r+1$ , and for $\ell=2r+1+\ell^{\prime}$ , the number of identities in $A$ of length $\ell$ is
$m^{2}n^{\ell^{\prime}}(\ell^{\prime}+1)$ . Then

	$\displaystyle P_{s}(A)$	$\displaystyle=m^{2}\sum_{2r+1+\ell^{\prime}\leq s}(\ell^{\prime}+1)n^{\ell^{\prime}}=m^{2}(1+2n+\ldots+(s-2r)n^{s-2r-1})$
		$\displaystyle=m^{2}\left(\frac{n^{s-2r}(s-2r+1)(n-1)+1}{(n-1)^{2}}\right),$

and

\frac{P_{s}(A)}{P_{s}}=\frac{1}{n^{2r+1}}\frac{(s-2r+1)(n-1)+1}{(s+2)(n-1)+1}\rightarrow\frac{1}{n^{2r+1}}\ .

The identities in $B$ have length at least $2r$ . For $\ell=2r+\ell^{\prime}$ , the number of identities in $B$ of length $\ell$ is $m^{2}n^{\ell^{\prime}}(\ell^{\prime}+1)$ . Then

	$\displaystyle P_{s}(B)$	$\displaystyle=m^{2}\sum_{2r+\ell^{\prime}\leq s}(\ell^{\prime}+1)n^{\ell^{\prime}}=m^{2}(1+2n+\ldots(s-2r+1)n^{s-2r})$
		$\displaystyle=m^{2}\left(\frac{n^{s-2r+1}(s-2r+2)(n-1)+1}{(n-1)^{2}}\right),$

and

\frac{P_{s}(B)}{P_{s}}=\frac{1}{n^{2r}}\frac{(s-2r+2)(n-1)+1}{(s+2)(n-1)+1}\rightarrow\frac{1}{n^{2r}}.

Since $n\geq 2$ , both of these limits are strictly between $0$ and $1$ . ∎

5 Naming the generators

5.1 A general result

Let $V$ be a variety in a language $L$ with axioms generating a theory $T$ . We consider presentations with a fixed generating tuple $\bar{a}$ , and $k$ identities. Let $L^{\prime}$ be the result of adding to $L$ constants for the generators. We ask when the $L^{\prime}$ -sentences true in the free structure have limiting density $1$ .

Proposition 5.1.

Let $T_{F}$ be the set of $L^{\prime}$ -sentences true in the free structure $F$ generated by $\bar{a}$ , and let $S$ be the set of $L^{\prime}$ -sentences with limiting density $1$ . Then the following are equivalent:

1.

$T_{F}\subseteq S$ ,
2.

$T_{F}=S$ ,
3.

$S$ has the following two properties:
1. (a)
  
  $S$ includes the sentences from $T_{F}$ of the form $t(\bar{a})\not=t^{\prime}(\bar{a})$ ,
2. (b)
  
  for any $L^{\prime}$ -formula $\varphi(x)$ with just $x$ free, if $\varphi(t(\bar{a}))\in S$ for all closed terms $t(\bar{a})$ , then $(\forall x)\varphi(x)\in S$ .

Proof.

We will prove $(1)\Rightarrow(2)\Rightarrow(3)\Rightarrow(1)$ . First, we assume (1) and prove (2). We must show that $S\subseteq T_{F}$ . Take $\varphi\in S$ . If $\varphi\notin T_{F}$ , then $\neg{\varphi}$ must be in $T_{F}$ , so it is in $S$ . Then $\varphi$ has limiting density $0$ , and we have a contradiction. Next, we assume (2) and prove (3). We can see that $T_{F}$ has properties (a) and (b), so $S$ does as well. Finally, we assume (3) and prove (1). The set $S$ has properties (a) and (b). Sentences that are logically equivalent have the same limiting density as well as the same truth value in the free structure $F$ . We show by induction on $\varphi(\bar{a})$ that if $\varphi(\bar{a})\in T_{F}$ , then $\varphi(\bar{a})\in S$ . We suppose that the negations in our formulas are brought inside, next to the atomic formulas.

1.

Suppose $\varphi$ has the form $t(\bar{a})=t^{\prime}(\bar{a})$ . If $F\models\varphi$ , then $T_{F}\vdash\varphi$ , so the limiting density is $1$ .
2.

Suppose $\varphi$ has the form $t(\bar{a})\not=t^{\prime}(\bar{a})$ . By (a), if $F\models\varphi$ , then $\varphi$ has limiting density $1$ .
3.

Suppose $\varphi=(\varphi_{1}\ \&\ \varphi_{2})$ . If $F\models\varphi$ , then both conjuncts are true, so both have limiting density $1$ . Then $\varphi$ also has limiting density $1$ .
4.

Suppose $\varphi=(\varphi_{1}\ \vee\ \varphi_{2})$ . If $F\models\varphi$ , then at least one disjunct is true, so it has limiting density $1$ . Then $\varphi$ also has limiting density $1$ .
5.

Suppose $\varphi=(\exists x)\psi(x)$ . If $F\models\varphi$ , then $F\models\psi(t(\bar{a}))$ for some $t(\bar{x})$ . Then this sentence has limiting density $1$ , so $\varphi$ also has limiting density $1$ .
6.

Suppose $\varphi=(\forall x)\psi(x)$ . If $F\models\varphi$ , then $F\models\psi(t(\bar{a}))$ for all closed terms $t(\bar{a})$ . Then the sentence $\psi(t(\bar{a}))$ has limiting density $1$ for all $t(\bar{a})$ , and by (b), $(\forall x)\psi(x)\in S$ .

∎

Consider the following further property.

Property (c): If $(\exists x)\psi(x)\in S$ , then $\psi(t(\bar{a}))\in S$ for some $t(\bar{a})$ .

Lemma 5.2.

If $S$ is complete (i.e., we have the zero–one law), then (b) and (c) are equivalent.

Proof.

First, suppose that (b) holds and that $(\exists x)\psi(x)\in S$ . If there is no $t(\bar{a})$ such that $\psi(t(\bar{a}))\in S$ , then $\neg{\psi(t(\bar{a}))}\in S$ for all $t(\bar{a})$ , and $(\forall x)\neg{\psi(x)}\in S$ for a contradiction. Now, suppose (c) holds and that $\psi(t(\bar{a}))\in S$ for all $t(\bar{a})$ . If $\neg{(\forall x)\psi(x)}\in S$ , then $(\exists x)\neg{\psi(x)}\in S$ . By (c), $\neg{\psi(t(\bar{a}))}\in S$ for some $t(\bar{a})$ for a contradiction, so $(\forall x)\psi(x)\in S$ . ∎

Lemma 5.3.

Suppose $S$ satisfies (a) and (b). Then for all formulas $\varphi(x,y)$ with free variables $x,y$ , if $\varphi(t(\bar{a}),t^{\prime}(\bar{a}))\in S$ for all terms $t(\bar{a}),t^{\prime}(\bar{a})$ , then
$(\forall x)(\forall y)\varphi(x,y)\in S$ .

Proof.

For a fixed term $t(\bar{a})$ , suppose $\varphi(t(\bar{a}),t^{\prime}(\bar{a}))\in S$ for all $t^{\prime}(\bar{a})$ . By (b),
$(\forall y)\varphi(t(\bar{a}),y)\in S$ for all $t(\bar{a})$ . So, by (b), $(\forall x)(\forall y)\varphi(\bar{a},x,y)\in S$ . ∎

If the orbit of $\bar{a}$ in $\mathcal{A}$ is defined by an $L$ -formula $\psi(\bar{x})$ , then for each $L^{\prime}$ -sentence $\varphi$ , we have $\mathcal{A}\models\varphi(\bar{a})$ iff $\mathcal{A}$ satisfies the $L$ -sentences $(\exists\bar{x})(\psi(\bar{x})\ \&\ \varphi(\bar{x}))$ and $(\forall\bar{x})(\psi(\bar{x})\rightarrow\varphi(\bar{x}))$ .

Proposition 5.4.

Let $F$ be the free structure generated by $\bar{a}$ . Suppose that the orbit of $\bar{a}$ is defined by the $L$ -formula $\psi(\bar{x})$ and the $L^{\prime}$ -sentence $\psi(\bar{a})$ has limiting density $1$ . Suppose also that for all $L$ -sentences $\varphi$ , $\varphi$ is true in $F$ if and only if it has limiting density $1$ . Then the same is true for all $L^{\prime}$ -sentences.

Proof.

Take an $L^{\prime}$ -sentence $\varphi(\bar{a})$ that is true in $F$ . In $F$ , this is equivalent to the $L$ -sentence $\varphi^{*}=(\forall\bar{x})(\psi(\bar{x})\rightarrow\varphi(\bar{x}))$ . The sentence $\varphi^{*}$ is true in $F$ , so it has limiting density $1$ . The set of sentences with limiting density $1$ is closed under logical consequence, so since $\psi(\bar{a})$ has limiting density $1$ , it follows that $\varphi(\bar{a})$ has limiting density $1$ . ∎

5.2 Generalized bijective structures and sentences with constants

We have seen that for the basic bijective variety and for the broader class of commutative generalized bijective varieties, when we consider presentations with a single generator and a single identity, the sentences (in the language of the variety) true in the free structure have density $1$ . We can apply Proposition 5.1 to extend this to sentences with a constant naming the generator.

Example 5.5.

Let $V$ be a commutative generalized bijective variety in the language $L$ . Consider presentations with a single generator $a$ and a single identity, and let $L^{\prime}$ be the extension of $L$ with a constant for the generator. Suppose that the free structure $F$ generated by $a$ is infinite. In $F$ , all elements are automorphic. In particular, $a$ and $t(a)$ are automorphic via the automorphism $x\mapsto t(x)$ . Preparing to apply Proposition 5.1, we take $\psi(x)$ to be $x=x$ . Clearly, $\psi(a)$ has limiting density $1$ . By Theorem 4.26, the $L$ -sentences true in the free structure have limiting density $1$ . Then Proposition 5.1 says that this holds also for the $L^{\prime}$ -sentences (involving $a$ ).

For a generating tuple $\bar{a}$ , the sentences $\varphi(\bar{a})$ true in the free structure on $\bar{a}$ have density $1$ . To establish this, we need to take a closer look at the formulas $C_{r,n}$ from Section 4.2 in the expanded language $L^{\prime}$ with constants naming the generators. We consider the unary functions $f_{i}$ as binary relations and the constants $a_{i}$ as unary relations. Thus, we have atomic formulas with the meanings $x=a_{i}$ and $f_{i}(x)=y$ .

Lemma 5.6.

Let $V$ be a commutative generalized bijective variety, and consider presentations with a generating $m$ -tuple $\bar{a}$ and a single identity. Let $F$ be the free structure. If $F$ is infinite, then for every $r\in\omega$ , there is a set $S$ of presentations such that

1.

$S$ has limiting density $1$ , and
2.

for $\alpha(\bar{x})\in C_{r,m}$ , the following are equivalent:
1. (a)
  
  $\alpha(\bar{a})$ holds in $F$ ,
2. (b)
  
  $\alpha(\bar{a})$ holds in some structure given by a presentation in $S$ ,
3. (c)
  
  $\alpha(\bar{a})$ holds in all structures given by a presentation in $S$ .

If $\mathcal{A}$ is the structure given by the identity $t_{1}(a_{i})=t_{2}(a_{j})$ , then for each $r$ , we get an isomorphism $p$ from $B_{2r}(\bar{a})$ in $F$ to $B_{2r}(\bar{a})$ in $\mathcal{A}$ , given by $p(u(a_{k}))=u(a)$ .

Proof.

We use notation from the proof of Theorem 4.26. Recall that $\Pi_{1}$ is the projection onto the copy of $\mathbb{Z}$ generated by $b_{1}$ , where $b_{1}$ is an element of infinite order in the abelian group $G(V)$ associated with the variety $V$ . Let $S$ be the set of presentations in which the identity $t_{1}(a_{i})=t_{2}(a_{j})$ satisfies $|\Pi_{1}(t_{1})-\Pi_{1}(t_{2})|>e_{0}r$ , where $e_{0}=\max_{i}|\Pi_{1}(f_{i})|+4$ . The fact that $S$ has limiting density $1$ follows from the proof of Theorem 4.26.

Since the formulas in $C_{r,m}$ uniquely describe the isomorphism type of $B_{r}(\bar{x})$ , it suffices to show that $p$ is an isomorphism. We know that $p$ is surjective since $\mathcal{A}$ is a quotient of $F$ . As in the proof of Theorem 4.26, it is also injective. Indeed, if $i=j$ , then the projection from $F$ to $\mathcal{A}$ is injective on the substructure generated by $a_{1},\cdots,a_{i-1},a_{i+1},\cdots,a_{m}$ . On the substructure generated by $a_{i}$ , if $u(a_{i})=u^{\prime}(a_{i})$ , then we can apply Lemma 4.25, and we see that (the equivalence class of) $u^{-1}u^{\prime}$ is in $\langle t_{1}t_{2}^{-1}\rangle$ as an element of $G(V)$ . Since $|\Pi_{1}(t_{1})-\Pi_{1}(t_{2})|>e_{0}r$ and the length of $u^{-1}u^{\prime}$ is at most $4r$ , this is not possible. If $i\neq j$ , then, as in Corollary 4.32, the substructure generated by $a_{i}$ and that generated by $a_{j}$ are identified via $a_{j}=t_{1}t_{2}^{-1}(a_{i})$ , while the projection map is injective on the substructure generated by the further generators $a_{k}$ . Thus, if $u(a_{i})=u^{\prime}(a_{j})$ , then we must have $uu^{\prime-1}=t_{1}t_{2}^{-1}$ , but since $|\Pi_{1}(t_{1})-\Pi_{1}(t_{2})|>e_{0}r$ and the length of $u^{-1}u^{\prime}$ is at most $4r$ , this is again impossible.

Recall that we are thinking of the language as relational and we have atomic formulas with the meanings $x=a_{i}$ and $f_{i}(x)=y$ . The formula saying $x=a_{i}$ holds exactly on $a_{i}$ , in either $B_{2r}(\bar{a};F)$ or $B_{2r}(\bar{a};\mathcal{A})$ . If the formula saying $f_{i}(x)=y$ holds in $B_{2r}(\bar{a};F)$ , then it holds of $p(x)$ and $p(y)$ in $B_{2r}(\bar{a};\mathcal{A})$ because $\mathcal{A}$ is a quotient of $F$ . Thus, we have $f_{i}(p(x))=p(y)$ . Finally, suppose that in $B_{2r}(\bar{a};\mathcal{A})$ , $f_{i}(p(x))=p(y)$ . Then $p(x)=u(a_{j})$ for some $j$ and $p(y)=f_{i}(u(a_{j}))$ . However, the map $p$ is bijective, so $F\models x=u(a_{j})$ and $F\models y=f_{i}(u(a_{j}))$ as well. Then $f_{i}(x)=y$ holds in $B_{2r}(\bar{a};F)$ . This shows that $p$ is an isomorphism, completing the proof. ∎

Theorem 5.7.

Let $V$ be a commutative generalized bijective variety in the language $L$ , and suppose that the free structure on one generator is infinite. Consider presentations with a fixed generating $m$ -tuple $\bar{a}$ and a single identity. Let $F$ be the free structure on $\bar{a}$ . Let $L^{\prime}$ be the result of adding constants for the elements of $\bar{a}$ to $L$ . Then an $L^{\prime}$ -sentence is true in $F$ iff it has limiting density $1$ .

Proof.

Let $\varphi^{\prime}$ be an $L^{\prime}$ -sentence that is true in $F$ , so $\varphi^{\prime}=\varphi(\bar{a})$ for some $L$ -formula $\varphi$ . By Theorem 4.10, $\varphi(\bar{x})$ can be expressed as a finite disjunction $\bigvee_{i}\varphi_{i}(\bar{x})$ for $\varphi_{i}(\bar{x})$ of the form $\rho_{i}(\bar{x})\ \&\ \chi_{i}$ , where $\rho_{i}(\bar{x})\in C_{r,m}$ and $\chi_{i}$ is a conjunction of special sentences and negations of special sentences. Recall that the special sentences have the form $(\exists v_{1},\cdots,v_{s})\left(\bigwedge\limits_{i}\alpha_{i}(v_{i})\ \&\bigwedge\limits_{i<j}d^{>2r}(v_{i},v_{j})\right)$ , where $\alpha_{i}(v_{i})\in C_{r,1}$ and $d^{>2r}(v_{i},v_{j})$ is the formula saying that the distance between $v_{i}$ and $v_{j}$ in the Gaifman graph is greater than $2r$ . Since $F\models\varphi(\bar{a})$ , we have $F\models\varphi_{i}(\bar{a})$ for all $i$ , i.e., $F\models\rho_{i}(\bar{a})\ \&\ \chi_{i}$ . Since $\chi_{i}$ is an $L$ -sentence true in $F$ , it has limiting density $1$ by Theorem 4.26. On the other hand, $\rho_{i}(\bar{x})\in C_{r,m}$ , and by Lemma 5.6, $\rho_{i}(\bar{a})$ also has limiting density $1$ . Thus, $\varphi^{\prime}$ has limiting density $1$ . ∎

6 More examples

In Section 3, we gave some examples illustrating different possible behaviors of limiting density. We considered sentences with no constants. In Section 4, we gave conditions guaranteeing that the sentences with limiting density $1$ are those true in the free structure. In Section 5, we gave some results for sentences with constants naming the generators. In the current section, we look again at some of the examples from Section 3 in light of the results from Sections 4 and 5. We we also give some further examples, illustrating more subtle points suggested by these results.

6.1 Examples of Proposition 5.1

Let $V$ be a variety in the language $L$ . Consider presentations with a fixed tuple $\bar{a}$ of generators, and some number of identities, and let $L^{\prime}$ be the result of adding to $L$ constants for the generators. Here, for reference, is the statement of Proposition 5.1.

Proposition 5.1. Let $T_{F}$ be the set of $L^{\prime}$ -sentences true in the free structure $F$ generated by $\bar{a}$ , and let $S$ be the set of $L^{\prime}$ -sentences with limiting density $1$ . Then the following are equivalent:

1.

$T_{F}\subseteq S$ ,
2.

$T_{F}=S$ ,
3.

$S$ has the following two properties:
1. (a)
  
  $S$ includes the sentences from $T_{F}$ of the form $t(\bar{a})\not=t^{\prime}(\bar{a})$ ,
2. (b)
  
  for any $L^{\prime}$ -formula $\varphi(x)$ with just $x$ free, if $\varphi(t(\bar{a}))\in S$ for all closed terms $t(\bar{a})$ , then $(\forall x)\varphi(x)\in S$ .

The proposition says that conditions (a) and (b) are necessary and sufficient for the $L^{\prime}$ -sentences true in $F$ to have density $1$ . We revisit some examples and see what the result says about them.

6.1.1 Generalized bijective structures

In Theorem 5.7, we saw that for the variety of generalized bijective structures and presentations with a single generator $a$ and a single identity, any sentence, possibly involving the constant $a$ , has limiting density $1$ iff it is true in the free structure on $a$ . Hence, we must have both properties (a) and (b) from Proposition 5.1.

6.1.2 Abelian groups

In Section 3.4, we saw that for abelian groups and presentations with a single generator and a single relator, the zero–one law fails.

Proposition 6.1.

For abelian groups and presentations with a single generator and a single relator, Property (a) holds and Property (b) fails, witnessed by the formulas $\varphi(a,x)$ saying that $p^{n+1}x\not=p^{n}a$ , where $p$ is an odd prime.

Proof.

The free structure is $\mathbb{Z}$ . Take a sentence of the form $ma\not=0$ . This is true in $\mathbb{Z}$ , and the sentence is in $S$ since all relators longer than $|m|$ make it true. Thus, Property (a) holds. Now fix $n$ and an odd prime $p$ . For all closed terms $t(a)$ , the sentence $p^{n+1}t(a)\not=p^{n}(a)$ is in $S$ . We have $p^{n+1}t(a)\not=p^{n}a$ for all terms $t(a)=ma$ . By Property (a), the sentences $p^{n+1}t(a)\not=p^{n}a$ are all in $S$ . If we had Property (b), then the sentence $(\forall x)p^{n+1}x\not=p^{n}a$ would be in $S$ . However, recall from Section 3.4 that the sentence $\beta(p,n,1)$ says that there is an element divisible by $p^{n}$ and not by $p^{n+1}$ . This is true in $\mathbb{Z}$ but not in $S$ —by Lemmas 3.28 and 3.32, the limiting density is $\frac{1}{p^{n+1}}$ . Since $p^{n}a=p^{n}a$ is logically valid, $p^{n}a$ is divisible by $p^{n}$ in all models. Thus, if the sentence $(\forall x)p^{n+1}x\not=p^{n}a$ is in $S$ , then $\beta(p,n,1)$ is in $S$ —in the models satisfying $(\forall x)p^{n+1}x\not=p^{n}a$ , $p^{n}a$ is not divisible by $p^{n+1}$ . This is a contradiction. ∎

6.1.3 Structures with a single unary function, one generator, and one identity

The next example is from Section 3.2. The variety of unary functions has a single unary function symbol $f$ and no axioms.

Proposition 6.2.

For the variety of unary functions and presentations of the form $a|f^{m}(a)=f^{n}(a)$ , Property (a) holds and Property (b) fails.

Proof.

To show that Property (a) holds, consider a sentence of the form $f^{i}(a)\neq f^{j}(a)$ . Note that the set of presentations with identity $f^{n}(a)=f^{m}(a)$ for $n,m>i+j+1$ has limiting density $1$ . Moreover, for any such presentation, we recall from Section 3.2 that the resulting structure is a finite chain leading to a finite cycle where the chain is longer than both $i+1$ and $j+1$ . Then we have $f^{i}(a)\neq f^{j}(a)$ in the structure. Thus, $f^{i}(a)\neq f^{j}(a)$ is in $S$ .

To show that Property (b) fails, let $\varphi(x)=(\forall y)(x\not=y\rightarrow f(x)\not=f(y))$ . We will show that this witnesses the failure of (b). For any fixed $x=f^{i}(a)$ , note that the set of presentations with identity $f^{n}(a)=f^{m}(a)$ for $n,m>i+1$ has limiting density $1$ . In any such presentation, we have $f^{j+1}(a)\neq f(y)$ unless $y=f^{j}(a)$ . Thus, the sentence saying that the formula

\varphi(x)=(\forall y)(x\not=y\rightarrow f(x)\not=f(y))

holds for $x=f^{i}(a)$ is in $S$ for any closed term $f^{i}(a)$ . On the other hand, the sentence $(\forall x)(\forall y)(x\not=y\rightarrow f(x)\not=f(y))$ saying that $f$ is injective has limiting density $0$ , as shown in Section 3.2. Thus Property (b) fails in this variety. ∎

6.1.4 A new example

In the next example, we modify the variety of bijective structures to obtain an example in which Property (b) holds but Property (a) fails.

Example 6.3.

Let $L_{c}$ be the language that consists of unary function symbols $S$ , $S^{-1}$ and a constant $c$ , and let $V$ be the variety with axioms saying that $S$ and $S^{-1}$ are inverse functions and $S^{3}(c)=c$ . Consider presentations with a single generator $a$ and one identity. For the resulting structure $A$ , let $A_{a}$ be the cycle generated by $a$ and let $A_{c}$ be the cycle generated by $c$ . We describe the structures obtained from all possible identities, and we give some limiting densities.

1.

Let $S_{1}$ be the set of identities of the form $S^{n}(a)=S^{m}(a)$ . This has density $\frac{1}{4}$ . If $k=|m-n|$ , then $A_{a}$ is a $k$ -cycle if $k>0$ and a $\mathbb{Z}$ -chain if $k=0$ . $A_{c}$ is always a $3$ -cycle in this case.
2.

Let $S_{2}$ be the set of identities of the form $S^{n}(c)=S^{m}(c)$ . This has density $\frac{1}{4}$ . Then $A_{a}$ is a $\mathbb{Z}$ -chain (always the same), and $A_{c}$ is a $3$ -cycle or a $1$ -cycle.
3.

Let $S_{3}$ be the set of identities of the form $S^{n}(a)=S^{m}(c)$ or $S^{n}(c)=S^{m}(a)$ . This set has density $\frac{1}{2}$ . In the resulting structure, $A_{a}=A_{c}$ is a $3$ -cycle.

To see that Property (a) fails, consider the sentence $c\neq S(c)$ . This is true in the free structure but fails exactly in the subset of $S_{2}$ where $A_{c}$ is a 1-cycle, which has limiting density $\frac{1}{6}$ .

To show that Property (b) holds, assume for some $\varphi(x)$ , $\varphi(t)$ has limiting density 1 for all closed terms $t=t(c,a)$ . We will show that for $i=1,2,3$ , the set of identities in $S_{i}$ for which the resulting structure satisfies $(\forall x)\varphi(x)$ has the same density as $S_{i}$ . For any finite set $\sigma$ of closed terms $t=t(c)$ or $t=t(a)$ , the sentence $\psi(c,a)=\bigwedge_{t\in\sigma}\varphi(t)$ has density $1$ . This makes the case $S_{3}$ easy. For the structures given by identities in $S_{3}$ , all $x$ are named by terms $c$ , $S(c)$ , or $S^{2}(c)$ .

For the remaining cases, we use Gaifman’s Locality Theorem. Consider a formula $\varphi^{\prime}(u,v,x)$ in the language of bijective structures such that $\varphi(x)=\varphi^{\prime}(c,a,x)$ . By Gaifman, $\varphi^{\prime}(u,v,x)$ is equivalent in bijective structures to a formula $\bigvee_{i}(\alpha_{i}(u,v,x)\ \&\ \beta_{i})$ , where for some $r$ , $\beta_{i}$ is a conjunction of local sentences and negations, each $r^{\prime}$ -local for $r^{\prime}\leq r$ , and $\alpha_{i}(u,v,x)$ is an $r$ -local formula that describes the union of the $r$ -balls around $u,v,x$ .

For identities in $S_{2}$ , $A_{c}$ may have one element or three, and $A_{a}$ is fixed. Let $\sigma$ be a finite set with closed terms naming the elements of $A_{c}$ and the elements of $A_{a}$ that are not far from $a$ , with $d(x,a)\leq 2r$ , plus one more element $x=t^{*}(a)$ where $d(x,a)>2r$ . The sentence $\psi(a,c)$ saying that $\varphi(t)$ holds for all of these terms has density $1$ . For the other $x\in A_{a}$ , the ones far from $a$ , the balls $B_{r}(x)$ are isomorphic. If $t^{*}(a)$ satisfies $\alpha_{i}(c,a,x)$ , then all elements do, so $\varphi(x)$ holds. Then $(\forall x)\varphi(x)$ has density $1$ .

Finally, for an identity in $S_{1}$ , $A_{c}$ is a fixed $3$ -cycle, while $A_{a}$ varies with the identity. Consider the disjuncts $\alpha_{i}(c,a,x)\ \&\ \beta_{i}$ that might be satisfied by some $x\in A_{a}$ . The same identities also yield plain bijective structures $A_{a}$ . Let $\alpha^{\prime}_{i}(a,x)$ be the part of $\alpha_{i}(c,a,x)$ describing the $r$ -balls around $a$ and $x$ . For $x=t(a)$ , $x$ satisfies $\alpha_{i}(c,a,x)\ \&\ \beta_{i}$ , iff $\beta_{i}$ holds in $A_{a}\cup Z_{3}$ and $\alpha^{\prime}_{i}(a,x)$ holds in $A_{a}$ . For each $\beta_{i}$ , there is a sentence $\beta^{\prime}_{i}$ such that for the structures $A$ given by an identity in $S_{1}$ , $A\models\beta_{i}$ iff $A_{a}\models\beta_{i}^{\prime}$ . We may take $\beta^{\prime}_{i}$ to be a finite disjunction of conjunctions of sentences that, in the setting of bijective structures, are $r^{\prime}$ -local sentences or negations. We can see this by using the Feferman-Vaught Theorem or, less formally, just by thinking about what $\beta_{i}$ says. Let $\varphi^{\prime}(a,x)=\bigvee_{i}\alpha^{\prime}_{i}(a,x)\ \&\ \beta^{\prime}_{i}$ . For $x=t(a)$ , the formula $\varphi^{\prime}(a,x)$ has density $1$ in the bijective structure $A_{a}$ . Then by our earlier result, $(\forall x)\varphi^{\prime}(x)$ has density $1$ . For each bijective structure generated by $a$ in which $(\forall x)\varphi^{\prime}(x)$ is true, we consider the structure to be $A_{a}$ , and in the variety we are currently considering, $A=A_{a}\cup Z_{3}$ satisfies $(\forall x)\varphi(x)$ , so this has density $1$ .

This example also shows that the zero–one law may fail if we allow constants in the language, giving an obstacle for generalizing Theorem 4.26 (on the zero–one law for generalized bijective structures) to varieties in a language with constants. Note that in Section 5, we did add constants naming a tuple of generators. However, these constants were not part of the language of the variety—they did not appear in the axioms.

6.2 Structures with a single unary function and more generators and identities

For the language with a single unary function symbol $f$ and the variety with no axioms, we saw in Section 3.2 that for presentations with a single generator and a single identity, the zero–one law holds, but the limiting theory is not that of the free structure. In particular, the sentence $\varphi$ saying that $f$ is not injective has density $1$ , but it is false in the free structure. We now consider presentations with multiple generators and identities.

Proposition 6.4.

Let $L$ be the language with a single unary function symbol $f$ , and let $V$ be the variety with no axioms. For presentations with $m$ generators and $k$ identities, the sentence $\varphi=(\exists x,y)(f(x)=f(y)\ \&\ x\neq y)$ has density $1$ .

Let the generators be $a_{1},\ldots,a_{m}$ . The identities have the form $f^{p}(a_{i})=f^{q}(a_{j})$ . As before, the sentence $\varphi$ is true if the chosen identities all satisfy that $p,q$ are both non-zero and $p\neq q$ . Indeed, in this case, without loss of generality, we may take $n,i$ such that $f^{n}(a_{i})$ appears as one side of some identity and there is no $m<n$ such that $f^{m}(a_{i})$ appears as one side of some identity. Suppose $f^{n}(a_{i})=f^{q}(a_{j})$ is one of the identities. Then $x=f^{n-1}(a_{i})$ and $y=f^{q-1}(a_{j})$ witness $\varphi$ .

We can show that $\varphi$ has density $1$ . The number of identities of length $r$ is $m^{2}(r+1)$ . The number of length at most $s$ is $m^{2}(1+2+\cdots+(s+1))=m^{2}\frac{(s+2)(s+1)}{2}$ . The number of unordered sets of $k$ identities of length at most $s$ is $P_{s}=\left(\begin{array}[]{c}m^{2}\frac{(s+2)(s+1)}{2}\\ k\end{array}\right)$ . We count the identities of length $r$ such that $p,q\neq 0$ and $p\neq q$ . If $r$ is even, then there are at most $3m^{2}$ identities of length $r$ for which the condition fails; namely, $f^{r}(a_{i})=a_{j}$ , $a_{i}=f^{r}(a_{j})$ , and $f^{r/2}(a_{i})=f^{r/2}(a_{j})$ . (If $r$ is odd, then the number is at most $2m^{2}$ .)

Thus, there are at least $m^{2}(r-2)$ identities of length $r$ satisfying the condition, and there are at least $m^{2}\frac{(s-1)(s-2)}{2}$ identities of length at most $s$ satisfying the condition. Let $A$ be the set of presentations with all identities satisfying the condition. Then $P_{s}(A)\geq\left(\begin{array}[]{c}m^{2}\frac{(s-1)(s-2)}{2}\\ k\end{array}\right)$ .

It is now a calculus exercise to show that $\frac{P_{s}(A)}{P_{S}}\to 1$ , and the proposition follows.

Remark.

We saw that when $m=k=1$ , the zero–one law holds. However, it does not hold in the case where $m=1$ and $k=2$ . Suppose the two identities are $f^{p}(a)=f^{q}(a)$ , $f^{p^{\prime}}(a)=f^{q^{\prime}}(a)$ , and consider the sentence $\psi=(\exists x)f(x)~=~x$ . This case is similar to the case of bijective structures with two identities in Section 3.3. The sentence $\psi$ is true if and only if $GCD(p-q,p^{\prime}-q^{\prime})=1$ . An argument like that in Section 3.3 shows that $\psi$ has density strictly between $0$ and $1$ . We omit the proof here.

6.3 Structures with multiple unary functions

We turn our attention to a more complicated case. Take the language with $n$ function symbols $f_{1},\ldots,f_{n}$ and the variety with no axioms, and consider presentations with $m$ generators and $k$ identities. We begin with the case where $k=1$ .

Proposition 6.5.

Let $\varphi$ be the sentence

(\exists x)(\exists y)\left(x\not=y\ \&\ \bigvee_{1\leq i,j\leq n}f_{i}(x)=f_{j}(y)\right)\ .

This sentence is false in the free structure, but it has limiting density $1$ among structures given by presentations with generators $a_{1},\ldots,a_{m}$ and a single identity of the form $t(a_{i})=t^{\prime}(a_{j})$ .

Proof.

For $m$ generators $a_{1},\ldots,a_{m}$ , the free structure $F$ is the join of disjoint substructures generated by the separate $a_{j}$ . In $F$ , each element is uniquely expressed as $t(a_{i})$ , where the term $t$ is built up out of the functions $f_{j}$ . The terms are all distinct, and the sentence $\varphi$ is false. For an identity $t(a_{i})=t^{\prime}(a_{j})$ , the length is the sum of the lengths of $t,t^{\prime}$ . The number of identities of length $\ell$ is $m^{2}n^{\ell}(\ell+1)$ , so the number of identities of length at most $s$ , or $P_{s}$ , is $m^{2}\sum_{0\leq\ell\leq s}n^{\ell}(\ell+1)$ , which is equal to $m^{2}\frac{(n-1)(s+2)n^{s+1}-(n^{s+2}-1)}{(n-1)^{2}}$ .

Let $A$ be the set of identities $t(a_{i})=t^{\prime}(a_{j})$ such that $t$ has length $0$ . We show that $A$ has limiting density $0$ . The number of identities in $A$ of length $\ell$ is $m^{2}n^{\ell}$ , so the number of length at most $s$ is $m^{2}(\sum_{0\leq\ell\leq s}n^{\ell})$ , or $m^{2}\frac{n^{s+1}-1}{n-1}$ . This is $P_{s}(A)$ . It is not difficult to verify that $\lim_{s\rightarrow\infty}\frac{P_{s}(A)}{P_{s}}=0$ . Similarly, let $B$ be the set of identities $t(a_{i})=t^{\prime}(a_{j})$ such that $t^{\prime}$ has length $0$ . Then $B$ also has limiting density $0$ . Therefore, the limiting density of $A\cup B$ is $0$ . Let $C$ be the set of identities not in $A\cup B$ . This will have limiting density $1$ . The identities in $C$ have the form $t(a_{i})=t^{\prime}(a_{j})$ where $t,t^{\prime}$ both have length at least $1$ . Say that $t(a_{i})=f_{i^{\prime}}(t^{*}(a_{i}))$ and $t^{\prime}(a_{j})=f_{j^{\prime}}(t^{\prime*}(a_{j}))$ for terms $t^{*}$ and $t^{\prime*}$ . In the model given by the identity $t(a_{i})=t^{\prime}(a_{j})$ , we have $t^{*}(a_{i})\not=t^{\prime*}(a_{j})$ . The elements $x=t^{*}(a_{i})$ and $y=t^{\prime*}(a_{j})$ witness that the sentence $\varphi$ is true. ∎

Now, we consider presentations with more than one identity. We let $\varphi$ be as in Proposition 6.5.

Proposition 6.6.

For the language with $n$ unary function symbol $f_{1},\dots,f_{n}$ , let $V$ be the variety with no axioms. For presentations with a fixed $m$ -tuple of generators and $k$ identities, where $k\geq 2$ , the sentence $\varphi$ has limiting density $1$ .

Proof.

Let $I_{s}$ be the number of identities of length at most $s$ . Then the number of presentations in which all identities have length at most $s$ is $P_{s}=\left(\begin{array}[]{c}I_{s}\\ k\end{array}\right)$ . Consider the identities $t(a_{i})=t^{\prime}(a_{j})$ in which neither side has length $0$ . The number of these identities of length $\ell$ , where $\ell\geq 2$ , is $m^{2}n^{\ell}(\ell-1)$ , so the number of length at most $s$ is $m^{2}\sum_{2\leq\ell\leq s}n^{\ell}(\ell-1)=m^{2}\frac{(n-1)sn^{s-1}-(n^{s}-1))}{(n-1)^{2}}$ . For convenience, we call this $J_{s}$ . Let $C$ be the set of presentations with $k$ identities in which neither side has length $0$ . Then $P_{s}(C)=\left(\begin{array}[]{c}J_{s}\\ k\end{array}\right)$ .

We show by induction on $k$ that $\lim_{s\rightarrow\infty}\frac{P_{s}(C)}{P_{s}}=1$ . We write $P_{s}^{k}$ and $P_{s}^{k}(C)$ to indicate the value of $k$ under consideration. For $k=2$ , $\frac{P_{s}^{2}(C)}{P_{s}^{2}}=\frac{J_{s}(J_{s}-1)}{I_{s}(I_{s}-1)}$ . We know that $\frac{J_{s}}{I_{s}}\rightarrow 1$ . For the expression $\frac{P_{s}^{2}(C)}{P_{s}^{2}}$ , we divide top and bottom both by $I_{s}$ and get a new numerator $\frac{J_{s}}{I_{s}}-\frac{1}{I_{s}}$ that goes to $1$ and a new denominator $\frac{I_{s}}{I_{s}}-\frac{1}{I_{s}}$ that also goes to $1$ . Now, supposing that the statement holds for $k$ , we show that it holds for $k+1$ . We have $\frac{P_{s}^{k+1}(C)}{P_{s}^{k}}=\left(\frac{P_{s}^{k}(C)}{P_{s}^{k}}\right)\left(\frac{J_{s}-k}{I_{s}-k}\right)$ . By the Induction Hypothesis, the first factor goes to $1$ . For the second factor, we again divide top and bottom by $I_{s}$ . The new numerator is $\frac{J_{s}}{I_{s}}-\frac{k}{I_{s}}$ , which has limit $1$ . The new denominator is $\frac{I_{s}}{I_{s}}-\frac{k}{I_{s}}$ , which also has limit $1$ .

We claim that the sentence $\varphi$ is true in all structures obtained from presentations in $C$ . Take any presentation in $C$ and consider the resulting model $\mathcal{A}$ . No $a_{i}$ is in the range of any function in any model of this sort. The given identities all take us from a non-trivial term in some $a_{i}$ to a non-trivial term in some $a_{j}$ and do not force us to assign values $a_{i}$ , so we can fill out the rest of the function values without ever using these values $a_{i}$ . Thus, all nontrivial identities true in $\mathcal{A}$ are in all structures with presentations in $C$ . Take an identity of shortest length, say $t(a_{i})=t^{\prime}(a_{j})$ , and proceed as for a single identity. Say that $t(a_{i})=f_{i^{\prime}}(t^{*}(a_{i}))$ and $t^{\prime}(a_{j})=f_{j^{\prime}}(t^{\prime*}(a_{j}))$ for terms $t^{*}$ and $t^{\prime*}$ . By the minimality of the length of $t(a_{i})=t^{\prime}(a_{j})$ , we have $t^{*}(a_{i})\not=t^{\prime*}(a_{j})$ . This witnesses the truth of $\varphi$ . ∎

References

[1] G. N. Arzhantseva. On groups in which subgroups with a fixed number of generators are free. Fundam. Prikl. Mat., 3(3):675–683, 1997.
[2] G. N. Arzhantseva and A. Yu. Ol’shanskiĭ. Generality of the class of groups in which subgroups with a lesser number of generators are free. Mat. Zametki, 59(4):489–496, 638, 1996.
[3] Stanley Burris and H. P. Sankappanavar. A course in universal algebra, volume 78 of Graduate Texts in Mathematics. Springer-Verlag, New York-Berlin, 1981.
[4] Matthew Cordes, Moon Duchin, Yen Duong, Meng-Che Ho, and Andrew P. Sánchez. Random nilpotent groups I. Int. Math. Res. Not. IMRN, (7):1921–1953, 2018.
[5] Shagnik Das. A brief note on estimates of binomial coefficients. http://page.mi.fu-berlin.de/shagnik/notes/binomials.pdf. Accessed March 1, 2022.
[6] Paul C. Eklof and Edward R. Fischer. The elementary theory of abelian groups. Ann. Math. Logic, 4:115–171, 1972.
[7] Haim Gaifman. On local and nonlocal properties. In Proceedings of the Herbrand symposium (Marseilles, 1981), volume 107 of Stud. Logic Found. Math., pages 105–135. North-Holland, Amsterdam, 1982.
[8] M. Gromov. Hyperbolic groups. In Essays in group theory, volume 8 of Math. Sci. Res. Inst. Publ., pages 75–263. Springer, New York, 1987.
[9] Matthew Harrison-Trainor, Bakh Khoussainov, and Daniel Turetsky. Effective aspects of algorithmically random structures. Computability, 8(3-4):359–375, 2019.
[10] Meng-Che Ho. Randomizing and Describing Groups. ProQuest LLC, Ann Arbor, MI, 2017. Thesis (Ph.D.)–The University of Wisconsin - Madison.
[11] Ilya Kapovich and Paul Schupp. Genericity, the Arzhantseva-Olshanskii method and the isomorphism problem for one-relator groups. Math. Ann., 331(1):1–19, 2005.
[12] Olga Kharlampovich and Alexei Myasnikov. Elementary theory of free non-abelian groups. J. Algebra, 302(2):451–552, 2006.
[13] Olga Kharlampovich and Rizos Sklinos. First-order sentences in random groups. arXiv preprint arXiv:2106.05461, 2022.
[14] Phokion G. Kolaitis. On the expressive power of logics on finite models. In Finite model theory and its applications, Texts Theoret. Comput. Sci. EATCS Ser., pages 27–123. Springer, Berlin, 2007.
[15] Leonid Libkin. Elements of finite model theory. Texts in Theoretical Computer Science. An EATCS Series. Springer-Verlag, Berlin, 2004.
[16] David Marker. Model theory, volume 217 of Graduate Texts in Mathematics. Springer-Verlag, New York, 2002. An introduction.
[17] Yann Ollivier. A January 2005 invitation to random groups, volume 10 of Ensaios Matemáticos [Mathematical Surveys]. Sociedade Brasileira de Matemática, Rio de Janeiro, 2005.
[18] A. Yu. Ol’shanskiĭ. Almost every group is hyperbolic. Internat. J. Algebra Comput., 2(1):1–17, 1992.
[19] Laurent Saloff-Coste. Random walks on finite groups. In Probability on discrete structures, volume 110 of Encyclopaedia Math. Sci., pages 263–346. Springer, Berlin, 2004.
[20] Z. Sela. Diophantine geometry over groups. VI. The elementary theory of a free group. Geom. Funct. Anal., 16(3):707–730, 2006.
[21] W. Szmielew. Elementary properties of Abelian groups. Fund. Math., 41:203–271, 1955.
[22] A. W. van der Vaart. Asymptotic statistics, volume 3 of Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, Cambridge, 1998.