Sum and Difference Sets in Generalized Dihedral Groups
Abstract.
Given a group , we say that a set has more sums than differences (MSTD) if , has more differences than sums (MDTS) if , or is sum-difference balanced if . A problem of recent interest has been to understand the frequencies of these type of subsets.
The seventh author and Vissuet studied the problem for arbitrary finite groups and proved that almost all subsets are sum-difference balanced as . For the dihedral group , they conjectured that of the remaining sets, most are MSTD, i.e., there are more MSTD sets than MDTS sets. Some progress on this conjecture was made by Haviland et al. in 2020, when they introduced the idea of partitioning the subsets by size: if, for each , there are more MSTD subsets of of size than MDTS subsets of size , then the conjecture follows.
We extend the conjecture to generalized dihedral groups , where is an abelian group of size and the nonidentity element of acts by inversion. We make further progress on the conjecture by considering subsets with a fixed number of rotations and reflections. By bounding the expected number of overlapping sums, we show that the collection of subsets of the generalized dihedral group of size has more MSTD sets than MDTS sets when for , where is the number of elements in with order at most . We also analyze the expectation for and for , proving an explicit formula for when is prime.
Key words and phrases:
More Sums Than Differences, Dihedral Group, Generalized Dihedral Group2020 Mathematics Subject Classification:
11P99, 05B101. Introduction and Main Results
Given a set of integers, the sumset and difference set of are defined as
(1) |
These elementary operations are fundamental in additive number theory. A natural problem of recent interest has been to understand the relative sizes of the sum and difference sets of sets .
Definition 1.1.
We say that a set has more sums than differences (MSTD) if ; has more differences than sums (MDTS) if ; or is sum-difference balanced if .
We intuitively expect most sets to be MDTS since addition is commutative and subtraction is not. Nevertheless, MSTD subsets of integers exist. Nathanson detailed in [Nat07] the history of the problem, and attributed to John Conway the first recorded example of an MSTD subset of integers, . Martin and O’Bryant proved in [MO07] that the proportion of the subsets of which are MSTD is bounded below by a positive value for all . They proved this by controlling the “fringe” elements of , those close to and , which have the most influence over whether elements are missing from the sum and difference sets. In [Zha11], Zhao gave a deterministic algorithm to compute the limit of the ratio of MSTD subsets of as goes to infinity and found that this ratio is at least . For more on the problem of MSTD sets in the integers, see also [Heg07] and [Nat07a] for constructive examples of infinite families of MSTD sets, [MOS10] and [Zha10] for non-constructive proofs of existence of infinite families of MSTD sets, and [HM09] and [HM13] for an analysis of sets with each integer from to included with probability .
More recently, several authors have examined analogous problems for groups other than the integers. For example, Do, Kulkarni, Moon, Wellens, Wilcox, and the seventh author studied in [DKMMWW15] the analogous problem for higher-dimensional integer lattices.
For finite groups, although the usual notation for the operation of the group is multiplication, we match the notation from previous work and define, for a subset , its sumset and difference set as
(2) |
Definition 1.1 of MSTD, MDTS, and sum-difference balanced sets apply in this context.
The approaches used to study MSTD subsets of integers do not generalize for MSTD subsets of finite groups due to the lack of fringes. Zhao proved asymptotics for numbers of MSTD subsets of finite abelian groups as the size of the group goes to infinity in [Zha10a]. The seventh author and Vissuet examined the problem for arbitrary finite groups , also with the size of the group going to infinity, and proved Theorem 1.2.
Theorem 1.2 ([MV14]).
Let be a sequence of finite groups, not necessarily abelian, with . Let be a uniformly chosen random subset of . Then as . In other words, as the size of the finite groups increases without bound, almost all subsets are balanced (with sumset and difference set equalling the entire group).
Furthermore, for the case of dihedral groups , they proposed Conjecture 1.3.
Conjecture 1.3 ([MV14]).
Let be an integer. There are more MSTD subsets of than MDTS subsets of .
Given a set , define (resp. ) as the set of elements of of the form (resp. ), called rotation elements (resp. flip elements). Hence, . Then, we can write
(3) |
Intuition for Conjecture 1.3 comes from noting that and contribute to both and ; and contribute only to ; and contributes only to .
In 2020, Haviland, Kim, Lâm, Lentfer, Trejos Suáres, and the seventh author made progress towards Conjecture 1.3 by partitioning subsets of by size. They proposed Conjecture 1.4 as a means of proving Conjecture 1.3.
Conjecture 1.4 ([HKLLMT20]).
Let be an integer, and let denote the collection of subsets of of size . For any , has at least as many MSTD sets as MDTS sets.
They showed that Conjecture 1.4 holds for , which we reproduce in this paper, and we also extend their approach to . They also showed that Conjecture 1.4 holds for by showing that all sets in are sum-difference balanced. We prove this result in this paper, using Lemma 1.5. These results are proved in Section 2.
Lemma 1.5.
Let be an integer, and let . Let (resp. ) be the subset of rotations (resp. flips) in . Suppose that or . Then, cannot be MDTS.
Furthermore, we extend Conjecture 1.3 as follows. A generalized dihedral group is given by , where is any abelian group and where the nonidentity element of acts on by inversion.
Conjecture 1.6.
Let be an abelian group with at least one element of order or greater, and let be the corresponding generalized dihedral group. Then, there are more MSTD subsets of than MDTS subsets of .
Conjecture 1.3 is a special case of Conjecture 1.6, with . We also state Conjecture 1.7, analogous to Conjecture 1.4.
Conjecture 1.7.
Let be a generalized dihedral group of size , and let denote the collection of subsets of of size . For any , has at least as many MSTD sets as MDTS sets.
In Section 3, we prove our main theorem, verifying Conjecture 1.7 for the case of , where is a constant (independent of ) depending only on the quantity , the number of elements of order at most in the abelian group . More explicitly, we show the following.
Theorem 1.8.
Let be a generalized dihedral group of size . Let denote the collection of subsets of of size , and let denote the number of elements in with order at most . If , where , then there are more MSTD sets than MDTS sets in .
See Section 3 the proof of this result and two related theorems. We also extend these results to the dihedral group on finitely generated abelian groups in Section 3.3.
Next, in Section 4, we discuss the following result about the expected size of when is a randomly chosen set in , the collection of subsets of of size .
Theorem 1.9.
If is prime, and is chosen uniformly at random from , then
(4) |
Finally, in Section 5, we discuss directions for further research.
2. Direct Analysis
2.1. Small Subsets
For the case of the usual dihedral group , we have the following two results.
Lemma 2.1 ([HKLLMT20]).
Let , and let denote the collection of subsets of of size . Then, has strictly more MSTD sets than MDTS sets.
Lemma 2.2.
Let , and let denote the collection of subsets of of size . Then, has strictly more MSTD sets than MDTS sets.
The proofs for both these lemmas use basic and somewhat tedious casework; they can be found in Appendix A.
Similar results for the generalized dihedral group likely follow from similar arguments.
2.2. Large Subsets
We consider what happens when gets close to . Here we can prove a result for any generalized dihedral group. For the rest of this section, let be a finite abelian group of size . Recall that the generalized dihedral group is given by , where the nonidentity element of acts on by inversion. Note that . Writing an element of the group as where and , we write to mean the subset of consisting of elements with and to mean the subset of consisting of elements with . Note that . For the case where is the usual dihedral group, and are the sets of rotations and flips, respectively; out of convenience, we will use these terms for the general case as well.
It turns out that having ensures that is balanced.
Lemma 2.3.
Let be a generalized dihedral group of size . Let , and let and . If , then and .
Proof.
Let be the larger of and , and define if and if .
For each rotation , define and . Note that , and , , are subsets of the set which has size . Hence, by the inclusion–exclusion principle, and are nonempty. Thus, and .
Therefore, as desired, and . ∎
Remark 1.
Note Lemma 2.3 implies if , then is not MDTS. This follows from the discussion after the statement of Conjecture 1.3 (which we explicitly extend to the general dihedral group case in Section 3). For a set to be MDTS, must contribute rotations to that the set does not have. But here we have shown that if , then has all the rotations.
Lemma 2.4.
Let be a generalized dihedral group of size , and let with . If , then .
Proof.
Let (resp. ) be the subset of rotations (resp. flips) in ; hence . Let be the larger and smaller of and , respectively. Define for . Thus, .
By Lemma 2.3, we have that and .
For each flip , define and .
Note that , , and are subsets of , which has size . Hence, by the inclusion–exclusion principle, and are nonempty. Thus, and . Therefore, and .
Thus, we have and , which imply . ∎
3. Collision Analysis
Let be a finite abelian group of size , written multiplicatively. Recall that the generalized dihedral group is given by , where the nonidentity element of acts on by inversion.
This section is dedicated to proving the following.
See 1.8
Note that the theorem is only useful when , or .
If is arbitrarily large and is a constant compared to , we can make a stronger statement: we can replace in the above theorem with a constant arbitrarily close to . Specifically, we have the following.
Theorem 3.1.
For fixed and , there exists with the following property. Let be an abelian group of size with at most elements of order or . Then with and defined as in Theorem 1.8, we have that if , then there are more MSTD sets than MDTS sets in .
We can give a stronger, more general statement on the proportion of MSTD sets in if is large and also bounded above by a (smaller) constant times .
Theorem 3.2.
Let , and be defined as in Theorem 1.8. For any , there exist and such that if , the proportion of MSTD sets in is at least .
Here and are independent of , but similarly to before, this theorem is only useful when .
Remark 2.
In practice, Theorems 1.8 and 3.2 are most useful if is essentially a constant compared to . This is indeed the case for the original dihedral group , where when is odd and when is even, yielding . The family of original dihedral groups is also a good example of how to use Theorem 3.1: we can apply that theorem with and arbitrarily small to get that for large enough dihedral groups, we can get the coefficient of the in the theorem to be very close to , which is a significant improvement over .
Recall the definitions of and from Section 2.2. Note that any element in has order in . Furthermore, any element in has order at most in if and only if it has order at most in .
We begin with a set with size and count the number of elements in and . In this count, we will make a naive assumption: there are no overlaps between sums and differences that we do not expect to overlap. Decompose into the union of the set of rotations of and the set of flips , and define . We have:
(5) |
Consider first the flips in and . In , these are in and . Note that we do not expect a lot of overlap in general; for a rotation and a flip , does not equal unless has order or in . On the other hand, for the flips in , we have . This is because , and , so these two are the same. There are rotations and flips in , so we thus expect the flips to contribute to but only to .
Next, consider the rotations in and . Begin with and . Since all flips have order , these are in fact the same set and thus always contribute equally to and to . Also note that if (we will treat the case later), the identity is contained in and .
Next consider and . Adding rotations is commutative, so we expect to contribute to the size of , where the term comes from the sum of each rotation in with itself. On the other hand, in general , so is expected to contribute . Here there is no additional term since when , we have , and was already counted in from .
We now put this all together. For , we need
(6) |
Note that when the set is necessarily balanced as . Further, one can now see why we may assume : when is smaller than , we expect the set to be MDTS, and indeed we will assume this is the case.
To use this naive estimate to prove our theorem, we first formalize our assumption that we have minimal overlaps within the sumset.
Definition 3.3.
Let such that , and let . We say that represents a collision if .
Every collision that occurs in has the potential to make smaller relative to our naive estimate, unless the quadruple is of a form that we already took into account. For example, if , then this is not a collision we need to count as trivially. Similarly, collisions of the form with and both rotations do not subtract from in Equation (3) as we already accounted for commutativity of addition for rotations. And finally, if is a collision where , , , and are all flips, then does not impact Equation (3) as do not affect the relative sizes of the sum and difference sets. We refer to these three kinds of quadruples as redundant.
Every non-redundant collision of , together with , decreases the size of by at most from our naive estimate. Let be half the total number of non-redundant collisions of . Then combining the above analysis with Equation (3), we are guaranteed to have that is MSTD when
(7) |
We use the quadratic equation to solve for when the above quantity equals and obtain . Thus Equation (7) is satisfied when:
(8) |
Remark 3.
We are assuming that no “collisions” of the form happen to lower the size of . Because our objective is to guarantee for a large proportion of , this assumption still gives a sufficient condition on and .
One therefore sees that for most values of , when the number of collisions is not too large, is MSTD. More formally, suppose that is chosen randomly out of the subsets of with size . Suppose that the expectation value of is bounded above by , where . Then the actual value of exceeds at most of the time by Markov’s inequality. Of sets with , the actual value of exceeds at most of the time. Thus when , Equation (8) is true at least of the time since when , the equation reads
(9) |
Now, we need to make sure that for our values of , more than proportion of sets in have . This will ensure that a proportion greater than of sets in satisfy Equation (8) and are therefore MSTD.
More formally speaking, we require
(10) | ||||
(11) |
where we may take the limit as because the left hand side of Equation (10) decreases with increasing . Equation (11) can be verified numerically to be true for .111In fact, when is large, we expect almost all of the sets in to have ; see Subection 3.1 for further discussion on this fact. For smaller values of , one can verify Equations (10) and (11) using the following Desmos link: https://www.desmos.com/calculator/e4zqwbmmcr.
The problem has therefore been reduced to placing an upper bound on the values of such that the expected value of is at most when is chosen uniformly at random from . The following lemma gives us the result we need.
Lemma 3.4.
When is chosen uniformly at random from , we have
(12) |
Much of the machinery of this proof lies in Lemma 3.4; its proof is rather technical and can be found in Subsection 3.2.
When , Lemma 3.4 implies that under the hypothesis of the lemma,
(13) |
We are ready to complete the proof of Theorem 1.8. Recall that we wanted to have to ensure most subsets of size would be MSTD. Thus the requisite upper bound on is determined as
(14) |
where .
This concludes the proof of Theorem 1.8.
3.1. Proof of Theorems 3.1 and 3.2
In this section we prove Theorem 3.1, and in the process we outline the steps needed to prove Theorem 3.2. We now assume that , where is a constant upper bound on the number of elements of order at most 2 in the group and is sufficiently large.
Having proved Theorem 1.8 for , we may now assume ; that is, is now large. We will follow similar steps to the previous proof, but using this assumption, we will increase to be arbitrarily close to , and we will decrease to be arbitrarily close to . Then we will have that the coefficient of the , which is (as discussed in the previous proof), is arbitrarily close to .
Take small . Note that inside of , the distribution of values of is a hypergeometric distribution. This is because one can construct a random set in by taking random elements of the group without replacement, one at a time; to begin with there is a chance each time that we choose a flip. Thus since is very large and is fixed, having is sufficient for a proportion at least of sets in to have .
Going back to Equation (8), we thus see that we just need to be at least , or , slightly more than half the time when is in the relevant interval. More specifically, we need
(15) |
Then, among sets with (which, recall, form a proportion of at least of sets in ), at least a proportion of satisfy Equation (8). This means that a proportion of at least of sets in are MSTD.
For Equation (15) to hold, we claim that it suffices to have the following probability bound, not conditioned on the size of :
(16) |
To see why Equation (16) implies Equation (15), call the event that and the event that . Then, we manipulate conditional probabilities as follows.
(17) |
Since and Equation (16) says that , we have that if Equation (16) is true, then
(18) |
and the claim is shown.
To ensure that Equation (16) is true, we require
(19) |
Then by Markov’s inequality, the probability that exceeds is at most , which is equivalent to Equation (16).
We may now choose a small value such that Equation (19) is true if
(20) |
Notice that in the limit , Equation (19) boils down to the statement that , so we can make be as small as desired by making be small.
We now revisit Lemma 3.4. We are now assuming that is large and is a constant, so only the first term dominates:
(21) |
We wanted to have . Thus the upper bound on is now determined as
(22) |
where
(23) |
If is arbitrarily large and is constant compared to , we can choose and to be very small, so that is arbitrarily close to .
This completes the proof of Theorem 3.1. To prove Theorem 3.2, we follow a very similar method. We choose large enough that almost all of the sets in have . The only difference is that now, in Equation (15) we replace the right-hand side with , so that the proportion of MSTD sets is at least . (We may choose so that equals the desired ). Then in Equation (16) we replace the right-hand side with , leading to the analog of Equation (20) being that for some small depending on . The rest of the proof continues as before, leading to a coefficient proportional to . This completes the proof.
3.2. Proof of Lemma 3.4
We now prove Lemma 3.4. Recall that is defined to be half the number of non-redundant collisions in the set , and we are interested in bounding above the expectation value of when is chosen uniformly at random from .
To more easily count the collisions in , we make the following definition.
Definition 3.5.
A redundant triple is a triple such that the quadruple is redundant. That is, a triple is redundant if , or if and are all flips, or if and and are both rotations. Denote to be the set of non-redundant triples.
Define the function by if for the non-redundant triple , the element is in , and otherwise.
For a fixed set , the set is the set of non-redundant triples with all three elements contained in . Notice that we have
(24) |
That is, the number of non-redundant collisions in is the same as the number of non-redundant triples such that the element is in , forming a quadruple representing a collision .
By definition of expectation value, we write
(25) |
Since is chosen uniformly at random from the sets in , we have . Thus,
(26) |
We swap the order of the sums.
(27) |
To compute the inner sum, we must simply count the number of sets with such that contains . That is,
(28) |
We now break this sum into seven pieces for different kinds of triples . These are:
-
•
-
•
-
•
-
•
-
•
-
•
-
•
We have , for these seven cases cover all the cases of possible equalities between the four elements except for those where , or equivalently, , since those cases are redundant triples. Furthermore, this union is disjoint.
Note that for triples , the quantity is given by since we are requiring four distinct elements to be in , and we have choices for the remaining elements. For triples in and , we are requiring three distinct elements to be in , so we have . Finally, for triples in and we have .
Therefore, from Equation (28), we may write:
(29) |
where in the second line we used the fact that
(30) |
and similarly for the other two terms.
Now, to use Equation (29) to find an upper bound on , all that remains to be done is find an upper bound on each of the ’s. We do so next.
We bound using the trivial inequality . There are total triples in , but we may subtract the redundant triples, including the triples consisting of three flips. Thus we obtain
(31) |
We bound and next. We have choices for . In , must equal , and in , must equal , and in both, we have at most choices for the remaining value of or . The extra condition that is distinct from the others only lowers and , so we do not have to take it into account to obtain an upper bound. Thus,
(32) |
For and , we have choices for and choices for , but must be the element for or for . Thus we have
(33) |
When considering , we note that since , the triple is redundant if and are both rotations, or if both are flips. So, one must be a rotation and the other must be a flip, and we require that , or . Thus to bound we must count the number of pairs of elements with , that is, . This happens if and only if , or . Recalling that there are elements in with order or less, there are therefore choices for the rotation element, and choices for the flip element. We multiply by since can be either the flip or the rotation, and is the other of the two. Thus,
(34) |
Finally, for , we again split into two cases: firstly where is a rotation, and secondly where is a flip, so is a rotation to avoid redundancy. Here we require , or . For the first case, we first consider the number of pairs with . Since must be a rotation, there are only choices for ; can be a flip or a rotation, so there are ways to have . So, there are at most pairs of this kind. If is a rotation and , then must also be a rotation since otherwise . Thus and commute, so . There are choices of with , and for each, there are choices of which have . Thus there are at most pairs of this kind.
3.3. Finitely Generated Abelian Groups
We transition to a discussion of finitely generated abelian groups . When Theorems 1.8, 3.1, and 3.2 hold, so the remaining case is when is an infinite group. However, we must make some restriction as to ensure taking subsets uniformly at random is well defined. By the fundamental theorem of finitely generated abelian groups,
(37) |
where are powers of (not necessarily distinct) primes. We denote elements of as a tuple , where and for . Since this section will occasionally require us to deal with multiple groups simultaneously, we update our notation of as the generalized dihedral group of to the standard . We still denote elements of as , where .
For some fixed , we will consider taking subsets uniformly at random from the finite
(38) |
Our goal is to leverage Theorems 1.8, 3.1, and 3.2, and, to do so, we will refer to , where
(39) |
To adhere to prior notation, let be the number of elements in that are at most order and let denote the collection of subsets of that have size . Then we get the following corollaries of Theorems 1.8, 3.1, and 3.2:
Corollary 3.6.
If , then there are more MSTD than MDTS in .
Corollary 3.7.
If , then if , then there are more MSTD than MDTS sets in .
Corollary 3.8.
For any , there exist and such that if , the proportion of MSTD sets in is at least .
Proof.
We will establish a bijection such that, if is a collision, then is also a collision. Since we are still working with generalized dihedral groups, this immediately implies Corollaries 3.6, 3.7, and 3.8, as the number of non-degenerate collisions in is bounded above by the number of non-degenerate collisions in .
Let defined by As defined, this is clearly a bijection. Let be a collision. By definition, . Let . Define the binary operation by
(40) |
Then we get
(41) |
In other words, we are given the following system of equations:
Consider . For to be a collision in , we require the following system of equations:
which is implied by our given system. Therefore the result follows. ∎
We present an example of the number of collisions in another dihedral group . We show that the dihedral group possesses exactly the same number of possible collisions as the dihedral group if and only if is odd and there are more collisions in otherwise. This result only provides some intuition on the the expected number of collisions depending on , the number of elements of order within the particular abelian group.
Lemma 3.9.
The number of possible collisions within the two groups are equal if and only if is odd and there are more collisions in otherwise.
To do so, we make use of the following useful results:
Lemma 3.10.
The number of pairs where and and the number of pairs where are both equal to
Proof.
For any given and , there exist only a single and which satisfies the conditions respectively. Thus, there are such pairs for both cases. ∎
Lemma 3.11.
The number of pairs where and is as follows:
-
•
when is odd, there are such pairs
-
•
when is even and is odd or is odd, there are such pairs
-
•
when , , and are all even, there are such pairs
Proof.
When is odd, each choice of is paired with a single possible , with one such pair being . Similarly, the same holds for and . This gives as the number of pairs. However, we over-counted since swapping with and with does not yield a distinct pair. Thus, we divide by except for the single pair where and which we did not over-count to get pairs of .
When is even but is odd, we choose pairs first which gives distinct pairs where is never equal to . For each of these pairs, we can choose any choice of which forces without over-counting. Thus, we get pairs. Similar arguments hold for when is odd.
When all of and are even, we get that there are two pairs of identical number and pairs of different numbers which sum to and similar for in . We choose first. If , we have choices for choosing . If , we have that for any , we uniquely determines without over-counting and thus there are choices. In total, we have pairs of . ∎
Lemma 3.12.
The number of pairs where is as follows:
-
•
when is odd, there are such pairs
-
•
when is even but is odd, there are such pairs
-
•
when both and are even, there are such pairs
Proof.
When is odd, we have that each of is paired with another , with exactly one being paired with itself. Thus, there are pairs of and .
When is even but is odd, we have that each of is paired with another , necessarily distinct. Thus, there are pairs of and .
When is even and is even, we have that each of is paired with another , with exactly two being paired with themselves. Thus, there are pairs of and . ∎
Combining these results over the casework where elements of our pairs may be a rotation or a flip yield the desired result, with the details of the proof in Appendix B.
4. Expected Size of Sum and Difference Sets
In this section, we only consider the classical dihedral groups
(42) |
Throughout this section, we use to denote the set of subsets of size in .
The method of collision analysis will likely not be sufficient to prove that has more MSTD sets than MDTS sets for values of greater in order of magnitude than . The intuition for this comes from the fact that the sum and difference sets for should very roughly have size on the same order of magnitude as . Hence, one would expect to usually have when is much greater than . The analysis for relative numbers of MSTD and MDTS sets in for these larger values of should therefore be based on counting the number of missed sums and differences in , in direct analogy with the case of slow decay for the integers in [HM09].
We take the first steps toward such an analysis by proving the following special case.
See 1.9
This follows from the following straightforward yet useful lemma, which reduces the problem of computing the probability of missing a sum or difference to an analogous problem in .
Lemma 4.1.
Let be a subset in chosen uniformly at random. Then if is a rotation in , we have
(43) |
and
(44) |
If is a flip in , we have
(45) |
and
(46) |
Proof.
Partition into its set of rotations and flips , and suppose that and . The rotation element can appear in precisely as for or as for . Taking the probability of the negations of these events respectively give second and third probabilities appearing in the sum of Equation (43). Equation (44) follows similarly.
A number of these probabilities can be expressed explicitly in terms of , , and . To prove Theorem 1.9, we need to compute all probabilities appearing in Equation (44) and Equation (46), but we will also compute the probabilities appearing in Equation (43) for completeness.
Lemma 4.2.
Let be a subset of of size chosen uniformly at random, and let be any element of . Then
(47) |
Proof.
If and are both even, then there exist exactly elements of that give when added to themselves. The remaining elements of partition into pairs of distinct elements adding to , and any such that is obtained by selecting of these pairs and one element from each pair, hence the result. The case where is even and is odd is identical except that all of the elements of are now partitioned into pairs as there are no elements that give when added to themselves. Finally, if is odd, then there is always a unique element that gives when added to itself, and the remaining elements partition into pairs, after which the same analysis can be applied. ∎
Lemma 4.3.
Let be a subset of of size chosen uniformly at random, and let be any nonzero element of of order . Then
(48) |
where
(49) |
Remark 4.
Proof.
Partition into the additive cosets of the subgroup . Each of these cosets has size and has elements that can be cyclically ordered such that the difference between any element and its predecessor is . Choosing a set of size such that is then equivalent to partitioning into with each , and choosing non-adjacent elements from the coset to include in . This is precisely what is counted by Equation (48) if is equal to the number of ways to choose non-adjacent elements from a cyclically ordered set of size .
Indeed, this is clear for , so assume . There are choices for the first element to be included, and each selection of elements may be made in different ways by designating different elements to be this first choice. Once the first element has been selected, the number of ways to choose the remaining elements is equal to the number of ways to choose non-adjacent elements from a linearly ordered set of size without choosing the extremal elements. This is the classic stars and bars problem, which yields precisely the binomial coefficient in the definition of . ∎
Lemma 4.4.
Let and be subsets of of sizes and , respectively, chosen uniformly at random, and let be any element of . Then
(51) |
Proof.
For any fixed choice of , we have total choices for . For each of these , we have if and only if for all . Because , this leaves choices for such that . ∎
We now have all the necessary parts to prove Theorem 1.9. Simple combinatorics yields
(52) |
Combining this with Lemmas 4.1, 4.3, and 4.4 yields the result after some simplification. See Appendix C for complete details.
By plotting against for certain large values of , we can see evidence for our intuition from the beginning of this section that quickly becomes close to for on the order of magnitude of (Figure 1). Numerical evidence from many large primes suggests that the value of such that is about .222Some code written for this project can be found at https://github.com/ZeusDM/MSTD-experiments.

5. Future Work
An immediate future direction of research is to extend the bounds on to show Conjecture 1.7 for all and . However, it is unlikely that the methods we have used in this paper will be useful for larger values of . This is because our approach showed that for values of that we considered, the majority of subsets of the generalized dihedral group with size were MSTD. But this is simply not the case for larger : numerical evidence shows that for any , the vast majority of the sets are balanced. A new approach is required to show that out of the sets that are not balanced, more are MSTD.
To this end, it would also be productive to more carefully analyze how many elements of the group are not in (or not in ) for closer to . This follows the approach of [HM09] for the “slow decay” case. Such an analysis could be used to find explicit formulas for the expectation values of and . In this paper we found a formula for , but only when is prime. Further, to use these results, we would also need to bound the variance of and . Depending on the results, this could be enough to prove Conjecture 1.6 if we can deduce an upper bound on the number of MDTS sets and a lower bound on the number of MSTD sets such that .
Yet another possible approach to prove Conjecture 1.6 is to construct an injective map from MDTS sets to MSTD sets in the group. Such an approach has proven to be difficult, but has the potential advantage of working for both large and small values of .
Appendix A Proof of Lemmas 2.1 and 2.2
See 2.1
Proof.
There are only 3 possible cases to consider for .
-
•
contains two flip elements: in this case, adding and subtracting flips is an identical operation as each flip element has order 2. Thus, will necessarily be sum-difference balanced.
-
•
contains one flip and one rotation element: let . Here, . However, . When or , will be MSTD. In the case where both of these are actually equalities (), will simply be balanced.
-
•
contains two rotation elements: let . Here, . However, . Note that . Both the sum set and the difference set have 3 elements except in one special case. Suppose that . Then, we have that . Thus, when contains two rotation elements only, is always balanced.
Therefore, has strictly more MSTD sets in than MDTS sets. ∎
See 2.2
Proof.
There are 4 possible cases to consider for .
-
•
contains three flip elements: just as in the case, since addition and subtraction of two flips are equivalent, is balanced.
-
•
contains two flip elements and one rotation element: let for . Then the sumset is
(53) while the difference set is
(54) so . Moreover, is MSTD precisely when at least one of and is distinct from each of the flips in that lie in . We enumerate all the ways that this could fail to occur:
We have and if and only if . This happens if and only if or if is even and , and in each of these cases we have choices for and . On the other hand, we have and if and only if , which implies that . We also require ; because , these equations can occur if and only if is divisible by , or , and . We therefore have choices for , and choices for and one corresponding choice of for each . Finally, note that we cannot have because , so this completes the enumeration of all such sets that fail to be MSTD.
There are total sets containing exactly flips. Subtracting the balanced sets that we just enumerated, we obtain the following numbers of MSTD sets:
if , if , if . (55) -
•
contains one flip element and two rotation element: let for . Then the sumset is
(56) while the difference set is
(57) We show that cannot be MDTS by separately comparing the flips and the rotations in and . The flips in are contained in , so this comparison is trivial. The rotations in are and ; we will show that has at least as many rotations. Consider the rotations in : note that is different from and from since , so there are at least distinct rotations in . If , then in fact has at least three distinct rotations, and we have . On the other hand, if , then , so there are only at most distinct rotations in , so again .
-
•
contains three rotation elements: for this case, there are such subsets. We compare directly to Equation (• ‣ A), the case that yields the least possible number of MSTD sets in . Here we have
(58) or equivalently,
(59) which is true if . When , since , we compare to instead, and the result is verified once again.
Therefore, even if all of the subsets with three rotation elements are MDTS, which is never true to begin with, we still have strictly more MSTD subsets of size 3 than MDTS subsets of size 3. ∎
Appendix B Proof of Lemma 3.9
For any given , we calculate the number of pairs where . Similarly, we also calculate the number of pairs where . Note that these can be counted through counting the number of pairs in which results in as follows:
Case 1,
-
•
Case 1.1,
-
–
Case 1.1.1 and are not flips.
We get that this is equivalent to counting pairs where and
-
–
Case 1.1.2 and are both flips.
We get that this is equivalent to counting pairs where and
-
–
-
•
Case 1.2,
-
–
Case 1.2.1 and are not flips.
We get that this is equivalent to counting pairs where and
-
–
Case 1.2.2 and are both flips.
We get that this is equivalent to counting pairs where and
-
–
Case 2,
-
•
Case 2.1
-
–
Case 2.1.1 is a flip.
We get that this is equivalent to counting pairs where and
-
–
Case 2.1.2 is a flip.
We get that this is equivalent to counting pairs where and
-
–
-
•
Case 2.2
-
–
Case 2.2.1 is a flip.
We get that this is equivalent to counting pairs where and
-
–
Case 2.2.2 is a flip.
We get that this is equivalent to counting pairs where and
-
–
We get a similar result with where or as follows:
Case 1,
-
•
Case 1.1,
-
–
Case 1.1.1 and are not flips.
We get that this is equivalent to counting pairs where
-
–
Case 1.1.2 and are both flips.
We get that this is equivalent to counting pairs where
-
–
-
•
Case 1.2,
-
–
Case 1.2.1 and are not flips.
We get that this is equivalent to counting pairs where
-
–
Case 1.2.2 and are both flips.
We get that this is equivalent to counting pairs where
-
–
Case 2,
-
•
Case 2.1
-
–
Case 2.1.1 is a flip.
We get that this is equivalent to counting pairs where
-
–
Case 2.1.2 is a flip.
We get that this is equivalent to counting pairs where
-
–
-
•
Case 2.2
-
–
Case 2.2.1 is a flip.
We get that this is equivalent to counting pairs where
-
–
Case 2.2.2 is a flip.
We get that this is equivalent to counting pairs where
-
–
Note that the signs are the same between corresponding cases in and . And thus there are only four distinct cases we need to count, and it suffices to compare the number of 4-tuple collisions over all with the number of 4-tuple collisions over all
We will prove the main result by counting the collisions of each of the 3 types. When is odd, it is simple to see from Lemma 3.10, Lemma 3.11, and Lemma 3.12 that the number of collisions for each of the in is the same for the corresponding and thus there are an equal total number of collisions. We now assume that is even.
-
•
Suppose is odd, from Lemma 3.11 we have pairs for each for a total of collisions. From Lemma 3.12, we also get pairs for each for an equal total collisions.If is even, we have a half of yielding pairs and another half yielding pairs for a total of . Meanwhile in we have each yielding for an equal total of . Thus, there are strictly more collisions in in this case.
-
•
Suppose is odd, we have choices for and choices for for a total of . We also get the same results for .If is even, we have half of yielding pairs and another half yielding pairs for a total of . For , we have each yielding collisions for a total of
-
•
Both groups have the same number of collisions for each and the corresponding at . Thus, the total number of collisions in this case is .
Thus, if we look over all the cases where is even, we found that either there are strictly more collisions in or there are equal amount of collisions. Thus, we can conclude that there are strictly more collisions in when is even.
Appendix C Proof of Theorem 1.9
We have
(60) |
by the combinatorial identity
(61) |
References
- [DKMMWW15] Thao Do et al. “Sets characterized by missing sums and differences in dilating polytopes” In J. Number Theory 157, 2015, pp. 123–153 DOI: 10.1016/j.jnt.2015.04.027
- [Heg07] Peter V. Hegarty “Some explicit constructions of sets with more sums than differences” In Acta Arith. 130.1 Instytut Matematyczny Polskiej Akademii Nauk, 2007, pp. 61–77 DOI: 10.4064/aa130-1-4
- [HKLLMT20] John Haviland et al. “More Sums Than Differences Sets in Finite Non-Abelian Groups”, Presentation slides, Young Mathematicians Conference, The Ohio State University, 2020 URL: http://www-personal.umich.edu/~havijw/resources/talks/mstd_finite_nonabelian.pdf
- [HM09] Peter V. Hegarty and Steven J. Miller “When almost all sets are difference dominated” In Random Structures Algorithms 35.1 Wiley Online Library, 2009, pp. 118–136 DOI: 10.1002/rsa.20268
- [HM13] Virginia Hogan and Steven J. Miller “When Generalized Sumsets are Difference Dominated”, 2013 arXiv:1301.5703 [math.NT]
- [MO07] Greg Martin and Kevin O’Bryant “Many sets have more sums than differences” In Additive combinatorics 43, CRM Proc. Lecture Notes Amer. Math. Soc., Providence, RI, 2007, pp. 287–305 DOI: 10.1090/crmp/043/16
- [MOS10] Steven J. Miller, Brooke Orosz and Daniel Scheinerman “Explicit constructions of infinite families of MSTD sets” In J. Number Theory 130.5, 2010, pp. 1221–1233 DOI: 10.1016/j.jnt.2009.09.003
- [MV14] Steven J. Miller and Kevin Vissuet “Most subsets are balanced in finite groups” In Combinatorial and additive number theory 101, Springer Proc. Math. Stat. Springer, 2014, pp. 147–157 DOI: 10.1007/978-1-4939-1601-6“˙11
- [Nat07] Melvyn B. Nathanson “Problems in additive number theory. I” In Additive combinatorics 43, CRM Proc. Lecture Notes Amer. Math. Soc., Providence, RI, 2007, pp. 263–270 DOI: 10.1090/crmp/043/13
- [Nat07a] Melvyn B. Nathanson “Sets with more sums than differences” In Integers 7, 2007, pp. A5\bibrangessep24 arXiv: https://www.emis.de/journals/INTEGERS/papers/h5/h5.pdf
- [Zha10] Yufei Zhao “Constructing MSTD sets using bidirectional ballot sequences” In J. Number Theory 130.5, 2010, pp. 1212–1220 DOI: 10.1016/j.jnt.2009.11.005
- [Zha10a] Yufei Zhao “Counting MSTD sets in finite abelian groups” In J. Number Theory 130.10 Elsevier, 2010, pp. 2308–2322 DOI: 10.1016/j.jnt.2010.06.001
- [Zha11] Yufei Zhao “Sets characterized by missing sums and differences” In J. Number Theory 131.11, 2011, pp. 2107–2134 DOI: 10.1016/j.jnt.2011.05.003