On computing sets of integers with maximum number of pairs summing to powers of 2

Max A. Alekseyev The George Washington University, Washington, DC, USA. Email: maxal@gwu.edu

Abstract

We address the problem of finding sets of integers of a given size with maximum number of pairs summing to powers of $2$ . By fixing particular pairs this problem reduces to finding a labeling of the vertices of a given graph with pairwise distinct integers such that the endpoint labels for each edge sum to a power of $2$ . We propose an efficient algorithm for this problem, which we use to determine the maximum size of graphs of order $n$ that admit such a labeling for all $n\leq 18$ . We also identify the minimal forbidden subgraphs of order $\leq 11$ , whose presence prevents graphs from having such a labeling.

1 Introduction

In April 2021, Dan Ullman and Stan Wagon in the “Problem of the Week 1321” [3] defined a function $f(A)$ of a finite set $A$ of integers as the number of $2$ -element subsets of $A$ that sum to a power of $2$ . They gave an example $f\big{(}\{-1,3,5\}\big{)}=3$ and further defined a function $g(n)$ as the maximum of $f(A)$ over all $n$ -element sets $A$ . The problem asked for a proof that $g(10)\geq 14$ , which was quickly improved to $g(10)\geq 15$ by the readers.

It was noted that the problem has a natural interpretation as finding a maximal graph of order $n$ , where the vertices are labeled with pairwise distinct integers and the sum of the endpoint labels for each edge is a power of $2$ . In March 2022, M. S. Smith proved that such a graph cannot contain a cycle $C_{4}$ , limiting the candidate graphs to well-studied squarefree graphs [2]. Smith’s result made it easy to establish the values of $g(n)$ for all $n\leq 9$ , and led to creation of the sequence A352178 to the Online Encyclopedia of Integer Sequences (OEIS) [8]. It further provided a nontrivial upper bound for $g(n)$ , namely the number of squarefree graphs of order $n$ (given by sequence A006855 in the OEIS).

The problem has received further attention after Neil Sloane presented it at the popular Numberphile Youtube channel in September 2022 [1]. This was followed by a few improvements, including the values $g(10)=15$ and $g(11)=17$ from Matthew Bolan [5] and Firas Melaih [4], respectively. These results were obtained via graph-theoretical treatment of the problem by manually analyzing a few candidate graphs.

At the same time, two methods were proposed for obtaining lower bounds for $g(n)$ , giving those at least for all $n\leq 100$ , which are listed in sequences A347301 and A357574 in the OEIS.

In the present paper, we propose an algorithm for testing admissibility of a given graph, i.e., whether its vertices can be labeled with pairwise distinct integers such that the sum of the endpoint labels for each edge is a power of $2$ . We use our algorithm to bound $g(n)$ from above, and together with the known lower bounds to establish the values of $g(n)$ for $n$ in the interval $[12,18]$ . Also, interpreting Smith’s result as saying that the cycle $C_{4}$ is a minimal forbidden subgraph (MFS), we find larger MFSs and show that there are none on $5$ , $6$ , $8$ , or $9$ vertices, while there are $2$ MFSs on $7$ vertices, $15$ MFSs on $10$ vertices, and $77$ MFSs on $11$ vertices. We have also enumerated all maximal admissible graphs of orders $14$ , $15$ , $16$ and established that there are $4$ , $28$ , and $2$ of them, respectively.

2 Algorithm for testing graph admissibility

A given graph $G$ on $n$ vertices with $m$ edges is admissible if and only if the following matrix equation is admissible:

M\cdot L=X,

(1)

where

•

$M$ is the $m\times n$ incidence matrix of $G$ with rows and columns indexed by the edges and vertices of $G$ , and so $M$ is a $\{0,1\}$ -matrix with each row containing exactly two $1$ ’s;
•

$L=(l_{1},l_{2},\dots,l_{n})^{T}$ is a column vector of pairwise distinct integer vertex labels;
•

$X=(x_{1},x_{2},\dots,x_{m})^{T}$ is a column vector formed by powers of $2$ representing the sums of edges’ endpoint labels.¹¹1The elements of $X$ are not required to be distinct.

Both $L$ and $X$ in this equation are unknown and have to be determined.

We start with solving (1) for $L$ in terms of $X$ , that is we compute a (partial) solution of the form $l_{i}=p_{i}(x_{1},\dots,x_{m})$ , where $p_{i}$ are linear polynomials with rational coefficients ( $i\in\{1,2,\dots,n\}$ ). Such a solution exists if and only if $K_{l}\cdot X=0$ , where $K_{l}$ is the a matrix with rows forming a basis of the left kernel of $M$ . For better efficiency, we will assume that $K_{l}$ has integer elements and is LLL-reduced. Let $E$ be the set of elements of $K_{l}X$ , which are linear homogeneous polynomials with integer coefficients representing linear equations in $x_{1},\dots,x_{m}$ .

Let $K_{r}$ be a matrix with columns form a basis of the right kernel of $M$ . For a connected graph $G$ , it is known that $K_{r}$ has size $n\times t$ , where $t=1$ or $t=0$ depending on whether $G$ is bipartite [9]. We find it convenient to view an $n\times 0$ matrix as composed of $n$ empty rows (and so all rows are equal). Adding a linear combination of the columns of $K_{r}$ to a solution $L$ to equation (1) turns it into another solution $L^{\prime}$ (with the same $X$ ), and furthermore all solutions can be obtained this way. The following theorem implies that for any set of pairwise distinct rows of $K_{r}$ , we can find a linear combination of the of the columns of $K_{r}$ such that the corresponding elements of $L^{\prime}$ will also be pairwise distinct.

Theorem 1.

Let $v$ be an integer column vector of size $k\geq 0$ , and $A$ be a $k\times s$ matrix with pairwise distinct rows. Then there exists an integer linear combination of the columns of $A$ such that adding it to $v$ results in vector with pairwise distinct elements.

Proof.

If $s=0$ , then with necessity we have $k=1$ , and thus $v$ already has pairwise distinct elements.

Let us prove the statement for $s=1$ . In this case, $A$ represents a column vector with pairwise distinct elements. Let $t$ be the difference between the largest and the smallest elements of $v$ . It is easy to see that vector $v+A\cdot(t+1)$ has pairwise distinct elements.

In the case of $s>1$ , let $d$ be the difference between the largest and the smallest elements of $A$ . Then the $k\times 1$ matrix $A^{\prime}:=A\cdot(1,(d+1),(d+1)^{2},\dots,(d+1)^{s-1})^{T}$ has pairwise distinct elements, thus reducing the problem to the case $s=1$ considered above. ∎

Theorem 1 implies that we need to take care only of the pairs of elements of $L$ corresponding to the equal rows of $K_{r}$ . For any equal rows of $K_{r}$ with indices $i<j$ , we compute $q(x_{1},\dots,x_{m}):=(p_{i}(x_{1},\dots,x_{m})-p_{j}(x_{1},\dots,x_{m}))c$ , where $c$ is a positive integer factor making all coefficients of $q$ integer. If $q$ is zero polynomial, then the condition $l_{i}\neq l_{j}$ is unattainable, and thus the graph $G$ is inadmissible. On the other hand, if $q$ consists of just a single term with a nonzero coefficient, then the condition $l_{i}\neq l_{j}$ always holds, and we ignore such $q$ . In the remaining case, when $q$ contains two or more terms with nonzero coefficient, we add $q$ to the set $N$ .

Our next goal is to solve the system of equations $E$ and inequations²²2We deliberately use the term inequation to denote relationship $p\neq 0$ and to avoid confusion with inequalities traditionally denoting relationships $\geq$ , $\leq$ , $>$ , or $<$ . $N$ in powers of $2$ , which we describe in the next section. For each solution $X=X_{0}$ , we substitute it in the system (1) turning it into a standard matrix equation, which we solve for $L$ composed of pairwise distinct integers $l_{i}$ (such a solution is guaranteed to exist). We outline the above description in Algorithm 1.

Algorithm 1 An algorithm for solving system (1) for a given graph

G

1:function GraphSolve(

G

)

2: Set

E:=\emptyset

and

N:=\emptyset

3: Construct the incidence matrix

M

G

with rows and columns indexed by edges and vertices of

G

4: Compute a LLL-reduced basis

K_{l}

of the left kernel of

M

\triangleright

We have

K_{l}\cdot M=0

5: for each row

r

K_{l}

6: Add polynomial

r\cdot X

E

7: end for

8: Solve

ML=X

for

L

in terms of

X

, let

(p_{1},\dots,p_{n})

be any particular solution.

9: Compute

K_{r}

whose columns form a basis of the right kernel of

M

\triangleright

We have

M\cdot K_{r}=0

10: for each

\{i,j\}\subset\{1,2,\dots,n\}

11: if

i

th and

j

th rows of

K_{r}

are not equal then

12: continue to next subset

\{i,j\}

\triangleright

Per Theorem 1.

13: end if

14: Set

q

equal to a multiple of

p_{i}-p_{j}

with integer coefficients

15: if

q=0

then

16: return

\emptyset

\triangleright

No solutions with

l_{i}\neq l_{j}

17: end if

18: if

q

contains two or more terms then

19: Add

q

N

20: end if

21: end for

22:

S:=\emptyset

23: for each

s

in SolveInPowers(

E,\ N

) do

\triangleright

s

is a map from

Y

to linear polynomials in

Y

24: Set

x_{i}:=2^{s[y_{i}]}

for each component

x_{i}

X

25: Solve

ML=X

for

L

composed of pairwise distinct integers, and add the solution to

S

26: end for

27: return

S

28:end function

3 Solving a system of (in)equations in powers of $2$

For given finite sets $E$ and $N$ of nonzero linear polynomials in $x_{1},x_{2},\dots,x_{m}$ , our goal is to find all $n$ -tuples of nonnegative integers $(y_{1},y_{2},\dots,y_{m})$ such that

\begin{split}\forall p\in E:&\quad p(2^{y_{1}},2^{y_{2}},\dots,2^{y_{m}})=0,\\ \forall p\in N:&\quad p(2^{y_{1}},2^{y_{2}},\dots,2^{y_{m}})\neq 0.\end{split}

As simple as it sounds, the following theorem provides a foundation for our algorithms.

Theorem 2.

In any multiset of nonzero integers that sum to $0$ , there exist two elements with equal $2$ -adic valuations.³³3Recall that the $2$ -adic valuation of an integer $k\neq 0$ , denoted by $\nu_{2}(k)$ , is the exponent of $2$ in the prime factorization of $k$ , while $\nu_{2}(0)=\infty$ .

Proof.

Let $S$ be a multiset of nonzero integers summing to $0$ , and let $k$ be an element of $S$ with the smallest $2$ -adic valuation, say $q:=\nu_{2}(k)$ . If every other element of $S$ has valuation greater than $q$ , then the sum of all elements (which is $0$ ) has valuation $q$ , which is impossible. Hence, there exist at least two elements in $S$ having $2$ -adic valuation equal $q$ . ∎

Applying Theorem 2 to an equation $c_{1}x_{1}+\dots+c_{m}x_{m}\in E$ , we conclude that if only one of the coefficients $c_{1},c_{2},\dots,c_{m}$ is nonzero, then the system $(E,N)$ is inadmissible. Otherwise, if there are two or more nonzero coefficients among $c_{1},c_{2},\dots,c_{m}$ , then there exists a pair of indices $i<j$ such that $c_{i}\neq 0$ , $c_{j}\neq 0$ , and $\nu_{2}(c_{i}x_{i})=\nu_{2}(c_{j}x_{j})$ , implying that we can make a substitution $x_{i}=2^{\nu_{2}(c_{j})-\nu_{2}(c_{i})}x_{j}$ or $x_{j}=2^{\nu_{2}(c_{i})-\nu_{2}(c_{j})}x_{i}$ (we pick one with integer coefficients). Then we proceed with making this substitution in $E$ and $N$ reducing the number of indeterminates, and if it does not make any elements of $N$ evaluate to zero, we proceed with solving the reduced system recursively. After the pair $(i,j)$ is explored, we add a new inequation $2^{\nu_{2}(c_{i})}x_{i}-2^{\nu_{2}(c_{j})}x_{j}$ to $N$ (to prevent obtaining the same solutions again in future), and proceed to a next pair of indices.

We outline the above description in Algorithm 2. For given sets $E$ and $N$ of linear equations and inequations in $x_{1},\dots,x_{m}$ , function SolveInPowers $(E,\ N)$ computes the set of their solutions in powers of $2$ . Each solution is given in the form of a map $s$ from the set of variables $Y:=\{y_{1},y_{2},\dots,y_{m}\}$ to linear polynomials in these variables, representing the exponents in the powers of $2$ . Namely, $s$ sends every variable from $Y$ either to itself (when it’s a free variable), or to a linear polynomial of the free varaibles. For example, the map $\{y_{1}\to y_{2}+1,\ y_{2}\to y_{2},\ y_{3}\to y_{2}+y_{4}+3,y_{4}\to y_{4}\}$ corresponds to the solution $(x_{1},x_{2},x_{3},x_{4})=(2^{y_{2}+1},2^{y_{2}},2^{y_{2}+y_{4}+3},2^{y_{4}})$ , where $y_{2}$ and $y_{4}$ are free variables taking nonnegative integer values.

Algorithm 2 An algorithm for solving a given system of linear equations

E

and inequations

N

for

x_{1},\dots,x_{m}

in powers of

2

. It returns a set of maps

s

from variables

Y:=\{y_{1},y_{2},\dots,y_{m}\}

to linear polynomials in these variables such that

(x_{1},\dots,x_{m})=(2^{s[y_{1}]},2^{s[y_{2}]},\dots,2^{s[y_{m}]})

is a solution.

1:function SolveInPowers(

E,\ N

)

2: if

E=\emptyset

then

3: return

\{\text{the identity map}\}

\triangleright

Every variable in

Y

is free.

4: end if

5: Pick

c_{1}x_{1}+\dots+c_{m}x_{m}\in E

with the smallest number of nonzero coefficients.

6: Let

I:=\{i\mid 1\leq i\leq m,\ c_{i}\neq 0\}

be the set of indices of nonzero coefficients.

7: if

|I|=1

then

8: return

\emptyset

\triangleright

Such equation has no solutions.

9: end if

10: Set

S:=\emptyset

\triangleright

We accumulate solutions in

S

11: for each

\{i,j\}\subseteq I

\triangleright

We iterate over all 2-element subsets of

I

12: Possibly exchanging the values of

i

and

j

, ensure that

d:=\nu_{2}(c_{i})-\nu_{2}(c_{j})\geq 0

13: Construct

N^{\prime}

from

N

by substituting

x_{j}\leftarrow 2^{d}x_{i}

14: if

0\in N^{\prime}

then

15: continue to the next pair

\{i,j\}

16: end if

17: Add

x_{j}-2^{d}x_{i}

N

\triangleright

For future we disallow the equality

c_{j}x_{j}=c_{i}x_{i}

18: Construct

E^{\prime}

from

E

by substituting

x_{j}\leftarrow 2^{d}x_{i}

and excluding zero polynomials.

19: for each

s

in SolveInPowers(

E^{\prime},\ N^{\prime}

) do

\triangleright

s

is a map from

Y

to linear polynomials in

Y

20: Redefine

s[y_{j^{\prime}}]:=s[y_{i^{\prime}}]+d

21: Add

s

to set

S

22: end for

23: end for

24: return

S

25:end function

4 Minimal forbidden subgraphs

We used the results of the previous sections to find minimal forbidden subgraphs (MFS) of small order, i.e., inadmissible graphs in which every proper subgraphs is admissible. It is easy to see that each MFS must be connected. It is further almost trivial task to verify that $C_{4}$ is the smallest MFS and the only one on $4$ vertices. Therefore, for $n>4$ we can restrict our attention to connected squarefree graphs as candidates, which we generate in SageMath [7] with the function nauty_geng() based on nauty tool [6] supporting both connected (option -c) and squarefree (option -f) graphs. This significantly speeds up the algorithm and eliminates the need to test the presence of MFS $C_{4}$ as a subgraph.

We look for MFSs, other than $C_{4}$ iteratively increasing their order, accumulating found MFSs in a set $S$ (initially empty). For each candidate graph $G$ , we check if $G$ contains any graphs from $S$ as a subgraph using the SageMath function is_subgraph(). If $G$ contains any of the graphs from $S$ , we go to the next candidate graph $G$ . Otherwise, we test admissibility of $G$ by calling $\textsc{GraphSolve}(G)$ . If $G$ is inadmissible, then it represents an MFS and we add it to $S$ . The described algorithm is outlined in Algorithm 3.

Algorithm 3 An algorithm for iterative computing minimal forbidden subgraphs, other than

C_{4}

, of order up to

u

1:function FindMFS(

u

)

S:=\emptyset

3: for

n=5,\dots,u

4: for each connected squarefree graph

G

of order

n

5: if

G

contains any graph

H

from

S

as a subgraph then

6: continue to next

G

7: end if

8: if GraphSolve(

G

) is empty then

9: Add

G

to the set

S

10: end if

11: end for

12: end for

13: return

S

14:end function

We confirmed that the smallest MFS is the cycle $C_{4}$ as it was originally proved by Smith, and the next two have order $7$ (Fig. 1). It happens that one of these graphs was previously proved to be inadmissible by Bolan while showing that $g(10)=15$ [5].

Refer to caption — Figure 1: Minimal forbidden subgraphs of order $7$ .

There are no MFSs of order $8$ or $9$ , but there are $15$ of them of order $10$ (Fig. 2), and there are $77$ MFSs of order $11$ . We use MFSs of order $\leq 10$ for quick filtering some inadmissible graphs.

5 Computing values of $g(n)$

We also used the proposed algorithms for computing values of $g(n)$ (sequence A352178 in the OEIS) for $n$ in $[12,18]$ using the known lower and upper bounds:

$n$	12	13	14	15	16	17	18
lower bound (A347301)	19	21	24	26	29	31	34
upper bound (A006855)	21	24	27	30	33	36	39

In all these cases, the value of $g(n)$ happens to coincide with the lower bound, and thus the problem can be posed as verifying that larger values (up to the upper bound) are not possible. For values $n\leq 13$ this can be done directly by generating all larger connected squarefree graphs and testing them for admissibility. For example, there are only $957$ such graphs of order $13$ with $22$ or more edges.

We find the following statements helpful:

Theorem 3.

For any $n>2$ :

•

if an admissible graph of order $n$ with $e$ edges exists, then its vertices have degree is at least $e-g(n-1)$ ;
•

$g(n)\leq\left\lfloor\frac{n\cdot g(n-1)}{n-2}\right\rfloor.$

Proof.

Suppose that there exists an admissible graph $G$ of order $n$ with $e$ edges, and that it has a vertex $v$ of degree smaller than $e-g(n-1)$ . Then removing $v$ from $G$ results in an admissible graph of order $n-1$ with more than $g(n-1)$ edges. The contradiction proves that degree of any vertex of $G$ is at least $e-g(n-1)$ . This further implies that $G$ has at least $\tfrac{n(e-g(n-1))}{2}$ edges, that is $e\geq\tfrac{n(e-g(n-1))}{2}$ , implying that $e\leq\big{\lfloor}\frac{n\cdot g(n-1)}{n-2}\big{\rfloor}$ . Taking an admissible graph of order $n$ with $e=g(n)$ edges proves that $g(n)\leq\big{\lfloor}\tfrac{n\cdot g(n-1)}{n-2}\big{\rfloor}$ . ∎

From $g(13)=21$ , Theorem 3 implies that $g(14)\leq 24$ , which matches the lower bound. Hence, we obtain $g(14)=24$ without any computation.

For $n=15$ , Theorem 3 implies that $g(15)\leq 27$ . If an admissible graph with $27$ edges exists, the minimum degree should be at least $3$ . This can be enforced with the option -d3 of nauty_geng(), which generates $8280$ such candidate graphs, but our check shows that neither of them is admissible. Hence, $g(15)=26$ .

For $n=16$ , Theorem 3 implies that $g(16)\leq 29$ and thus $g(16)=29$ .

For $n=17$ , Theorem 3 implies that $g(17)\leq 32$ . If an admissible graph with $32$ edges exists, the minimum degree should be at least $3$ . There are $1023100$ such candidate graphs, which is possible to check directly although it would be quite time consuming. Instead, we approached this problem from another side—by constructing candidate graphs from the maximal admissible graphs of order $16$ as explained below. This way we established that there are no admissible graph of order $17$ with $32$ edges, thus proving that $g(17)=31$ .

For $n=18$ , Theorem 3 implies that $g(18)\leq 34$ and thus $g(18)=34$ .

6 Maximal admissible graphs

The maximal admissible graphs of each order $n\leq 14$ can be obtained directly from the candidate graphs generated by nauty_geng(). In particular, for $n=14$ we can restrict our attention to the $2184$ connected squarefree graphs with minimum degree $3$ , among which we identified only $4$ admissible graphs (Fig. 3).

To construct maximal admissible graphs of order $15$ , we noticed that either they contain a vertex of degree $2$ whose removal results in a maximal admissible graph of order $14$ , or their minimal degree is at least $3$ . We obtained $20$ maximal admissible graphs of the first type (by adding a vertex of degree $2$ to the maximal admissible graphs of order $14$ in all possible ways, and testing admissibility of the resulting graphs), and $8$ maximal admissible graphs of the second type by testing $33608$ candidate graphs generated by nauty_geng().

Similarly, we further extended maximal admissible graphs of order $15$ to those of order $16$ by adding a vertex of degree $3$ , which resulted in just two maximal admissible graphs of order $16$ (Fig. 4). However, extending them to graphs of order $17$ with $32$ edges by adding a vertex of degree $3$ produced no admissible graphs, thus proving that $g(17)=31$ .

Acknowledgements

The author thanks Neil Sloane for his nice introduction to the problem [1] and proofreading of the earlier version of this paper.

References

[1] Brady Haran and N. J. A. Sloane. Problems with Powers of Two. Numberphile Youtube Channel, September 2022. https://youtu.be/IPoh5C9CcI8.
[2] C. R. J. Clapham, A. Flockhart, and J. Sheehan. Graphs without four-cycles. Journal of Graph Theory, 13(1):29–47, 1989.
[3] Dan Ullman and Stan Wagon. Problem 1321: Powers of Two. Macalester College Problem of the Week, April 2021. Available electronically at https://oeis.org/A347301/a347301_1.pdf.
[4] Firas Melaih. On The OEIS Sequence A352178. Memo, September 2022. Available electronically at https://oeis.org/A352178/a352178_3.pdf.
[5] Matthew Bolan. Stan Wagon 1321 Solution. Memo, September 2022. Available electronically at https://oeis.org/A352178/a352178.pdf.
[6] Brendan D. McKay and Adolfo Piperno. Practical graph isomorphism, II. Journal of Symbolic Computation, 60:94–112, 2014.
[7] SageMath. version 9.7, 2022. https://www.sagemath.org/.
[8] The OEIS Foundation. The On-Line Encyclopedia of Integer Sequences. http://oeis.org, 2023.
[9] C. Van Nuffelen. On the incidence matrix of a graph. IEEE Transactions on Circuits and Systems, 23(9):572–572, 1976.

On computing sets of integers with maximum number of pairs summing to powers of 2

Abstract

1 Introduction

2 Algorithm for testing graph admissibility

Theorem 1.

Proof.

3 Solving a system of (in)equations in powers of 22

Theorem 2.

Proof.

4 Minimal forbidden subgraphs

5 Computing values of g​(n)g(n)

Theorem 3.

Proof.

6 Maximal admissible graphs

Acknowledgements

References

3 Solving a system of (in)equations in powers of $2$

5 Computing values of $g(n)$