This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Discrete Richman-bidding Scoring Games

Urban Larsson National University of Singapore Neel Patel National University of Singapore  and  Ravi Kant Rai Indian Institute of Technology Bombay
Abstract.

We study zero-sum (combinatorial) games, within the framework of so-called Richman auctions (Lazarus et al. 1996) namely, we modify the alternating play scoring ruleset Cumulative Subtraction (CS) (Cohensius et al. 2019), to a discrete bidding scheme (similar to Develin and Payne 2010). Players bid to move and the player with the highest bid wins the move, and hands over the winning bidding amount to the other player. The new game is dubbed Bidding Cumulative Subtraction (BCS). In so-called unitary games, players remove exactly one item out of a single heap of identical items, until the heap is empty, and their actions contribute to a common score, which increases or decreases by one unit depending on whether the maximizing player won the turn or not. We show that there is a unique bidding equilibrium for a much larger class of games, that generalize standard scoring play in the literature. We prove that for all sufficiently large heap sizes, the equilibrium outcomes of unitary BCS are eventually periodic, with period 2, and we show that the periodicity appears at the latest for heaps of sizes quadratic in the total budget.

1. Introduction

Suppose that you are involved in a 2-player game, where the goal is to capture more objects than your opponent, but at each stage of play, you may only remove one item. You may remove an object if and only if you win a specially designed auction. Say that there are two objects in the heap (and therefore to win the game you need to collect both, and if you collect one of them the game is drawn), and assume that you and your opponent share $5; you have $1 and he has $4. In addition to your single dollar, you have a tie-breaking marker, which works in your favor. If a player wins the bid strictly, by bidding say dd dollars, then they get an item but must pay the bidding amount to the opponent. If the bid ends up equal, then the player with the tie breaking marker wins the round, gets an item, but has to hand over $dd and the marker to the other player. Clearly, you cannot win both bids with a single dollar. The question is whether you will be able to tie this game, and win one of the bids, or if the opponent can win both, and the answer is easy. Since the opponent has to bid $2 to win the bid, then you will win the next round. (If you had only the marker but no dollar you would lose both bids, and if you had only a dollar but no marker you would lose both bids.) From this gentle introduction, we already have an intuition that the marker has a non-trivial impact on such games, and it could be worth sometimes a dollar. The tie-breaking is necessary to play the game, and to understand such games, it seems an unavoidable issue to resolve the worth of this ‘marker’ in every game situation. We will formalize this ‘every’ setting and present some further major issues to solve. But first, we provide some more background.

Richman auctions (Lazarus et al., 1996) are designed for any standard combinatorial 2-player game (Berlekamp et al., 2004), to resolve who is to play next. Instead of alternating play, for each stage of game, the 2 players, called Left and Right,111These names are adapted from standard literature on combinatorial games, Left is ‘she’ and Right is ‘he’. resolve this crucial moment by a type of auction where the winning player must pay the losing player their bid amount.

Richman bidding can be adapted to any standard combinatorial game, and thus offers a way to extend classical Combinatorial Game Theory (CGT) (Siegel, 2013; Berlekamp et al., 2004; Conway, 1976) to a more economic style of gameplay. Moreover, Richman auctions are a perfect fit for combinatorial games, as they offer an unlimited number of bidding rounds. This is useful, since many popular board games can be played on arbitrarily large game boards, and various kinds of Nim-type removal games (a.k.a. heap games, take-away games, etc) (Berlekamp et al., 2004) can contain arbitrarily many objects in a starting position.

In this paper, we study discrete bidding in a manner similar to (Develin and Payne, 2010). We adapt the setting to a class of games known as Cumulative Subtraction (CS) (Cohensius et al., 2019). CS is played on a finite heap of pebbles, and players take turns removing pebbles from the heap, but are only allowed to remove a number of pebbles from a fixed set of values, called the subtraction set SS\subset\mathbb{N}. For example, if S={3,5,9}S=\{3,5,9\}, then players are allowed to remove either 33, 55 or 99 pebbles at each turn, as long as the heap size stays non-negative. In the standard zero-sum variation, the final score is the difference between the number of pebbles accumulated by each player by the end of the game. Player Left is the maximizer, whereas player Right is the minimizer; the final score is the total number of pebbles collected by Left during play minus Right’s total number. In order not to obscure the main ideas of our bidding setting, we will simplify the ‘subtraction-part’ of the games, and focus on the simplest possible subtraction set, namely when S={1}S=\{1\} (although several results hold also for more general subtraction sets).

Let ={1,2,}\mathbb{N}=\{1,2,\ldots\}, and let 0={0}\mathbb{N}_{0}=\mathbb{N}\cup\{0\}. Discrete bidding here means that there is a given total budget of TB0\mathrm{TB}\in\mathbb{N}_{0}. One of the players has a budget p{0,,TB}p\in\{0,\ldots,\mathrm{TB}\}, and the other player has the budget q=TBpq=\mathrm{TB}-p. At each turn, players submit a (closed) bid. The player with the higher bid gets to play, and pays their bid to the player with the lower bid. Thus, if player Left wins by bidding \ell, she pays this bid to Right, and makes her desired move. The new budget partition becomes (p,q+)(p-\ell,q+\ell). Ties are resolved using a tie-breaking marker: one of the players has the marker, and this player wins the turn in case of equal bids. They makes their desired move, pays the other player the bidding amount, and passes them the marker.

By adapting the bidding mechanism to CS, we get a new game, called Bidding Cumulative Subtraction (BCS).

The discrete setting gives new challenges and problems to Richman games. For example, the marker, which is a necessary and convenient tool to resolve a tie, can sometimes be worth a dollar (but not more).222In the continuous setting, any positive bidding amount is worth more than the marker. Perhaps this seems like an unfortunate side effect of the discrete setting, but it turns out that it is quite interesting to resolve the accompanying asymmetry of the game. In many situations, the player with the marker has a slight advantage, and it appears non-trivial to find out exactly what that means, even in the case of the simplest subtraction set, S={1}S=\{1\}, which will also be called the unitary setting.

1.1. Related Work

In the classical Richman setting (Lazarus et al., 1996) (Lazarus et al., 1999), the auction is continuous, say with total budget $1, and the players split that budget to, say pp and qq, with p+q=1p+q=1. Optimal bids have been resolved for the game tic-tac-toe (Develin and Payne, 2010), but very little is known for the game of chess (Larsson and Wästlund, 2019).

The main theorem of Richman games considers one additional setting, where the turn function is a Bernoulli trial: for each stage of play, an rr-biased coin is tossed to decide whose turn it is. A player wins a given coin tossing game if and only if the player with budget $(1r)\$(1-r) wins the corresponding Richman game. A rule to resolve ties is required, but since ties occur with probability 0, in the continuous setting, the main result holds for any standard tie breaking rule, with open or sealed (simultaneous) bids. In games with zugzwangs (games where no player wants to move first), it is customary to let the winner of a bid decide who is to play next. In games with a nonnegative incentive (such as the games we will study), it suffices to have the players bidding for the opportunity to take the next turn.

Develin and Payne (Develin and Payne, 2010) study win-loss games, with discrete Richman bidding, and they use a modified tie resolution method: the player with the marker may either keep the marker and let the other player pay the bid and move in the game, or give away the marker together with the bid and make their desired move in the game. This is not just a subtle difference in approach; namely, we emphasize that our simpler approach has the benefit of generalizing alternating play. Namely, in our setting, alternating play corresponds to TB=0\mathrm{TB}=0 (with both players having a respective individual budget of 0 as a result). Another obvious difference is that we study games with numeric results, in contrast to the traditional CGT win-loss situation. This leads to an interesting question about uniqueness of bidding equilibrium.

Combinatorial games are often viewed in the light of so-called normal play, where it is always good to be able to move (an idea, which recently leads to an ‘absolute combinatorial game theory’ (Larsson et al., 2016a, 2018a)). Here, we may have a situation where the player who wins the bid is the normal play loser. Thus, for our most general setting, we are motivated to include a certain penalty function, τ\tau, with input Left and Right terminal positions. If a player wins a bid that has no move, there is some consequence of this final auction; indeed in Guaranteed Scoring games (Larsson et al., 2018b, 2016b) a ‘terminal penalty’ is invoked for a player who cannot move (and this penalty corresponds to the maximum score that the other player can still achieve by playing out all remaining move options). We include this short discussion here, since we want to emphasize that the proposed move convention in this work generalizes game settings in the literature.

All literature on combinatorial games assumes a unique equilibrium, which defines a game value under optimal play; see for example (Milnor, 1953; Hanner et al., 1959; Ettinger, 1996; Larsson et al., 2018b, 2016b; Johnson, 2014; Siegel, 2013).

1.2. Our Contribution

In this paper, we first analyze a general setting of discrete Richman bidding adapted for any standard combinatorial scoring game. We show that if the ruleset satisfies some basic and intuitive monotonicity properties, then each such game exhibits a unique equilibrium. Then in the rest of the paper, we focus on a simple class of Bidding Cumulative Games (defined in Section 3) named unitary games. Our theoretical contribution can be summarized in three main results.

In Section 3.1, we show that every unitary game has a unique equilibrium described recursively by maximin functions of the game positions (Theorem LABEL:thm:unitary). We prove this result by showing that the ruleset of unitary games satisfies sufficient properties for uniqueness. Then we prove further monotonicity properties of unitary games, which helps us to prove a convergence result. In Section 3.3, we prove that the equilibrium outcomes of unitary games converge for heap sizes of the same parity, i.e. the sequence is eventually periodic with period 2 (Theorem LABEL:thm:const). Moreover, aided by a certain ‘bidding automaton’, in Section 3.4, we show that the convergence to this periodicity is quadratic in the size of the total budget (Theorem LABEL:thm:convbou). In addition, we provide a conjecture on the corresponding closed formula expression for the equilibrium outcomes in the limit (Conjecture 28).

2. Equilibrium properties

Let us first elaborate on the fundamental properties of discrete Richman bidding, for any combinatorial scoring (zero-sum) game. For a given total budget TB0\mathrm{TB}\in\mathbb{N}_{0}, let TB=p+q\mathrm{TB}=p+q, where pp is Left’s part of the budget, and unless otherwise stated, Left has the tie-breaking marker. Moreover, let {0,TB}\varnothing\neq\mathcal{B}\subseteq\{0,\ldots\mathrm{TB}\}, be the set of possible Richman bids.333We let the bidding set be symmetric to simplify notation. A player may not be able to bid on their turn; for example, if 00\not\in\mathcal{B} and the player’s budget has been exhausted. In this case, the player who is able to place a bid moves and transfers a valid bid amount bb\in\mathcal{B} to the other player. In case there is a position such that no player can bid, the ruleset is invalid. We will assume valid rulesets, i.e. some player is able to place a bid at every turn.

In this section, we consider acyclic combinatorial rulesets of the form Γ=(X,,,)\Gamma=(X,\mathcal{L},\mathcal{R},\mathcal{B}), where XX is a set of positions (nodes), and ,:X2X\mathcal{L},\mathcal{R}:X\rightarrow 2^{X} are the move functions for players Left and Right respectively, with discrete Richman \mathcal{B}-bidding, and with given weight functions wL,wR:X×Xw_{L},w_{R}:X\times X\rightarrow\mathbb{R} on the set of move edges, for each player. Here XX denotes a possibly infinite set of ‘starting’ positions, in a usual sense of CGT: any game state may be considered as a starting position, but each play sequence is finite. A Left move is of the form xXy(x)x\in X\rightarrow y\in\mathcal{L}(x), and similarly for Right. The final score (or utility) of a terminating play sequence σ\sigma is

u(σ)=τ(t)+wL(eL)wR(eR),u(\sigma)=\tau(t)+\sum w_{L}(e_{L})-w_{R}(e_{R}),

where Left played the moves eLe_{L} and Right played the moves eRe_{R}. The weighted move edges are chosen at each stage of play by the winner of the Richman bidding. The terminal penalty scores are given by a function τ:TLTR\tau:T_{L}\cup T_{R}\rightarrow\mathcal{R}, where TL,TRXT_{L},T_{R}\subset X is the set of terminal positions for Left and Right respectively. The game ends when the player who wins the Richman bidding cannot move.

Throughout the paper, given a total budget TB\mathrm{TB}, for a given position xXx\in X, a Richman-position is denoted by (x,p^)(x,\widehat{p}), where Left has pp dollars and the tie-break marker, or by (x,p)(x,p), where Left has pp dollars but Right has the marker. A note on terminology: we will abbreviate ‘Richman-position’ to position, since the generalized CGT ‘move-flag’, the current budget partition (sometimes denoted simply by p^\widehat{p} or pp), is mandatory information to play.444This is consistent with standard CGT terminology, which often omits the notion of who is to move, since the theory requires analysis of all players as starting players.

A player may have move options, but not being able to access them even if the other player does not have any move options. For example, if Left has no remaining budget, and Right has the marker, then if Left is the only player with move options, the game will end, because Right will win that bid. Combinatorial games are often viewed in the light of so-called normal play, where it is always good to be able to move. Here, we may have a situation where the player who wins the bid is the normal play loser.

This is the motivation for the penalty function, τ\tau; if a player wins a bid that has no move, there is some consequence of this final auction.

For symmetric games, where =\mathcal{L}=\mathcal{R} and wL=wRw_{L}=w_{R}, this discussion is obsolete, and we may set τ(t)=0\tau(t)=0, for all terminal positions tt. In the symmetric case, there is no bidding when one of the players runs out of options, because then both players run out of options, and there is no possibility that the game may continue.

Our games generalize standard combinatorial scoring games; namely, by taking TB=0\mathrm{TB}=0, we obtain the standard alternating play mechanics. As mentioned, combinatorial games have unique equilibria, which defines game values under optimal play, and depending on who starts the game. We will demonstrate that we do have uniqueness of equilibrium in our generalization, under a natural set of axioms. In those cases, we may refer to unique game values.

Let us first define the relevant maximin functions. The maximin function induces a bid-action pair that offers the greatest value of a set of minima: Left declares bids and notes Right’s bid-action responses in each case. Then Left chooses the bid that maximizes the value, and given that Left can choose action if she wins the bid. Minimax is the reverse situation.

Definition 1 (Maximin Functions).
\thlabel

def:maximin Consider a a total budget TB0\mathrm{TB}\in\mathbb{N}_{0} and a ruleset (X,,,)(X,\mathcal{L},\mathcal{R},\mathcal{B}). The maximin functions ν^,ν:XTB+1\widehat{\nu},\nu:X\rightarrow\mathbb{R}^{\mathrm{TB}+1} are defined on positions (x,p^)(x,\widehat{p}) and (x,p)(x,p), respectively.

For all p{0,,TB}p\in\{0,\ldots,\mathrm{TB}\}, for terminal positions tL,tRXt_{L},t_{R}\in X, ν^p(tL)=τ(tL)\widehat{\nu}_{p}(t_{L})=\tau(t_{L}), νp(tR)=τ(tR)\nu_{p}(t_{R})=\tau(t_{R}), and recursively, for non-terminal positions xXx\in X,

ν^p(x)=max,y(x)minr,z(x){ν^p(y)|>r+wL(x,y),νp(y)|=r+wL(x,y),ν^p+r(z)|<r+wR(x,z)}\widehat{\nu}_{p}(x)=\max_{\ell,y\in\mathcal{L}(x)}\;\min_{r,z\in\mathcal{R}(x)}\{\widehat{\nu}_{p-\ell}(y)|_{\ell>r}+w_{L}(x,y),\nu_{p-\ell}(y)|_{\ell=r}+w_{L}(x,y),\widehat{\nu}_{p+r}(z)|_{\ell<r}+w_{R}(x,z)\}

and

νp(x)=max,y(x)minr,z(x){νp(y)|>r+wL(x,y),ν^p+r(z)|=r+wR(x,z),νp+r(z)|<r+wR(x,z)},\nu_{p}(x)=\max_{\ell,y\in\mathcal{L}(x)}\;\min_{r,z\in\mathcal{\mathcal{R}}(x)}\{\nu_{p-\ell}(y)|_{\ell>r}+w_{L}(x,y),\widehat{\nu}_{p+r}(z)|_{\ell=r}+w_{R}(x,z),\nu_{p+r}(z)|_{\ell<r}+w_{R}(x,z)\},

where {0,,p}\ell\in\mathcal{B}\cap\{0,\ldots,p\} and r{0,,q}r\in\mathcal{B}\cap\{0,\ldots,q\}.

The minimax functions are defined analogously.

The classical ‘CGT outcome’ in alternating play from position xx corresponds to TB=0\mathrm{TB}=0, and it is the ordered pair o(x)=(ν^0(x),ν0(x))o(x)=(\widehat{\nu}_{0}(x),\nu_{0}(x)), the optimal play score when Left and Right starts, respectively. Note that, since the bidding is trivial, for this case, there is a unique equilibrium. In general, if, for all positions xx, each entry has a unique equilibrium, the generalized CGT-outcome, for TB\mathrm{TB}\in\mathbb{N}, is the vector o(x)=(ν^TB(x),,ν^0(x),ν0(x),,νTB(x))o(x)=(\widehat{\nu}_{\mathrm{TB}}(x),\ldots,\widehat{\nu}_{0}(x),\nu_{0}(x),\ldots,\nu_{\mathrm{TB}}(x)). Whenever applicable, we call this vector o(x)o(x) the (optimal play) outcome of xx. For symmetric games, i.e. when the move sets =\mathcal{L}=\mathcal{R} are the same, it suffices to store half this vector, and so we drop the marker notation, and write o(x)=(ν^TB(x),,ν^0(x))o(x)=(\widehat{\nu}_{\mathrm{TB}}(x),\ldots,\widehat{\nu}_{0}(x)) (i.e. symmetric outcomes assume that Left has the marker).

In Definition LABEL:def:U, we provide intuitive properties of the ruleset Γ\Gamma and we prove that the following properties suffice to establish uniqueness of equilibria in Theorem LABEL:thm:unique.

Definition 2 (Uniqueness Properties).
\thlabel

def:U A ruleset Γ𝒰\Gamma\in\mathcal{U} if the pair of maximin functions (ν^(\widehat{\nu}, ν)\nu) satisfies, for each position xx, for all p{0,,TB}p\in\{0,\ldots,\mathrm{TB}\}:

  • (A)

    ‘Budget Monotonicity.’ ν^p(x)ν^π(x)\widehat{\nu}_{p}(x)\geq\widehat{\nu}_{\pi}(x) and νp(x)νπ(x)\nu_{p}(x)\geq\nu_{\pi}(x), if p>πp>\pi.

  • (B)

    ‘Marker Monotonicity.’ ν^p(x)νp(x)\widehat{\nu}_{p}(x)\geq\nu_{p}(x).

  • (C)

    ‘Marker Worth.’ ν^p(x)νp+1(x)\widehat{\nu}_{p}(x)\leq\nu_{p+1}(x).

If a ruleset Γ𝒰\Gamma\in\mathcal{U}, then, by (A) the players weakly prefer to win by smaller bids, by (B) Right weakly prefers to tie a Left winning bid, and by (C), given a Right bid, Left weakly prefers to tie before winning strictly (the marker is worth at most a dollar). These properties are fairly straight forward and intuitive in our setting. We now show in Theorem LABEL:thm:unique that if ruleset Γ\Gamma satisfies such properties, then it exhibits unique equilibria.

Theorem 3 (Uniqueness).
\thlabel

thm:unique If a ruleset Γ𝒰\Gamma\in\mathcal{U}, then for all positions and for all budget partitions, the game has a unique equilibrium.

Proof.

We assume with no loss of generality that Left has the marker: a symmetric argument holds when Right has the marker.

Suppose that Left bids ‘0’. Then, if Right benefits by deviating, by (A) and the zero-sum property, we may assume that he bids ‘1’ (perhaps he bids more if the same value). Suppose next that Left bids 1. Then similarly Right might benefit by bidding 2, and so on.

Item (C) implies that Left does not benefit by winning strictly at bid \ell, if she can win by a tie at bid 1\ell-1. Item (B) implies that Left does not prefer a Right tie at a given Left bid. By combining these two statements, Left weakly prefers a tie bid at 0\ell\geq 0, before any larger tie.

Altogether, this deviation scheme must terminate at some tie bid p\ell\leq p or possibly Right wins at +1q\ell+1\leq q, and in either case the unique equilibrium is given by Definition LABEL:def:maximin. ∎

For the special case of symmetric games, we simplify the notations, and denote ==\mathcal{L}=\mathcal{R}=\mathcal{M}. In Observation LABEL:obs:maximinsym, we define maximin function for symmetric games which is a special case of the maximin function defined in the Definition LABEL:def:maximin.

Proposition 4 (Symmetric Maximin).
\thlabel

obs:maximinsym Consider a symetric ruleset. For all terminal positions tXt\in X, for all pp, ν^p(t)=νp(t)=0\widehat{\nu}_{p}(t)=\nu_{p}(t)=0. For non-terminal xx,

ν^p(x)=max0p,y(x)min0rq,y(x){ν^p(y)|>r+w(x,y),νp(y)|=r+w(x,y),ν^p+r(y)|<rw(x,y)},\widehat{\nu}_{p}(x)=\max_{0\leq\ell\leq p,y\in\mathcal{M}(x)}\;\min_{0\leq r\leq q,y\in\mathcal{M}(x)}\{\widehat{\nu}_{p-\ell}(y)|_{\ell>r}+w(x,y),\nu_{p-\ell}(y)|_{\ell=r}+w(x,y),\widehat{\nu}_{p+r}(y)|_{\ell<r}-w(x,y)\},

where, for all xx, pp,

(1) νp(x)=ν^q(x).\displaystyle\nu_{p}(x)=-\widehat{\nu}_{q}(x).
Proof.

We express the equality (1) as

νp(x)\displaystyle\nu_{p}(x) =ν^q(x)\displaystyle=-\widehat{\nu}_{q}(x)
=max0qmin0rp{ν^q(y)|>r+w,νq(y)|=r+w,ν^q+r(y)|<rw}\displaystyle=-\max_{0\leq\ell\leq q}\;\min_{0\leq r\leq p}\{\widehat{\nu}_{q-\ell}(y)|_{\ell>r}+w,\nu_{q-\ell}(y)|_{\ell=r}+w,\widehat{\nu}_{q+r}(y)|_{\ell<r}-w\}
=max0qmin0rp{νp+(y)|>r+w,ν^p+(y)|=r+w,νpr(y)|<rw}\displaystyle=-\max_{0\leq\ell\leq q}\;\min_{0\leq r\leq p}\{-\nu_{p+\ell}(y)|_{\ell>r}+w,-\widehat{\nu}_{p+\ell}(y)|_{\ell=r}+w,-\nu_{p-r}(y)|_{\ell<r}-w\}
=max0qmin0rp{νp+r(y)|r>+w,ν^p+r(y)|=r+w,νp(y)|r<w}\displaystyle=\max_{0\leq\ell\leq q}\;\min_{0\leq r\leq p}\{\nu_{p+r}(y)|_{r>\ell}+w,\widehat{\nu}_{p+r}(y)|_{\ell=r}+w,\nu_{p-\ell}(y)|_{r<\ell}-w\}

where, at each instance, y(x)y\in\mathcal{M}(x), and where the middle equality is by (1). Since the maximin is now applied on the negative maximin values, we must swap the players inside the brackets, and hence the last equality follows when we remove the signs. This coincides with \threfdef:maximin, whenever ==\mathcal{L}=\mathcal{R}=\mathcal{M}. ∎

And even simpler, by the restriction to symmetric games, and in view of (1), we may leave out the function ν\nu.

Observation 5 (Simplified Maximin).
\thlabel

obs:maximinsim The tuple of maximin functions ν^:X×{0,1}TB+1\widehat{\nu}:X\times\{0,1\}\rightarrow\mathbb{R}^{\mathrm{TB}+1} is, for all p{0,,TB}p\in\{0,\ldots,\mathrm{TB}\}, if tXt\in X is terminal, ν^p(t)=0\widehat{\nu}_{p}(t)=0, and, for non-terminal xx, given ,r\ell,r\in\mathcal{B},

ν^p(x)=max0p,y(x)min0rq,y(x){ν^p(y)|>r+w(x,y),ν^q+(x)|=r+w(x,y),ν^p+r(y)|<rw(x,y)}.\widehat{\nu}_{p}(x)=\max_{0\leq\ell\leq p,y\in\mathcal{M}(x)}\;\min_{0\leq r\leq q,y\in\mathcal{M}(x)}\{\widehat{\nu}_{p-\ell}(y)|_{\ell>r}+w(x,y),-\widehat{\nu}_{q+\ell}(x)|_{\ell=r}+w(x,y),\widehat{\nu}_{p+r}(y)|_{\ell<r}-w(x,y)\}.

If a ruleset satisfies the uniqueness properties, by the proof of \threfthm:unique, we may simplify the maximin function a bit further. For example, if the game is symmetric, we get the following convenient simplification of Observation LABEL:obs:maximinsim. In Corollary LABEL:cor:U, we show a simpler form of maximin function for symmetric games whenever ruleset Γ𝒰\Gamma\in\mathcal{U}.

Corollary 6.
\thlabel

cor:U If the ruleset Γ𝒰\Gamma\in\mathcal{U} is symmetric, then for all non-terminal game positions and for all budget partitions, the unique equilibrium value is given by: for all p{0,,TB}p\in\{0,\ldots,\mathrm{TB}\}, if tXt\in X is terminal, ν^p(t)=0\widehat{\nu}_{p}(t)=0, and, for non-terminal xx, given ,r\ell,r\in\mathcal{B},

(2) ν^p(x)=max0p,y(x)min0rq,z(x){ν^q+(y)|=r+w(x,y),ν^p+r(z)|<rqw(x,z)}.\displaystyle\widehat{\nu}_{p}(x)=\max_{0\leq\ell\leq p,y\in\mathcal{M}(x)}\;\min_{0\leq r\leq q,z\in\mathcal{M}(x)}\{-\widehat{\nu}_{q+\ell}(y)|_{\ell=r}+w(x,y),\widehat{\nu}_{p+r}(z)|_{\ell<r\leq q}-w(x,z)\}.
Proof.

This follows by combining Observation LABEL:obs:maximinsim with \threfthm:unique, since given any Right bid, by (C), Left prefers a tie, before winning strictly. ∎

This simplifies the analysis of equilibrium, because if a ruleset satisfies 𝒰\mathcal{U} then the cases where Left wins strictly need not be considered because Right always prefers to tie a Left winning bid. In general, we have the following conjecture.

Conjecture 7.

Consider a ruleset Γ\Gamma. If all weights are nonnegative for Left and all weights nonpositive for Right, and 00\in\mathcal{B}, then ΓU\Gamma\in U.

Moreover, in the following examples, we show that if a ruleset has negative Left-weights and/or positive Right-weights, then there exist zugzwang games for which property 𝒰\mathcal{U} does not hold. However, when 00\in\mathcal{B}, we have not found any game that violates property 𝒰(A)\mathcal{U}(A) or 𝒰(C)\mathcal{U}(C).

Example 2.1.
\thlabel

ex:B Consider a ruleset Γ\Gamma, where X={x1,x2}X=\{x_{1},x_{2}\}, (x1)={x2}\mathcal{R}(x_{1})=\{x_{2}\}, (x1)=(x1)=(x2)=\mathcal{R}(x_{1})=\mathcal{L}(x_{1})=\mathcal{L}(x_{2})=\varnothing and 00\in\mathcal{B}. Suppose that wR(x1,x2)=1w_{R}(x_{1},x_{2})=1, and τ(x2)=0\tau(x_{2})=0, i.e. there is no penalty for not being able to move from the terminal position x2x_{2}. If the current score c(x1)=0c(x_{1})=0 and Right has the marker but no budget, then if Left bids 0 at position x1x_{1}, Right must move to the terminal position x2x_{2}, and the game ends at the final score c(x2)=1c(x_{2})=1, which is good for Left. If Right does not have the marker (independently of the budget partition), he is better off, because he will bid 0, and the game will end at c(x1)=0c(x_{1})=0 (if there were a penalty at this terminal that would have been a Left penalty, but Right would not have been able to take benefit). Thus, 0=ν^1(x1)<ν1(x1)=10=\widehat{\nu}_{1}(x_{1})<\nu_{1}(x_{1})=1, and property 𝒰\mathcal{U}(B) is not satisfied.

Example 2.2.

Suppose that 00\not\in\mathcal{B}, and otherwise with Γ\Gamma as in \threfex:B. Analogously, a player does not want to have any budget, and thus 𝒰\mathcal{U}(A) may not hold. Similarly, one can see that the marker can be worth more than a dollar; if TB=1\mathrm{TB}=1 and you have the marker but not the dollar, then if you exchange the marker for the dollar, you are worse off. Hence 𝒰\mathcal{U}(C) may not be satisfied if 00\not\in\mathcal{B}.

3. Bidding Cumulative Subtraction

In this section, games are symmetric, i.e. the move options are the same for both players; see (Cohensius et al., 2019) for the motivation on similar alternating play games. We now define the ruleset Bidding Cumulative Subtraction.

Definition 8 (Bidding Cumulative Subtraction, BCS).

There is a subtraction set 𝒮\mathcal{S}\subset\mathbb{N}, a total budget TB0\mathrm{TB}\in\mathbb{N}_{0}, a bidding set {0,TB}\mathcal{B}\subset\{0,\ldots\mathrm{TB}\}, and a heap of finitely many, x0x\in\mathbb{N}_{0}, objects (pebbles). There are two players, Left and Right, who take turns removing objects from the heap. The total budget TB0\mathrm{TB}\in\mathbb{N}_{0} (a game constant) is partitioned between the players, as (p,q)(p,q), with p+q=TBp+q=\mathrm{TB}. Exactly one of the players has a tie-break marker m{0,1}m\in\{0,1\}, where m=1m=1 if Left has the marker. A complete game configuration is of the form (𝒮,;x,p,m,c)(\mathcal{S},\mathcal{B};x,p,m,c), where cc\in\mathbb{Z} is the current score. If Left has the marker and c=0c=0, we abbreviate a position by (x,p^)(x,\widehat{p}), and otherwise, when Right has the marker, we write (x,p)(x,p). At each stage of play, the players (make closed) bid of who is to take an action, Left bids \ell and Right bids rr. The player with the highest bid wins the move. If the bids are equal, the player with the marker wins the move. The winning bidder transfers the bidding amount (together with the marker in case of a tie) to the other player. If Left has the marker, a typical bid is (^,r)(\widehat{\ell},r). A player who wins the bid acts by collecting s𝒮s\in\mathcal{S} objects, which adds ss to a current score cc, if Left wins the bid, and otherwise, it subtracts ss. The game ends when the number of objects in the heap is smaller than minS\min S. Left seeks to maximize the final score (utility) whereas Right seeks to minimize it. A removal of sSs\in S, in case Left has the marker, is of one of the forms:

  • (TB;x,p,1;c)(TB;xs,p,1;c+s)(\mathrm{TB};x,p,1;c)\rightarrow(\mathrm{TB};x-s,p-\ell,1;c+s), sSs\in S, if Left bids p\ell\leq p and wins a non-tie.

  • (TB;x,p,1;c)(TB;xs,p,0;c+s)(\mathrm{TB};x,p,1;c)\rightarrow(\mathrm{TB};x-s,p-\ell,0;c+s), sSs\in S, if Left bids p\ell\leq p and wins a tie.

  • (TB;x,p,1;c)(TB;xs,p+r,1;cs)(\mathrm{TB};x,p,1;c)\rightarrow(\mathrm{TB};x-s,p+r,1;c-s), sSs\in S, if Right wins by bidding rqr\leq q.

In view of Section 2, we have the ruleset Γ=(0,)\Gamma=(\mathbb{N}_{0},\mathcal{M}), where , for all xminSx\geq\min S, (x)={xs0sS}\mathcal{M}(x)=\{x-s\geq 0\mid s\in S\}. In rest of this section, we focus on unitary games, which is a BCS with subtraction set S={1}S=\{1\}. We illustrate bidding in such games in Section 5.

3.1. Unitary Games

If 𝒮={1}\mathcal{S}=\{1\} and ={0,,TB}\mathcal{B}=\{0,\ldots,\mathrm{TB}\}, we call BCS unitary. In this section, we prove that for each heap size and each budget partition, there is a unique equilibrium outcome, given by the maximin function (\threfthm:unique). That is, by using the notation in Section 2, we prove that the symmetric ruleset Γ=(0,)𝒰\Gamma=(\mathbb{N}_{0},\mathcal{M})\in\mathcal{U}, if for all x,(x)={x1}x\in\mathbb{N},\mathcal{M}(x)=\{x-1\}. In the spirit of Definition 8, for unitary games, we write (TB;x,p,m;c):=({1},{0,,TB};x,p,m;c)(\mathrm{TB};x,p,m;c):=(\{1\},\{0,\ldots,\mathrm{TB}\};x,p,m;c). Unitary games simplify quite a lot and allows us to prove simple properties on the bound game values, uniqueness of equilibrium and convergence bounds. We start with a simple result in Lemma LABEL:lem:pari.

Lemma 9 (Parity).
\thlabel

lem:pari For unitary games, the utility of any sequence of play from a heap is odd if and only if the size of the heap is odd.

Proof.

For an odd heap size xx, if we divide the total wins between Left and Right, it has to be even for one player and odd for the other. If xx is even, the number of wins for the players will either both be even or both will be odd. Thus, the difference is even. ∎

Since unitary games are symmetric, we may assume that Left has the marker, unless otherwise stated, so m=1m=1 will be the default. We define maximin function for unitary games in Defination 10 which is a special case of Observation LABEL:obs:maximinsim.

Definition 10 (Unitary Maximin).

The maximin function ν^:X×{0,1}TB+1\widehat{\nu}:X\times\{0,1\}\rightarrow\mathbb{R}^{\mathrm{TB}+1} is, for all p{0,,TB}p\in\{0,\ldots,\mathrm{TB}\}, ν^p(0)=0\widehat{\nu}_{p}(0)=0, and for x>0x>0,

ν^p(x)=max0pmin0rq{ν^p(x1)|>r+1,ν^q+(x1)|=r+1,ν^p+r(x1)|<r1}.\widehat{\nu}_{p}(x)=\max_{0\leq\ell\leq p}\;\min_{0\leq r\leq q}\{\widehat{\nu}_{p-\ell}(x-1)|_{\ell>r}+1,-\widehat{\nu}_{q+\ell}(x-1)|_{\ell=r}+1,\widehat{\nu}_{p+r}(x-1)|_{\ell<r}-1\}.

We are now ready to prove the first main result of the paper in Theorem LABEL:thm:unitary, which establishes the unique equilibrium in unitary games. However, before proving Theorem LABEL:thm:unitary, we show the following examples to motivate the result, \threfthm:unitary, by maximin play in simple unitary games. Let us first revisit the example in the first paragraph, in a more formal manner.

Example 11 (Maximin Play).
\thlabel

ex:? Let us illustrate Definition 10 with an example where TB=5\mathrm{TB}=5, and x=2x=2, displaying the values op(x)o_{p}(x) for TB=5\mathrm{TB}=5 and the heap sizes x=0,1,2x=0,1,2. We explain the entry with the question mark in the table.

xp^x\setminus\widehat{p} 5 4 3 2 1 0
0 0 0 0 0 0 0
1 1 1 1 -1 -1 -1
2 2 2 0 0 ‘?’ -2

If the position is (2,1^)(2,\widehat{1}), then {0,1}\ell\in\{0,1\} and r{0,1,2}r\in\{0,1,2\}, by the reduced form equivalent. If =0\ell=0, then Rights minimum is min{ν^4+0(1)+1,ν^1+1(1)1}=2\min\{-\widehat{\nu}_{4+0}(1)+1,\widehat{\nu}_{1+1}(1)-1\}=-2. If =1\ell=1, then Right’s minimum is min{ν^11(1)+1,ν11(1)+1,ν^1+2(1)1}=min{ν^0(1)+1,ν^5(1)+1,ν^3(1)1}=0\min\{\widehat{\nu}_{1-1}(1)+1,\nu_{1-1}(1)+1,\widehat{\nu}_{1+2}(1)-1\}=\min\{\widehat{\nu}_{0}(1)+1,-\widehat{\nu}_{5}(1)+1,\widehat{\nu}_{3}(1)-1\}=0. Hence, Left will bid =1\ell=1 and ν^1(2)=max{2,0}=0\widehat{\nu}_{1}(2)=\max\{-2,0\}=0.

Example 12.

Consider TB=9\mathrm{TB}=9. We computed maximin values for heap sizes x8x\leq 8, and gain the following bidding tables for heap size x=9x=9. Left bids \ell and Right bids rr. The lower right corners show the equilibrium values, i.e. the maximin == minimax value; see \threfthm:unique. Note that everything to the right of the bold is smaller than or equal the diagonal in bold, but by the second table, the bold does not bound the below area (as in the proof of \threfthm:unique).

rr\setminus\ell 0 1 2 3 4 5 6 max\max
0 1 1 1 1 -1 -1 -3 1
1 1 1 1 1 -1 -1 -3 1
2 3 3 1 1 -1 -1 -3 3
3 3 3 3 -1 -1 -1 -3 3
min\min 1 1 1 -1 -1 -1 -3 1
rr\setminus\ell 0 1 2 3 4 max\max
0 1 1 -1 -1 -1 1
1 -1 1 -1 -1 -1 1
2 -1 -1 -1 -1 -1 -1
3 1 1 1 -1 -1 1
4 1 1 1 1 -3 1
5 3 3 3 3 3 3
min\min -1 -1 -1 -1 -3 -1

For unitary games, we aim to prove

Theorem 13 (Equilibrium for Unitary Games).
\thlabel

thm:unitary If BCS is unitary, then for all positions, it has a unique equilibrium. It is given by, for all pp, op(0)=0o_{p}(0)=0, and if x>0x>0, then

(3) op(x)=max0pmin0rq{oq+(x1)|=r+1,op+r(x1)|<rq1}.\displaystyle o_{p}(x)=\max_{0\leq\ell\leq p}\;\min_{0\leq r\leq q}\{-o_{q+\ell}(x-1)|_{\ell=r}+1,o_{p+r}(x-1)|_{\ell<r\leq q}-1\}.

As demonstrated by Corollary LABEL:cor:U, the proof of the theorem will follow, by establishing the properties in Definition LABEL:def:U for unitary games. In Section 3.2, we prove the properties in Definition LABEL:def:U for unitary games.

After this, in Section 3.3, we prove a certain monotonicity result on increasing heap sizes (\threflem:rowsplitmon). This will lead to the second main result of this paper, an eventual period 2 of equilibrium outcomes in case of unitary games.

In Figures 2 and 2, we show the equilibrium outcomes for all sufficiently large heap sizes for the special cases of TB=8\mathrm{TB}=8 and 99 respectively. This eventual stabilization of the outcomes and bids is later formalized via an independent bidding automaton, and we conjecture that these are generic examples.

\boldsymbol{-}(x,8^)(x,\widehat{8})(x,7^)(x,\widehat{7})(x,6^)(x,\widehat{6})(x,5^)(x,\widehat{5})(x,4^)(x,\widehat{4})(x,3^)(x,\widehat{3})(x,2^)(x,\widehat{2})(x,1^)(x,\widehat{1})(x,0^)(x,\widehat{0})xx even+5+5+3+3+3+3+1+1+1+11-11-13-33-3xx odd+4+4+4+4+2+2+2+2002-22-24-4
Figure 1. The value table for TB=8\mathrm{TB}=8 and sufficiently large heap sizes xx, even and odd respectively. The blue arrow indicates a shift of sign in the representation of (x,8^)(x1,8)(x,\widehat{8})\rightarrow(x-1,8), xx odd; that is, both players bid 0, and Left wins the bid by tie-breaking. This bidding is equilibrium play, since 5=1(4)5=1-(-4).
(x,9^)(x,\widehat{9})(x,8^)(x,\widehat{8})(x,7^)(x,\widehat{7})(x,6^)(x,\widehat{6})(x,5^)(x,\widehat{5})(x,4^)(x,\widehat{4})(x,3^)(x,\widehat{3})(x,2^)(x,\widehat{2})(x,1^)(x,\widehat{1})(x,0^)(x,\widehat{0})xx even+5+5+5+5+3+3+3+3+1+11-11-13-33-35-5xx odd+6+6+4+4+4+4+2+2+2+202-22-24-44-4
Figure 2. The value table for TB=9\mathrm{TB}=9 and sufficiently large heap sizes xx, even and odd respectively. The arrow indicates a Left win by the bid =3\ell=3, from (x,9^)(x,\widehat{9}), on a large even heap size xx. This bid is not in equilibrium. In fact, the bid is dominated by Left bidding 1, which is in equilibrium, since 5+1=65+1=6. Note also that bidding zero is in equilibrium. Indeed 0-ties are in equilibrium for all sufficiently large heap sizes.

3.2. Property 𝒰\mathcal{U} for Unitary Games

We begin by proving monotonicity of maximin values for fixed heap sizes, i.e. property 𝒰\mathcal{U}(A), ‘budget monotonicity’ in Lemma LABEL:lem:col_mon.

Lemma 14 (Budget Monotonicity, 𝒰\mathcal{U}(A)).
\thlabel

lem:col_mon For all games, ν^p(x)ν^π(x)\widehat{\nu}_{p}(x)\geq\widehat{\nu}_{\pi}(x) if pπp\geq\pi.

Proof.

The proof is by induction over the heap sizes, and note that νp(0,1)=00=νπ(0,1)\nu_{p}(0,1)=0\geq 0=\nu_{\pi}(0,1). Suppose that the statement holds for all heap sizes smaller than x>0x>0. In row xx, Left has all the bidding options with budget pp as she has with budget π\pi. Since, by induction, the values are non-decreasing in row x1x-1, if she can force a win (perhaps using a tie) with budget π\pi, she is assured at least as good value in column pp as in column π\pi. If she cannot force a win with budget π\pi, the only difference is that with budget pp she could perhaps force a win, and again, the budget monotonicity of the smaller heaps imply the result. ∎

Remark 15.

By combining (1) with Lemma  LABEL:lem:col_mon, we get νp(x)νπ(x)\nu_{p}(x)\geq\nu_{\pi}(x) if pπp\geq\pi.

Thus, by combining Lemma LABEL:lem:col_mon and Remark 15, we show that unitary games satisfies \threfdef:U (A) property. However, we have some direct consequences of budget monotonicity. We may condition maximin values on given bids. If the players tie \ell from position (x,p^)(x,\widehat{p}), we write ν^p(x)|T()=ν^q+(x1)+1\widehat{\nu}_{p}(x)|_{T(\ell)}=-\widehat{\nu}_{q+\ell}(x-1)+1, and so on, for the result, given an \ell-tie at (x,p^)\left(x,\widehat{p}\,\right) and subsequent maximin play.

In Lemma LABEL:lem:tie prove the tie monotonicity property of for unitary games which shows that Left weakly prefers the tie (1^,1)(\widehat{\ell-1},\ell-1) over (^,)(\widehat{\ell},\ell). This property will be useful to prove that unitary games satisfies 𝒰(B)\mathcal{U}(B) property.

Lemma 16 (Tie Monotonicity).
\thlabel

lem:tie At any position, with x,p,>0x,p,\ell>0, the tie (^,)(\widehat{\ell},\ell) is maximin weakly worse for player Left, than the tie (1^,1)(\widehat{\ell-1},\ell-1).

Proof.

The proof follows by heap monotonicity (\threflem:col_mon). In particular, we have ν^q+(x)ν^q+1(x)\widehat{\nu}_{q+\ell}(x)\geq\widehat{\nu}_{q+\ell-1}(x). On the other hand ν^p(x)|T()=1ν^q+(x1)\widehat{\nu}_{p}(x)|_{T(\ell)}=1-\widehat{\nu}_{q+\ell}(x-1) and ν^p(x)|T(1)=1ν^q+1(x1)\widehat{\nu}_{p}(x)|_{T(\ell-1)}=1-\widehat{\nu}_{q+\ell-1}(x-1). Hence, ν^p(x)|T()ν^p(x)|T(1)\widehat{\nu}_{p}(x)|_{T(\ell)}\leq\widehat{\nu}_{p}(x)|_{T(\ell-1)}. ∎

This innocent consequence of heap monotonicity implies that Right can assure the outcome of the 0-tie, namely all entries in the upper right area weakly bounded by the main diagonal (in the game value matrix described in Example 12), are weakly smaller than the value at the 0-tie. (This property is implicit in the proof of \threfthm:unique.)

In the next Lemma LABEL:lem:marker, we prove a natural bounds on game value of the equilibrium outcomes when Left has marker in term of the game value at equilibrium when Left does not have a marker. This result precisely prove that unitary games satisfies Definition LABEL:def:U (B). As a bonus, we get a bound on how good the marker can be.

Lemma 17 (Marker Monotonicity, 𝒰\mathcal{U}(B)).
\thlabel

lem:marker Consider any unitary ruleset. Then, for all heap sizes xx and all Left budgets pp,

(4) νp(x)ν^p(x)νp(x)+2.\displaystyle\nu_{p}(x)\leq\widehat{\nu}_{p}(x)\leq\nu_{p}(x)+2.
Proof.

The proof is by induction on the inequalities (4) (and they clearly hold for x=0x=0). Suppose that there is a position such that νp(x)>ν^p(x)\nu_{p}(x)>\widehat{\nu}_{p}(x), i.e. Left does not want to win a tie at (x,p^)\left(x,\widehat{p}\,\right). By \threflem:tie, we assume that Left bids 0. Then, if Right accepts the 0-tie, induction gives a contradiction, namely

1+νp(x1)<ν^p(x1)1,1+\nu_{p}(x-1)<\widehat{\nu}_{p}(x-1)-1,

which is equivalent with

2+νp(x1)<ν^p(x1).2+\nu_{p}(x-1)<\widehat{\nu}_{p}(x-1).

Suppose next that 2+νp(x)<ν^p(x)2+\nu_{p}(x)<\widehat{\nu}_{p}(x), which means that a tie-win is worth more than 2 points. Suppose that Left bids 0. Then 2+ν^p(x1)1<νp(x1)+12+\widehat{\nu}_{p}(x-1)-1<\nu_{p}(x-1)+1, which contradicts the first inequality in (4). ∎

The next lemma is a consequence of Marker Monocity.

Lemma 18.
\thlabel

lem:markdom For all heap sizes xx, all Left budgets pp and all Left bids 0p0\leq\ell\leq p, ν^p(x)ν^q+(x)\widehat{\nu}_{p}(x)\geq-\widehat{\nu}_{q+\ell}(x).

Proof.

We have that, for all xx, and all pp,

ν^(x,p)\displaystyle\widehat{\nu}(x,p) ν(x,p)=ν^q(x)ν^q+(x),\displaystyle\geq\nu(x,p)=-\widehat{\nu}_{q}(x)\geq-\widehat{\nu}_{q+\ell}(x),

by \threflem:marker, because \threflem:col_mon gives ν^q+(x)ν^q(x)\widehat{\nu}_{q+\ell}(x)\geq\widehat{\nu}_{q}(x). ∎

In the next Lemma LABEL:lem:markworth, we show that unitary games (rules) satisfies 𝒰(C)\mathcal{U}(C). That is, the marker is never worth more than $1. This will complete our argument for the proof of the main Theorem LABEL:thm:unitary

Lemma 19 (Marker Worth, 𝒰\mathcal{U}(C)).
\thlabel

lem:markworth Consider a unitary ruleset. Then, for all heap sizes xx and all Left budgets pp,

ν^p(x)νp+1(x)\widehat{\nu}_{p}(x)\leq\nu_{p+1}(x)
Proof.

We prove this theorem by an induction argument on heaps of size xx, and analyze the values for all budget partitions. Consider the base case when x=1x=1. If Left can force a win with budget pp and the marker, then pqp\geq q, and hence p+1>q1p+1>q-1 gives that ν^p(1)=1\widehat{\nu}_{p}(1)=1 implies νp+1(1)=1\nu_{p+1}(1)=1. Therefore ν^p(1)νp+1(1)\widehat{\nu}_{p}(1)\leq\nu_{p+1}(1).

Suppose next that the theorem is true for all heap sizes smaller than xx. To complete the induction, we need to show that ν^p(x)νp+1(x)\widehat{\nu}_{p}(x)\leq\nu_{p+1}(x). We will prove the induction step for three different cases of maximin ν^p(x)\widehat{\nu}_{p}(x):

  • Case 1. Left wins with tie at (x,p^)\left(x,\widehat{p}\,\right) by bidding \ell;

  • Case 2. Left wins without tie at (x,p^)\left(x,\widehat{p}\,\right) by bidding \ell;

  • Case 3. Right wins at (x,p^)\left(x,\widehat{p}\,\right) by bidding rr.

Consider Case 1. Right maximin weakly prefers losing an item at tie \ell rather than winning it by bidding +1\ell+1. Therefore,

(5) 1+ν^p++1(x1)1+νp(x1)-1+\widehat{\nu}_{p+\ell+1}(x-1)\geq 1+\nu_{p-\ell}(x-1)

There are two subcases.

The first subcase is whenever 0q10\leq\ell\leq q-1. We claim that Right does not maximin prefer winning if Left bids +1\ell+1 at position (x,p+1)\left(x,p+1\right). If Right wins at (x,p+1)\left(x,p+1\right) by bidding +1\ell+1 with marker, then

1+ν^p++2(x1)1+ν^p++1(x1)1+νp(x1)-1+\widehat{\nu}_{p+\ell+2}(x-1)\geq-1+\widehat{\nu}_{p+\ell+1}(x-1)\geq 1+\nu_{p-\ell}(x-1)

The first inequality holds by heap monotonicity and the second follows from inequality (5).

Now assume that Right bids r>+1r>\ell+1 at (x,p+1)\left(x,p+1\right), and wins. Then,

1+νp+r+1(x1)1+νp++2(x1)1+ν^p++1(x1)1+νp(x1)-1+\nu_{p+r+1}(x-1)\geq-1+\nu_{p+\ell+2}(x-1)\geq-1+\widehat{\nu}_{p+\ell+1}(x-1)\geq 1+\nu_{p-\ell}(x-1)

The first inequality holds by heap monotonicity, the second by induction and the third follows from inequality (5).

Altogether this implies that, for this subcase, νp+1(x)1+νp(x1)=ν^p(x)\nu_{p+1}(x)\geq 1+\nu_{p-\ell}(x-1)=\widehat{\nu}_{p}(x).

Consider the other subcase, when q\ell\geq q. In this case, Left can secure a win by bidding \ell at (x,p+1)\left(x,p+1\right), as Right cannot over bid. This implies that

νp+1(x)1+νp+1(x1)1+νp(x1)=ν^p(x)\nu_{p+1}(x)\geq 1+\nu_{p-\ell+1}(x-1)\geq 1+\nu_{p-\ell}(x-1)=\widehat{\nu}_{p}(x)

(by heap monotonicity). This completes the proof for Case 1.

We now prove Case 2. Left wins the bid, without a tie, by bidding \ell, and therefore, at (x,p^)\left(x,\widehat{p}\,\right), Right maximin weakly prefers losing the bid. We get

(6) 1+ν^p++1(x1)1+ν^p(x1)=ν^p(x)-1+\widehat{\nu}_{p+\ell+1}(x-1)\geq 1+\widehat{\nu}_{p-\ell}(x-1)=\widehat{\nu}_{p}(x)

Consider first the subcase, q1\ell\leq q-1. We claim that, if Left bids \ell, then she maximin wins at position (x,p+1)\left(x,p+1\right). If not, then either Right maximin wins by bidding \ell with a marker or by bidding r>r>\ell. Now, if Right bids \ell and wins using a marker, we study the inequalities

1+ν^p++1(x1)1+ν^p(x1)1+νp(x1)-1+\widehat{\nu}_{p+\ell+1}(x-1)\geq 1+\widehat{\nu}_{p-\ell}(x-1)\geq 1+\nu_{p-\ell}(x-1)

The first inequality holds by (6) and the second inequality follows from marker monotonicity. Hence, at νp+1(x)\nu_{p+1}(x), Right would prefer losing if Left bids \ell over winning by bidding \ell with marker.

Now, if Right bids r>r>\ell and maximin wins the bid, we study the inequalities

1+νp+r+1(x1)\displaystyle-1+\nu_{p+r+1}(x-1) 1+ν^p++1(x1)\displaystyle\geq-1+\widehat{\nu}_{p+\ell+1}(x-1)
1+ν^p(x1)\displaystyle\geq 1+\widehat{\nu}_{p-\ell}(x-1)
1+νp(x1)\displaystyle\geq 1+\nu_{p-\ell}(x-1)

The first inequality holds by induction, the second follows from inequality (6), and the third follows from marker monotonicity. Hence, at νp+1(x)\nu_{p+1}(x), Right would prefer losing if Left bids \ell over winning by bidding r>r>\ell.

The subcase q\ell\geq q is similar to Case 1. Altogether,

νp+1(x)1+νp+1(x1)1+ν^p(x1)=ν^p(x),\nu_{p+1}(x)\geq 1+\nu_{p-\ell+1}(x-1)\geq 1+\widehat{\nu}_{p-\ell}(x-1)=\widehat{\nu}_{p}(x),

which completes the proof of Case 2.

We now prove Case 3. As Right maximin wins at (x,p^)\left(x,\widehat{p}\,\right) by bidding rr, we get

(7) ν^p(x)=ν^p+r(x1)1νpr+1(x1)+1\widehat{\nu}_{p}(x)=\widehat{\nu}_{p+r}(x-1)-1\leq\nu_{p-r+1}(x-1)+1

In other words, at (x,p^)\left(x,\widehat{p}\,\right), Right prefers winning, by bidding rr, over losing for marker, by bidding r1r-1.

In order to prove this case, we claim that if Left bids =r1\ell=r-1 at (x,p+1)(x,p+1), then Right prefers winning by a tie. This follows by the inequalities

1+νpr+2(x1)1+νpr+1(x1)ν^p+r(x1)11+\nu_{p-r+2}(x-1)\geq 1+\nu_{p-r+1}(x-1)\geq\widehat{\nu}_{p+r}(x-1)-1

The first inequality holds by budget monotonicity and the second follows from inequality (7). Hence, playing from (x,p+1)(x,p+1), Left can force at least the value where Right ties, i.e. νp+1(x)ν^p+r(x1)1=ν^p(x)\nu_{p+1}(x)\geq\widehat{\nu}_{p+r}(x-1)-1=\widehat{\nu}_{p}(x). This completes the proof of Case 3.

Thus, the lemma holds. ∎

We have established the three properties of Definition LABEL:def:U. Therefore we conclude with the proof of \threfthm:unitary.

Proof of \threfthm:unitary.

Combine \threflem:col_mon with \threflem:markdom and \threflem:markworth. Thus Definition LABEL:def:U applies: unitary games are in 𝒰\mathcal{U}, and we may apply \threfcor:U. ∎

From now onward we use outcome, i.e. oo-notation, instead of ν^\widehat{\nu}, and in proofs, we may fix the bid of either player to prove inequalities as required by a given context. By fixing the bid of player Left (Right), we may obtain a lower (upper) bound of the outcome, corresponding to maximin (minimax) evaluation.

3.3. A Sign Border and Bounded Game Values

In the last subsection, we have already shown that, for each position, unitary games exhibit a unique equilibrium. Now, in this subsection, we discuss the bounds on game values for given TB\mathrm{TB}, xx and budget partition. First, we establish the sign border of the game value for a given total budget of TB\mathrm{TB}. In Lemma LABEL:lem:signborder, we show that if Left has more money than Right, i.e.pTB/2p\geq\lceil\mathrm{TB}/2\rceil then the game value op(x)0o_{p}(x)\geq 0. Moreover, we show that the sign of the outcome is sensitive to the sign border TB/2\lceil\mathrm{TB}/2\rceil

Lemma 20 (Sign Border).
\thlabel

lem:signborder For a given TB\mathrm{TB} and all heap sizes xx, 2pTB2p\geq\mathrm{TB} implies op(x)0o_{p}(x)\geq 0, and 2p<TB2p<\mathrm{TB} implies op(x)0o_{p}(x)\leq 0.

Proof.

We prove that, if pTB/2p\geq\lceil\mathrm{TB}/2\rceil then op(x)0o_{p}(x)\geq 0. If p<TB/2p<\lceil\mathrm{TB}/2\rceil then op(x)0o_{p}(x)\leq 0.

By the 0-sum property (1) combined with the first inequality in Lemma LABEL:lem:marker, we have op(x)oq(x)o_{p}(x)\geq-o_{q}(x). Adding op(x)o_{p}(x) on both sides gives,

2op(x)op(x)oq(x)2o_{p}(x)\geq o_{p}(x)-o_{q}(x)

As pqp\geq q, using heap monotonicity lemma, 2op(x)02o_{p}(x)\geq 0, which proves op(x)0o_{p}(x)\geq 0.

For the second part when p<TB/2p<\mathrm{TB}/2, we use the second inequality in Lemma LABEL:lem:marker combined with the 0-sum property (1), we have op(x)oq(x)+2o_{p}(x)\leq-o_{q}(x)+2. Adding op(x)o_{p}(x) on both sides, we get

2op(x)op(x)oq(x)+2.2o_{p}(x)\leq o_{p}(x)-o_{q}(x)+2.

Hence,

op(x)(op(x)oq(x))/2+1.o_{p}(x)\leq(o_{p}(x)-o_{q}(x))/2+1.

If xx is even, this implies that op(x)0o_{p}(x)\leq 0, by heap monotonicity and Lemma LABEL:lem:pari; if xx is odd then op(x)1o_{p}(x)\leq 1.

Thus we may assume that, for some p<TB/2p<\mathrm{TB}/2, op(x)=1o_{p}(x)=1. If op1(x1)=0o_{p-1}(x-1)=0, then if Left wins by bidding 1, Right increases to either get a tie by bidding 1 or to win by bidding 2. More precisely, if oTB/2+2(x1)=0o_{\mathrm{TB}/2+2}(x-1)=0 then he 1-ties, and otherwise, he bids 2. In either case this gives op(x)=1o_{p}(x)=-1.

If Left 0-ties at (x,p)(x,p) and oTB/2(x1)=0o_{\lceil\mathrm{TB}/2\rceil}(x-1)=0, then Right bids 1 and obtains op(x)1o_{p}(x)\leq-1, with equality if p=TB/2p=\lfloor\mathrm{TB}/2\rfloor. ∎

Observation 21.

Note that if the heap size xx is odd, then, by \threflem:pari, the inequalities in \threflem:signborder are strict.

The previous Lemma LABEL:lem:signborder is extensively useful to obtain an upper and lower bound on the game value. In the next Lemma LABEL:lem:bound, by adapting \threflem:signborder, we prove an upper and a lower bound of the game values.

Lemma 22 (Bounded Outcome).
\thlabel

lem:bound For all xx and all pp, TB/2op(x)TB/2+1-\lfloor\mathrm{TB}/2\rfloor\leq o_{p}(x)\leq\lceil\mathrm{TB}/2\rceil+1.

Proof.

Consider first pTB/2p\geq\lceil\mathrm{TB}/2\rceil. We define a Right strategy, such that Left cannot get better than TB/2+1\lceil\mathrm{TB}/2\rceil+1. Right bids 0 until (possibly) the first time Left’s budget partition becomes smaller than TB/2\lceil\mathrm{TB}/2\rceil. In the case where Left has the marker and bids 0, the budget partition will stay the same, but Right gets the marker. Hence, unless Left bids 1 in the next bid, the change in score would be 11=01-1=0. When, at some point, Left bids 1, she keeps the marker, and gets a point, but the budget partition is one unit closer to the sign border. Since the sign border is at TB/2\lceil\mathrm{TB}/2\rceil, the result follows, by the similar bound for the case p<TB/2p<\lceil\mathrm{TB}/2\rceil. ∎

The following result in Proposition 23 proves the relation between change in the game value with added extra one dollar to Left’s budget. We prove this result, even though we do not explicitly use it in order to prove other results. We wonder how this result generalizes for arbitrary subtraction sets.

Proposition 23.

For all TB\mathrm{TB}, for all xx, for all pp, op(x)+2op+1(x)o_{p}(x)+2\geq o_{p+1}(x).

Proof.

Assume for a contradiction that Left can obtain

(8) op+1(x)=op(x)+4\displaystyle o_{p+1}(x)=o_{p}(x)+4

(or more), where the outcome at (x,p^)\left(x,\widehat{p}\,\right) is given. If Right wins at (x,p+1^)\left(x,\widehat{p+1}\,\right), then he will decrease bid, to possibly a tie. But Left prefers a smaller tie, so she can decrease and perhaps Right wins again. If so, if the gain is only 2 points we are done, so assume gain is 4 points. Then he decreased again and Left still prefers smaller tie, and so on.

By \threfthm:unique we do not need to consider cases where Left wins. Hence those cases where Right wins at (x,p+1^)\left(x,\widehat{p+1}\,\right) reduce to study of cases where Left wins by a 0-tie at position (x,p+1^)\left(x,\widehat{p+1}\,\right) and with (8). Thus, only two cases remain.

  1. (1)

    Left wins an \ell-tie at (x,p^)\left(x,\widehat{p}\,\right), and she wins a 0-tie at (x,p+1^)\left(x,\widehat{p+1}\,\right).

  2. (2)

    Right wins at (x,p^)\left(x,\widehat{p}\,\right) by bidding rr, and Left wins a 0-tie at (x,p+1^)\left(x,\widehat{p+1}\,\right).

For Case 1, we have op(x)=oq+(x1)+1o_{p}(x)=-o_{q+\ell}(x-1)+1 and op+1(x)=oq1(x1)+1o_{p+1}(x)=-o_{q-1}(x-1)+1, and hence, by (8)

(9) 4oq+(x1)=oq1(x1)\displaystyle 4-o_{q+\ell}(x-1)=-o_{q-1}(x-1)

Thus, by induction, >0\ell>0. This means that Left must avoid a Right win with 1r=11\leq r=\ell-1 at (x,p+1^)\left(x,\widehat{p+1}\,\right). Hence op+r(x1)<oq+(x1)+1=oq1(x1)3op+1(x1)3o_{p+r}(x-1)<-o_{q+\ell}(x-1)+1=-o_{q-1}(x-1)-3\leq o_{p+1}(x-1)-3, by (9) and marker monotonicity.

For Case 2, we have op(x)=op+r(x1)1o_{p}(x)=o_{p+r}(x-1)-1 and op+1(x)=oq1(x1)+1o_{p+1}(x)=-o_{q-1}(x-1)+1, and hence

(10) 4+op+r(x1)1=oq1(x1)+1,\displaystyle 4+o_{p+r}(x-1)-1=-o_{q-1}(x-1)+1,

That is, op+r(x1)+oq1(x1)=2o_{p+r}(x-1)+o_{q-1}(x-1)=-2. If q1q-1 is to the left of the sign border, then op+r(x1)op+1(x1)2o_{p+r}(x-1)-o_{p+1}(x-1)\leq-2, which is impossible, by budget monotonicity and since r1r\geq 1. If q1q-1 is to the right of the sign border, then p+rp+r is to the left, and we get oq1(x1)oqr(x1)2o_{q-1}(x-1)-o_{q-r}(x-1)\leq-2, which is impossible, by r1r\geq 1 and budget monotonicity. ∎

Next, in Lemma LABEL:lem:rowsplitmon we prove that the equilibrium outcomes are monotone (non-decreasing or non-increasing as reflected by the sign border) for heap sizes of the same parity. We show that whenever 2pTB2p\geq\mathrm{TB}, then the game value increases as heap size increases and similarly, for 2p<TB2p<\mathrm{TB}, game value is monotonically decreasing. We note that budget monotonicity does not imply this result as we do not change the parity but heap size.

{restatable}

[Heap Monotonicity]lemmalemmaheapmonotonisity\thlabellem:rowsplitmon Fix a total budget TB\mathrm{TB} and a Left budget p^\widehat{p}, and consider unitary games with heap sizes of the same parity. If 2pTB2p\geq\mathrm{TB}, then the value is monotonically non-decreasing, and otherwise it is monotonically non-increasing. This is impossible, by budget monotonicity, since r1r\geq 1.

Proof.

The base case concerns heap sizes x=0x=0 and 22. Since the outcomes are all 0 at x=0x=0, this case is covered by Lemma LABEL:lem:signborder (Sign Border).

We study the outcome at heap size xx. The value op(x2)o_{p}(x-2) satisfies (at least) one out of three definitions:

  • L:

    op(x2)=op(x3)+1o_{p}(x-2)=o_{p-\ell}(x-3)+1.555Note that by the proof of \threfthm:unique, it is not required to study the cases where Lefts wins a bid strictly. However, for the flow of the proof we find it somewhat nicer when those cases are included.

  • R:

    op(x2)=op+r(x3)1o_{p}(x-2)=o_{p+r}(x-3)-1.

  • T:

    op(x2)=oq+(x3)+1o_{p}(x-2)=-o_{q+\ell}(x-3)+1.

In case L, Left wins the bid; in case R, Right wins the bid, and in case T, there is a tie, so Left, who has the marker, wins the bid.

The proof splits into 10 distinct cases, depending on how the bid is won,

  1. (1)

    Tie and 2pTB,q+<TB/22p\geq\mathrm{TB},q+\ell<\lceil\mathrm{TB}/2\rceil

  2. (2)

    Tie and 2p<TB,q+TB/22p<\mathrm{TB},q+\ell\geq\lceil\mathrm{TB}/2\rceil

  3. (3)

    Right wins and 2pTB,2(p+r)TB2p\geq\mathrm{TB},2(p+r)\geq\mathrm{TB}

  4. (4)

    Left wins and 2pTB,2(p)TB2p\geq\mathrm{TB},2(p-\ell)\geq\mathrm{TB}

  5. (5)

    Left wins and 2p<TB,2(p)<TB2p<\mathrm{TB},2(p-\ell)<\mathrm{TB}

  6. (6)

    Right wins and 2p<TB,2(p+r)<TB2p<\mathrm{TB},2(p+r)<\mathrm{TB}

  1. A.

    Left wins and 2pTB,2(p)<TB2p\geq\mathrm{TB},2(p-\ell)<\mathrm{TB}

  2. B.

    Right wins and 2p<TB,2(p+r)TB2p<\mathrm{TB},2(p+r)\geq\mathrm{TB}

  3. C.

    Tie and 2pTB,q+TB/22p\geq\mathrm{TB},q+\ell\geq\lceil\mathrm{TB}/2\rceil

  4. D.

    Tie and 2p<TB,q+<TB/22p<\mathrm{TB},q+\ell<\lceil\mathrm{TB}/2\rceil

The case (D) cannot happen, because qTB/2q\geq\lceil\mathrm{TB}/2\rceil and 0\ell\geq 0.

In cases 1-6 and several subcases we use induction. Suppose for example that 2pTB2p\geq\mathrm{TB}, and we wish to prove, by induction, that for all x>1x>1,

(11) op(x)op(x2).\displaystyle o_{p}(x)\geq o_{p}(x-2).

Hence, we show that Right cannot do better in (x,p)(x,p) than in (x2,p)(x-2,p). As induction hypothesis, we may assume op+r(x1)op+r(x3)o_{p+r}(x-1)\geq o_{p+r}(x-3), and hence, if Right wins at xx by bidding rr, then inequality (11) holds. We will refer to similar situations by saying ‘by induction’. For some more detail, to contradict the inequality (11), Right must change strategy at xx. That is he must lower his bid to r1r-1 or smaller. If he lowers to r1r-1, then we may assume that this is now a tie, and this situation has to be considered. In case the decrease of bid is successful for Right, then Left might deviate, etc; the particular context will determine.

In cases of tie, the argument will be by induction in cases where the relevant budget partition crosses the Sign Border. For example in case (1), Left wins, and TBp+<TB/2\mathrm{TB}-p+\ell<\mathrm{TB}/2. Since 2pTB2p\geq\mathrm{TB}, we wish to prove a non-decreasing outcome, and induction thus applies when signs for non-decreasing outcomes change. Note that in case of tie bids, the player who tries to contradict the inequality, could do this either by lowering, or raising the bid, and thus several sub-cases may need to be considered. Since there are significant variations to why the contradicting player will not succeed, we will treat all cases.

Case 1. The players \ell-tie at heap size x2x-2 and where 2pTB2p\geq\mathrm{TB}. We have

(12) op(x2)=oq+(x3)+1,\displaystyle o_{p}(x-2)=-o_{q+\ell}(x-3)+1,

where q+<TB/2q+\ell<\lceil\mathrm{TB}/2\rceil, i.e. 2(p)>TB2(p-\ell)>\mathrm{TB}. We need to prove that

(13) op(x)op(x2).\displaystyle o_{p}(x)\geq o_{p}(x-2).

We use induction to assume that

(14) oq+(x1)oq+(x3)\displaystyle-o_{q+\ell}(x-1)\geq-o_{q+\ell}(x-3)

By way of contradiction, (14) and (13), Right deviates at xx:

  1. (1)

    If Right decreases his bid, Left wins by bidding \ell. Hence, by pTB/2p-\ell\geq\mathrm{TB}/2, induction gives op(x)|L()=1+op(x1)1+op(x3)o_{p}(x)|_{L(\ell)}=1+o_{p-\ell}(x-1)\geq 1+o_{p-\ell}(x-3). By \threflem:tie, Tie Monotonicity, op(x3)oq+(x3)o_{p-\ell}(x-3)\geq-o_{q+\ell}(x-3) and 1oq+(x3)=op(x2)1-o_{q+\ell}(x-3)=o_{p}(x-2). Thus, op(x)|L()op(x2)o_{p}(x)|_{L(\ell)}\geq o_{p}(x-2).

  2. (2)

    If Right increases his bid, he wins and we get op(x)|R(r)=op+r(x1)1o_{p}(x)|_{R(r)}=o_{p+r}(x-1)-1, with r=+1r=\ell+1. As Right does not benefit by increasing his bid at ‘x3x-3’, he cannot benefit by increasing his bid at ‘x1x-1’. This follows by induction, because, by assumption (12), op(x2)op+r(x3)1op+r(x1)1=op(x)|R(r)o_{p}(x-2)\leq o_{p+r}(x-3)-1\leq o_{p+r}(x-1)-1=o_{p}(x)|_{R(r)}.

Case 2. The players \ell-tie at heap size x2x-2 and where 2p<TB2p<\mathrm{TB}. We have

(15) op(x2)=oq+(x3)+1,\displaystyle o_{p}(x-2)=-o_{q+\ell}(x-3)+1,

where q+TB/2q+\ell\geq\lceil\mathrm{TB}/2\rceil, i.e. 2(p)TB2(p-\ell)\leq\mathrm{TB}. We need to prove that

(16) op(x)op(x2).\displaystyle o_{p}(x)\leq o_{p}(x-2).

We use induction to assume that

(17) oq+(x1)oq+(x3)\displaystyle-o_{q+\ell}(x-1)\leq-o_{q+\ell}(x-3)

By way of contradiction of (16), Left deviates at xx.

  1. (1)

    If Left decreases her bid, Right wins by bidding \ell. Hence, op(x)|R(l)=op+(x1)1o_{p}(x)|_{R(l)}=o_{p+\ell}(x-1)-1. If 2(p+)<TB2(p+\ell)<\mathrm{TB}, by induction, Left cannot contradict (16). Since the relative loss for Left is 2, when Right wins the bid, by heap monotonicity, Left requires a relative 4-gap in outcomes at ‘x3x-3’ and ‘x1x-1’, with 2(p+)TB2(p+\ell)\geq\mathrm{TB}. That is, Left requires

    (18) oq+(x3)+4op+(x1)\displaystyle-o_{q+\ell}(x-3)+4\leq o_{p+\ell}(x-1)

    By the assumption 2p<TB2p<\mathrm{TB}, we get p+<q+p+\ell<q+\ell. Therefore, by heap monotonicity, oq+(x3)2o_{q+\ell}(x-3)\geq 2. Moreover, if oq+(x3)=2o_{q+\ell}(x-3)=2, by heap monotonicity and (18), this forces 0=op+(x3)<op+(x1)=20=o_{p+\ell}(x-3)<o_{p+\ell}(x-1)=2. We will refer to this situation as the 4-gap principle.

    Hence, Right can decrease the bid to 1\ell-1, and still satisfy oq+1(x3)=2o_{q+\ell-1}(x-3)=2, and the argument can be repeated with 1\ell-1 instead of \ell, until at some point op+(x1)=0o_{p+\ell^{\prime}}(x-1)=0; this must happen for some 0\ell^{\prime}\geq 0, by 2p<TB2p<\mathrm{TB} and Sign Border. The case oq+(x3)4o_{q+\ell}(x-3)\geq 4 need not be considered, since 2(p+)TB2(p+\ell)\geq\mathrm{TB} and the Sign Border implies that Left should have decreased the bid to 1\ell-1 at ‘x3x-3’ (she would have been strictly better off losing the bid and if Right decreases then by Tie Monotonicity, she benefits by a smaller tie).

  2. (2)

    If Left increases her bid, by induction, she cannot increase the outcome. Namely, op(x)|L(+1)=op1(x1)+1op1(x3)+1oq+(x3)+1o_{p}(x)|_{L(\ell+1)}=o_{p-\ell-1}(x-1)+1\leq o_{p-\ell-1}(x-3)+1\leq o_{q+\ell}(x-3)+1, by assumption.

Case 3. Right wins at heap size x2x-2, by bidding rr, and where 2pTB2p\geq\mathrm{TB}. Then 2(p+r)>TB2(p+r)>\mathrm{TB}. We have op(x2)=op+r(x3)1o_{p}(x-2)=o_{p+r}(x-3)-1, and need to prove that

(19) op(x)op(x2).\displaystyle o_{p}(x)\geq o_{p}(x-2).

If Right keeps the same winning bid, by induction, he cannot contradict this inequality. Hence he decreases the bid to r1r-1 at xx. We may assume this is a tie, so the relative loss for Right is 2 points. By induction, we may assume that q+r1TB/2q+r-1\geq\lceil\mathrm{TB}/2\rceil, and that

(20) oq+r1(x1)<oq+r1(x3)\displaystyle-o_{q+r-1}(x-1)<-o_{q+r-1}(x-3)

Suppose first that op+r(x3)1=op(x2)=0o_{p+r}(x-3)-1=o_{p}(x-2)=0. Then (20) forces q+r1>p+rq+r-1>p+r. This contradicts the assumption 2pTB2p\geq\mathrm{TB}.

Suppose next that op+r(x3)1=op(x2)=1o_{p+r}(x-3)-1=o_{p}(x-2)=1. Then

(21) 2=op+r(x3)op+r(x1),\displaystyle 2=o_{p+r}(x-3)\leq o_{p+r}(x-1),

by induction.

Claim: p+r>q+r1p+r>q+r-1. Proof of Claim: If p+rq+r1p+r\leq q+r-1, then, by (21), heap monotonicity and the 4-gap principle, Right instead prefers to tie at ‘x3x-3’.

Therefore, by (20), (21) and heap monotonicity, we get 2=op+r1(x1)>op+r1(x3)=02=o_{p+r-1}(x-1)>o_{p+r-1}(x-3)=0.

But then, Left will decrease her bid at xx, below r1r-1, and Right will win. However, the argument gives the same outcome as when he wins by bidding rr. Thus, we may repeat the argument, and Right cannot contradict (19).

Case 4. Left wins by bidding >0\ell>0, 2pTB2p\geq\mathrm{TB} and 2(p)TB2(p-\ell)\geq\mathrm{TB}. Thus, op(x2)=op(x3)+1o_{p}(x-2)=o_{p-\ell}(x-3)+1. We need to prove that

(22) op(x)op(x2).\displaystyle o_{p}(x)\geq o_{p}(x-2).

If Right keeps the same bid at xx, by induction, he cannot contradict this inequality. If he can increase the bid to \ell at xx, there is a tie (otherwise we are done). If the tie remains to the left of the Sign Border, and there is an increase of outcome, i.e. 2(q+)TB2(q+\ell)\geq\mathrm{TB} and oq+(x1)>oq+(x3)o_{q+\ell}(x-1)>o_{q+\ell}(x-3), then he will gain the sufficient amount to contradict (19). But by the third assumption, this can only happen if 2(p)=TB2(p-\ell)=\mathrm{TB}. Hence TB\mathrm{TB} is even. Hence, 2=oq+(x1)>oq+(x3)=02=o_{q+\ell}(x-1)>o_{q+\ell}(x-3)=0, which would give outcome -1 at xx instead of +1. But, since the bids q+=pq+\ell=p-\ell, Left can deviate and let Right win the bid, to produce an outcome at least +1, since p+>pp+\ell>p-\ell.

Case 5. Left wins by bidding >0\ell>0 and 2p<TB2p<\mathrm{TB}. We have op(x2)=op(x3)+1o_{p}(x-2)=o_{p-\ell}(x-3)+1. We need to prove that

(23) op(x)op(x2).\displaystyle o_{p}(x)\leq o_{p}(x-2).

By induction,

op(x1)op(x3).\displaystyle o_{p-\ell}(x-1)\leq o_{p-\ell}(x-3).

Therefore, to contradict (23), Left must change her bid, and she can decrease to ‘1\ell-1’, to get a tie. But,

(24) op(x)|T(1)\displaystyle o_{p}(x)|_{T(\ell-1)} =oq+1(x1)+1oq+1(x3)+1\displaystyle=-o_{q+\ell-1}(x-1)+1\leq-o_{q+\ell-1}(x-3)+1
(25) op(x3)+1=op(x2),\displaystyle\leq o_{p-\ell}(x-3)+1=o_{p}(x-2),

since p<TB/2p<\lceil\mathrm{TB}/2\rceil implies that q+1TB/2q+\ell-1\geq\lceil\mathrm{TB}/2\rceil. If she tries to decrease further, to make Right win the bid to gain a higher outcome, then Right can respond by decreasing his bid, and the sequence of inequalities still holds.

Case 6. Right wins by bidding rr and 2(p+r)<TB2(p+r)<\mathrm{TB}. That is, op(x2)=op+r(x3)1<0o_{p}(x-2)=o_{p+r}(x-3)-1<0. We need to prove that

(26) op(x)op(x2).\displaystyle o_{p}(x)\leq o_{p}(x-2).

And assume, by induction,

op+r(x1)op+r(x3).\displaystyle o_{p+r}(x-1)\leq o_{p+r}(x-3).

Hence Left must change bid, to contradict (26). If Left can increase her bid to ‘rr’ at ‘xx’, we get

op(x)|T(r)\displaystyle o_{p}(x)|_{T(r)} =oq+r(x1)+1oq+r(x3)+1\displaystyle=-o_{q+r}(x-1)+1\leq-o_{q+r}(x-3)+1
op+r(x3)1=op(x2),\displaystyle\leq o_{p+r}(x-3)-1=o_{p}(x-2),

since p<TB/2p<\lceil\mathrm{TB}/2\rceil implies that q+rTB/2q+r\geq\lceil\mathrm{TB}/2\rceil. If she can increase her bid to ‘r+1r+1’, since 2p<TB2p<TB, right can also raise his bid to r+1r+1, and the sequence of inequalities still holds.

Case A. Left wins, by bidding \ell, 2pTB2p\geq\mathrm{TB} and 2(p)<TB2(p-\ell)<\mathrm{TB}. We have

(27) op(x2)=op(x3)+1,\displaystyle o_{p}(x-2)=o_{p-\ell}(x-3)+1,

and we must prove that

(28) op(x)op(x2).\displaystyle o_{p}(x)\geq o_{p}(x-2).

By induction,

(29) op(x1)op(x3).\displaystyle o_{p-\ell}(x-1)\leq o_{p-\ell}(x-3).

If the inequality is strict, then 2op(x1)-2\geq o_{p-\ell}(x-1), and Left must change her bid. Suppose she decreases her bid to a ‘0’ tie. Then

op(x)|T(0)=oq(x1)+11,\displaystyle o_{p}(x)|_{T(0)}=-o_{q}(x-1)+1\geq 1,

by the assumption pTB/2p\geq\lceil\mathrm{TB}/2\rceil, which suffices to justify (27). If Right instead wins by bidding 1, then this only contradicts (28) if op+1(x1)=0o_{p+1}(x-1)=0. In this case Left can raise the bid to get a 1-tie, and indeed, oq+1(x1)op+1(x1)=0o_{q+1}(x-1)\leq o_{p+1}(x-1)=0, by heap monotonicity, since qpq\leq p; this implies oq+1(x1)0-o_{q+1}(x-1)\geq 0. This argument can be repeated (Right instead wins by bidding 2 and perhaps Left raises to a 2-tie etc.) until one of the assumptions fails to hold.

Case B. Right wins, by bidding rr, 2p<TB2p<\mathrm{TB} and 2(p+r)TB2(p+r)\geq\mathrm{TB}. Thus,

(30) op(x2)=op+r(x3)11,\displaystyle o_{p}(x-2)=-o_{p+r}(x-3)-1\leq-1,

and we need to prove that

(31) op(x)op(x2).\displaystyle o_{p}(x)\leq o_{p}(x-2).

By induction,

(32) op+r(x1)op+r(x3)\displaystyle o_{p+r}(x-1)\geq o_{p+r}(x-3)

If this is a strict inequality, then Right must change his bid to satisfy (31). He decreases to r1r-1, and gets a tie:

op(x)|T(r1)=oq+r1(x1)+1\displaystyle o_{p}(x)|_{T(r-1)}=-o_{q+r-1}(x-1)+1

Note that q+r1p+rq+r-1\geq p+r. Therefore, by assumption of strict inequality in (31), Right obtains the desired 4-gap, which implies that the inequality (31) holds. Thus

Left could decrease, and let Right win by r1r-1, but this could only help him, and we could either repeat the argument, or go to Case 6. Suppose that Left can increase to win by bidding rr. But Right can ‘rr’-tie, and this is weakly better for him than the assumed ‘r1r-1’ tie.

Case C. Tie, 2pTB2p\geq\mathrm{TB} and q+TB/2q+\ell\geq\lceil\mathrm{TB}/2\rceil. We have

(33) op(x2)=oq+(x3)+1,\displaystyle o_{p}(x-2)=-o_{q+\ell}(x-3)+1,

We need to prove that

(34) op(x)op(x2).\displaystyle o_{p}(x)\geq o_{p}(x-2).

We use induction to assume that

(35) oq+(x1)oq+(x3)\displaystyle-o_{q+\ell}(x-1)\leq-o_{q+\ell}(x-3)

If the inequality is strict, then Left must change her bid. If she decreases so Right wins by bidding \ell, then

(36) op(x)|R()=op+(x1)1.\displaystyle o_{p}(x)|_{R(\ell)}=o_{p+\ell}(x-1)-1.

By the assumption of strict inequality, together with \threflem:markdom, we get 2oq+(x1)op+(x1)2\leq o_{q+\ell}(x-1)\leq o_{p+\ell}(x-1), which gives the desired 4-gap. Then, if Right decreases, \threflem:tie gives that Left cannot be worse of by tie ‘1\ell-1’. The argument can be repeated, or we are in Case 1.

If the inequality (35) is not strict, then Right must change his bid to contradict (34). Induction shows that he cannot benefit by increasing his bid. Suppose he decreases so Left wins by bidding \ell. By \threflem:markdom, op(x1)oq+(x1)o_{p-\ell}(x-1)\geq-o_{q+\ell}(x-1), which does not worsen Left’s result. ∎

We are now ready to prove the second main theorem of the paper. In Theorem LABEL:thm:const, we show that the game value of a given budget partition is constant for large heapsizes. The proof of the Theorem follows from Lemma LABEL:lem:bound and Lemma LABEL:lem:rowsplitmon.

Theorem 24 (Eventual period 2 of equilibrium outcomes).
\thlabel

thm:const The game value of a given budget partition is constant, for all sufficiently large heaps of the same parity.

Proof.

Fix any parity for the heap sizes. By \threflem:rowsplitmon, the game values are column-wise non-decreasing weakly to the left of TB/2\mathrm{TB}/2, and non-increasing to the right of TB/2\mathrm{TB}/2. Therefore, since, for each column, by \threflem:bound their absolute values are bounded, they converge to a finite constant. ∎

3.4. A Bidding Automaton and a Quadratic Bound

In this subsection, we analyze the convergence of the game value for unitary games. We will determine a quadratic bound in the total budget TB\mathrm{TB} for the demonstrated game value ‘convergence’. We will do this via explicit bounds of the outcome vector.

We make use of functions defined on the even and odd integers, that later will represent the possible budget partitions for even and odd total budgets, respectively. See \threflem:bidaut below. Define nearest integer functions αeven\alpha_{\mathrm{even}} and αodd\alpha_{\mathrm{odd}} (the indexes will correspond to the parities of the heap sizes) on the even integers (for TB\mathrm{TB} even inputs will correspond to pq=2pTBp-q=2p-\mathrm{TB}), by

αeven(δ)={δ+12,if δ0(mod4);δ+12,otherwise.\alpha_{\mathrm{even}}(\delta)=\begin{cases}\left\lfloor\frac{\delta+1}{2}\right\rfloor,&\text{if }\delta\equiv 0\pmod{4};\\ \\ \left\lceil\frac{\delta+1}{2}\right\rceil,&\text{otherwise}.\end{cases}
αodd(δ)={δ+12,if δ0(mod4);δ+12,otherwise.\alpha_{\mathrm{odd}}(\delta)=\begin{cases}\left\lceil\frac{\delta+1}{2}\right\rceil,&\text{if }\delta\equiv 0\pmod{4};\\ \\ \left\lfloor\frac{\delta+1}{2}\right\rfloor,&\text{otherwise}.\end{cases}

Let ι:{0,1}\iota:\mathbb{Z}\rightarrow\{0,1\} be the function ι(x)=1\iota(x)=1 if and only if x>0x>0. Define a nearest integer function, β\beta on the odd integers (for TB\mathrm{TB} odd inputs will correspond to pq=2pTBp-q=2p-\mathrm{TB}), by

β(δ)={δ2+ι(δ),if δ1(mod4).δ2+ι(δ),otherwise.\beta(\delta)=\begin{cases}\left\lfloor\frac{\delta}{2}\right\rfloor+\iota(\delta),&\text{if }\delta\equiv 1\pmod{4}.\\ \\ \left\lceil\frac{\delta}{2}\right\rceil+\iota(\delta),&\text{otherwise}.\end{cases}

For any fixed (budget) TB0\mathrm{TB}\in\mathbb{N}_{0}, we define an automaton 𝒜\mathcal{A} with TB+1\mathrm{TB}+1 nodes and 2 states per node, such that, for all states j{even,odd}j\in\{\mathrm{even},\mathrm{odd}\}, for all nodes p{0,,TB}p\in\{0,\ldots,\mathrm{TB}\}

(j,p)𝒜(jc,q),(j,p)\overset{\mathcal{A}}{\longrightarrow}(j^{c},q),

with updates, for all j,pj,p,

𝒜(j,p)=1𝒜(jc,q),\mathcal{A}(j,p)=1-\mathcal{A}(j^{c},q),

where jcj^{c} is the complement of jj, and where initial values are assigned to say all even states. Note that, by definition, independently of initial values, the even states are reflexive, and so are the odd ones. In Lemma LABEL:lem:bidaut, we show that the α\alpha and β\beta functions are automaton 𝒜\mathcal{A} duals in the following sense.

Lemma 25.
\thlabel

lem:bidaut For all pp, let 𝒜(even,p)=αeven(2pTB)\mathcal{A}(\mathrm{even},p)=\alpha_{\mathrm{even}}(2p-\mathrm{TB}). Then, for all pp, 𝒜(odd,p)=αodd(2pTB)\mathcal{A}(\mathrm{odd},p)=\alpha_{\mathrm{odd}}(2p-\mathrm{TB}). For all pp, let 𝒜(even,p)=β(2pTB)\mathcal{A}(\mathrm{even},p)=\beta(2p-\mathrm{TB}). Then, for all pp, 𝒜(odd,p)=β(2pTB)\mathcal{A}(\mathrm{odd},p)=\beta(2p-\mathrm{TB}).

Proof.

In case of even TB\mathrm{TB}, we want to justify that αeven(2pTB)=1αodd(TB2p)\alpha_{\mathrm{even}}(2p-\mathrm{TB})=1-\alpha_{\mathrm{odd}}(\mathrm{TB}-2p), which holds since, for all δ\delta, δ+12+δ+12=1\lceil\frac{\delta+1}{2}\rceil+\lfloor\frac{-\delta+1}{2}\rfloor=1. Namely, if δ\delta is odd, then we cancel the nearest integer functions and the equality holds; if δ=2m\delta=2m is even, then we get m+1/2m+1/2=1m+\lceil 1/2\rceil-m+\lfloor 1/2\rfloor=1.

In case of odd TB\mathrm{TB}, we want to justify that β(2pTB)=1β(TB2p)\beta(2p-\mathrm{TB})=1-\beta(\mathrm{TB}-2p). Suppose first that p>TB/2p>\mathrm{TB}/2. Then β(2pTB)+β(TB2p)=δ/2+1+δ/2\beta(2p-\mathrm{TB})+\beta(\mathrm{TB}-2p)=\lfloor\delta/2\rfloor+1+\lceil-\delta/2\rceil or δ/2+1+δ/2\lceil\delta/2\rceil+1+\lfloor-\delta/2\rfloor; in either case these expressions equal one. Because TB\mathrm{TB} is odd, the other case is i<TB/2i<\mathrm{TB}/2, and so the argument is analogous. ∎

A (generic) TB\mathrm{TB}-bidding automaton is a finite state machine with ‘TB+1\mathrm{TB}+1’ states, directed edges between the states, and an update rule for each directed edge. Each state has an outgoing edge corresponding to a feasible winning bid from one of the players. Clearly, for any TB\mathrm{TB}𝒜\mathcal{A} is a bidding automaton where the winning bid is 0, i.e. Left wins a 0-tie. We improve on \threfthm:const, by describing an explicit bound, and begin with a lemma.

Lemma 26.
\thlabel

lem:boundaut For any given total budget TB\mathrm{TB}, and any Left budget p^\widehat{p}, the entries of automaton 𝒜\mathcal{A} bound the outcome op()o_{p}(\cdot). If pTB/2p\geq\mathrm{TB}/2, then op(x)𝒜(j,p)o_{p}(x)\leq\mathcal{A}(j,p), where the parity of xx is jj, and otherwise op(x)𝒜(j,p)o_{p}(x)\geq\mathcal{A}(j,p).

Proof.

We prove, by induction that the values as prescribed by automaton 𝒜\mathcal{A} cannot be exceeded.

Consider first even TB\mathrm{TB}. In this case we are concerned with the α\alpha functions. For odd heap sizes xx, and 2pTB2p\geq TB, we show that

  1. (1)

    op+r(x)1αeven(2pTB)o_{p+r}(x)-1\geq\alpha_{\mathrm{even}}(2p-\mathrm{TB}), r>0r>0,

  2. (2)

    op(x)+1αeven(2pTB)o_{p-\ell}(x)+1\leq\alpha_{\mathrm{even}}(2p-\mathrm{TB}), >0\ell>0, and

  3. (3)

    oq+(x)+1αeven(2pTB)-o_{q+\ell}(x)+1\leq\alpha_{\mathrm{even}}(2p-\mathrm{TB}), 0\ell\geq 0.

That is, by induction, we show,

  1. (1)

    αodd(2(p+r)TB)1αeven(2pTB)\alpha_{\mathrm{odd}}(2(p+r)-\mathrm{TB})-1\geq\alpha_{\mathrm{even}}(2p-\mathrm{TB}), r>0r>0,

  2. (2)

    αodd(2(p)TB)+1αeven(2pTB)\alpha_{\mathrm{odd}}(2(p-\ell)-\mathrm{TB})+1\leq\alpha_{\mathrm{even}}(2p-\mathrm{TB}), >0\ell>0, and

  3. (3)

    αodd(2(q+))TB)+1αeven(2pTB)-\alpha_{\mathrm{odd}}(2(q+\ell))-\mathrm{TB})+1\leq\alpha_{\mathrm{even}}(2p-\mathrm{TB}).

Note that Case 3 has already been justified for =0\ell=0 in \threflem:bidaut, and when >0\ell>0 obviously the inequality still holds.

For Case 2, the tightest situation is when =1\ell=1 i.e. when 2(p1)TB2(mod4)2(p-1)-\mathrm{TB}\equiv 2\pmod{4}, i.e. if the outcome equals 2(p1)TB+12+1\left\lfloor\frac{2(p-1)-\mathrm{TB}+1}{2}\right\rfloor+1 and we see that it equals 2pTB+12\left\lfloor\frac{2p-\mathrm{TB}+1}{2}\right\rfloor, i.e., this is the case when 2pTB0(mod4)2p-\mathrm{TB}\equiv 0\pmod{4} and 2(p1)TB2(mod4)2(p-1)-\mathrm{TB}\equiv 2\pmod{4}. If =2\ell=2 then the outcome equals 2(p2)TB+12+1=2pTB+1212pTB+12\left\lceil\frac{2(p-2)-\mathrm{TB}+1}{2}\right\rceil+1=\left\lceil\frac{2p-\mathrm{TB}+1}{2}\right\rceil-1\leq\left\lfloor\frac{2p-\mathrm{TB}+1}{2}\right\rfloor. If >2\ell>2 the inequality is immediate.

The remaining case for even TB\mathrm{TB} and the cases for odd TB\mathrm{TB} are justified analogously. ∎

We say that a game converges at heap size xx if, for all pp, then op(x)=op(x+2)o_{p}(x)=o_{p}(x+2), but there is a pp such that op(x)op(x2)o_{p}(x)\neq o_{p}(x-2). Observe that convergence at xx implies that op(x+1)=op(x+3)o_{p}(x+1)=o_{p}(x+3), for all pp. We say that a game converges if it converges at some x<x<\infty.

Theorem 27 (Convergence Bound).
\thlabel

thm:convbou The upper bound for convergence is the sum of the entries in the 0-bidding automaton, and it is of order of magnitude O(TB2)O(\mathrm{TB}^{2}).

Proof.

We use the bounds of the outcomes for each budget partition as prescribed by the 0-bidding automaton 𝒜\mathcal{A}. By \threflem:boundaut if the game did not converge before the entries of 𝒜\mathcal{A} have been reached, it converges at the first occurrence of the 𝒜\mathcal{A} entries. We use \threfthm:budadv to see that (nearly) half the entires change by going from heap size 0 to heap size 2.

Thus, for an even total budget TB\mathrm{TB}, we bound the maximal number of rounds induced by the 0-bidding automaton 𝒜\mathcal{A}, as

1+δ{0,2,,TB}(δ+12+δ+12)TB=1+(TB/2+1)TB/2TB/2=O(TB2),1+\sum_{\delta\in\{0,2,\ldots,\mathrm{TB}\}}\left(\frac{\delta+1}{2}+\frac{\delta+1}{2}\right)-\mathrm{TB}=1+(\mathrm{TB}/2+1)\mathrm{TB}/2-\mathrm{TB}/2=O(\mathrm{TB}^{2}),

and a similar convergence bound holds for the odd sized total budgets,

1+TB/2+δ{1,3,,TB}(δ2+δ2)TB=1+TB/22TB/2=O(TB2),1+\mathrm{TB}/2+\sum_{\delta\in\{1,3,\ldots,\mathrm{TB}\}}\left(\frac{\delta}{2}+\frac{\delta}{2}\right)-\mathrm{TB}=1+\left\lceil\mathrm{TB}/2\right\rceil^{2}-\mathrm{TB}/2=O(\mathrm{TB}^{2}),

If the upper bounds are obtained, then this proves that the equilibrium bids are zero-ties. Namely, the edges of the automaton 𝒜\mathcal{A} correspond to 0-bids. (There may be other optimal bids as well, but we do not classify those here.) In the next section, we illustrate some feasible bids for automaton 𝒜\mathcal{A}.

Conjecture 28 (Automaton Game Correspondence).

Consider any unitary game. The entries of the corresponding automaton 𝒜\mathcal{A} are obtained as outcomes, for all heaps of size at least O(TB2)O(\mathrm{TB}^{2}).

3.5. The Number of Forced Wins

As an independent result, we count the number of forced wins a player with a larger budget can have.

Theorem 29 (Budget Advantage).
\thlabel

thm:budadv Consider any game (TB;x,p,m;c)(\mathrm{TB};x,p,m;c), with TB=p+q\mathrm{TB}=p+q. Suppose that Left has the marker. Then Left can force a win of the xx final moves if p(2x1)q+2x11p\geq(2^{x}-1)q+2^{x-1}-1. If Right has the marker, then Left can force a win of the xx final moves if p(2x1)(q+1)p\geq(2^{x}-1)(q+1).

Proof.

We take the case when we have the configuration of the game as game (TB;x,p,1;c)(\mathrm{TB};x,p,1;c), where p+q=TBp+q=\mathrm{TB} and pqp\geq q. To win the first move Left should at least bid qq dollars, i.e. pqp\geq q. So, Right has now at least 2q2q dollars and so Left to win the second round he must bid 2q+12q+1 dollars. He must have q+(2q+1)=3q+1q+(2q+1)=3q+1 dollars to win 22 consecutive moves. Right has now at least 4q+14q+1 dollars. Left must bid 4q+24q+2 dollars to win the third consecutive round and in total he must have at least q+(2q+1)+(4q+2)=7q+3q+(2q+1)+(4q+2)=7q+3 dollars. Similarly, he should bid 8q+48q+4 and 16q+816q+8 dollars to win the fourth and fifth consecutive moves. In total, he must have at least q+(2q+1)+(4q+2)+(8q+4)+(16q+8)=31q+15q+(2q+1)+(4q+2)+(8q+4)+(16q+8)=31q+15 dollars to win 5 consecutive moves.

We prove this by induction. We take the base case of x=1x=1 move and we get pqp\geq q which is true since if the budget is equal, Left wins by the marker. We assume it to be true for x=kx=k moves and prove it for x=k+1x=k+1 moves.

To win x=kx=k consecutive moves, Left’s budget should be at least (2k1)q+2k11(2^{k}-1)q+2^{k-1}-1. To win (k+1)st(k+1)^{\rm st} move, Left should bid at least ‘1’ more than that of Right budget after kk move which is (2k1)q+2k11+q(2^{k}-1)q+2^{k-1}-1+q, which is equal to 2kq+2k112^{k}q+2^{k-1}-1. Hence Left must bid 2kq+2k12^{k}q+2^{k-1}. Hence total budget which must be available with Left after k+1k+1 move is {(2k1)q+2k11}+{2kq+2k1}\{(2^{k}-1)q+2^{k-1}-1\}+\{2^{k}q+2^{k-1}\} = 22 = (2k+11)q+2k1(2^{k+1}-1)q+2^{k}-1. Hence induction holds.

When we have the configuration of the game (TB;x,p,0;c)(\mathrm{TB};x,p,0;c), Left should at least bid q+1q+1 dollars. Right now has 2q+12q+1 dollars. So, Left must have at least (q+1)+(2q+2)=3q+3(q+1)+(2q+2)=3q+3 dollars to win 22 consecutive moves. Right now has 4q+34q+3 dollars for the third round and so Left must have at least (q+1)+(2q+2)+(4q+4)=7q+7(q+1)+(2q+2)+(4q+4)=7q+7 dollars to win 33 consecutive moves.

We prove this by induction. We take the base case of x=1x=1 move and we get pq+1p\geq q+1 which is true since if budgets are equal, Left wins by bidding ‘1’ more than that of the bid of Right, which can be at most qq. We assume it to be true for x=kx=k moves and prove it for x=k+1x=k+1 moves.

To win x=kx=k consecutive moves, Left’s budget should be at least (2k1)(q+1)(2^{k}-1)(q+1). To win (k+1)st(k+1)^{st} move, Left should bid at least ‘1’ more than that of Right budget after kk move which is (2k1)(q+1)+q(2^{k}-1)(q+1)+q, which is equal to 2k(q+1)12^{k}(q+1)-1. Hence Left must bid 2k(q+1)2^{k}(q+1). Hence total budget which must be available with Left after k+1k+1 move is {2k(q+1)}+{(2k1)(q+1)}\{2^{k}(q+1)\}+\{(2^{k}-1)(q+1)\} = (2k+11)(q+1)(2^{k+1}-1)(q+1). Hence induction holds. ∎

4. Discussion

Note that one can deduce neat formulas for the upper bounds for the convergence of the outcomes, by using the explicit bounds of outcome values as prescribed by the α\alpha and β\beta functions. The remaining question is if this worst possible convergence bound is tight. We believe so, because of the elegance of the 0-bidding automaton 𝒜\mathcal{A}. But currently, we do not have an argument to show that earlier convergence could not happen, for some large total budget. There are other open questions: 1) Prove or disprove periodicity of outcomes for any TB\mathrm{TB}, but with an arbitrary finite subtraction set. Our methods indicate that if we restrict the allowed bids of the two players, then we still have convergence, but how do restricted bidding sets affect the strategies? In particular, what happens if 0-bids are not allowed? Classify asymmetric (partizan) bidding sets according to (asymptotic) player strength.

On another note, an interesting paper on ‘general sum’ Richman games has recently appeared (Meir et al., 2018). They show budget monotonicity for games on binary trees, but find a counterexample if a node may have three children (a threat provokes a situation where a player prefers a smaller budget). Our setting readily generalizes to general sum, or more specifically to so-called self interest games, by instead of the zero-sum definition, letting both players maximize their individual final scores. The discrete Richman bidding scheme would stay the same, and one would need to expand on various questions of monotonicity, and for example existence of a unique PSPE and Pareto efficiency.

Acknowledgements.
This work began with informal discussions at the (Chocolate Café, where you can order hot chocolate with your chocolate bar and not coffee, nearby the) Combinatorial Game Theory Colloquium III, Lisbon, 22-24 January, 2019, organized and hosted by Carlos P. dos Santos, Lisbon, Portugal. The main part of this work was done while Ravi Kant Rai was visiting National University of Singapore in Summer 2019 and while Urban Larsson was visiting Indian Institute of Technology Bombay, Fall 2019. We are grateful to our hosts Dr. Yair Zick and Prof. K.S. Mallikarjuna Rao for many valuable suggestions and discussions.

5. Illustration of feasible, dominated and relevant bids

Consider (symmetric) BCS. We illustrate the feasible bids for total budget 5, including a short discussion of dominated bids. Such bids are also feasible arrows for automaton 𝒜\mathcal{A}. In Figure 3, we show the possible tie bids (and here domination is never an issue). In Figure 4, we show the possible Left winning bids, and in Figure 5, we show the same bids, but without dominated bids. Note that whenever property 𝒰\mathcal{U} holds, then Left winning bids may be ignored, in optimal play. In Figure 6, we illustrate the corresponding situations for Right winning bids, and here the pictures (where dominated bids have been erased) are relevant. By relevant, we mean that, in general, one cannot exclude the possibility that a bid may be a unique equilibrium (in some specific setting).

0123450T0T0T
0123451T1T
0123452T2T
Figure 3. The pictures represent Left’s wins via tie bids, for TB=5\mathrm{TB}=5; Left has the marker together with the indicated number of dollars. Note that only in the case of both players bidding “0” all nodes have outgoing edges. In either case, they are all negative, indicated with the color blue in the picture. For example, if Left has $4, and wins by bidding 0, then the next state is that Right gets the marker and $1. Of course none of the bids are dominated in the case of a win by using the marker. Dominated bids only appear because the other player does not have enough budget to motivate such a bid. Compare this situation with Figures 4 and 5
0123451W1W1W1W1W
0123452W2W2W2W
0123453W3W3W
0123454W4W
0123455W
Figure 4. Non-reduced bidding, for TB=5\mathrm{TB}=5, where a right (left) pointing edge indicates that Left (Right) wins the bidding. As before the default is that Left has the marker together with the indicated number of dollars. Note the increasing number of nodes without outgoing edges.
0123451W1W1W1W1W
0123452W2W2W
0123453W
Figure 5. The remaining Left winning bids from Figure 4 when dominated bids have been erased, for TB=5\mathrm{TB}=5. These bids are not relevant whenever property 𝒰\mathcal{U} is satisfied.
0123451W1W1W1W1W
0123452W2W2W
0123453W
Figure 6. The remaining Right winning bids from Figure 4 when dominated bids have been erased, for TB=5\mathrm{TB}=5. These bids are relevant, even when property 𝒰\mathcal{U} holds.

References

  • (1)
  • Berlekamp et al. (2004) Elwyn R Berlekamp, John H Conway, and Richard K Guy. 2004. Winning Ways for Your Mathematical Plays, Volume 1. AK Peters/CRC Press.
  • Cohensius et al. (2019) Gal Cohensius, Urban Larsson, Reshef Meir, and David Wahlstedt. 2019. Cumulative subtraction games. The Electronic Journal of Combinatorics 26, P4.52 (2019).
  • Conway (1976) John H Conway. 1976. On Numbers And Games, Acad. Press, London, New York, San Francisco (1976).
  • Develin and Payne (2010) Mike Develin and Sam Payne. 2010. Discrete bidding games. The Electronic Journal of Combinatorics 17, 1 (2010), 85.
  • Ettinger (1996) J Mark Ettinger. 1996. Topics in Combinatorial Games. Ph.D. Dissertation. University of Wisconsin–Madison.
  • Hanner et al. (1959) Olof Hanner et al. 1959. Mean play of sums of positional games. Pacific J. Math. 9, 1 (1959), 81–99.
  • Johnson (2014) Will Johnson. 2014. The combinatorial game theory of well-tempered scoring games. International Journal of Game Theory 43, 2 (2014), 415–438.
  • Larsson et al. (2016b) Urban Larsson, Richard J Nowakowski, João P Neto, and Carlos P Santos. 2016b. Guaranteed scoring games. The Electronic Journal of Combinatorics 23, 3 (2016), P3–27.
  • Larsson et al. (2016a) Urban Larsson, Richard J Nowakowski, and Carlos P Santos. 2016a. Absolute combinatorial game theory. arXiv preprint arXiv:1606.01975 (2016).
  • Larsson et al. (2018a) Urban Larsson, Richard J Nowakowski, and Carlos P Santos. 2018a. Game comparison through play. Theoretical Computer Science 725 (2018), 52–63.
  • Larsson et al. (2018b) Urban Larsson, Richard J Nowakowski, and Carlos P Santos. 2018b. Games with guaranteed scores and waiting moves. International Journal of Game Theory 47, 2 (2018), 653–671.
  • Larsson and Wästlund (2019) Urban Larsson and Johan Wästlund. 2019. Endgames in bidding chess. Games of No Chance 5 70 (2019), 421.
  • Lazarus et al. (1999) Andrew J Lazarus, Daniel E Loeb, James G Propp, Walter R Stromquist, and Daniel H Ullman. 1999. Combinatorial games under auction play. Games and Economic Behavior 27, 2 (1999), 229–264.
  • Lazarus et al. (1996) Andrew J Lazarus, Daniel E Loeb, James G Propp, and Daniel Ullman. 1996. Richman games. Games of no chance 29 (1996), 439–449.
  • Meir et al. (2018) Reshef Meir, Gil Kalai, and Moshe Tennenholtz. 2018. Bidding games and efficient allocations. Games and Economic Behavior 112 (2018), 166–193.
  • Milnor (1953) John Milnor. 1953. Sums of positional games. Contributions to the Theory of Games II 28 (1953), 291–301.
  • Siegel (2013) Aaron N Siegel. 2013. Combinatorial Game Theory. Vol. 146. American Mathematical Soc.