Optimizing stakes in simultaneous bets

Robbert Fokkink, Ludolf Meester, and Christos Pelekis

Abstract.

We want to find the convex combination $S$ of iid Bernoulli random variables that maximizes $\mathbf{P}(S\geq t)$ for a given threshold $t$ . Endre Csóka conjectured that such an $S$ is an average if $t\geq p$ , where $p$ is the success probability of the Bernoulli random variables. We prove this conjecture for a range of $p$ and $t$ .

Key words and phrases:

bold play, intersecting family, stochastic inequality, tail probability.

2010 Mathematics Subject Classification:

60G50, 05D05

We study tail probabilities of convex combinations of iid Bernoulli random variables. More specifically, let $\beta_{1},\beta_{2},\ldots$ be an infinite sequence of independent Bernoulli random variables with success probability $p$ , and let $t\geq p$ be a real number. We consider the problem of maximizing $\mathbf{P}(\sum c_{i}\beta_{i}\geq t)$ over all sequences $c_{1},c_{2},\ldots$ of non-negative reals such that $\sum c_{i}=1$ . By the weak law of large numbers, the supremum of $\mathbf{P}(\sum c_{i}\beta_{i}\geq t)$ is equal to $1$ if $t<p$ . That is why we restrict our attention to $t\geq p$ .

As a motivating example, consider a venture capitalist who has a certain fortune $f$ to invest in any number of startup companies. Each startup has an (independent) probability $p$ of succeeding, in which case it yields a return $r$ on investment. If the capitalist divides his fortune into a (possibly infinite!) sequence $f_{i}$ of investments, then his total return is $\sum rf_{i}\beta_{i}$ . Suppose he wants to maximize the probability that the total return reaches a threshold $d$ . Then we get our problem with $t=\frac{d}{rf}$ .

The problem has a how-to-gamble-if-you-must flavor [6]: the capitalist places stakes $c_{i}$ on a sequence of simultaneous bets. There is no need to place stakes higher than $t$ . The way to go all out, i.e., bold play, is to stake $t$ on $\lfloor\frac{1}{t}\rfloor$ bets, but this is not a convex combination. That is why we say that staking $\frac{1}{k}$ on $k$ bets with $k={\lfloor\frac{1}{t}\rfloor}$ is bold play.

In a convex combination $\sum c_{i}\beta_{i}$ we order $c_{1}\geq c_{2}\geq c_{3}\geq\ldots$ . We denote the sequence $(c_{i})$ by $\gamma$ and write $S_{\gamma}=\sum c_{i}\beta_{i}$ . We study the function

(1)

\pi(p,t)=\sup\left\{\mathbf{P}\left(S_{\gamma}\geq t\right)\mid\gamma\right\}

for $0\leq p\leq t\leq 1$ . It is non-decreasing in $p$ and non-increasing in $t$ . The following has been conjectured by Csóka [5], who was inspired by some well known open problems in combinatorics:

Conjecture 1 (Csóka).

For every $p$ and $t$ there exists a $k\in\mathbb{N}$ such that $\pi(p,t)$ is realized by $c_{i}=\frac{1}{k}$ if $i\leq k$ and $c_{i}=0$ if $i>k$ for some $k\in\mathbb{N}$ . In other words, the maximal probability is realized by an average.

If the conjecture is true, then $\pi(p,t)$ is a binomial tail probability and we still need to determine the optimal $k$ . Numerical results of Csóka suggest that bold play is optimal for most parameter values.

We are able to settle the conjecture for certain parameter values, as illustrated in figure 1 below.

Refer to caption — Figure 1. The shaded region represents all $(p,t)$ for which we are able to settle the conjecture. In all these cases bold play is optimal. Our results can be divided into three parts: favorable odds $p>\frac{1}{2}$ , high threshold $t\geq\frac{1}{2}$ , and unfavorable odds $p<\frac{1}{2}$ .

It is natural to expect, though we are unable to prove this, that a gambler becomes bolder if the threshold goes up or if the odds go down. In particular, if $p^{\prime}\leq p$ and $t^{\prime}\geq t$ and if bold play is optimal for $(p,t)$ , then it is natural to expect that bold play is optimal for $(p^{\prime},t^{\prime})$ as well. This is clearly visible in the figure above, which is a union of rectangles with lower right vertices $(\frac{k}{k+1},\frac{k}{k+1})$ and $(\frac{1}{2k+1},\frac{1}{k+1})$ for $k\in\mathbb{N}$ .

Our paper is organized as follows. We first lay the groundwork by analyzing properties of $\pi(p,t)$ and prove that the supremum in equation 1 is a maximum. Then we cover the shaded region in figure 1 for the three separate parts of odds greater than one, threshold greater than half, and odds smaller than one. Finally, we recall an old result on binomial probabilities which would imply that (assuming Csóka’s conjecture holds and bold play is stable in the sense that we just explained) bold play is optimal if $p\leq\frac{1}{n}\leq t$ for all $n\in\mathbb{N}$ .

1. Related problems and results

According to Csóka’s conjecture, if the coin is fixed and the stakes vary, then the maximum tail probability is attained by a (scaled) binomial. If the stake is fixed and the coins vary, then Chebyshev already showed that the maximum probability is attained by a binomial:

Theorem 2 (Chebyshev, [21]).

For a given $s$ and $l$ , let $Z=\beta_{1}+\cdots+\beta_{l}$ be any sum of $l$ independent Bernoullis such that $\mathbf{E}[Z]=s$ . Then $\mathbf{P}(Z\geq t)$ is maximized by Bernoullis for which the success probabilities assume at most three different values, only one of which is distinct from $0$ and $1$ . In particular, the maximum $\mathbf{P}(Z\geq t)$ is a binomial tail probability.

Samuels considered a more general situation with fixed expectations and arbitrary random variables.

Conjecture 3 (Samuels, [20]).

Let $0\leq c_{1}\leq\cdots\leq c_{l}$ be such that $\sum_{i=1}^{l}c_{i}<1$ . Consider $\sup\mathbf{P}(X_{1}+\cdots+X_{l}\geq 1)$ over all collections of $l$ independent non-negative random variables such that $\mathbf{E}[X_{i}]=c_{i}$ . This supremum is a maximum which is attained by $X_{j}=c_{j}$ for $j\leq k$ and $X_{j}=(1-b)\beta_{j}$ for $j>k$ , where $k$ is an integer, the $\beta_{j}$ are Bernoulli random variables, and $b=\sum_{i=1}^{k}c_{i}$ . In other words, the gambler accumulates $b$ from small expectations before switching to bold play.

If all $c_{i}$ are equal, then the $\beta_{j}$ are identically distributed, and the conjecture predicts that the maximum probability is attained by a binomial.

If one assumes that the Samuels conjecture holds, then one still needs to determine the optimal $k$ . If $c_{1}=\ldots=c_{l}=\frac{1}{l+1}$ then the optimal $k$ is equal to zero [1]. This implies that another well-known conjecture is a consequence of Samuels’ conjecture, see also [18].

Conjecture 4 (Feige, [7]).

For all collections of $l$ independent non-negative random variables such that $\mathbf{E}[X_{i}]\leq 1$ it is true that

\mathbf{P}(X_{1}+\cdots+X_{l}<l+1)\geq\frac{1}{e}.

As a step towards solving this conjecture, Feige proved the remarkable theorem that there exists a $\delta>0$ such that $\mathbf{P}(X_{1}+\cdots+X_{l}<l+1)\geq\delta$ . His original value of $\delta=\frac{1}{13}$ has been gradually improved. The current best result is $0.1798$ by Guo et al [12].

2. Properties of $\pi(p,t)$

The function $\pi(p,t)$ is defined on a region bounded by a rectangular triangle. It is easy to compute its value on the legs of the triangle: $\pi(0,t)=0$ and $\pi(p,1)=p$ . It is much harder to compute the value on the hypothenuse.

Proposition 5.

$\frac{1}{2}<\pi(p,p)<1$ if $0<p<1$ .

Proof.

We follow the proof of [2, Lemma 1]. The following Paley-Zygmund type inequality for random variables of zero mean was proved in [13, lemma 2.2] and extended in [14]:

\mathbf{P}(X<0)\geq\left(2\sqrt{3}-3\right)\frac{\mathbf{E}[X^{2}]^{2}}{\mathbf{E}[X^{4}]}.

Applying this to $S_{\gamma}-p$ we have

\mathbf{P}(S_{\gamma}<p)\geq\left(2\sqrt{3}-3\right)\frac{\mathbf{E}[(S_{\gamma}-p)^{2}]}{\mathbf{E}[(S_{\gamma}-p)^{4}]}.

The second moment of $S_{\gamma}-p$ is equal to $p(1-p)\sum c_{i}^{2}$ and the fourth moment is equal to

\displaystyle 3p^{2}(1-p)^{2}\sum_{i\not=j}c_{i}^{2}c_{j}^{2}+(p(1-p)^{4}+p^{4}(1-p))\sum c_{i}^{4}

This can be bounded by

\displaystyle\max\left(3,\frac{1}{p(1-p)}-3\right)p^{2}(1-p)^{2}\left(\sum c_{i}^{2}\right)^{2}

The Paley-Zygmund type inequality produces a lower bound on $\mathbf{P}(S_{\gamma}<p)$ . Its complementary probability $\pi(p,p)$ is bounded by:

\pi(p,p)\leq 1-\frac{2\sqrt{3}-3}{\max\left(3,\frac{1}{p(1-p)}-3\right)}.

It is possible to improve on this bound for small $p$ by using Feige’s theorem. We write $S_{\gamma}=c_{1}\beta_{1}+S$ . Then $\mathbf{P}(S_{\gamma}<p)\geq\mathbf{P}(\beta_{1}=0)\mathbf{P}(S<p)=(1-p)\mathbf{P}(S<p)$ . Note that $\mathbf{E}[S]=p(1-c_{1})$ and we write $\mathbf{P}(S<p)=\mathbf{P}(S<\mathbf{E}[S]+pc_{1})$ . We can approximate this probability by a truncated sum $\mathbf{P}(S_{n}<\mathbf{E}[S]+pc_{1})$ , where $S_{n}=c_{2}\beta_{2}+\cdots+c_{n}\beta_{n}$ is a sum of independent random variables of expectation $\leq pc_{1}$ . By dividing by $pc_{1}$ and applying the bound $0.1798$ of [12] we find

\pi(p,p)\leq 0.8202+0.1798p.

We have two upper bounds. The first is more restrictive for large $p$ and the second is more restrictive for small $p$ . The lower bound follows from bold play. Let $k\in\mathbb{N}$ be such that $\frac{1}{k+1}<p\leq\frac{1}{k}$ . If $\bar{S}_{k}$ is the average of $k\geq 1$ Bernoullis, then $\mathbf{P}(\bar{S}_{k}\geq p)=\mathbf{P}(\bar{S}_{k}\geq\frac{1}{k})=1-(1-p)^{k}>1-(1-\frac{1}{k+1})^{k}$ . This is minimal and equal to $\frac{1}{2}$ if $k=1$ . ∎

We say that a sequence $\gamma$ is finite if $c_{i}=0$ for all but finitely many $i$ , and infinite otherwise.

Proposition 6.

$\pi(p,t)=\sup\left\{\mathbf{P}\left(S_{\gamma}\geq t\right)\mid\gamma\ \mathrm{is\ finite}\right\}$

Proof.

According to Jessen and Wintner’s law of pure type [3, Theorem 3.5], either $\mathbf{P}(S_{\gamma}=s)=0$ for each $s\in\mathbb{R}$ or there exists a countable set $\mathcal{C}$ such that $\mathbf{P}(S_{\gamma}\in\mathcal{C})=1$ . In other words, the random variable $S_{\gamma}$ is either non-atomic or discrete. If $X$ and $Y$ are independent, and if $X$ is non-atomic, then the convolution formula implies that $X+Y$ is non-atomic.

Suppose that $\gamma$ is infinite. We prove that $S_{\gamma}$ is non-atomic. Let $(c_{i_{j}})$ be a subsequence such that $c_{i_{j}}>2\sum_{k=j+1}^{\infty}c_{i_{k}}$ . Let $I$ be the set of all $i_{j}$ and let $J$ be its complement. Then both $S_{I}=\sum_{I}c_{i}\beta_{i}$ and $S_{J}=\sum_{J}c_{i}\beta_{i}$ are either discrete or non-atomic. By our choice of $c_{i_{j}}$ , $S_{I}$ has the property that its value determines the values of all $\beta_{i}$ for $i\in I$ . This implies that $S_{I}$ is non-atomic. Therefore, $S_{\gamma}=S_{I}+S_{J}$ is non-atomic. In particular $\mathbf{P}(S_{\gamma}\geq t)=\mathbf{P}(S_{\gamma}>t)$ .

Denote a truncated sum by $S_{\gamma,n}=\sum_{i\leq n}c_{i}\beta_{i}$ . By monotonic convergence $\mathbf{P}\left(S_{\gamma}>t\right)=\lim_{n\to\infty}\mathbf{P}(S_{\gamma,n}>t)$ . Therefore, for any infinite $\gamma$ , $\mathbf{P}(S_{\gamma}\geq t)$ can be approximated by tail probabilities of finite convex combinations. ∎

Csóka conjectures that the tail probability is maximized by an average. This would imply that the sup in proposition 6 is a max. We are unable to prove this, but we can prove that the sup in equation 1 is a max.

Theorem 7.

If $p<t$ then $\pi(p,t)=\mathbf{P}(S_{\gamma}\geq t)$ for some $\gamma$ . Furthermore, $\pi(p,t)$ is left-continuous in $t$ .

Proof.

We write $\pi(p,t^{-})=\lim_{s\uparrow t}\pi(p,s)$ . Since $\pi(p,t)$ is decreasing in $t$ , it suffices to show that there exists an $S_{\gamma}$ such that $\mathbf{P}(S_{\gamma}\geq t)\geq\pi(p,t^{-})$ . Let $\gamma_{n}=(c_{n,i})_{i}$ be such that $\mathbf{P}(S_{\gamma_{n}}\geq t_{n})$ converges to $\pi(p,t^{-})$ for an increasing sequence $t_{n}\uparrow t$ . By a standard diagonal argument we can assume that $(c_{n,i})_{n}$ is convergent for all $i$ . Let $c_{i}=\lim_{n\to\infty}c_{n,i}$ and let $\gamma=(c_{i})_{i}$ . Then $\gamma$ is a non-increasing sequence which adds up to $\sum c_{i}=1-c\leq 1$ . Observe that $\gamma$ cannot be the all zero sequence, since this would imply that $c_{n,1}\to 0$ and $\mathrm{Var}(S_{\gamma_{n}})=p(1-p)\sum c_{n,i}^{2}\leq p(1-p)c_{n,1}\to 0$ , so $S_{\gamma_{n}}$ converges to $p$ in distribution. Since we limit ourselves to $p<t$ , this means that $\mathbf{P}(S_{\gamma_{n}}\geq t)\to 0$ which is nonsense. Therefore, $1-c>0$ .

We first prove that $\pi(p,t^{-})\leq\mathbf{P}(S_{\gamma}\geq t-cp)$ . Fix an arbitrary $\epsilon>0$ . Let $i_{0}$ be such that $\sum_{j\geq i_{0}}c_{j}<\frac{\epsilon}{4}$ and $c_{i_{0}}<\epsilon^{4}$ . Let $n_{0}$ be such that $\sum_{j\leq i_{0}}|c_{n,j}-c_{j}|<\frac{\epsilon}{4}$ and $c_{n,i_{0}}<\epsilon^{4}$ for all $n\geq n_{0}$ . Now

\textstyle\{S_{\gamma_{n}}\geq t_{n}\}\subset\left\{\sum_{j\leq i_{0}}c_{n,j}\beta_{j}\geq t_{n}-cp-\epsilon\right\}\bigcup\left\{\sum_{j\geq i_{0}}c_{n,j}\beta_{j}\geq cp+\epsilon\right\}

so by our assumptions

\textstyle\begin{array}[]{rcl}\{S_{\gamma_{n}}\geq t_{n}\}&\subset&\left\{\sum_{j\leq i_{0}}c_{j}\beta_{j}\geq t_{n}-cp-2\epsilon\right\}\bigcup\left\{\sum_{j\geq i_{0}}c_{n,j}\beta_{j}\geq cp+\epsilon\right\}\\ \\ &\subset&\left\{S_{\gamma}\geq t_{n}-cp-2\epsilon\right\}\bigcup\left\{T_{n}\geq cp+\epsilon\right\}\end{array}

where we write $T_{n}=\sum_{j\geq i_{0}}c_{n,j}\beta_{j}$ . Observe that

\textstyle\mathbf{E}[T_{n}]=\mathbf{E}[S_{\gamma_{n}}]-p\sum_{j<i_{0}}c_{n,j}<p-p\left(\sum_{j<i_{0}}c_{j}-\frac{\epsilon}{4}\right)<p-p\left(1-c-\frac{\epsilon}{2}\right)<pc+\frac{\epsilon}{2}

and

\mathrm{Var}(T_{n})=p(1-p)\sum_{j\geq i_{0}}c_{n,j}^{2}\leq c_{n,i_{0}}\sum_{j\geq i_{0}}c_{n,j}<\epsilon^{4}.

By Chebyshev’s inequality, we conclude that $\mathbf{P}\left(T_{n}\geq cp+\epsilon\right)<\epsilon$ for sufficiently small $\epsilon$ . It follows that

\mathbf{P}(S_{\gamma_{n}}\geq t_{n})\leq\mathbf{P}\left(S_{\gamma}\geq t_{n}-cp-2\epsilon\right)+\epsilon.

By taking limits $n\to\infty$ and $\epsilon\to 0$ we conclude that

\pi(p,t^{-})\leq\mathbf{P}\left(S_{\gamma}\geq t-cp\right).

Let $\bar{\gamma}=\frac{1}{1-c}\gamma$ . Then $S_{\bar{\gamma}}=\frac{1}{1-c}S_{\gamma}$ is a convex combination such that

\mathbf{P}\left(S_{\bar{\gamma}}\geq t\right)=\mathbf{P}\left(S_{\gamma}\geq(1-c)t\right)\geq\mathbf{P}\left(S_{\gamma}\geq t-cp\right)\geq\pi(p,t^{-}).

Therefore, $\pi(p,t)=\mathbf{P}(S_{\bar{\gamma}}\geq t)$ and these inequalities are equalities. ∎

We now more or less repeat this proof to show that $\pi(p,t)$ is continuous in $p$ . Since we need to vary $p$ , we write $\beta^{p}$ for a Bernoulli with succes probability $p$ and $S_{\gamma}^{p}=\sum c_{i}\beta_{i}^{p}$ .

Theorem 8.

$\pi(p,t)$ is continuous in $p$ .

Proof.

For any $\epsilon>0$ choose a finite $\gamma$ such that $\mathbf{P}(S_{\gamma}^{p}\geq t)\geq\pi(p,t)-\epsilon$ . If $p_{n}$ converges to $p$ then $\beta^{p_{n}}$ converges to $\beta^{p}$ in probability. Since $\gamma$ is finite

\limsup_{n\to\infty}\pi(p_{n},t)\geq\lim_{n\to\infty}\mathbf{P}(S_{\gamma}^{p_{n}}\geq t)=\mathbf{P}(S_{\gamma}^{p}\geq t)\geq\pi(p,t)-\epsilon.

It follows that $\limsup_{n\to\infty}\pi(p_{n},t)\geq\pi(p,t)$ for any sequence $p_{n}\to p$ . Since $\pi(p,t)$ is increasing in $p$ , it follows that $\pi(p,t)$ is left-continuous in $p$ .

We need to prove right continuity, i.e., $\pi(p^{+},t)=\pi(p,t)$ . This is trivially true on the hypothenuse, because this is the right-hand boundary of the domain. Consider $p<t$ . Let $p_{n}\downarrow p$ and $\gamma_{n}$ be such that $\lim_{n\to\infty}\mathbf{P}(S_{\gamma_{n}}^{p_{n}}\geq t)=\pi(p^{+},t)$ . By a standard diagonal argument we can assume that $\gamma_{n}$ converges coordinatewise to some $\gamma$ , which may not sum up to one. It cannot be the all zero sequence, i.e., the sum is not zero, by the same argument as in the proof of theorem 7. The sequence $\gamma$ therefore sums up to $1-c$ for some $0\leq c<1$ . Again, we split $S^{p}_{\gamma}=H+T$ where $H=\sum_{j\leq i_{0}}c_{j}\beta_{j}^{p}$ and $T=\sum_{j>i_{0}}c_{j}\beta_{j}^{p}$ . We choose $i_{0}$ such that $\mathbf{E}[T]<\frac{\epsilon}{4}$ and $c_{i_{0}}<\epsilon^{4}$ . Similarly, $S_{\gamma_{n}}=H_{n}+T_{n}$ where $H_{n}$ converges to $H$ in probability, $\mathbf{E}[T_{n}]<pc+\frac{\epsilon}{2}$ and $\mathrm{Var}(T_{n})<\epsilon^{4}$ for sufficiently large $n$ . As in the previous proof, Chebyshev’s inequality and convergence in probability imply

\mathbf{P}(S_{\gamma_{n}}^{p_{n}}\geq t)\leq\mathbf{P}(H_{n}\geq t-cp-\epsilon)+\epsilon\leq\mathbf{P}(H\geq t-cp-2\epsilon)+\epsilon.

for sufficiently large $n$ . By taking limits $n\to\infty$ and $\epsilon\to 0$ it follows that $\pi(p^{+},t)\leq\mathbf{P}(S_{\gamma}^{p}\geq t-cp)$ . If we standardize $\gamma$ to a sequence $\bar{\gamma}$ so that we get a convex combination, we again find that $\pi(p^{+},t)\leq\mathbf{P}(S_{\bar{\gamma}}^{p}\geq t)$ . ∎

3. Favorable odds

We consider $\frac{1}{2}\leq p<t$ . In this case, bold play comes down to a single stake $c_{1}=1$ . We say that $I\subset\mathbb{Z}/n\mathbb{Z}$ is an interval of length $a<n$ if $I=[b,b+a)=\{b,b+1,\ldots,b+a-1\}$ for some $b$ , which we call the initial element. We say that two intervals $I$ and $J$ are separate if $I\cup J$ is not an interval. If $\mathcal{F}$ is a family of sets, then we write $\bigcup\mathcal{F}$ for the union of all these sets.

Lemma 9.

Let $\mathcal{F}$ be a family of $k$ intervals of length $a$ in $\mathbb{Z}/n\mathbb{Z}$ such that $\bigcup\mathcal{F}$ is a proper subset. Then $|\bigcup{\mathcal{F}}|\geq k+a-1$ .

Proof.

$\bigcup\mathcal{F}$ is a union of say $c\geq 1$ separate intervals, all of lengths $\geq a$ . Any interval of length $b\geq a$ contains $b-(a-1)$ intervals of length $a$ . Therefore, $\bigcup\mathcal{F}$ contains $|\bigcup\mathcal{F}|-c(a-1)$ intervals of length $a$ . It follows that $|\bigcup\mathcal{F}|-c(a-1)\geq k$ . ∎

Two families $\mathcal{F}$ and $\mathcal{G}$ are cross-intersecting if $I\cap J\not=\emptyset$ for all $I\in\mathcal{F}$ and $J\in\mathcal{G}$ .

Lemma 10.

Let $\mathcal{F}$ be a family of $k$ intervals of length $k$ in $\mathbb{Z}/n\mathbb{Z}$ . Let $\mathcal{G}$ be a family of intervals of length $a\leq n-k$ such $\mathcal{F}$ and $\mathcal{G}$ are cross-intersecting. Then $|\mathcal{G}|\leq a$ .

Proof.

Let $I=[b,b+k)$ be any element in $\mathcal{F}$ . An interval $[c,c+a)$ intersects $I$ if and only if $c\in[b-a+1,b+k)$ , which is an interval of length $k+a-1$ . Therefore, the set $\mathcal{I}$ of initial elements $c$ of intervals in $\mathcal{G}$ is contained in an intersection of $k$ intervals of length $k+a-1$ . The complement of $\mathcal{I}$ thus contains a union of $k$ intervals of length $n-k-a+1$ . By the previous lemma, this union has cardinality $\geq n-a$ . Therefore, $\mathcal{I}$ contains at most $a$ elements. ∎

Lemma 11.

Let $(V,\mu)$ be a finite measure space such that $\mu(V)=b$ and let $V_{i}\subset V$ for $i=1,\ldots,k$ be such that $\mu(V_{i})\geq t$ . Then $\mu\left(\bigcap V_{i}\right)\geq kt-(k-1)b$ .

Proof.

\mu\left(\bigcap V_{i}\right)=b-\mu\left(\bigcup V_{i}^{c}\right)\geq b-\sum(b-\mu(V_{i}))\geq kt-(k-1)b.

∎

Theorem 12.

If $\frac{k}{k+1}<p\leq\frac{k+1}{k+2}<t$ for some positive integer $k$ , then bold play is optimal.

Proof.

Bold play gives a probability $p$ of reaching $t$ . We need to prove that $\mathbf{P}(S_{\gamma}\geq t)\leq p$ for arbitrary $\gamma$ . By proposition 6 we may assume that $\gamma$ is finite. It suffices to prove that $\mathbf{P}(S_{\gamma}\geq t)\leq p$ for rational $p$ , since $\pi(p,t)$ is monotonic in $p$ .

Let $n$ be the number of non-zero $c_{i}$ in $\gamma$ and let $p=\frac{a}{b}$ . Let $X_{i}$ be a sequence of $n$ independent discrete uniform $U\{0,b-1\}$ random variables, i.e, $X_{i}=c$ for $c\in\{0,\ldots,b-1\}$ with probability $\frac{1}{b}$ . Let $B_{i}^{0}=1_{[0,a)}(X_{i})$ for $1\leq i\leq n$ . Then $S_{\gamma}$ and $Y^{0}=\sum c_{i}B_{i}^{0}$ are identically distributed. Think of $c_{i}B_{i}^{0}$ as an assignment of weight $c_{i}$ to a random element in $\{0,\ldots,b-1\}=\mathbb{Z}/b\mathbb{Z}$ . Let $\ell(j)$ be the sum of the coefficients – the load – that is assigned to $j\in\mathbb{Z}/b\mathbb{Z}$ . Then $Y^{0}=\ell(0)+\cdots+\ell(a-1)$ , i.e., $Y^{0}$ is the load of $[0,a)$ . Instead of $[0,a)$ we might as well select any interval $[j,j+a)\subset\mathbb{Z}/b\mathbb{Z}$ . If $Y^{j}$ is the load of $[j,j+a)$ , then $S_{\gamma}\sim Y^{j}$ , and $\mathbf{P}(S_{\gamma}\geq t)=\frac{1}{b}\sum\mathbf{P}(Y^{j}\geq t)$ . We need to prove that $\sum\mathbf{P}(Y^{j}\geq t)\leq a$ .

Let $\Omega$ be the sample space of the $X_{i}$ . For $\omega\in\Omega$ , let $J(\omega)$ be the cardinality of $\mathcal{J}(\omega)=\{j\colon Y^{j}(\omega)\geq t\}\subset\mathbb{Z}/b\mathbb{Z}$ . In particular, $\mathbf{P}(S_{\gamma}\geq t)=\frac{1}{b}\mathbf{E}[J]$ . It suffices to prove that $J\leq a$ . Assume that $J(\omega)\geq a$ for some $\omega\in\Omega$ . Apply lemma 11 to the counting measure to find

\left|\bigcap_{l=0}^{k}(\mathcal{J}(\omega)-la)\right|\geq(k+1)a-kb.

Note that $i\in\mathcal{J}(\omega)-j$ if and only if $[i+j,i+j+a)$ has load $\geq t$ . Therefore, there are at least $(k+1)a-kb$ elements $i$ such that the intervals $[i,i+a),[i+a,i+2a),\ldots,[i+ka,i+(k+1)a)$ all have load $\geq t$ . The intersection of these $k+1$ intervals is equal to

I_{i}=[i,i+(k+1)a-kb)

It has load $\geq(k+1)t-k$ by lemma 11. Its complement $I_{i}^{c}$ has load $\leq k+1-(k+1)t<t$ . If $j\in\mathcal{J}(\omega)$ then $[j,j+a)$ has load $\geq t$ and therefore it intersects $I_{i}$ for all $i\in\bigcap_{l=0}^{k}(J(\omega)-la)$ . There are $\geq(k+1)a-kb$ such intervals $I_{i}$ , and we denote this family by $\mathcal{F}$ . Let $\mathcal{G}$ be the family of $[j,j+a)$ with $j\in J(\omega)$ . Lemma 10 applies since the length of $I_{i}$ is $(k+1)a-kb$ and since $a\leq b-\left((k+1)a-kb\right)$ . We conclude that $J(\omega)\leq a$ . ∎

With some additional effort, we can push this result to the hypothenuse.

Proposition 13.

If $p=t=\frac{k+1}{k+2}$ for some positive integer $k$ , then bold play is optimal if $k>1$ , and $c_{1}=c_{2}=c_{3}=\frac{1}{3}$ is optimal if $k=1$ .

Proof.

By proposition 6 it suffices to prove that $\mathbf{P}(S_{\gamma}\geq t)\leq p$ for finite $\gamma$ . We adopt the notation of the previous theorem. Let $n$ be the number of non-zero coefficients in $\gamma$ , and let $X_{i}$ be $n$ random selections of $\{0,\ldots,k+1\}$ . We assign the coefficients according to these selections and let $Y^{j}=1-\ell(j)$ be the load of the set $\{0,\ldots,k+1\}\setminus\{j\}$ . Each $Y^{j}$ is identically distributed to $S_{\gamma}$ . For $\omega\in\Omega$ let $J(\omega)$ be the number of $Y^{j}(\omega)$ that reach the threshold, or equivalently, the number of loads $\ell(j)\leq\frac{1}{k+2}$ . We have $\frac{1}{k+2}\mathbf{E}[J]=\mathbf{P}(S_{\gamma}\geq t)$ . In the proof above, we showed that $J\leq k+1$ if $t>p=\frac{k+1}{k+2}$ . This is no longer true now that we have $t=p$ . It may happen that $J(\omega)=k+2$ in which case all $Y^{j}(\omega)$ are equal to $\frac{k+1}{k+2}$ and all loads $\ell(j)$ are equal to $\frac{1}{k+2}$ . Note that this can only happen if all $c_{i}$ are bounded by $\frac{1}{k+2}$ , so $n\geq k+2$ .

We think of the coefficients as being assigned one by one in increasing order. In particular, $c_{n-1}$ and $c_{n}$ are placed last. If $J=k+2$ , then either $k$ or $k+1$ of the loads are equal to $\frac{1}{k+2}$ before $c_{n-1}$ and $c_{n}$ are placed. In the first case, there are two remaining loads $<\frac{1}{k+2}$ and the probability that $c_{n-1}$ are $c_{n}$ are placed here is $\frac{2}{(k+2)^{2}}$ . In the second case, there is only one remaining load $<\frac{1}{k+2}$ and the probability that $c_{n-1}$ and $c_{n}$ are placed here is $\frac{1}{(k+2)^{2}}$ . We conclude that $\mathbf{P}(J=k+2)\leq\frac{2}{(k+2)^{2}}$ and therefore

\mathbf{E}[J]\leq(k+2)\mathbf{P}(J=k+2)+(k+1)\mathbf{P}(J<k+2)=k+1+\mathbf{P}(J=k+2)

is bounded by $k+1+\frac{2}{(k+2)^{2}}$ . Thus we obtain $\mathbf{P}(S_{\gamma}\geq t)\leq\frac{k+1}{k+2}+\frac{2}{(k+2)^{3}}$ . This bound is reached if $k=1$ and $c_{1}=c_{2}=c_{3}=\frac{1}{3}$ .

Let $k>1$ and let $J(\omega)=k+2$ . We first consider the case that $c_{n-1}\not=c_{n}$ . suppose there are two remaining loads $<\frac{1}{k+2}$ before $c_{n-1}$ and $c_{n-2}$ are placed, then each $c_{n-1}$ and $c_{n}$ can only be assigned to a unique place to complete all loads to $\frac{1}{k+2}$ . Since $k>1$ , there are at least two loads $\ell(i_{1})=\ell(i_{2})=\frac{1}{k+2}$ before the final two coefficients are placed. Let $\bar{\omega}$ assign $c_{j}$ for $j<n-1$ in the same way as $\omega$ , but it reassigns $c_{n-1}$ to $i_{1}$ and $c_{n}$ to $i_{2}$ . Then $J(\bar{\omega})=k$ , because the loads at $i_{1}$ and $i_{2}$ exceed the threshold. We can reconstruct $\omega$ from $\bar{\omega}$ because the loads at $i_{1}$ and $i_{2}$ are the only ones that exceed the threshold for $\bar{\omega}$ , and their values are different because $c_{n-1}\not=c_{n}$ . We have a $1-1$ correspondence between $\omega\in\{J=k+2\}$ and $\bar{\omega}\in\{J=k\}$ . Let $\mathcal{E}=\{J=k+2\}$ and let $\mathcal{F}=\{\bar{\omega}\colon\omega\in\mathcal{E}\}$ . Then $\mathbf{P}(\mathcal{F})=\mathbf{P}(\mathcal{E})$ and $\mathcal{E}\cap\mathcal{F}=\emptyset$ . This implies that

\mathbf{E}[J]\leq(k+2)\mathbf{P}(\mathcal{E})+k\mathbf{P}(\mathcal{F})+(k+1)\mathbf{P}(\mathcal{E}^{c}\cap\mathcal{F}^{c})\leq k+1.

In particular $\mathbf{P}(S_{\gamma}\geq t)\leq\frac{k+1}{k+2}$ and bold play is optimal.

Finally, consider the remaining case $k>1$ and $J(\omega)=k+2$ and $c_{n-1}=c_{n}$ . In this case, we may switch the assignments of $c_{n-1}$ and $c_{n}$ to complete the loads. Let $\omega^{\prime}\in\Omega$ represent this switch (it may be equal to $\omega$ if the assignments are the same). Again, let $i_{1}$ and $i_{2}$ be two locations for which the loads have already been completed before $c_{n-1}$ and $c_{n}$ are placed. Let $\{\bar{\omega},\bar{\omega}^{\prime}\}$ be the elements which assign the first $n-2$ coefficients in the same way, but assigns the final two elements to $i_{1}$ and $i_{2}$ . In particular, $J(\bar{\omega})=J(\bar{\omega}^{\prime})=k$ . We can reconstruct $\{\omega,\omega^{\prime}\}$ from $\{\bar{\omega},\bar{\omega}^{\prime}\}$ , the correspondence is injective, so again $\mathbf{P}(\mathcal{E})=\mathbf{P}(\mathcal{F})$ and we conclude in the same way that bold play is optimal. ∎

If bold play is stable, as discussed above, then proposition 13 would imply that bold play is optimal if $p\leq\frac{k+1}{k+2}\leq t$ for $k>1$ . Establishing this stability of bold play appears to be hard, and we are able to settle only one specific case in the range $p<\frac{k+1}{k+2}=t$ .

Proposition 14.

If $p=\frac{2k+1}{2k+3}$ and $t=\frac{k+1}{k+2}$ , then bold play is optimal if $k>1$ .

Proof.

We randomly distribute the coefficients of a finite $\gamma$ over $2k+3$ locations. Let $Y^{j}=\ell(j)+\ldots+\ell(2k+j)$ be the load of the discrete interval $[j,2k+j+1)$ , where as before we reduce modulo $2k+3$ . Then $S_{\gamma}\sim Y^{j}$ and the sum of all $Y^{j}$ is equal to $2k+1$ . Let $J$ be the number of $Y^{j}$ that reach the threshold. Not all $Y^{j}$ can reach the threshold and therefore $J\leq 2k+2$ . Then

\mathbf{P}(S_{\gamma}\geq t)=\frac{\mathbf{E}[J]}{2k+3}\leq\frac{2k+1}{2k+3}\mathbf{P}(J\leq 2k+1)+\frac{2k+2}{2k+3}\mathbf{P}(J=2k+2).

We need to prove that $\mathbf{P}(S_{\gamma}\geq t)\leq\frac{2k+1}{2k+3}$ . If $\mathbf{P}(J=2k+2)=0$ then we are done. Therefore, we may assume that $\mathbf{P}(J=2k+2)>0$ . Only one of the $Y^{j}$ does not meet the threshold and without loss of generality we may assume it is $Y^{2}$ , which has load $1-\ell(0)-\ell(1)$ . The other $Y^{j}$ reach the threshold, and since the sum of all $Y^{j}$ is equal to $2k+1$ , we find that

2k+1\geq(2k+2)t+1-\ell(0)-\ell(1).

In other words, $\ell(0)+\ell(1)\geq\frac{2}{k+2}$ . If $\ell(0)>\frac{1}{k+2}$ then $Y^{j}$ does not reach the threshold if it does not include $\ell(0)$ . Only $2k+1$ of the $Y^{j}$ include $0$ , contradicting our assumption that $J=2k+2$ . Therefore $\ell(0)\leq\frac{1}{k+2}$ and since the same applies to $\ell(1)$ we have in fact that $\ell(0)=\ell(1)=\frac{1}{k+2}$ . Since $Y^{j}=1-\ell(j-1)-\ell(j-2)$ we have that all $\ell(i)+\ell(i+1)\leq\frac{1}{k+2}$ other than $\ell(0)+\ell(1)$ . In particular, $\ell(2)=\ell(2k+2)=0$ . We find that $Y^{2}=\frac{k}{k+2}$ and $Y^{1}=Y^{3}=\frac{k+1}{k+2}$ . The sum of the remaining $Y^{j}$ , with $j\not\in\{1,2,3\}$ is at least $2kt$ and the sum of all $Y^{j}$ is $2k+1$ . Since $2kt=2k+1-\frac{k}{k+2}-\frac{2k+2}{2k+1}$ all the remaining $Y^{j}$ have to be equal to $t$ . It follows that the loads alternate between zero and $\frac{1}{k+2}$ : $\ell(i)=\frac{1}{k+2}$ if $i$ is odd and $\ell(i)=0$ if $i>0$ is even. We conclude that if $J=k+2$ then all non-zero loads are equal and only two non-zero loads are consecutive. There are exactly $2k+3$ such arrangements. There are also $2k+3$ arrangements in which the non-zero loads are consecutive. In this case $J=k+2\leq 2k$ . It follows that $\mathbf{P}(J\leq 2k)\geq P(J=2k+2)$ , which implies that $\mathbf{E}[J]\leq 2k+1$ . Bold play is optimal. ∎

These results conclude our analysis of the upper right hand block of figure 1. A zigzag of triangles along the hypothenuse remains. Numerical results of Csóka [5] suggest that bold play is optimal for all of these triangles, except for the one touching on $\{(p,p)\colon\frac{1}{2}\leq p\leq\frac{2}{3}\}$ . In the next section we will confirm that bold play is not optimal for this triangle.

4. High threshold

We now consider the case $p\leq\frac{1}{2}<t$ , when bold play comes down to a single stake $c_{1}=1$ . We need to maximize $\mathbf{P}(S_{\gamma}\geq t)$ and we may assume that $\gamma$ is finite by proposition 6. Suppose that $\gamma$ has $\leq n$ non-zero coefficients, i.e., $c_{n+1}=0$ . Let $\mathcal{F}_{t,\gamma}$ be the family of $V\subset\{1,2,\ldots,n\}$ , such that $\sum_{i\in V}c_{i}\geq t$ . Let $p(V)=p^{|V|}(1-p)^{n-|V|}$ . Then

(2)

\mathbf{P}(S_{\gamma}\geq t)=\sum_{V\in\mathcal{F}_{t,\gamma}}p(V).

Therefore we need to determine the family $\mathcal{F}_{t,\gamma}$ that maximizes the sum on the right hand side. Problems of this type are studied in extremal combinatorics, see [8] for recent progress. A family $\mathcal{F}$ is intersecting if no two elements are disjoint. Two standard examples of intersecting families are $\mathcal{F}_{1}$ , the family of all $V$ such that $1\in V$ , and $\mathcal{F}_{>n/2}$ , the family of all subsets such that $|V|>n/2$ . Fishburn et al [9] settled the problem of maximizing

p(\mathcal{F})=\sum_{V\in\mathcal{F}}p(V)

over all intersecting families $\mathcal{F}$ :

Theorem 15 (Fishburn et al).

For a fixed $n$ , let $\mathcal{F}$ be any intersecting family of subsets from $\{1,\ldots,n\}$ . If $p\leq\frac{1}{2}$ then $p(\mathcal{F})$ is maximized by $\mathcal{F}_{1}$ . If $p\geq\frac{1}{2}$ and $n$ is odd, then $p(\mathcal{F})$ is maximized by $\mathcal{F}_{>n/2}$ .

Proof.

Following [9]. First suppose $n$ is odd. At most one of $V$ and $V^{c}$ can be in $\mathcal{F}$ . If $p\geq\frac{1}{2}$ , then $p(V)>p(V^{c})$ if $|V|>|V^{c}|$ . Therefore $p(\mathcal{F})$ is maximal if $\mathcal{F}$ contains each set of largest cardinality. It follows that $\mathcal{F}_{>n/2}$ maximizes $p(\mathcal{F})$ if $n$ is odd and $p\geq\frac{1}{2}$ .

Now consider an arbitrary $n$ and $p\leq\frac{1}{2}$ . Let $c_{a}=|\mathcal{F}^{a}|$ be the cardinality of the subfamily $\mathcal{F}^{a}=\{V\in\mathcal{F}\colon|V|=a\}.$ At most one of $V$ and $V^{c}$ can be in $\mathcal{F}$ and therefore $c_{a}+c_{n-a}\leq\binom{n}{a}$ . Since $p\leq\frac{1}{2}$ we have $p(V)>p(V^{c})$ if $|V|<|V^{c}|$ . For $a<\frac{n}{2}$ we want to maximize $c_{a}$ under the constraint $c_{a}+c_{n-a}\leq\binom{n}{a}$ (for $a=\frac{n}{2}$ it does not matter which of the two subsets we select, as long as we select one of them). By the Erdős-Ko-Rado theorem, if $a<\frac{n}{2}$ then $c_{a}$ is maximized by a family of subsets that contain one common element. For such a family, $c_{a}+c_{n-a}=\binom{n}{a}$ . It follows that $p(\mathcal{F}_{1})$ is maximal if $p\leq\frac{1}{2}$ . ∎

Note that $\mathcal{F}_{1}$ corresponds to $\mathcal{F}_{t,\gamma}$ if $t>\frac{1}{2}$ and $c_{1}=1$ , i.e., bold play.

Corollary 16.

If $p\leq\frac{1}{2}<t$ then bold play is optimal.

Proof.

If $t>\frac{1}{2}$ then $\mathcal{F}_{t,\gamma}$ is intersecting. The maximizing family $\mathcal{F}_{1}$ corresponds to $\gamma$ with $\gamma=(1,0,\ldots,0)$ . This takes care of the upper left-hand block in figure 1. ∎

The positive odds part of theorem 15 can be applied to the triangle touching on $\{(p,p)\colon\frac{1}{2}\leq p\leq\frac{2}{3}\}$ that we mentioned above. The family $\mathcal{F}_{>n/2}$ can be represented by $\mathcal{F}_{t,\gamma}$ if $n=2k+1$ and $\frac{1}{2}<t\leq\frac{k+1}{2k+1}$ , by taking $\gamma=(\frac{1}{2k+1},\ldots,\frac{1}{2k+1})$ .

Corollary 17.

If $\frac{1}{2}<p\leq t\leq\frac{2}{3}$ then bold play is not optimal.

Proof.

Choose $k$ maximal such that $t\leq\frac{k+1}{2k+1}$ and set $n=2k+1$ . Then $\mathcal{F}_{>n/2}$ is the unique maximizer of $p(\mathcal{F})$ and corresponds to $c_{1}=\cdots=c_{2k+1}=\frac{1}{2k+1}$ , which is not bold play. ∎

Finally, we settle the remaining part of the box in the upper left corner of figure 1.

Corollary 18.

If $p\leq t=\frac{1}{2}$ then bold play is optimal.

Proof.

Note that we may restrict our attention to $\gamma=(c_{1},c_{2},\ldots)$ such that $c_{1}\leq\frac{1}{2}$ and that bold play corresponds to $(\frac{1}{2},\frac{1}{2},0,\ldots)$ .

\begin{array}[]{ccl}\mathbf{P}(S_{\gamma}\geq\frac{1}{2})&=&p\mathbf{P}(S_{\gamma}\geq\frac{1}{2}\mid\beta_{1}=1)+(1-p)\mathbf{P}(S_{\gamma}\geq\frac{1}{2}\mid\beta_{1}=0)\\ &\leq&p+(1-p)\mathbf{P}(S_{\gamma}\geq\frac{1}{2}\mid\beta_{1}=0)\end{array}

If $\bar{\gamma}=\frac{1}{1-c_{1}}(c_{2},c_{3},\ldots)$ then $\mathbf{P}(S_{\gamma}\geq\frac{1}{2}\mid\beta_{1}=0)=\mathbf{P}(S_{\bar{\gamma}}\geq\frac{1}{2(1-c_{1})})\leq p$ by theorem 15. We find that $\mathbf{P}(S_{\gamma}\geq\frac{1}{2})\leq p+(1-p)p$ with equality for bold play. ∎

These results take care of the box in the upper left corner of figure 1, including its boundaries.

5. Unfavorable odds

A family $\mathcal{F}\subset 2^{\{1,\ldots,n\}}$ has matching number $k$ , denoted by $\nu(\mathcal{F})=k$ , if the maximum number of pairwise disjoint $V\in\mathcal{F}$ is equal to $k$ . In particular, $\mathcal{F}$ is intersecting if and only if $\nu(\mathcal{F})=1$ . The matching number of $\mathcal{F}_{t,\gamma}$ is bounded by $\frac{1}{t}$ , because $\sum_{j\in V}c_{j}\geq t$ for each $V\in\mathcal{F}_{t,\gamma}$ and $\gamma$ sums up to one.

A family $\mathcal{F}^{u}$ is $u$ -uniform if all its elements have cardinality $u$ . According to the Erdős matching conjecture [1, 10, 11], if $n\geq(k+1)u$ then the maximum cardinality $|\mathcal{F}^{u}|$ of a $u$ -uniform family such that $\nu(\mathcal{F}^{u})\leq k$ is either attained by $\mathcal{F}_{k}^{u}$ , the family of all $u$ -subsets containing at least one element from $\{1,\ldots,k\}$ , or by $\mathcal{F}_{[(k+1)u-1]}^{u}$ , the family containing all $u$ -subsets from $\{1,\ldots,(k+1)u-1\}$ . Frankl [10] proved that $\mathcal{F}_{k}^{u}$ has maximum cardinality if $n\geq(2k+1)u-k$ . For recent progress on this conjecture, see [11] and the references therein.

Theorem 19.

If $p<\frac{1}{2k+1}$ and $\frac{1}{k+1}<t$ for some positive integer $k$ then bold play is optimal.

Proof.

We need to prove that $\mathbf{P}(S_{\gamma}\geq t)\leq 1-(1-p)^{k}$ for finite $\gamma=(c_{1},c_{2},\ldots)$ . For a large enough $n$ we have that $c_{j}=0$ if $j>n$ . We have

\mathbf{P}(S_{\gamma}\geq t)=\sum_{\mathcal{F}_{t,\gamma}}p(V)=\sum_{j}|\mathcal{F}_{t,\gamma}^{j}|p^{j}(1-p)^{n-j}

where $|\mathcal{F}_{t,\gamma}^{j}|$ denotes the number of subsets of cardinality $j$ . By Frankl’s result, we can put a bound on $|\mathcal{F}_{t,\gamma}^{j}|$ if $(2k+1)j-k\leq n$ . For larger $j$ we simply bound by $\binom{n}{j}$ . In this way we get that $\mathbf{P}(S_{\gamma}\geq t)$ is bounded by

\sum_{j\leq{\frac{n+k}{2k+1}}}\left(\binom{n}{j}-\binom{n-k}{j}\right)p^{j}(1-p)^{n-j}+\sum_{j>{\frac{n+k}{2k+1}}}\binom{n}{j}p^{j}(1-p)^{n-j}

which is equal to

1-\sum_{j\leq{\frac{n+k}{2k+1}}}\binom{n-k}{j}p^{j}(1-p)^{n-j}=1-(1-p)^{k}\mathbf{P}\left(X\leq\frac{n+k}{2k+1}\right)

for $X\sim\mathrm{Bin}(n-k,p)$ . By our assumptions, there exists a $c<1$ such that $p<\frac{c}{2k+1}$ . If $n\to\infty$ then $\mathbf{P}\left(X\leq\frac{n+k}{2k+1}\right)\to 1$ since $\mathbf{E}[X]=(n-k)p<\frac{(n-k)c}{2k+1}$ . ∎

We can push this result to the hypothenuse, using the same approach as in the proof of proposition 13, in one particular case: $p=t=\frac{1}{3}$ . By stability, one would expect that bold play is optimal for any $p<t=\frac{1}{3}$ but we can only prove this for $p=\frac{1}{b}$ for integers $b>3$ .

Proposition 20.

Bold play is optimal if $t=\frac{1}{3}$ and $p=\frac{1}{b}$ for an integer $b\geq 3$ .

Proof.

We may assume that $\gamma$ is finite and we randomly assign its coefficients to $\{0,1,\ldots,b-1\}$ . We denote $\ell(0)$ , the load at zero, by $Y$ and we need to prove that

\mathbf{P}\left(Y\geq\frac{1}{3}\right)\leq p+(1-p)p+(1-p)^{2}p=r

which is the success probability of bold play if $t=\frac{1}{3}$ . Let $K$ be the number of loads exceeding the threshold of $\frac{1}{3}$ before the last two coefficients $c_{n-1}$ and $c_{n}$ are assigned. Obviously, $K$ is either equal to $0$ or $1$ or $2$ . We will show that $\mathbf{P}(Y\geq\frac{1}{3}|K=j)\leq r$ for $j\in\{0,1,2\}$ .

Suppose that $K=0$ . In this case, if $Y$ reaches the threshold, then at least one of the two final coefficients has to be placed in $0$ . This happens with probability $p+(1-p)p<r$ .

Suppose that $K=1$ . One load has reached the threshold before the final two coefficients are placed. This load is in $0$ with probability $p$ . If the load is not in $0$ , then at least one of the remaining two coefficients has to be placed there. This happens with probability $p+(1-p)p$ . We conclude that

\mathbf{P}\left(Y\geq\frac{1}{3}\,\middle|\,K=1\right)\leq p+(1-p)(p+(1-p)p)=r.

Suppose that $K=2$ . In other words, two loads have already reached the threshold before the final two coefficients are placed. The probability that one of these two loads is in $0$ is $2p$ . If none of the two loads is in $0$ , then $\ell(0)$ can only reach the threshold if both remaining coefficients are assigned to $0$ . The probability that this happens is $p^{2}$ .

\mathbf{P}\left(Y\geq\frac{1}{3}\,\middle|\,K=2\right)\leq 2p+(1-2p)p^{2}\leq r

if $p\leq\frac{1}{3}$ . ∎

6. Binomial tails

If conjecture 1 holds, then the tail probability is maximized by a Bernoulli average $\bar{X}_{k}$ and we need to determine the optimal $k$ . It is more convenient to state this in terms of binomials. For a fixed $p$ and $t$ , maximize

\mathbf{P}\left(\mathrm{Bin}(k,p)\geq kt\right)

for a positive integer $k$ . Since the probability increases if $k$ increases and $kt$ does not pass an integer, we may restrict our attention to $k$ such that $kt\leq n<(k+1)t$ for some integer $n$ . In other words, we need to only consider $k=\lfloor\frac{n}{t}\rfloor$ for $n\in\mathbb{N}$ . If $t=\frac{1}{a}$ is the reciprocal of an integer $a$ , then the $k$ are multiples of $a$ . This is a classical problem. In 1693 John Smith asked which $k$ is optimal if $a=6$ and $p=\frac{1}{6}$ . Or in his original words, which of the following events is most likely: fling at least one six with 6 dice, or at least two sixes with 12 dice, or at least three sixes with 18 dice. The problem was communicated by Samuel Pepys to Isaac Newton, who computed the probabilities. Chaundy and Bullard [4] gave a very nice historical description (more history can be found in [16, 17]) and solved the problem, see also [15].

Theorem 21 (Chaundy and Bullard).

For an integer $a>1$ , $\mathbf{P}(\mathrm{Bin}(ka,\frac{1}{a})\geq k)$ is maximal for $k=1$ . Even more so, the tail probabilities strictly decrease with $k$ .

In other words, if $p=t=\frac{1}{a}$ and if Csóka’s conjecture holds, then bold play is optimal. By stability, one would expect that bold play is optimal for $p\leq\frac{1}{a}\leq t$ . It turns out that it is possible to extend Chaundy and Bullard’s theorem in this direction and prove that $\mathbf{P}(\mathrm{Bin}(ka,p)\geq k)$ decreases with $k$ for arbitrary $p\leq\frac{1}{a}$ , see [19, Theorem 1.5.4].

7. Conclusion

We settled Csóka’s conjecture for a range of parameters building on combinatorial methods. Csóka’s conjecture predicts that $\mathbf{P}(S_{\gamma}\geq t)$ attains its maximum at an extreme point of the subset of positive non-increasing sequences $\gamma$ in the unit ball in $\ell_{1}$ . Perhaps variational methods need to be considered.

Christos Pelekis was supported by the Czech Science Foundation (GAČR project 18-01472Y), by the Czech Academy of Sciences (RVO: 67985840), and by a visitor grant of the Dutch mathematics cluster Diamant.

References

[1] N. Alon, P. Frankl, H. Huang, V. Rödl, A. Ruciński, B. Sudakov, Large matchings in uniform hypergraphs and the conjectures of Erdős and Samuels. J. Combin. Th., Ser. A 119 (2012) 1200–1215.
[2] I. Arieli, Y. Babichenko, R. Peretz, H. Peyton Young, The speed of innovation in social networks. Econometrica 88 no. 2 (2020), 569–594.
[3] L. Breiman, Probability, Reading, Addison-Wesley, 1968.
[4] T.W. Chaundy, J.E. Bullard, John Smith’s problem. Math. Gazette 44(350), (1960), 253–260.
[5] E. Csóka, Limit theory of discrete mathematics problems. arXiv1505.06984.
[6] L. E. Dubins and L. J. Savage, Inequalities for Stochastic Processes (How to Gamble If You Must). Dover, 1976.
[7] U. Feige, On sums of independent random variables with unbounded variances, and estimating the average degree in a graph. SIAM J. Comput. 35 (2006), 964–984.
[8] Y. Filmus, The weighted complete intersection theorem. J. Combin. Th. Ser. A 151 (2017), 84–101.
[9] P.C. Fishburn, P. Frankl, D. Freed, J. Lagarias, A.M. Odlyzko. Probabilities for intersecting systems and random subsets of finite sets. SIAM. J. Algebraic Discrete Methods 7 no. 1 (1986) 73–79.
[10] P. Frankl, Improved bounds on Erdős’ matching conjecture. J. Combin. Th., ser. A, 120 (2013) 1068–1072.
[11] P. Frankl, A. Kupavskii, Some results around the Erdős matching conjecture. Acta Math. Univ. Comenianae 88 no. 3 (2019), 695–699.
[12] J. Guo, S. He, Z. Ling, Y. Liu, Bounding probability of small deviation on sum of independent random variables: Combination of moment approach and Berry-Esseen theorem. arXiv:2003.03197.
[13] S. He, Z. Q. Luo, J. Nie, S. Zhang, Semidefinite relaxation bounds for indefinite homogeneous quadratic optimization. SIAM J. Optim. 19 (2007) 503–523.
[14] S. He, J. Zhang, S. Zhang, Bounding probability of small deviation: a fourth moment approach. Math. Oper. Res. 45 no. 1 (2010), 208–232.
[15] K. Jogdeo, S.M. Samuels, Monotone Convergence of Binomial Probabilities and a Generalization of Ramanujan’s Equation. Ann. Math. Statist. 39 no. 4, (1968), 1191–1195.
[16] T.H. Koornwinder, M.J. Schlosser, On an identity by Chaundy and Bullard, I. Indag. Math. (NS) 19, (2008), 239–261.
[17] T.H. Koornwinder, M.J. Schlosser, On an identity by Chaundy and Bullard, II. Indag. Math. (NS) 24, (2013), 174–180.
[18] R. Paulin, On some conjectures by Feige and Samuels. arXiv1703.05152.
[19] C. Pelekis, Search games on hypergraphs. PhD thesis, TU Delft, 2014.
[20] S.M. Samuels, On a Chebyshev-type inequality for sums of independent random variables. Ann. Math. Statist. 37 no. 1, (1966), 248–259.
[21] P. Tchebichef, Démonstration élémentaire d’une proposition générale de la theorie des probabilités. J. Reine. Angew. Math. 33 (1848), 259–267.

Institute of Applied Mathematics
Delft University of Technology
Mourikbroekmanweg 6
2628 XE Delft, The Netherlands
r.j.fokkink@tudelft.nl, l.e.meester@tudelft.nl
School of Electrical and Computer Engineering
National Technical University of Athens
Zografou, 15780, Greece
pelekis.chr@gmail.com

Optimizing stakes in simultaneous bets

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

Conjecture 1 (Csóka).

1. Related problems and results

Theorem 2 (Chebyshev, [21]).

Conjecture 3 (Samuels, [20]).

Conjecture 4 (Feige, [7]).

2. Properties of π​(p,t)\pi(p,t)

Proposition 5.

Proof.

Proposition 6.

Proof.

Theorem 7.

Proof.

Theorem 8.

Proof.

3. Favorable odds

Lemma 9.

Proof.

Lemma 10.

Proof.

Lemma 11.

Proof.

Theorem 12.

Proof.

Proposition 13.

Proof.

Proposition 14.

Proof.

4. High threshold

Theorem 15 (Fishburn et al).

Proof.

Corollary 16.

Proof.

Corollary 17.

Proof.

Corollary 18.

Proof.

5. Unfavorable odds

Theorem 19.

Proof.

Proposition 20.

Proof.

6. Binomial tails

Theorem 21 (Chaundy and Bullard).

7. Conclusion

References

2. Properties of $\pi(p,t)$