Money Pumps and Bounded Rationality

Joshua Lanier, Matthew Polisson, and John K.-H. Quah Emails: jlanier84@gmail.com; matt.polisson@leicester.ac.uk; ecsqkhj@nus.edu.sg.

The standard criterion of rationality in economics is the maximization of a utility function that is stable across multiple observations of an agent’s choice behavior. In this paper, we discuss two notions of the money pump that characterize two corresponding notions of utility-maximization. We explain the senses in which the amount of money that can be pumped from a consumer is a useful measure of the consumer’s departure from utility-maximization.

JEL Codes: D11, D12.

Keywords: money pump, revealed preference, GARP, cyclical monotonicity, quasilinear utility, additive utility, constrained optimization

1 Introduction

It has often been argued that an agent’s vulnerability to money pumps is a sign of bounded or limited rationality. This paper is devoted to the study of money pumps and how they can be used to measure an agent’s adherence to rational principles.

We begin our discussion with a simple example. Suppose that there are two goods and a consumer purchases the bundle $\mathbf{x}^{1}=(1,2)$ when prices are $\mathbf{p}^{1}=(1,2)$ and the bundle $\mathbf{x}^{2}=(2,1)$ when the prices are $\mathbf{p}^{2}=(2,1)$ . This situation is depicted on the left side of Figure 1 below. Note that in period 1, the bundle $\mathbf{x}^{2}$ is cheaper than $\mathbf{x}^{1}$ , while in period 2, $\mathbf{x}^{1}$ is cheaper than $\mathbf{x}^{2}$ . An arbitrageur can use this purchasing behavior to turn the consumer into a money pump. In particular, the arbitrageur may sell the consumer $\mathbf{x}^{1}$ and purchase $\mathbf{x}^{2}$ in period 1, which nets the arbitrageur $\mathbf{p}^{1}\cdot(\mathbf{x}^{1}-\mathbf{x}^{2})=1$ . Then in period 2 the arbitrageur can sell the consumer $\mathbf{x}^{2}$ and purchase $\mathbf{x}^{1}$ , netting the arbitrageur a further $\mathbf{p}^{2}\cdot(\mathbf{x}^{2}-\mathbf{x}^{1})=1$ . At the end of the day the consumer has been pumped for 2 dollars, while there is no net change to the arbitrageur’s stock of goods 1 and 2.

More generally, suppose that there are $L$ goods and we observe a consumer who purchases the consumption bundle $\mathbf{x}^{t}=(x_{1}^{t},x_{2}^{t},\ldots,x_{L}^{t})\in\mathbb{R}_{+}^{L}$ when prices are $\mathbf{p}^{t}=(p_{1}^{t},p_{2}^{t},\ldots,p_{L}^{t})\in\mathbb{R}_{++}^{L}$ , at some observation $t\in\{1,2,\ldots,T\}$ . The dataset $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ admits a money pump if there are observations $t_{1},\,t_{2},\ldots,\,t_{K}$ with $t_{1}=t_{K}$ such that an arbitrageur who purchases $\mathbf{x}^{t_{k+1}}$ in period $t_{k}$ while simultaneously selling $\mathbf{x}^{t_{k}}$ to the consumer will make a strictly positive amount of money; formally,

\text{MP}_{t_{1},t_{2},\ldots,t_{K}}\coloneqq\sum_{k=1}^{K-1}\mathbf{p}^{t_{k}}\cdot(\mathbf{x}^{t_{k}}-\mathbf{x}^{t_{k+1}})>0.

(1)

There is another way of understanding the money pump which turns out to be crucial to the formal results in the paper. Notice that if $\text{MP}_{t_{1},t_{2},\ldots,t_{K}}>0$ then

\mathbf{p}^{t_{1}}\cdot\mathbf{x}^{t_{1}}+\mathbf{p}^{t_{2}}\cdot\mathbf{x}^{t_{2}}+\ldots+\mathbf{p}^{t_{K-1}}\cdot\mathbf{x}^{t_{K-1}}>\mathbf{p}^{t_{1}}\cdot\mathbf{x}^{t_{2}}+\mathbf{p}^{t_{2}}\cdot\mathbf{x}^{t_{3}}+\ldots+\mathbf{p}^{t_{K-2}}\cdot\mathbf{x}^{t_{K-1}}+\mathbf{p}^{t_{K-1}}\cdot\mathbf{x}^{t_{1}}

where we have used the requirement that $t_{1}=t_{K}$ . Instead of buying $\mathbf{x}^{t_{1}}$ at $t_{1}$ , and $\mathbf{x}^{t_{2}}$ at $t_{2}$ and so forth, the consumer could have purchased $\mathbf{x}^{t_{2}}$ at $t_{1}$ , $\mathbf{x}^{t_{3}}$ at $t_{2}$ , and so forth; by altering the timing of purchases in this way (interpreting the index $t$ as time), the consumer could have obtained the same set of bundles $\{\mathbf{x}^{t_{1}},\mathbf{x}^{t_{2}},\ldots,\mathbf{x}^{t_{K-1}}\}$ and saved money. In this sense, the consumer’s purchasing behavior is irrational and the presence of a money pump is equivalent to the presence of expenditure decisions which, taken as a whole across all observations, are cost inefficient.

But how is the presence or absence of a money pump related to the other standard by which economists judge rationality, namely, utility-maximization? Indeed, the absence of a money pump is also known as cyclical monotonicity and it is known that a dataset $D$ is cyclically monotone (i.e., free of money pumps) if and only if there is a quasilinear rationalization of $D$ , i.e., there is a well-behaved (in the sense of being continuous, strictly increasing, and concave) utility function $U:\mathbb{R}^{L}_{+}\to\mathbb{R}$ such that $\mathbf{x}^{t}$ maximizes $U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}$ , at all $t\leqslant T$ (see Brown and Calsamiglia (2007)). Alternatively, cyclical monotonicity is also equivalent to the additive rationalization of $D$ , by which we mean that there is a well-behaved utility function $U$ such that

\sum_{t=1}^{T}U(\mathbf{x}^{t})\geqslant\sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t})

for all $(\tilde{\mathbf{x}}^{1},\tilde{\mathbf{x}}^{2},\ldots,\tilde{\mathbf{x}}^{T})\in\mathbb{R}^{LT}_{+}$ with $\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\tilde{\mathbf{x}}^{t}\leqslant\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ (see Browning (1989)). In other words, the consumer’s choices $(\mathbf{x}^{t})_{t=1}^{T}$ can be understood as maximizing an overall utility function $V:\mathbb{R}^{LT}_{+}\to\mathbb{R}$ that is additive across bundles at each observation, i.e.,

V(\tilde{\mathbf{x}}^{1},\tilde{\mathbf{x}}^{2},\ldots,\tilde{\mathbf{x}}^{T})=\sum_{t=1}^{T}U(\mathbf{x}^{t}),

subject to the overall expenditure not exceeding $\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ .¹¹1Notice that in quasilinear rationalization, as opposed to additive rationalization, the consumer is not subject to a budget; she can, in principle, spend as much as she likes, but expenditure is restrained because it incurs dis-utility. It is particularly clear that a dataset that has an additive rationalization must be free of money pumps: the consumer cannot be maximizing a function like $V$ if a reshuffling of purchases leads to savings, since this frees up money that could be used to buy more goods in one or more periods and thus increase $V$ .

These characterizations of cyclical monotonicity (recall that cyclical monotonicity is equivalent to the absence of a money pump) lead naturally to the following question. For a given dataset $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ collected from a consumer, we could work out the optimal trading strategy of the arbitrageur (the strategy which nets the most money) and consequently we could work out the amount of money which would be extracted from the consumer by an arbitrageur following this optimal strategy. We use TMP (for total money pump) to denote this amount of money. It seems sensible to regard TMP as a measure of the consumer’s departure from rationality, but in what sense does the TMP actually capture irrational behavior? In particular, can TMP be understood as a measure of the degree to which behavior departs from either quasilinear or additive utility maximization?

A natural approach to measuring deviations from quasilinear rationalization was proposed by Allen and Rehbeck (2021).²²2Our comments here refer to their generalized approach as discussed in their Appendix B. Notice that a well-behaved utility function $U$ is a quasilinear rationalization of $D$ if and only if

\max_{\mathbf{x}\in\mathbb{R}^{L}_{+}}\Big{(}U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}\Big{)}-\Big{(}U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\Big{)}=0\>\mbox{ for all $t\leqslant T$.}

When $D$ admits a money pump, no such $U$ exists and thus, for every well-behaved utility function $U$ ,

\max_{\mathbf{x}\in\mathbb{R}^{L}_{+}}\Big{(}U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}\Big{)}-\Big{(}U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\Big{)}\geqslant 0\>\mbox{ for all $t\leqslant T$.}

with the inequality being strict for at least one observation $t$ . It follows that a natural measure of the consumer’s departure from quasilinear rationalization is

Q\coloneqq\inf_{U}\left\{\sum_{t=1}^{T}\left[\max_{\mathbf{x}\in\mathbb{R}^{L}_{+}}\Big{(}U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}\Big{)}-\Big{(}U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\Big{)}\right]\right\}

where the infimum is taken over all well-behaved utility functions. $Q$ is a measure of inefficiency in the form of lost utility (summed across all $T$ periods).

In the case of additive rationalization, we can measure the severity of departures from this property using the cost efficiency approach advocated by Afriat (1973). Firstly, we observe that a dataset $D$ cannot be additively rationalized if and only if the consumer is cost inefficient in the following sense: for any well-behaved utility function $U$ , the required expenditure to hit the utility target $\sum_{t=1}^{T}U(\mathbf{x}^{t})$ is strictly lower than the amount actually spent; formally,

e_{U}\coloneqq\min\left\{\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\tilde{\mathbf{x}}^{t}:\sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t})\geqslant\sum_{t=1}^{T}U(\mathbf{x}^{t})\right\}<\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}.

We can then measure the level of cost inefficiency by

A=\inf_{U}\left\{\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}-e_{U}\right\},

where the infimum is taken over all well-behaved utility functions.

The first main result of the paper says that these three ways of measuring rationality coincide; i.e., $Q=A=\mbox{TMP}$ .

The second part of the paper begins with the observation that the existence of a money pump is not always a compelling sign of limited rationality. Suppose a consumer purchases the bundle $\mathbf{x}^{1}=(1,1)$ when prices are $\mathbf{p}^{1}=(1,1)$ and purchases $\mathbf{x}^{2}=(2,2)$ when prices are $\mathbf{p}^{2}=(2,2)$ . This situation is depicted in Figure 1(b). This behavior clearly leads to a money pump. Specifically, the arbitrageur can, in period 1, buy bundle $\mathbf{x}^{2}$ and sell $\mathbf{x}^{1}$ netting $\mathbf{p}^{1}\cdot(\mathbf{x}^{1}-\mathbf{x}^{2})=-2$ . Then, in period 2, the arbitrageur can buy bundle $\mathbf{x}^{1}$ and sell $\mathbf{x}^{2}$ to make $\mathbf{p}^{2}\cdot(\mathbf{x}^{2}-\mathbf{x}^{1})=4$ . So the arbitrageur has extracted 2 dollars from the consumer. However, such behavior by a consumer may not be that irrational: for example, goods 1 and 2 could be festive goods which are consumed in greater quantities in the festive season (period 2), notwithstanding the higher prices. Put another way, the data has no additive rationalization because the consumer may not be maximizing an additive utility function that is symmetric across the two periods when observations are taken.

Notice also that there is a difference between the two examples depicted in Figure 1. The example depicted in Figure 1(b) clearly admits a rationalization, in the sense that there is a well-behaved utility function $U:\mathbb{R}^{L}_{+}\to\mathbb{R}$ such that $U(\mathbf{x}^{t})\geqslant U(\mathbf{x})$ for all bundles which are cheaper than $\mathbf{x}^{t}$ at price $\mathbf{p}^{t}$ , i.e., for all $\mathbf{x}$ such that $\mathbf{p}^{t}\cdot\mathbf{x}\leqslant\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ . On the other hand, this is plainly not the case with the example depicted in Figure 1(a). We know from Afriat’s Theorem (see Afriat (1967), Diewert (1973), and Varian (1982)) that a dataset $D$ admits a rationalization if and only if obeys a property called the generalized axiom of revealed preference (GARP). The latter property says that the dataset does not admit a money pump (as we have defined it), with the added requirement that the arbitrageur makes money at every round of the exchange. We call a money pump with this added property a constrained money pumps. While a money pump exists in the second example depicted in Figure 1(b), it is not a constrained money pump, because the arbitrageur makes money in one round and loses in another; indeed, GARP holds because a constrained money pump is not possible and (by Afriat’s Theorem) a rationalization exists. On the other hand, in the example depicted in Figure 1(a), the money pump is a constrained money pump and thus that example violates GARP and has no rationalization.

These observations suggest that we could use the amount of money extracted from constrained money pumps as a measure of rationality; this is (in essence) the approach proposed by Echenique, Lee, and Shum (2011) and Smeulders, Cherchye, Spieksma, and De Rock (2013) in their studies of rationality and constrained money pumps.³³3There is a significant literature on the measurement of deviations from rationality. Afriat (1973) proposed to measure deviations in terms of the proportion of one’s budget wasted in each period and then aggregating across periods by taking the maximum waste; he refers to his proposed measure as the critical cost efficiency index. Varian (1990) generalizes Afriat’s approach by proposing other ways of aggregating waste across periods. Houtman and Maks (1985) suggest measuring violations in terms of the number of observations which need to be dropped before the remaining dataset becomes rationalizable. Apesteguia and Ballester (2015) propose measuring violations by taking the Lebesgue measure of the affordable strict upper contour set (the set preferred to the bundle actually chosen) in each period and adding up these measures across observations. (We are referring to the approach discussed in Appendix D.3; their main paper deals with a discrete choice environment.) Dean and Martin (2016) propose assigning a cost for breaking each revealed preference relation and then calculating the cheapest way of breaking every revealed preference cycle (i.e. every violation of GARP). Halevy, Persitz, and Zrill (2018) show how measures of departures from rationality can be used to perform parametric estimation of utility functions. The second main result of our paper provides a foundation for this approach.⁴⁴4While our index as well as those of Echenique et al. (2011) and Smeulders et al. (2013) are based on the notion of a money pump there is actually an important distinction between our different measures. See Section 3.3 for details.

Given a dataset $D$ , let $\text{TMP}_{c}$ be the the amount of money which would be extracted from the consumer by an arbitrageur who was following his optimal trading strategy under the provision that he cannot lose money in any round of trading. It follows immediately from the definition that $\text{TMP}_{c}\leqslant\text{TMP}$ and $\text{TMP}_{c}=0$ if the consumer purchasing behavior satisfies GARP. We show that $\text{TMP}_{c}$ coincides with appropriately modified versions of the additive cost and quasilinear utility inefficiency indices. To explain further, let $e_{U}^{c}$ denote the smallest amount of money the consumer could pay (summed across $T$ periods) to hit the utility target $\sum_{t=1}^{T}U(\mathbf{x}^{t})$ , provided that in each period $t$ the consumer cannot spend more than $\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ . Clearly, the additional constraint guarantees that $e_{U}^{c}\geqslant e_{U}$ . Analogous to the case of $A$ , we could use $A_{c}=\inf_{U}\mathbf{p}^{t}\cdot\mathbf{x}^{t}-e_{U}^{c}$ (where the infimum is taken over all well-behaved utility functions) as a measure of constrained additive cost inefficiency. Lastly, we can define $Q_{c}$ in a way similar to our definition of $Q$ , with the difference being that the suprema are taken over all bundles $\mathbf{x}$ such that $\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}$ instead of over all $\mathbf{x}\in\mathbb{R}_{+}^{L}$ . Like $Q$ , the index $Q_{c}$ is a measure how much more utility (net of expenditure) that a consumer with utility function $U$ can obtain by deviating from $(\mathbf{x}^{t})_{t\leqslant T}$ , but with an additional constraint on expenditure in each period. Obviously, $Q_{c}\leqslant Q$ . Our second main result states that $\text{TMP}_{c}=A_{c}=Q_{c}$ . Additionally, we show that these measures can be calculated by solving a simple linear program.

Outline of the paper. The remainder of the paper is organized as follows. Section 2 introduces the total money pump index TMP and presents our main theorem (Theorem 2) for TMP. Section 3 introduces the constrained total money pump index $\text{TMP}_{c}$ and presents our main theorem for $\text{TMP}_{c}$ (Theorem 4). This section also has a comparison between our measure $\text{TMP}_{c}$ and the money pump indices introduced in Echenique et al. (2011) and Smeulders et al. (2013). Section 4 concludes. The appendix gives a more careful explanation of the relationship between our measure of additive cost inefficiency and the critical cost efficiency index introduced in Afriat (1973), as well as all proofs omitted from the body of the paper.

2 The Money Pump

We assume that we observe the consumer’s purchases of $L$ goods. A consumption bundle is denoted $\mathbf{x}=(x_{1},x_{2},\ldots,x_{L})\in\mathbb{R}_{+}^{L}$ where $x_{\ell}$ is the quantity of good $\ell$ consumed. A price vector is denoted $\mathbf{p}=(p_{1},p_{2},\ldots,p_{L})\in\mathbb{R}_{++}^{L}$ where $p_{\ell}$ is the price of good $\ell$ . A dataset is a finite collection of consumption bundle / price pairs denoted $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ where intuitively $(\mathbf{p}^{t},\mathbf{x}^{t})$ means that the consumer purchased $\mathbf{x}^{t}$ when prices were $\mathbf{p}^{t}$ . A well-behaved utility function is a mapping $U:\mathbb{R}_{+}^{L}\rightarrow\mathbb{R}$ which is continuous, increasing, and concave.⁵⁵5A function $U:X\rightarrow\mathbb{R}$ is increasing if $\mathbf{x}\geqslant\mathbf{x}^{\prime}$ implies $U(\mathbf{x})\geqslant U(\mathbf{x}^{\prime})$ and $\mathbf{x}>\mathbf{x}^{\prime}$ implies $U(\mathbf{x})>U(\mathbf{x}^{\prime})$ .

A dataset $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ is rationalized by a utility function $U:\mathbb{R}_{+}^{L}\rightarrow\mathbb{R}$ if
$U(\mathbf{x}^{t})\geqslant(>)\ U(\mathbf{x})$ for all $t$ and all $\mathbf{x}\in\mathbb{R}_{+}^{L}$ satisfying $\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant(>)\ \mathbf{p}^{t}\cdot\mathbf{x}$ .⁶⁶6What we mean by this is that $U(\mathbf{x}^{t})\geqslant U(\mathbf{x})$ for all $t$ and all $\mathbf{x}\in\mathbb{R}_{+}^{L}$ satisfying $\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}$ , and the former inequality is strict if the latter inequality is strict. The dataset $D$ is additively rationalized by a utility function $U:\mathbb{R}_{+}^{L}\rightarrow\mathbb{R}$ if

\sum_{t=1}^{T}U(\mathbf{x}^{t})\geqslant(>)\ \sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t}),\qquad\forall(\tilde{\mathbf{x}}^{1},\tilde{\mathbf{x}}^{2},\ldots,\tilde{\mathbf{x}}^{T})\in\mathbb{R}_{+}^{LT}\text{ s.t. }\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant(>)\ \sum_{t=1}^{T}\mathbf{p}^{t}\cdot\tilde{\mathbf{x}}^{t}

An additive rationalization requires that the consumer’s choices yield a higher additive (across-period) utility than any other affordable sequence of bundles $(\tilde{\mathbf{x}}^{1},\tilde{\mathbf{x}}^{2},\ldots,\tilde{\mathbf{x}}^{T})$ . Note that if $U$ additively rationalizes the data then, for each $t$ , the choice $\mathbf{x}^{t}$ must yield more utility than any other bundle which costs less (i.e. any bundle $\mathbf{x}$ satisfying $\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}$ ). Thus, if $U$ additively rationalizes the data then $U$ rationalizes the data as well. Recall, however, the second example in the Introduction which shows that the converse does not hold. The dataset $D$ is quasilinear rationalized by a utility function $U:\mathbb{R}_{+}^{L}\rightarrow\mathbb{R}$ if, for all $t$ ,

U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\ U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x},\qquad\qquad\forall\mathbf{x}\in\mathbb{R}_{+}^{L}

A quasilinear utility maximizer seeks to maximize utility net of expenditure. Note that in the case of quasilinear rationalization, the consumer is not constrained by a budget set, but expenditure leads to dis-utilty and has the effect of restraining purchases.

2.1 Money Pump and Cyclical Monotonicity

A dataset $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ contains a money pump if there are observations $t_{1},\,t_{2},\ldots,\,t_{K}$ (drawn from $D$ ) such that $t_{1}=t_{K}$ and $\text{MP}_{t_{1},t_{2},\ldots,t_{K}}$ as defined by (1) is strictly positive. A dataset $D$ that is free of money pumps is said to satisfy cyclical monotonicity. Another way of saying the same thing is that a money pump exists if there is a permutation of $\{1,2,\ldots,T\}$ , denoted $\sigma$ , where, in period $t$ , the arbitrageur sells $\mathbf{x}^{t}$ and buys $\mathbf{x}^{\sigma(t)}$ ; and the arbitrageur pumps a strictly positive amount of money,

\text{MP}_{\sigma}=\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})

from the consumer. We define the total money pump (TMP) as the amount of money which would be extracted from the consumer by an arbitrageur following an optimal trading strategy. That is,

\text{TMP}=\sup_{\sigma}\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})

(2)

where the supremum is taken over all permutations $\sigma$ .

It is not hard to show that if $D$ is quasilinear rationalized by $U$ then $D$ is also additively rationalized by $U$ ; on the other hand, there is no guarantee that if $D$ is additively rationalized by $U$ , then the same $U$ provides a quasilinear rationalization of $D$ . The following result shows, among other things, that in fact these two types of rationalizations are empirically equivalent, and holds whenever $\text{TMP}=0$ .

Theorem 1.

Let $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ be a dataset. The following are equivalent.

1.

$\text{{\em TMP}}=0$ .
2.

The dataset $D$ satisfies cyclical monotonicity.
3.

The dataset $D$ can be additively rationalized by a utility function $U$ .
4.

The dataset $D$ can be additively rationalized by a well-behaved utility function $U$ .
5.

The dataset $D$ can be quasilinear rationalized by a utility function $U$ .
6.

The dataset $D$ can be quasilinear rationalized by a well-behaved utility function $U$ .

The claims in Theorem 1 are not new in the sense that they can easily be pieced together using existing results. In particular, the equivalence between statements 1 and 2 is obvious. The equivalence between statements 2, 3, and 4 is found in Browning (1989) and the equivalence between statements 2, 5, and 6 is found in Brown and Calsamiglia (2007). To keep this article reasonably self-contained, we provide a proof of Theorem 1 in the Appendix.

The equivalence between statements 3 and 4 and the equivalence between items 5 and 6 shows that there are no additional restrictions placed on the data by assuming that $U$ is well-behaved in either the additive rationalization or the quasilinear rationalization.

The equivalence between 1, 4, and 6 shows that there is a tight relationship between the existence of a money pump and additive and quasilinear rationalizations. This suggests that the value of the total money pump may be useful as a measure of the degree to which the consumer fails to act as an additive or quasilinear utility maximizer. We address this issue in the next subsection.

2.2 TMP as a measure of rationality

By defnition, a dataset $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ can be additively rationalized if the observations, when taken as a whole, maximize an overall utility function $V:\mathbb{R}^{LT}_{+}\to\mathbb{R}$ that is additive across bundles at each observations, i.e., $V(\tilde{\mathbf{x}}^{1},\tilde{\mathbf{x}}^{2},\ldots,\tilde{\mathbf{x}}^{T})=\sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t})$ , subject to the overall expenditure not exceeding the consumer’s total expenditure, which is $\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ . When $D$ cannot be additively rationalized, how should we measure the extent of the violation? The cost-efficiency approach proposed by Afriat (1973) measures the amount of money which could have been saved by the consumer were they to have acted perfectly in line with the utility maximization hypothesis under investigation (the additive utility model in our case).⁷⁷7Appendix A discusses in greater detail the relationship between $A$ and Afriat’s critical cost efficiency index. For a utility function $U$ let $e^{U}$ denote the smallest amount of money for which a consumer with additive (across periods) utility function $U$ could obtain utility $\sum_{t=1}^{T}U(\mathbf{x}^{t})$ . That is,

e^{U}=\inf\left\{\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\tilde{\mathbf{x}}^{t}:\sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t})\geqslant\sum_{t=1}^{T}U(\mathbf{x}^{t})\right\}

The additive cost inefficiency displayed by the consumer is the difference between the amount that the consumer actually spent and the smallest amount of money they could have spent to achieve the same utility. More formally, the additive cost inefficiency is the number

A=\inf_{U}\left(\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}-e^{U}\right)

(3)

where the infimum is taken over all utility functions.

Next, let us suppose that we would like to measure the extent to which the consumer has failed to act as a quasilinear utility maximizer. One approach, which might be termed the utility efficiency approach, is to measure the additional utility which the consumer could have derived had they acted perfectly in line with the model of utility maximization under investigation. Of course, this approach only makes sense when utility is cardinal (in particular, the utility function must be identified up to translation by the consumer’s behavior). If the utility function is only identified up to monotonic transformation then it makes no sense to talk about differences in utility. As the quasilinear utility function is indeed cardinal in the requisite sense, it is reasonable to define the quasilinear utility inefficiency displayed by the consumer as

Q=\inf_{U}\left(\sum_{t=1}^{T}\sup_{\mathbf{x}\in\mathbb{R}_{+}^{L}}\left[U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}\right]-\left[U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\right]\right)

(4)

The term in the supremum represents the largest amount of quasilinear utility which could have been attained by the consumer in period $t$ whereas the rightmost term represents the amount of quasilinear utility actually achieved. The object $Q$ was introduced in Allen and Rehbeck (2021) as a measure of deviation from quasilinear utility maximization.⁸⁸8What we call $Q$ corresponds to the “minimum deviations” considered in Appendix B of Allen and Rehbeck (2021) when using (in their language) the aggregator $f(e_{1},e_{2},\ldots,e_{T})=\sum_{t=1}^{T}e_{t}$ .

Importantly, Allen and Rehbeck (2021) show that $Q$ is easy to calculate. In particular, $Q$ is equal to $\bar{\varepsilon}$ , the solution value for the linear programming problem of finding $(u_{1},u_{2},\ldots,u_{T})\in\mathbb{R}^{T}$ and $(\varepsilon_{1},\varepsilon_{2},\ldots,\varepsilon_{T})\in\mathbb{R}_{+}^{T}$ to solve {IEEEeqnarray}rCl min& ∑_t=1^T ε_t
s.t. u_s ⩽u_t + p^t ⋅( x^s - x^t ) + ε_t, for all s,t

To recap, we have introduced three distinct ways of quantifying deviations from cyclical monotonicity. It turns out that they are all the same.

Theorem 2.

For any dataset $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ ,

\text{{\em TMP}}=A=Q=\bar{\varepsilon}.

(5)

Moreover, there exists a well-behaved utility function $U:\mathbb{R}_{+}^{L}\rightarrow\mathbb{R}$ that attains the infimum in the definitions of $A$ and $Q$ .

As noted above, the result $Q=\bar{\varepsilon}$ is shown in Allen and Rehbeck (2021). We include it here and provide a proof for the sake of completeness.

To give some insight into the proof of Theorem 2 let us focus on how we show that $\text{TMP}=Q$ (the idea behind showing that $\text{TMP}=A$ is similar). The “easy direction” is showing that $Q\geqslant\text{TMP}$ . Indeed, for any $U$ and permutation $\sigma$ we have {IEEEeqnarray*}rCl ∑_t=1^T sup_x∈R_+^L [ U(x) - p^t ⋅x] - [ U(x^t) - p^t ⋅x^t ] & ⩾ ∑_t=1^T [ U( x^σ(t) ) - p^t ⋅x^σ(t) ] - [ U(x^t) - p^t ⋅x^t ]
= ∑_t=1^T p^t ⋅( x^t - x^σ(t) ) and so the amount of quasilinear utility wasted for any $U$ is always greater than the amount of money which can be pumped for any $\sigma$ . Thus, $Q\geqslant\text{TMP}$ .

Showing that $\text{TMP}\geqslant Q$ is more delicate. The key insight we utilize is that for any permutation $\sigma$ which achieves the supremum in the definition of TMP it happens that the permuted dataset $D_{\sigma}=(\mathbf{p}^{t},\mathbf{x}^{\sigma(t)})_{t\leqslant T}$ satisfies cyclical monotonicity. In other words, the purchasing behavior of the arbitrageur, which is given by $D_{\sigma}$ , must satisfy cyclical monotonicity. Once this fact is established Theorem 1 can be applied to show that there exists a well-behaved utility function $U$ which rationalizes the permuted data. It can then be shown that the amount of money pumped via $\sigma$ is weakly greater than the amount of quasilinear utility wasted according to $U$ which establishes that $\text{TMP}\geqslant Q$ .

Recall that from items 3-6 of Theorem 1 we learned that when dealing with additive or quasilinear rationalizations there are no additional restrictions imposed on the data by requiring that the rationalizing utility function is well-behaved. This insight is preserved in Theorem 2 in the sense that the infima in the definitions of $A$ and $Q$ can always be achieved by well-behaved utility functions and thus we could have defined $A$ and $Q$ using infima over the collection of well-behaved utility functions without changing the content of these objects.

3 The Constrained Money Pump

3.1 GARP and the constrained money pump

A dataset $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ satisfies the generalized axiom of revealed preferences (GARP) if, for all $t_{1},t_{2},\ldots,t_{K}$ with $t_{1}=t_{K}$ satisfying

\mathbf{p}^{t_{k}}\cdot\mathbf{x}^{t_{k}}\geqslant\mathbf{p}^{t_{k}}\cdot\mathbf{x}^{t_{k+1}},\qquad\qquad\text{ for all }k\in\{1,2,\ldots,K-1\}

(6)

it is not the case that any of the inequalities in (6) hold strictly. We know from Afriat (1967) that GARP is a necessary and sufficient condition for a dataset to be rationalized by a well-behaved utility function.

A permutation of $\{1,2,\ldots,T\}$ , denoted $\sigma$ , is constrained for $D$ if $\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma(t)}$ for all $t$ . A constrained permutation $\sigma$ represents a trading strategy (sell $\mathbf{x}^{t}$ and buy $\mathbf{x}^{\sigma(t)}$ in each period $t$ ) which nets the arbitrageur a weakly positive sum in each period. The constrained total money pump index, denoted $\text{TMP}_{c}$ , is defined as in (2), but with the supremum taken over all constrained permutations. In other words, $\text{TMP}_{c}$ is the amount of money which would be extracted from the consumer by an arbitrageur who was following his optimal trading strategy under the provision that he cannot lose money in any round of trading.

From Theorem 1 we know that a consumer who satisfies GARP can be turned into a money pump provided they fail to satisfy the stronger property of cyclical monotonicity. It is however plain from the definitions that the consumer who satisfies GARP cannot be turned into a constrained money pump. We next present two models of behavior which turn out to be characterized by GARP.

The dataset $D$ is constrained additively rationalized by $U:\mathbb{R}_{+}^{L}\rightarrow\mathbb{R}$ if

\sum_{t=1}^{T}U(\mathbf{x}^{t})\geqslant\ \sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t}),\qquad\forall(\tilde{\mathbf{x}}^{1},\tilde{\mathbf{x}}^{2},\ldots,\tilde{\mathbf{x}}^{T})\in\mathbb{R}_{+}^{LT}\ \text{ s.t. }\ \mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\tilde{\mathbf{x}}^{t},\ \forall t

where the leftmost inequality is required to be strict if any of the inequalities on the right hand side hold strictly. The constrained additive rationalization requires the consumer to maximize additive utility subject to period specific budget sets. In particular, the consumer is not allowed to reduce spending in one period in order to increasing spending in a different period. Due to this credit constraint it is easy to see that a dataset is rationalized by some utility function $U$ if and only if the dataset is constrained additively rationalized.

The dataset $D$ is constrained quasilinear rationalized by $U:\mathbb{R}_{+}^{L}\rightarrow\mathbb{R}$ if

U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x},\qquad\qquad\forall\mathbf{x}\in\mathbb{R}_{+}^{L}\text{ s.t. }\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}

In a constrained quasilinear rationalization, the observed choice $\mathbf{x}^{t}$ need only be superior (net of expenditure) to bundles which are cheaper than itself. Unlike an (unconstrained) quasilinear rationalization, it is not required that the net utility of $\mathbf{x}^{t}$ is higher than that of all alternative bundles.

The following result relates the concepts just introduced.

Theorem 3.

Let $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ be a dataset. The following are equivalent.

1.

$\text{{\em TMP}}_{c}=0$ .
2.

The dataset $D$ satisfies GARP.
3.

The dataset $D$ can be rationalized by a utility function $U$ .
4.

The dataset $D$ can be rationalized by a well-behaved utility function $U$ .
5.

The dataset $D$ can be constrained additively rationalized by a well-behaved $U$ .
6.

The dataset $D$ can be constrained quasilinear rationalized by an increasing and continuous $U$ .

It is easy to check that items 1 and 2 are equivalent. It is also easy to see that a utility function $U$ rationalizes $D$ if and only if $U$ constrained additively rationalizes $D$ and thus items 4 and 5 are equivalent. The equivalence between items 2, 3, and 4 is well-known and is part of Afriat’s Theorem. Thus, the only novel part of Theorem 3 is the equivalences involving item 6. Note that we do not guarantee that the constrained quasilinear utility function is well-behaved (however, it is increasing and continuous).

Theorem 3 suggests that $\text{TMP}_{c}$ can be used to measure the extent to which any of the equivalent conditions in the theorem are violated. This is the theme of the next subsection.

3.2 $\text{TMP}_{c}$ as a measure of rationality

Here we introduce constrained versions of the additive cost inefficiency and quasilinear utility inefficiency measures introduced in Section 2. Because Theorem 3 establishes that constrained additive and constrained quasilinear utility maximization are equivalent to GARP these measures can be thought of as reporting the extent to which GARP is violated.

To proceed, let $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ be a dataset. For a utility function $U$ let $e_{c}^{U}$ denote the smallest amount of money which a consumer with additive (across periods) utility function $U$ would need in order to obtain the utility level $\sum_{t=1}^{T}U(\mathbf{x}^{t})$ , while not spending more than $\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ in each period $t$ , i.e.,

e_{c}^{U}=\inf\left\{\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\tilde{\mathbf{x}}^{t}:\sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t})\geqslant\sum_{t=1}^{T}U(\mathbf{x}^{t})\text{ and }\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\tilde{\mathbf{x}}^{t},\ \forall t\right\}

The constrained additive cost inefficiency displayed by the consumer is the amount of money the consumer could save while still obtaining the same additive utility provided that in each period spending is kept within the original budget set. More precisely, the constrained additive cost inefficiency is

A_{c}=\inf_{U}\left(\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}-e_{c}^{U}\right)

(7)

where the infimum is taken over all utility functions. The interpretation of $A_{c}$ is the same as with $A$ and, in particular, it could be thought of a measure of deviation from the model of constrained additive utility maximization via the cost efficiency approach.

For a price vector $\mathbf{p}\in\mathbb{R}_{++}^{L}$ and a number $m>0$ let $B(\mathbf{p},m)$ denote the linear budget set $B(\mathbf{p},m)=\{\mathbf{x}\in\mathbb{R}_{+}^{L}:m\geqslant\mathbf{p}\cdot\mathbf{x}\}$ . The constrained quasilinear utility inefficiency displayed by the consumer is the amount of extra quasilinear utility the consumer could have acquired by deviating from $\mathbf{x}^{t}$ , provided that in each period the consumer’s expenditure does not exceed $\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ . Formally, the constrained quasilinear utility inefficiency is

Q_{c}=\inf_{U}\left(\sum_{t=1}^{T}\sup_{\mathbf{x}\in B(\mathbf{p}^{t},\mathbf{p}^{t}\cdot\mathbf{x}^{t})}\left[U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}\right]-\left[U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\right]\right)

(8)

where the infimum is taken over all utility functions. The interpretation of $Q_{c}$ is the same as with $Q$ . We know that $Q$ can be calculated by solving a linear programming problem and it is natural to conjecture that $Q_{c}$ can also be calculated in this fashion. Indeed, this happens to be the case. $Q_{c}$ coincides with $\bar{\varepsilon}_{c}$ , the value of the linear programming problem of finding $(u_{1},u_{2},\ldots,u_{T})\in\mathbb{R}^{T}$ and $(\varepsilon_{1},\varepsilon_{2},\ldots,\varepsilon_{T})\in\mathbb{R}_{+}^{T}$ to solve {IEEEeqnarray}rCl min& ∑_t=1^T ε_t
s.t. u_s ⩽u_t + p^t ⋅( x^s - x^t ) + ε_t, ∀s,t such that p^t ⋅x^t ⩾p^t ⋅x^s Note that the linear programs of (2.2) and (8) are almost identical. The difference is that the constraints in (8) only apply for pairs of observations $t,s$ where $\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}^{s}$ whereas the constraints in (2.2) apply regardless of whether this condition holds or not.

The following result shows that all the measures of deviation from GARP just introduced are equivalent.

Theorem 4.

For any dataset $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ ,

\text{{\em TMP}}_{c}=A_{c}=Q_{c}=\bar{\varepsilon}_{c}.

(9)

Moreover, there exists a continuous and increasing utility function $U:\mathbb{R}_{+}^{L}\rightarrow\mathbb{R}$ that attains the infimum in the definitions of $A_{c}$ and $Q_{c}$ .

The proof of Theorem 4 shares much in common with the proof of Theorem 2 (with a couple of added nuances). To explain further we focus on the proof that $\text{TMP}_{c}=Q_{c}$ (the proof that $\text{TMP}_{c}=A_{c}$ is similar). The “easy direction” is showing that $Q_{c}\geqslant\text{TMP}_{c}$ and the proof approach is essentially the same as the one we used to show $Q\geqslant\text{TMP}$ in Theorem 2.

To show that $\text{TMP}_{c}\geqslant Q_{c}$ we proceed as follows. Let $\sigma$ be a constrained permutation which achieves the supremum in the definition of $\text{TMP}_{c}$ . The first step of our proof is to show that the permuted dataset $D_{\sigma}=(\mathbf{p}^{t},\mathbf{x}^{\sigma(t)})$ cannot be pumped by any permutation $\sigma^{\prime}$ which is constrained for $D$ (note that we consider $\sigma^{\prime}$ which are constrained for $D$ and not permutations constrained for $D_{\sigma}$ ). Once we establish that $D_{\sigma}$ cannot be pumped in this fashion we employ Lemma 1 in the Appendix which guarantees that there exists a continuous and increasing utility function $U$ that satisfies $U(\mathbf{x}^{\sigma(t)})-\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma(t)}\geqslant U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}$ for all $\mathbf{x}\in B(\mathbf{p}^{t},\mathbf{p}^{t}\cdot\mathbf{x}^{t})$ . We then show that the amount of money which can be pumped with $\sigma$ is weakly greater than the amount of quasilinear utility wasted according to $U$ ; in other words, the extra utility the consumer could have gained by deviating from $\mathbf{x}^{t}$ , subject to any deviation costing weakly less than $\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ . This establishes $\text{TMP}_{c}\geqslant Q_{c}$ .

3.3 Comparison to other money pump approaches

Here we compare our $\text{TMP}_{c}$ with related measures introduced in Echenique, Lee, and Shum (2011) (henceforth ELS) and Smeulders, Cherchye, Spieksma, and De Rock (2013) (henceforth SCSD). The starting point for ELS is the observation that any violation of GARP (i.e. any sequence $t_{1},t_{2},\ldots,t_{K}$ with $t_{1}=t_{K}$ so that (6) holds with at least one strict inequality) can be exploited by an arbitrageur to pump money from the consumer. This observation led ELS to quantify the degree of the violation of GARP $t_{1},t_{2},\ldots,t_{K}$ in accordance with the amount of money which could be extracted by the arbitrageur. As a single dataset $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ can contain multiple violations of GARP it was proposed by ELS to take the average (either the mean or the median) of the amount of money which can be pumped (where the average is taken over each violation of GARP) in order to measure the degree of irrationality exhibited by the consumer. SCSD proved that calculating ELS’ average money pump (using either the mean or median) is NP-hard suggesting that this measure can be difficult to calculate in practice. SCSD proposed taking the maximum amount of money which can be pumped from the consumer through any single violation of GARP (i.e. any sequence $t_{1},t_{2},\ldots,t_{K}$ with $t_{1}=t_{K}$ so that (6) holds with at least one strict inequality). They show that this maximum money pump, in contrast to the average money pump, can be calculated in polynomial time. Recall that from Theorem 4 we know that our measure $\text{TMP}_{c}$ can be calculated by solving a linear programming problem and thus our measure too can be calculated in polynomial time.

The main difference between our measure, $\text{TMP}_{c}$ , and the measures introduced by ELS and SCSD is that our index is the amount of money which can be extracted using the optimal trading strategy of the arbitrageur. On the other hand, the indices of ELS and SCSD aggregate (using either an average or a maximum) the amount of money which can be extracted over each violation of GARP whether or not the optimal strategy of the arbitrageur would actually exploit this violation (and, as we shall see in the examples below, he may not). We also note that there are no theoretical results connecting either the measure of ELS or the measure of SCSD to quasilinear or additive rationalizations (or any other rationalization concept) in the sense of our Theorem 4.

To further elucidate the differences between our $\text{TMP}_{c}$ and the measures of ELS and SCSD we present two examples.

Example 1.

Suppose there are two goods and three observations. Let $\mathbf{x}^{1}=(1,2)$ , $\mathbf{x}^{2}=(2,1)$ , and $\mathbf{x}^{3}=(\tfrac{5}{3},\tfrac{5}{3})$ and let $\mathbf{p}^{1}=(1,2)$ , $\mathbf{p}^{2}=(2,1)$ , $\mathbf{p}^{3}=(2,1)$ . Note that the budget set in observations 2 and 3 are the same and so the budget sets in this example are the same as those displayed in Figure 1(a). It is easy to check that the violations of GARP are observation sequences $(1,2)$ , $(1,3)$ , $(1,2,3)$ , and $(1,3,2)$ .⁹⁹9To confirm, for instance, that $(1,3)$ is indeed a violation of GARP just note that $\mathbf{p}^{1}\cdot(\mathbf{x}^{1}-\mathbf{x}^{3})=0$ and $\mathbf{p}^{3}\cdot(\mathbf{x}^{3}-\mathbf{x}^{1})=1$ and thus (6) holds with one strict inequality for $t=1,3$ . The amounts of money which can be pumped from each violation are: $2$ , $1$ , $2$ , and $1$ , respectively.¹⁰¹⁰10To calculate the amount of money which can be pumped, for instance, from sequence $(1,3)$ just take $\mathbf{p}^{1}\cdot(\mathbf{x}^{3}-\mathbf{x}^{1})+\mathbf{p}^{3}\cdot(\mathbf{x}^{1}-\mathbf{x}^{3})=0+1=1$ . ELS’ mean and median money pumps are thus both equal to $1.5$ . On the other hand, the optimal strategy for the arbitrageur is to ignore observation 3 and pump via observations 1 and 2 and thus $\text{TMP}_{c}=2$ (the amount of money which can be extracted from observations 1 and 2).

Because ELS measure the average of the money pumps it is possible that a more rational dataset (in the sense that it has fewer GARP violations) can actually have a higher average money pump. For instance, if the choice in observation 3 of this example were perturbed slightly to $\tilde{\mathbf{x}}^{3}=(\tfrac{5}{3}-\varepsilon,\tfrac{5}{3}+2\varepsilon)$ for some small $\varepsilon>0$ then the only GARP violations are the observation sequences $(1,2)$ and $(1,2,3)$ and thus ELS’ average money pump would be $2$ . Thus surprisingly, we see that a perturbation to the data which made the data more rational (it removed several violations of GARP) resulted in ELS’ index actually increasing in value. Note that this perturbation has no effect on $\text{TMP}_{c}$ .

Example 2.

Suppose there are four goods and four observations. In each observation $t\in\{1,2,3,4\}$ the consumer purchases 1 unit of good $t$ and $0$ units of all other goods. Thus, $\mathbf{x}^{1}=(1,0,0,0)$ , $\mathbf{x}^{2}=(0,1,0,0)$ , $\mathbf{x}^{3}=(0,0,1,0)$ , and $\mathbf{x}^{4}=(0,0,0,1)$ . Let $\mathbf{p}^{1}=(4,2,8,1)$ , $\mathbf{p}^{2}=(2,4,8,8)$ , $\mathbf{p}^{3}=(8,8,4,2)$ , and $\mathbf{p}^{4}=(2,8,2,4)$ . Note that in observation 1 the consumer purchased $\mathbf{x}^{1}=(1,0,0,0)$ but could have attained either $\mathbf{x}^{2}=(0,1,0,0)$ or $\mathbf{x}^{4}=(0,0,0,1)$ for less money. Similarly, in period 2 the consumer could have purchased $\mathbf{x}^{1}$ for less money, in period 3 the consumer could have purchased $\mathbf{x}^{4}$ for less money, and in period 4 the consumer could have purchased either $\mathbf{x}^{1}$ or $\mathbf{x}^{3}$ for less money. Using these remarks it is easy to show that the violations of GARP are observation sequences: $(1,2)$ , $(1,4)$ , and $(3,4)$ . The amounts of money which can be pumped are: $4$ , $5$ , and $4$ , respectively. The maximum money pump of SCSD is thus $5$ . Note however that the arbitrageur would not wish to pump via observations $(1,4)$ (these being the observations corresponding to the money pump amount of $5$ ) as the arbitrageur could do better by pumping via observations $(1,2)$ and $(3,4)$ (this corresponds to using the permutation $\sigma$ which satisfies $\sigma(1)=2,\sigma(2)=1,\sigma(3)=4,\sigma(4)=3$ ) as following this strategy nets the arbitrageur a total of $8$ . Thus, while the maximum money pump is $5$ we see that $\text{TMP}_{c}=8$ .

The two preceding examples help make clear the distinction between our measure, which corresponds to the arbitrageur’s optimal strategy, with the measures of ELS and SCSD. In Example 1 we see that ELS’ average money pump incorporates several small violations of GARP which the arbitrageur’s optimal strategy ignores. Similarly, in Example 2 we that SCSD’s maximum money pump exploits the most severe single violation of GARP however the arbitrageur’s optimal strategy ignores this severe violation because he can make even more money by pumping two smaller violations.

4 Conclusion

In this article we propose two notions of the money pump. The total money pump (TMP) index is the total amount of money which can be extracted from the consumer via an arbitrage strategy. We show in Theorem 2 that this money pump is equivalent to a measure of additive cost inefficiency and is also equivalent to the measure of quasilinear utility inefficiency proposed by Allen and Rehbeck (2021). The constrained total money pump (TMP_c) index, is the total amount of money which can be extracted from the consumer via an arbitrage strategy, with the added condition that the arbitrageur must make money in each round in which he enacts his strategy. Theorem 4 shows that the constrained total money pump is equivalent to a measure of constrained additive cost inefficiency and constrained quasilinear utility inefficiency.

References

Afriat (1967) S. N. Afriat. The construction of utility functions from expenditure data. International Economic Review, 8(1):67–77, 1967.
Afriat (1973) S. N. Afriat. On a system of inequalities in demand analysis: An extension of the classical method. International Economic Review, 14(2):460–472, 1973.
Allen and Rehbeck (2021) Roy Allen and John Rehbeck. Satisficing, aggregation, and quasilinear utility. October 2021.
Apesteguia and Ballester (2015) Jose Apesteguia and Miguel A. Ballester. A measure of rationality and welfare. Journal of Political Economy, 123(6):1278–1310, 2015. ISSN 00223808, 1537534X. URL https://www.jstor.org/stable/10.1086/683838.
Brown and Calsamiglia (2007) Donald J. Brown and Caterina Calsamiglia. The nonparametric approach to applied welfare analysis. Economic Theory, 31(1):183–188, 2007. ISSN 09382259, 14320479. URL http://www.jstor.org/stable/27822509.
Browning (1989) Martin Browning. A nonparametric test of the life-cycle rational expections hypothesis. International Economic Review, 30(4):979–992, 1989. ISSN 00206598, 14682354. URL http://www.jstor.org/stable/2526762.
Dean and Martin (2016) Mark Dean and Daniel Martin. Measuring Rationality with the Minimum Cost of Revealed Preference Violations. The Review of Economics and Statistics, 98(3):524–534, 07 2016. ISSN 0034-6535.
Diewert (1973) W. E. Diewert. Afriat and revealed preference theory. Review of Economic Studies, 40(3):419–425, 1973.
Echenique et al. (2011) F. Echenique, S. Lee, and M. Shum. The money pump as a measure of revealed preference violations. Journal of Political Economy, 119(6):1201–1223, 2011.
Halevy et al. (2018) Y. Halevy, D. Persitz, and L. Zrill. Parametric recoverability of preferences. Journal of Political Economy, 126(4):1558–1593, 2018.
Houtman and Maks (1985) M. Houtman and J. A. H. Maks. Determining all maximal data subsets consistent with revealed preference. Kwantitatieve Methoden, 6(19):89–104, 1985.
Smeulders et al. (2013) Bart Smeulders, Laurens Cherchye, Frits C. R. Spieksma, and Bram De Rock. The money pump as a measure of revealed preference violations: A comment. Journal of Political Economy, 121(6):1248–1258, 2013. ISSN 00223808, 1537534X. URL http://www.jstor.org/stable/10.1086/674077.
Varian (1982) H. R. Varian. The nonparametric approach to demand analysis. Econometrica, 50(4):945–973, 1982.
Varian (1990) H. R. Varian. Goodness-of-fit in optimizing models. Journal of Econometrics, 46(1-2):125–140, 1990.

Appendix A Additive Cost Efficiency and Afriat’s CCEI

Here we show that the additive cost inefficiency $A$ , as defined by (3), is in fact a version of Afriat’s CCEI (after applying a suitable normalization to $A$ ). To proceed, we normalize $A$ by dividing by total expenditure

\tilde{A}=\inf_{U}\ \dfrac{\left(\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}-e^{U}\right)}{\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}}

Now, let $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ be some dataset and let $U$ be an arbitrary utility function. Let $\bar{e}_{U}^{t}$ denote the smallest amount of money which the consumer could have spent in period $t$ to acquire a bundle giving as much utility as $\mathbf{x}^{t}$ . That is, $\bar{e}_{U}^{t}=\inf\{\mathbf{p}^{t}\cdot\mathbf{x}:U(\mathbf{x})\geqslant U(\mathbf{x}^{t})\}$ . Let $\mathcal{U}$ be some collection of utility functions. The CCEI for $\mathcal{U}$ is the number¹¹¹¹11When $\mathcal{U}$ is the class of well-behaved utility functions then $\text{CCEI}_{\mathcal{U}}$ can be expressed in terms of the extent to which the budget constraints need to be relaxed in order for the data to satisfy GARP. That is, the CCEI is equal to the infimum number $e$ so that for all $t_{1},t_{2},\ldots,t_{K}$ with $t_{1}=t_{K}$ we have $(1-e)\mathbf{p}^{t_{k}}\cdot\mathbf{x}^{t_{k}}\geqslant\mathbf{p}^{t_{k}}\cdot\mathbf{x}^{t_{k+1}},\qquad\qquad\text{for all }k\in\{1,2,\ldots,K-1\}$ implies that none of these inequalities hold strictly. See Halevy et al. (2018) for the proof that our definition of the CCEI and the version involving relaxed budget sets are in fact equivalent when $\mathcal{U}$ is the collection of well-behaved utility functions.

\text{CCEI}_{\mathcal{U}}=\inf_{U\in\mathcal{U}}\sup_{t}\left(\dfrac{\mathbf{p}^{t}\cdot\mathbf{x}^{t}-\bar{e}_{t}^{U}}{\mathbf{p}^{t}\cdot\mathbf{x}^{t}}\right)

(10)

In other words, the CCEI considers the percent of the budget set which is wasted in each period and aggregates the measure over periods by taking a supremum.

How does the CCEI relate to $A$ ? To answer this question let us cease to consider $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ as $T$ separate purchasing occasions but rather let us consider $D$ as one giant purchasing occasion in which the consumer buys the bundle $(\mathbf{x}^{1},\mathbf{x}^{2},\ldots,\mathbf{x}^{T})\in\mathbb{R}_{+}^{TL}$ when prices are $(\mathbf{p}^{1},\mathbf{p}^{2},\ldots,\mathbf{p}^{T})\in\mathbb{R}_{++}^{TL}$ . Let $\mathcal{U}_{A}$ denote the collection of utility functions $V$ which take the additive form $V(\tilde{\mathbf{x}}^{1},\tilde{\mathbf{x}}^{2},\ldots,\tilde{\mathbf{x}}^{T})=\sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t})$ for some $U$ . It follows that

e^{U}=\inf\left\{\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\tilde{\mathbf{x}}^{t}:\sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t})\geqslant\sum_{t=1}^{T}U(\mathbf{x}^{t})\right\}=\bar{e}_{1}^{V}

where $V\in\mathcal{U}_{A}$ is the additive utility function with sub-utility function $U$ . Note that ‘1’ in the subscript of $\bar{e}_{1}^{V}$ denotes the single (giant) observation where $(\mathbf{x}^{1},\mathbf{x}^{2},\ldots,\mathbf{x}^{T})$ is purchased at prices $(\mathbf{p}^{1},\mathbf{p}^{2},\ldots,\mathbf{p}^{T})$ . As $e^{U}=\bar{e}_{1}^{V}$ it is clear that $\tilde{A}=\text{CCEI}_{\mathcal{U}_{A}}$ . Thus $\tilde{A}$ can be thought of as a version of Afriat’s CCEI.

Appendix B Proofs

Proof of Theorem 1..

Let $\sigma$ be some permutation of $\{1,2,\ldots,T\}$ and note that if $D$ is quasilinear rationalized by $U$ then $\sum_{t=1}^{T}U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\sum_{t=1}^{T}U(\mathbf{x}^{\sigma(t)})-\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma(t)}$ which, after rearranging, yields $\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})\leqslant 0$ and thus item 5 implies item 1.

Next, suppose $D$ satisfies cyclical monotonicity. Let $U:X\rightarrow\mathbb{R}$ be defined by

U(\mathbf{x})=\inf\left(\mathbf{p}^{t_{K}}\cdot(\mathbf{x}-\mathbf{x}^{t_{K}})+\sum_{k=1}^{K-1}\mathbf{p}^{t_{k}}\cdot(\mathbf{x}^{t_{k+1}}-\mathbf{x}^{t_{k}})\right)

(11)

where the infimum is taken over all finite sequences $t_{1},t_{2},\ldots,t_{K}$ . As $D$ satisfies cyclical monotonicity it is easy to see that the infimum in (11) is always attained by some sequence $t_{1},t_{2},\ldots,t_{K}$ with at most $T$ elements. As such, $U$ is the pointwise infimum of finitely many well-behaved functions and is thus well-behaved. Let $t\in\{{1,2,\ldots,T}\}$ and let $\tilde{t}_{1},\tilde{t}_{2},\ldots,\tilde{t}_{K}$ be the sequence which attains the infimum in (11) for $U(\mathbf{x}^{t})$ . For any $\mathbf{x}\in\mathbb{R}_{+}^{L}$ we have

U(\mathbf{x}^{t})+\mathbf{p}^{t}\cdot(\mathbf{x}-\mathbf{x}^{t})=\mathbf{p}^{t}\cdot(\mathbf{x}-\mathbf{x}^{t})+\mathbf{p}^{\tilde{t}_{K}}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\tilde{t}_{K}})+\sum_{k=1}^{K-1}\mathbf{p}^{\tilde{t}_{k}}\cdot(\mathbf{x}^{\tilde{t}_{k+1}}-\mathbf{x}^{\tilde{t}_{k}})\geqslant U(\mathbf{x})

where the final inequality follows from the definition of $U$ . Rearranging the previous inequality gives $U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}$ and so $D$ is quasilinear rationalized by $U$ . Let $(\tilde{\mathbf{x}}^{1},\tilde{\mathbf{x}}^{2},\ldots,\tilde{\mathbf{x}}^{T})$ satisfy $\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant(>)\ \mathbf{p}^{t}\cdot\tilde{\mathbf{x}}^{t}$ . Then, as $D$ is quasilinear rationalized by $U$ we see $\sum_{t=1}^{T}U(\mathbf{x}^{t})\geqslant\sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t})+\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\tilde{\mathbf{x}}^{t})\geqslant(>)\ \sum_{t=1}^{T}U(\tilde{\mathbf{x}}^{t})$ and so $D$ is additively rationalized by $U$ . We have just shown that item 2 implies items 4 and 6. It is easy to see that items 1 and 2 are equivalent and that item 4 implies item 3 and item 6 implies item 5 and so the proof is complete. ∎

Proof of Theorem 2..

To show that $Q\geqslant\bar{\varepsilon}$ let $U$ be any utility function and define $u_{t}=U(\mathbf{x}^{t})$ and $\varepsilon_{t}=\sup_{s}\big{(}u_{s}-\mathbf{p}^{t}\cdot\mathbf{x}^{s}+u_{t}-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\big{)}$ . It is easy to see that these numbers $u_{t}$ and $\varepsilon_{t}$ constitute a feasible solution to (2.2) and further that $\sum_{t=1}^{T}\varepsilon_{t}$ lies below the quantity

\sum_{t=1}^{T}\sup_{\mathbf{x}\in\mathbb{R}_{+}^{L}}\left[U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}\right]-\left[U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\right]

and so indeed $Q\geqslant\bar{\varepsilon}$ .

To see that $\bar{\varepsilon}\geqslant\text{TMP}$ let $(u_{1},u_{2},\ldots,u_{T})$ and $(\varepsilon_{1},\varepsilon_{2},\ldots,\varepsilon_{T})$ constitute a feasible solution to (2.2) and let $\sigma$ be any permutation of $\{1,2,\ldots,T\}$ . Using the constraint inequalities in (2.2) we see

\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})=\sum_{t=1}^{T}\left[u_{\sigma(t)}-u_{t}+\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})\right]\leqslant\sum_{t=1}^{T}\varepsilon_{t}=\bar{\varepsilon}

and so indeed $\bar{\varepsilon}\geqslant\text{TMP}$ .

To see that $A\geqslant\text{TMP}$ let $\sigma$ be any permutation of $\{1,2,\ldots,T\}$ and note that for any utility function $U$ we have $\sum_{t=1}^{T}U(\mathbf{x}^{\sigma(t)})=\sum_{t=1}^{T}U(\mathbf{x}^{t})$ and so

\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})\leqslant\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}-e^{U}

from which we see that indeed $A\geqslant\text{TMP}$ .

Next, let $\sigma$ be a permutation which achieves the supremum in the definition of the TMP (i.e. achieves the supremum in (2)). Let $D_{\sigma}=(\mathbf{p}^{t},\mathbf{x}^{\sigma(t)})_{t\leqslant T}$ . We claim that $D_{\sigma}$ satisfies cyclical monotonicity. For a contradiction suppose that this is not the case and thus there exists some permutation $\sigma^{\prime}$ which “money pumps” the dataset $D_{\sigma}$ in the sense that $\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{\sigma(t)}-\mathbf{x}^{\sigma^{\prime}(\sigma(t))})>0$ . We have

0<\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{\sigma(t)}-\mathbf{x}^{\sigma^{\prime}(\sigma(t))})=\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma^{\prime}(\sigma(t))})-\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})

As $D_{\sigma}$ satisfies cyclical monotonicity we may appeal to Theorem 1 to see that there exists a well-behaved $U$ which rationalizes $D_{\sigma}$ . Therefore, $\sum_{t=1}^{T}[U(\mathbf{x}^{\sigma(t)})-\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma(t)}]-[U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}]=\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})=\text{TMP}$ and so we see that $\text{TMP}\geqslant Q$ . Noting that $\sum_{t=1}^{T}U(\mathbf{x}^{\sigma(t)})=\sum_{t=1}^{T}U(\mathbf{x}^{t})$ delivers $\text{TMP}\geqslant A$ . Thus, (5) holds and the infimum in the definitions of $A$ and $Q$ are attained by the well-behaved utility function $U$ . ∎

Lemma 1.

Let $D=(\mathbf{p}^{t},\mathbf{x}^{t})_{t\leqslant T}$ be a dataset and let $\mathbf{m}=(m^{1},m^{2},\ldots,m^{T})\in\mathbb{R}_{+}^{T}$ be a vector satisfying $m^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ for all $t$ . Suppose that for all permutations $\sigma$ satisfying $m^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma(t)}$ for all $t$ we have $\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})\leqslant 0$ . Then, there exists a continuous and increasing utility function $U$ which satisfies $U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}$ for all $t$ and all $\mathbf{x}\in\mathbb{R}_{+}^{L}$ satisfying $m^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}$ .

Proof.

Let $\beta>0$ be some very large number. For each $t$ let $f_{t}:\mathbb{R}_{+}^{L}\rightarrow\mathbb{R}$ be defined by

f_{t}(\mathbf{x})=\begin{cases}\mathbf{p}^{t}\cdot(\mathbf{x}-\mathbf{x}^{t}),&\qquad\text{ if }m^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}\\ \beta\mathbf{p}^{t}\cdot(\mathbf{x}-\mathbf{x}^{t}),&\qquad\text{ if }m^{t}<\mathbf{p}^{t}\cdot\mathbf{x}\end{cases}

Note that each $f_{t}$ is increasing and continuous. Let $U:\mathbb{R}_{+}^{L}\rightarrow\mathbb{R}$ be defined by

U(\mathbf{x})=\inf\left(f_{t_{K}}(\mathbf{x})+\sum_{k=1}^{K-1}f_{t_{k}}(\mathbf{x}^{t_{k+1}})\right)

(12)

where the infimum is taken over all finite sequences $t_{1},t_{2},\ldots,t_{K}$ . From our assumptions on $D$ we can, by taking $\beta$ sufficiently large, ensure that (i) for each $\mathbf{x}\in\mathbb{R}_{+}^{L}$ the infimum in (12) is attained by some finite sequence $t_{1},t_{2},\ldots,t_{K}$ which has at most $T$ elements and (ii) for all $t$ and all sequences $t_{1},t_{2},\ldots,t_{K}$ which attain the infimum in (12) with $\mathbf{x}=\mathbf{x}^{t}$ we have $m^{t_{K}}\geqslant\mathbf{p}^{t_{K}}\cdot\mathbf{x}^{t}$ and $m^{t_{k}}\geqslant\mathbf{p}^{t_{k}}\cdot\mathbf{x}^{t_{k+1}}$ for all and $k<K$ .

By property (i) we see that $U$ is the pointwise infimum of finitely many increasing and continuous functions and is thus increasing and continuous. Let $t\in\{{1,2,\ldots,T}\}$ and let $\tilde{t}_{1},\tilde{t}_{2},\ldots,\tilde{t}_{K}$ be the sequence which attains the infimum in (12) for $U(\mathbf{x}^{t})$ . For any $\mathbf{x}\in\mathbb{R}_{+}^{L}$ satisfying $m^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}$ we may use property (ii) to see that

U(\mathbf{x}^{t})+f_{t}(\mathbf{x})=f_{t}(\mathbf{x})+f_{\tilde{t}_{K}}(\mathbf{x}^{t})+\sum_{k=1}^{K-1}f_{\tilde{t}_{k}}(\mathbf{x}^{\tilde{t}_{k+1}})\geqslant U(\mathbf{x})

where the final inequality follows from the definition of $U$ . Rearranging the previous inequality and using the assumption that $m^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}$ gives $U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}$ and thus the proof is complete. ∎

Proof of Theorem 3..

As noted, the equivalence between items 2-4 is well-known and is part of Afriat’s Theorem and so we omit the proof. Also as noted, the equivalence between items 1 and 2 and the equivalence between items 4 and 5 are obvious and so we also do not prove these results either. Clearly item 6 implies item 3 and so to complete the proof it suffices to show that item 1 implies item 6. So, suppose $\text{TMP}_{c}=0$ . Take $\mathbf{m}=(m^{1},m^{2},\ldots,m^{T})$ where $m^{t}=\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ . It is easy to see that $D$ and $\mathbf{m}$ satisfy the hypothesis in Lemma 1 and so the lemma provides an increasing and continuous utility function $U$ which satisfies $U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}$ for all $\mathbf{x}\in\mathbb{R}_{+}^{L}$ satisfying $\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}$ . In other words, $D$ is constrained quasilinear rationalized by $U$ and so the proof is complete. ∎

Proof of Theorem 4..

To show that $Q_{c}\geqslant\bar{\varepsilon}_{c}$ let $U$ be any utility function and define $u_{t}=U(\mathbf{x}^{t})$ and $\varepsilon_{t}=\sup_{s}\big{(}u_{s}-\mathbf{p}^{t}\cdot\mathbf{x}^{s}+u_{t}-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\big{)}$ where the supremum is taken over all $s$ such that $\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}^{s}$ . It is easy to see that these numbers $u_{t}$ and $\varepsilon_{t}$ constitute a feasible solution to (8) and further that $\sum_{t=1}^{T}\varepsilon_{t}$ lies below the quantity

\sum_{t=1}^{T}\sup_{\mathbf{x}\in B(\mathbf{p}^{t},\mathbf{p}^{t}\cdot\mathbf{x}^{t})}\left[U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}\right]-\left[U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}\right]

and so indeed $Q_{c}\geqslant\bar{\varepsilon}_{c}$ .

To see that $\bar{\varepsilon}_{c}\geqslant\text{TMP}_{c}$ let $(u_{1},u_{2},\ldots,u_{T})$ and $(\varepsilon_{1},\varepsilon_{2},\ldots,\varepsilon_{T})$ constitute a feasible solution to (8) and let $\sigma$ be any constrained permutation of $\{1,2,\ldots,T\}$ . Using the constraint inequalities in (8) we see

\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})=\sum_{t=1}^{T}\left[u_{\sigma(t)}-u_{t}+\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})\right]\leqslant\sum_{t=1}^{T}\varepsilon_{t}=\bar{\varepsilon}_{c}

and so indeed $\bar{\varepsilon}_{c}\geqslant\text{TMP}_{c}$ .

To see that $A_{c}\geqslant\text{TMP}_{c}$ let $\sigma$ be any constrained permutation of $\{1,2,\ldots,T\}$ and note that for any utility function $U$ we have $\sum_{t=1}^{T}U(\mathbf{x}^{\sigma(t)})=\sum_{t=1}^{T}U(\mathbf{x}^{t})$ and so

\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})\leqslant\sum_{t=1}^{T}\mathbf{p}^{t}\cdot\mathbf{x}^{t}-e^{U}

from which we see that indeed $A_{c}\geqslant\text{TMP}_{c}$ .

Next, let $\sigma$ be a constrained permutation which achieves the supremum in definition of the $\text{TMP}_{c}$ . Let $D_{\sigma}=(\mathbf{p}^{t},\mathbf{x}^{\sigma(t)})_{t\leqslant T}$ and let $\mathbf{m}=(m^{1},m^{2},\ldots,m^{T})$ where $m^{t}=\mathbf{p}^{t}\cdot\mathbf{x}^{t}$ for all $t$ . We claim that $D_{\sigma}$ and $\mathbf{m}$ satisfy the hypotheses of Lemma 1 in the sense that for any permutation $\sigma^{\prime}$ satisfying $m^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma^{\prime}(\sigma(t))}$ we have $\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{\sigma(t)}-\mathbf{x}^{\sigma^{\prime}(\sigma(t))})\leqslant 0$ . For a contradiction suppose that this is not the case and thus there exists a permutation $\sigma^{\prime}$ satisfying (i) $m^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma^{\prime}(\sigma(t))}$ for all $t$ and (ii) $\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{\sigma(t)}-\mathbf{x}^{\sigma^{\prime}(\sigma(t))})>0$ . Using (i) and the fact that $\sigma$ is a constrained permutation for $D$ we see that for all $t$ we have $\mathbf{p}^{t}\cdot\mathbf{x}^{t}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma(t)}\geqslant\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma^{\prime}(\sigma(t))}$ and thus $\sigma^{\prime}\circ\sigma$ is a constrained permutation for $D$ . Using (ii) we see

0<\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{\sigma(t)}-\mathbf{x}^{\sigma^{\prime}(\sigma(t))})=\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{\sigma(t)}-\mathbf{x}^{t})-\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{\sigma^{\prime}(\sigma(t))}-\mathbf{x}^{t})

After rearranging we see that $\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{\sigma^{\prime}(\sigma(t))}-\mathbf{x}^{t})>\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{\sigma(t)}-\mathbf{x}^{t})$ which contradicts the assumption that $\sigma$ achieves the supremum in the definition of $\text{TMP}_{c}$ (recall that we have shown that $\sigma^{\prime}\circ\sigma$ is a constrained permutation for $D$ ). Having achieved a contradiction we conclude that indeed $D_{\sigma}$ and $\mathbf{m}$ satisfy the hypotheses in Lemma 1 and so the lemma guarantees that there exists a continuous and increasing utility function $U$ which satisfies $U(\mathbf{x}^{\sigma(t)})-\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma(t)}\geqslant U(\mathbf{x})-\mathbf{p}^{t}\cdot\mathbf{x}$ for all $\mathbf{x}\in B(\mathbf{p}^{t},\mathbf{p}^{t}\cdot\mathbf{x}^{t})$ . Therefore, $\sum_{t=1}^{T}[U(\mathbf{x}^{\sigma(t)})-\mathbf{p}^{t}\cdot\mathbf{x}^{\sigma(t)}]-[U(\mathbf{x}^{t})-\mathbf{p}^{t}\cdot\mathbf{x}^{t}]=\sum_{t=1}^{T}\mathbf{p}^{t}\cdot(\mathbf{x}^{t}-\mathbf{x}^{\sigma(t)})=\text{TMP}_{c}$ and so we see that $\text{TMP}_{c}\geqslant Q_{c}$ . Noting that $\sum_{t=1}^{T}U(\mathbf{x}^{\sigma(t)})=\sum_{t=1}^{T}U(\mathbf{x}^{t})$ delivers $\text{TMP}_{c}\geqslant A_{c}$ . Thus, (9) holds and the infimum in the definitions of $A_{c}$ and $Q_{c}$ are attained by the continuous and increasing utility function $U$ . ∎