This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Coopetition Against an Amazon

Ronen Gradwohl Department of Economics and Business Administration, Ariel University. Email: roneng@ariel.ac.il. Gradwohl gratefully acknowledges the support of National Science Foundation award number 1718670.    Moshe Tennenholtz Faculty of Industrial Engineering and Management, The Technion – Israel Institute of Technology. Email: moshet@ie.technion.ac.il. The work by Moshe Tennenholtz was supported by funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant number 740435).
Abstract

This paper studies cooperative data-sharing between competitors vying to predict a consumer’s tastes. We design optimal data-sharing schemes both for when they compete only with each other, and for when they additionally compete with an Amazon—a company with more, better data. We show that simple schemes—threshold rules that probabilistically induce either full data-sharing between competitors, or the full transfer of data from one competitor to another—are either optimal or approximately optimal, depending on properties of the information structure. We also provide conditions under which firms share more data when they face stronger outside competition, and describe situations in which this conclusion is reversed.

1 Introduction

A key challenge to firms competing in today’s electronic marketplace is competition against Big Tech companies that have considerably more data and so better predictive models. One way in which smaller firms may overcome this hurdle and survive or even thrive in such a market is to engage in coopetitive strategies—namely, to cooperate with other small firms that are its competitors. Such coopetitive strategies have increasingly become a field of study by both academics and practitioners (see, for example, Brandenburger and Nalebuff (2011) and Bengtsson and Kock (2000)), but they largely focus on industrial applications such as healthcare, IT, and service industries. In this paper, we study coopetition between e-commerce companies, and focus on the possibility of data sharing as a way to deal with their data imbalance vis-à-vis Big Tech.

Such coopetitive data-sharing can be undertaken by the firms themselves or by external service-providers. To facilitate the former, the newly burgeoning area of federated machine learning has as its goal the design of mechanisms that generate predictive models for firms based on such shared data (Yang et al., 2019). The assumption underlying federated learning is that these firms actually want predictive models based on all shared data, but in competitive scenarios this need not be the case. Although each firm certainly desires its own model to be as predictive as possible, it most likely does not wish the same for its competitors. So when is full data-sharing, leading to maximally predictive models for all firms, optimal? When is partial or no sharing better? And do these answers differ when firms compete against an Amazon?

In addition to using federated machine learning, coopetitive data-sharing can also be facilitated by an external service-provider. While tools for data sharing are prevalent—and include platforms such as Google Merchant and Azure Data Share—they are currently undergoing even further development. For example, in its European Strategy for Data, the European Commission plans the following: “In the period 2021-2027, the Commission will invest in a High Impact Project on European data spaces and federated cloud infrastructures. The project will fund infrastructures, data-sharing tools, architectures and governance mechanisms for thriving data-sharing and Artificial Intelligence ecosystems” (European Commission, 2020). But just as with federated learning, the potential benefits from data sharing may arise under partial, rather than full, sharing. For instance, one industry white paper urges service providers to

“…analyze the combined, crowdsourced data and generate benchmark analyses and comparative performance reports. Each participating client gains insights that it could not otherwise access, and each benefits from the service-host provider’s ability to slice and dice the aggregated data and share the results that are relevant to each client” (Loshen, 2014).

What is the optimal way for such service providers to combine, slice and dice, and share e-commerce firms’ data in order to facilitate successful coopetition against an Amazon?

Regardless of how coopetitive data-sharing is achieved—via federated learning or through a service provider—the answer to whether or not firms desire maximally predictive models for all depends on how these predictions are eventually used. If the firms involved act in completely unrelated markets, then of course full data-sharing is optimal—it yields a maximally predictive model for a firm, and that firm is not harmed by others’ better predictions. This is no longer true if the firms are competitors, since then a firm may be harmed by an improvement in its competitors’ predictions. In this case, whether or not the tradeoff is worthwhile depends on the details of the market in which firms compete.

In this paper we focus on one such market, which is motivated by recommender systems for online advertising and Long Tail retail. In our model, there are many types of consumers, and firms use their data to infer and take a tailored action for each consumer type. In the online advertising market, for example, firms use their data to personalize advertisements in order to maximize the probability that a consumer clicks on their ad. Data sharing is common in this market, and platforms such as Google Merchant and Azure Data Share specifically facilitate such sharing across clients in order to improve personalization and increase clickthrough rates.

For another concrete application, consider retailers in a Long Tail market—a market with an enormous number of low-demand goods that collectively make up substantial market share—interested in finding consumers for their products. Success in such markets crucially relies on retailers’ abilities to predict consumers’ tastes in order to match them to relevant products (Anderson, 2006; Pathak et al., 2010). On their own, smaller firms can be destroyed by giants with vastly greater amounts of data and thus a clear predictive advantage. In cooperation with other small firms, however, this data imbalance may be mitigated, possibly granting smaller firms a fighting chance. (We do not discuss here antitrust issues surrounding data sharing; the reader is referred to Martens et al. (2020) for a discussion thereof in the context of the EU’s European Strategy for Data.)

In this paper we study such cooperation between competing firms, and design data-sharing schemes that are optimal—ones that maximize total firm profits subject to each firm having the proper incentives to participate. More specifically, we study two settings, one without and the other with an Amazon. In the former, in which smaller firms compete only with one another, we show that coopetition is beneficial to the extent that firms can share data to simultaneously improve their respective predictions. In the optimal data-sharing scheme firms benefit from sharing because it allows them to better predict the tastes of consumers for whom no firm predicts well on its own.

In the latter setting, in which smaller firms compete with one another but also with an Amazon, sharing data about consumers for whom no firm predicts well can be beneficial, but is not optimal. We show that, in the optimal scheme, firms share data with others and weaken their own market positions on some market segments, in exchange for receiving data from others and strengthening their market positions on other segments.

The following simple example illustrates our model and previews some of our results. The example is framed as retailers offering goods to consumers who may make a purchase, but can also be interpreted as advertisers displaying advertisements to consumers who may click on the ads. In either case, suppose the market consists of two goods, g0g_{0} and g1g_{1}, that consumers may desire. Each consumer is described by a feature vector in {0,1}×{0,1}\{0,1\}\times\{0,1\} that determines that consumer’s taste: Consumers of types 00 and 11 are only interested in g0g_{0}, whereas consumers of types 01 and 10 are only interested in g1g_{1}. A priori, suppose the distribution of consumers in the market is such that an α\alpha fraction are of type 00, a β\beta fraction of type 01, a γ\gamma fraction of type 10, and a δ\delta fraction of type 11.

When a consumer shows up to the market, a retailer may offer him one of the goods (as a convention, we use masculine pronouns for consumers and feminine pronouns for retailers/players). If the retailer has data about the consumer’s type, she will offer the good desired by that consumer. Assume that when a consumer is offered the correct good he makes the purchase, and that this leads to a profit of 1 to the retailer. If there are two retailers offering this good to the consumer he chooses one at random from whom to make the purchase. A retailer whose good is not chosen gets profit 0.

Suppose now that there are two retailers pursuing the same consumer, but that they do not fully know the consumer’s type. Instead, the first retailer only knows the first bit of the consumer’s type, and the second retailer only knows the second bit. This situation is summarized in Figure 1, where the first retailer knows the row bit and the second knows the column bit. Which goods should the competing retailers offer the consumer? For this example, suppose for simplicity that α>2β2γ>4δ\alpha>2\beta\geq 2\gamma>4\delta. Then each of the retailers has a dominant strategy: If the bit they know is 0, offer g0g_{0}, and if the bit they know is 1, offer g1g_{1}.

To see that this is a dominant strategy, consider for example the row retailer, and suppose she learns that a consumer’s row bit is 0. She thus knows that the correct good is g0g_{0} with probability α/(α+β)\alpha/(\alpha+\beta) and g1g_{1} with probability β/(α+β)\beta/(\alpha+\beta). Her utility, however, depends also on the good offered by the column retailer. The row retailer’s worst-case utility from offering g0g_{0} is α/(2α+2β)\alpha/(2\alpha+2\beta), which occurs when the column player also offers g0g_{0}. Her best case utility from offering g1g_{1} is β/(α+β)\beta/(\alpha+\beta). Since we assumed α>2β\alpha>2\beta, offering g0g_{0} is best for the row player regardless of the other retailer’s offer, and is thus dominant. The analyses for the case in which the row retailer’s bit is 1, as well as for the column retailer’s strategy, are similar.

0 1
0 g0g_{0} g1g_{1}
1 g1g_{1} g0g_{0}
(a) Desired good
0 1
0 α\alpha β\beta
1 γ\gamma δ\delta
(b) Prior probability
Figure 1: Data sharing: An example

The result from deployment of these dominant strategies is that consumers of type 00 will be offered the correct good by both retailers, and will thus choose one at random. Consumers of types 01 and 10 will be offered the correct good by only one of the retailers, whereas consumers of type 11 will not make a purchase as they will not be offered the correct good. The expected profit of the first retailer will thus be α/2+γ\alpha/2+\gamma, and of the second retailer α/2+β\alpha/2+\beta.

Does data sharing improve retailers’ profits? Suppose retailers could share their respective information with one another, so that both always knew the consumer’s type. This would lead to both always offering the correct good, and hence to expected profits of 1/21/2 for each. However, this would be detrimental to the second retailer whenever α/2+β>1/2\alpha/2+\beta>1/2, since her profits with data sharing would be lower than without. Such cooperation is thus not individually rational – the second retailer will not want to cooperate with the first.

Instead, suppose there is a mediator—a trusted third-party that may represent either a service provider or a cryptographic protocol (such as a federated learning algorithm) run by the two retailers. The European Commission calls mediators common data spaces European Commission (2020), and describes them as “third-parties [who] may act as intermediaries and apply new technologies and ways of organising markets in order to…enable transactions that were previously not feasible. They can be private, public or community organisations that are neutral with respect to data uses…” (Martens et al., 2020). Given such a mediator, consider the following data-sharing scheme: both retailers share their data with the mediator, who then passes along the data to all retailers only if the consumer’s type is 11. If the consumer is not of type 11 then each retailer has her original data, and additionally infers that the consumer’s type is not 11 (for otherwise she would have learned that the type is 11).

How does this scheme affect retailers’ strategies? Clearly, when retailers learn that the consumer’s type is 11, both offer g0g_{0}. What happens if a retailer learns that the consumer’s type is not 11? Consider the row retailer. If her bit is 0, then learning that the consumer’s type is not 11 does not provide any new information, and so she still has the same dominant strategy of offering g0g_{0}. If her bit is 1, however, then she learns that the consumer’s type must be 10. But note that in this case, her best strategy is to offer g1g_{1}, which is the same as her dominant strategy absent a mediator. Thus, the mediator changes the row retailer’s behavior only when the consumer is of type 11. A similar analysis and conclusion hold for the column retailer.

Since retailers offer the same goods with and without data sharing in all cases except when the consumer is of type 11, this scheme changes retailer’s profits only in this latter case. In particular, it leads to an additional (ex ante) profit of δ/2\delta/2 for each retailer beyond her original profit—with probability δ\delta the consumer is of type 11, in which case the retailers split the additional surplus of 1—and is thus beneficial to both. Observation 1 shows that such a scheme is, in fact, not only individually rational but also optimal.

Suppose now that the two retailers are also competing against an Amazon for the consumer. Since the Amazon is a giant it has more data, and, in particular, we assume that it has complete information about each consumer’s type. In this three-way competition, the original dominant strategies of our two smaller retailers do not perform as well: They lead to profits of α/3+γ/2\alpha/3+\gamma/2 and α/3+β/2\alpha/3+\beta/2 to the first and second retailer, respectively, since when they offer the correct good the consumer now chooses amongst up to 3 retailers. The mediator described above, which reveals information when the type is 11, leads to higher profits, since now there is an additional δ/3\delta/3 to each of the smaller retailers. However, that mediator is no longer optimal, and the retailers can actually do better.

To see this, observe that, conditional on consumer type 01, the total profit to the small retailers is β/2\beta/2, since only the second makes the correct offer g1g_{1} and then competes with the Amazon. In contrast, if both retailers were to know the type and offer the correct good, then their total profit would be 2β/32\beta/3, namely β/3\beta/3 each. This is harmful to the second retailer as it involves a loss of profit, but a gain for the first and the sum. But the second retailer can be compensated by getting data from the first elsewhere, for instance on consumer type 10.

For simplicity of this example, suppose that β=γ\beta=\gamma, and consider a mediator that facilitates full data-sharing, in which both retailers learn each other’s data and thus have complete information about consumers’ types. Here, the profit of each is 1/3>α/3+β/2=α/3+γ/21/3>\alpha/3+\beta/2=\alpha/3+\gamma/2, so both gain from this data sharing. Furthermore, the total profit of the retailers is 2/32/3, which is the maximal utility they can obtain when competing against an Amazon. Thus, this data-sharing scheme is individually rational and optimal. Observe that it leads to higher retailer welfare than the optimal scheme absent a mediator.

Full data-sharing is not always individually rational, however. If β\beta is much greater than γ\gamma, then the second retailer is not sufficiently compensated by the first for sharing data about consumer 01. The second retailer will consequently be harmed by such data sharing, and so the scheme will not be individually rational. However, in Theorem 4 we show that there is a different data-sharing scheme that is individually rational and optimal, a scheme in which the first retailer shares all her data and the second shares some of her data. Overall, we show that such data-sharing coopetition against an Amazon is beneficial to the small retailers.

1.1 Contribution

Our model is more general than the simple example above, and involves many goods and many types of consumers. For the example it was useful to think of each consumer as having a feature vector describing his type, but in the model we take a more general approach that allows for a wider class of information structures. Furthermore, in general the retailers will not have dominant strategies, either with or without a mediator. Instead, we suppose that, absent a mediator, players play an arbitrary Bayesian Nash equilibrium, and construct mediators that lead to higher expected utilities in equilibrium.

In Section 3, we design optimal mediators for coopetitive data-sharing. In addition to maximizing total retailer profits, all our mediators are individually rational—retailers attain higher utilities when they use the mediators—and incentive compatible—in equilibrium, each retailer’s best strategy is to follow the mediators’ recommendations. One of our main results is that the optimal schemes are simple. In particular, instead of relying on the details of retailers’ data or on their strategies in equilibrium sans data-sharing (as in the introductory example above), our mediators consist of threshold rules that probabilistically induce either full data-sharing between retailers, or the full transfer of data from one retailer to another.

The analysis in Section 3 retains one of the assumptions present in the example above—that the joint data of the small retailers is sufficient to uniquely identify each consumer’s type. In Section 4 we drop this assumption, leading to a new set of challenges. This is because, when the assumption holds, there is no conflict between welfare maximization and equilibrium, and so the main hurdle in designing optimal mediators is the individual rationality constraint. Without the assumption, however, participants’ incentive compatibility constraints impose a limit on the total welfare that can be achieved. Nonetheless, our main results in this section are that variants of the simple mediators from Section 3 are approximately optimal here as well. We also provide examples showing that our approximation factor is tight.

Finally, in Section 5 we delve into the intriguing question of whether players share more data in the presence of an Amazon or in its absence. First, under the assumption that the joint data of the small retailers is sufficient to uniquely identify each consumer’s type, we show that if data sharing is strictly beneficial in the absence of an Amazon, then it is also strictly beneficial in its presence. This confirms the intuition that players share more data when facing stronger outside competition. However, we also show that this conclusion may be reversed when the assumption does not hold. In this latter case, we show that sometimes data sharing can be strictly beneficial in the absence of an Amazon, but not in its presence. We show that the reason this may happen depends on whether or not players’ equilibrium considerations conflict with welfare maximization.

The rest of the paper proceeds as follows. First we survey the related literature, and then, in Section 2, develop the formal model. This is followed by our main analyses in Sections 34, and 5. Concluding notes appear in Section 6.

1.2 Related Literature

Most broadly, our paper contributes to a burgeoning literature on competition in prediction and machine learning. Within this literature, papers such as Mansour et al. (2018), Ben-Porat and Tennenholtz (2019), and Feng et al. (2019) study different models of learning embedded in settings where participants compete with others in their predictions. They take participants’ data as given, and point to the effect competition has on optimal learning algorithms. Our focus, in contrast, is on the effect of data sharing on competitive prediction.

Because of its focus on data sharing, our paper is also related to the more established literature on strategic information-sharing, a literature that focuses on a number of distinct applications: oligopolistic competition (Clarke, 1983; Raith, 1996), financial intermediation (Pagano and Jappelli, 1993; Jappelli and Pagano, 2002; Gehrig and Stenbacka, 2007), supply chain management (Ha and Tong, 2008; Shamir and Shin, 2016), price discrimination (Liu and Serfes, 2006; Jentzsch et al., 2013), and competition between data brokers (Gu et al., 2019; Ichihashi, 2020). Much of this literature revolves around the question of whether it is beneficial for participating firms to pool all their data, or whether they would prefer to pool only some or none at all. As Bergemann and Morris (2013) demonstrate, however, more finely tuned data sharing can be beneficial even when full or partial pooling is not, and so the existing schemes do not exhaust the potential benefits of data sharing. In this paper we analyze precisely such finely tuned sharing.

A different paper that does consider more finely tuned sharing is that of de Zegher and Lo (2020), who study an infinitely repeated setting in which firms compete for market share. Their main result is the construction of a data sharing scheme that Pareto improves all participants’ welfare. Like our results, their scheme also only reveals partial information to participants. Unlike our approach, however, de Zegher and Lo (2020)’s scheme relies on the repeated nature of the interaction, and uses folk-theorem-type arguments to show that cooperation can be sustained. In addition, their focus is on data sharing between many small firms, and they do not consider the possible presence of an Amazon.

In terms of modeling and techniques, our paper falls into the literature on information design. Information design, recently surveyed by Bergemann and Morris (2019), is the study of how the allocation of information affects incentives and hence behavior, with a focus on the extent and limits of purely informational manipulation. The information design problem encompasses work on communication in games and on Bayesian persuasion. When the mediator is assumed to have only the information held by the players, or only the information they are willing to share with him, the problem maps to the one studied in the literature on communication in games. The goal of these studies is to characterize the equilibrium outcomes achievable when players are allowed to communicate prior to playing a fixed game, and where communication is captured by players’ interaction with a mediator. Myerson (1991) and Forges (1993) provide useful overviews. Our paper builds on this model by endowing the mediator with the specific aim of maximizing (some of) the players’ utilities.

A different setting, called Bayesian persuasion (Kamenica and Gentzkow, 2011), is one in which only the mediator (here called the sender) has payoff-relevant information, and in which he can commit to a particular information structure prior to observing that information. Initial work on Bayesian persuasion focused on the case of a single sender and a single player (Kamenica and Gentzkow, 2011), but more recent research also consists of settings with multiple senders (such as Gentzkow and Kamenica (2016)) and multiple players. Our paper is most closely related to the latter, and specifically to Bergemann and Morris (2016) and Galperti and Perego (2018), who develop a linear programming approach to study the effect of different information structures on the Bayes correlated equilibria of the subsequent game, as well as Mathevet et al. (2020), who extend the belief-based approach of Kamenica and Gentzkow (2011) to study optimal information structures under different solution concepts.

A closely related strand of the literature, recently surveyed by Bergemann and Bonatti (2019), focuses on the sale of information by a data broker. Most of this work differs from our paper in that it focuses on a market with one-sided information flow, with information going only from one party to another, whereas we study a market where information flow is bidirectional across firms. Nonetheless, the insights arising from that literature are related to ones we develop in this paper. For example, Bimpikis et al. (2019) consider a monopolistic data broker who sells data to firms who compete in a downstream market. A major insight in that paper is that the amount of information optimally sold by the data broker depends on the nature of downstream competition, and, in particular, that more (resp., less) data is sold if that competition features strategic complements (resp., substitutes). While our paper studies data sharing as opposed to data sale, our results also shed light on the effect the nature of competition has on the amount of data shared. In our model firms’ actions are strategic substitutes—when more firms offer the correct good to a particular consumer (e.g., due to the presence of an Amazon), the value of offering the correct good decreases. Our results indicate that sometimes there is more data sharing in the absence of an Amazon and hence when strategic substitutes are weaker, which is consistent with the insight of Bimpikis et al. (2019). However, we also show that sometimes the opposite holds.

There is also a relationship between information sale and information sharing, which is pointed out by Bergemann and Bonatti (2019). These authors relate their general model of a market for information to incentives for information sharing, and point to the study of finely tuned sharing schemes as an open problem.

Our paper is also related to a set of papers that focuses on designing mediators to achieve various goals, such as to improve the incentives of players, make equilibria robust to collusion, or implement correlated equilibria while guaranteeing privacy Monderer and Tennenholtz (2004, 2009); Kearns et al. (2014). The first two differ from our work in that they make stronger assumptions about the mediator’s capabilities, such as changing payoffs or limiting player actions, and the third focuses on a setting with many players that is quite different from our own.

Finally, our work is conceptually related to research on federated learning, and in particular on cross-silo federated learning Kairouz et al. (2019). This framework consists of a set of agents with individual data whose goal is to jointly compute a predictive model. A recent emphasis within federated learning is on incentivizing the agents to participate, sometimes through monetary transfers Zhan et al. (2021) and sometimes by providing different models to different agents Lyu et al. (2020). The focus of these papers is on fairness—agents that provide more data should be compensated more generously. Our paper differs, in that agents are assumed to utilize the resulting models in some competition, and this subsequent competition drives agents’ incentives to participate. Our work is thus orthogonal to that surveyed in Zhan et al. (2021): we do not focus on the algorithmic aspects of computing a joint model, but rather on the incentives and joint benefits of participating even when other agents are competitors.

2 Model and Preliminaries

There is a set GG of goods and a population of consumers interested in obtaining one of them. Each consumer has one of a finite set of types, ωΩ\omega\in\Omega, that describe the good gωGg_{\omega}\in G in which he is interested. There are three players who compete for consumers: two regular players indexed 1 and 2, and an Amazon, a player indexed 0. We will separate the analysis to two settings: first, when the Amazon player 0 is not present and players 1 and 2 compete only with one another; and second, when they additionally compete with Amazon. The model and definitions here apply to both settings.

Player 0 (if present) has complete information of the consumer’s type. In Section 3 we assume that players 1 and 2 have jointly complete information—that, when combined, their respective data uniquely identifies each consumer’s type—but that each player on her own may only be partially informed. In Section 4 we remove this assumption.

To model this informational setting, we represent players’ data using the information partition model of Aumann (1976): Each player ii is endowed with a partition Πi\Pi_{i} of Ω\Omega, where Πi\Pi_{i} is a set of disjoint, nonempty sets whose union is Ω\Omega. For each ωΩ\omega\in\Omega we denote by Pi(ω)P_{i}(\omega) the unique element of Πi\Pi_{i} that contains ω\omega, with the interpretation that if the realized type of consumer is ω\omega, each player ii only learns that the type belongs to the set Pi(ω)P_{i}(\omega).

Framing the example from the introduction within this model would associate Ω\Omega with {0,1}2\{0,1\}^{2} and the partitions P1(00)=P1(01)={00,01}P_{1}(00)=P_{1}(01)=\{00,01\}, P1(10)=P1(11)={10,11}P_{1}(10)=P_{1}(11)=\{10,11\}, P2(00)=P2(10)={00,10}P_{2}(00)=P_{2}(10)=\{00,10\}, and P2(01)=P2(11)={01,11}P_{2}(01)=P_{2}(11)=\{01,11\}.

In this model, player 0’s complete information means that P0(ω)={ω}P_{0}(\omega)=\{\omega\} for all ωΩ\omega\in\Omega, and players 1 and 2’s jointly complete information means that P1(ω)P2(ω)={ω}P_{1}(\omega)\cap P_{2}(\omega)=\{\omega\} for all ωΩ\omega\in\Omega. We further assume that, before obtaining any information, all players have a common prior π\pi over Ω\Omega.

To model data sharing between players 1 and 2 we suppose there is a mediator that gathers each player’s information and shares it with the other in some way. Formally, a mediator is a function :2Ω×2ΩΔ(M2)\mathcal{M}:2^{\Omega}\times 2^{\Omega}\mapsto\Delta\left(M^{2}\right), where MM is an arbitrary message space. The range is a distribution over pairs of messages, where the first (respectively, second) is the message sent to player 1 (respectively, player 2).

We begin with an informal description of the game: Players offer consumers a good, and consumers choose a player from whom to acquire the good. Consumers are single-minded: For each ω\omega there is a unique gωGg_{\omega}\in G such that the consumer will only choose a player who offers good gωg_{\omega}. If there is more than one such player, the consumer chooses uniformly at random between them. One interpretation of this consumer behavior is that he chooses by “satsificing”—making a random choice among options that are “good enough” Simon (1956), where gωg_{\omega} represents these good-enough options. See Ben-Porat and Tennenholtz (2019) for a similar approach.

We assume that prices and costs are fixed, and normalize a player’s utility to 1 if she is chosen and to 0 otherwise. So, for example, in the online advertising application of our model, consumers are assumed to click on a random ad amongst those ads that are most relevant; once they click, the expected profit to the advertiser (conversion rate times profit from a sale) is 1. The normalization to 1 is actually without loss of generality. If we had a different profit pωp_{\omega} for each consumer type ω\omega, then we could change every pωp_{\omega} to 1, modify the prior probability π(ω)\pi(\omega) of type ω\omega to pωπ(ω)p_{\omega}\cdot\pi(\omega), and then renormalize the prior to be a proper distribution. This would leave all of our analysis intact.

The following is the order of events, given a fixed mediator \mathcal{M}. It applies both to the case in which there are only two players {1,2}\{1,2\} and to the case in which there is an additional Amazon player, indexed 0.

  1. 1.

    Each of players 1 and 2 chooses whether or not to opt into using the mediator.

  2. 2.

    If one or both players {1,2}\{1,2\} did not opt into using the mediator, then each player ii obtains her base value, the utility viv_{i} (described below).

  3. 3.

    If both players {1,2}\{1,2\} opted into using the mediator, then:

    1. (a)

      A consumer of type ω\omega is chosen from Ω\Omega with prior distribution π\pi.

    2. (b)

      If present, player 0 learns P0(ω)P_{0}(\omega).

    3. (c)

      Messages (M1,M2)(M_{1},M_{2}) are chosen from the distribution (P1(ω),P2(ω))\mathcal{M}(P_{1}(\omega),P_{2}(\omega)), and each i{1,2}i\in\{1,2\} learns Pi(ω)P_{i}(\omega) and MiM_{i}.

    4. (d)

      Each player simultaneously chooses a good to offer the consumer, and then the consumer chooses a player from whom to obtain the good.

There are several ways to interpret the base values v1v_{1} and v2v_{2}. Our main interpretation is that these are the expected utilities of the players in the game without data sharing. To formalize this, consider the unmediated game Γ=(Ω,𝒫,G|𝒫|,(Pi())i𝒫,(ui)i𝒫,π)\Gamma=\left(\Omega,\mathcal{P},G^{\left|{\mathcal{P}}\right|},(P_{i}(\cdot))_{i\in\mathcal{P}},(u_{i})_{i\in\mathcal{P}},\pi\right), where 𝒫\mathcal{P} is the set of participating players, and is either {1,2}\{1,2\} or {0,1,2}\{0,1,2\}, GG is the set of actions, Pi()P_{i}(\cdot) is the information of player ii (which consists of the partition element of the realized type), uiu_{i} is ii’s utility function (described below), and π\pi is the common prior over Ω\Omega. Given this Bayesian game, let v=(v1,v2)v=(v_{1},v_{2}) be the expected utilities in some Bayesian Nash equilibrium (BNE) of Γ\Gamma—a profile (s1,s2)(s_{1},s_{2}) in which sis_{i} is the best strategy for each player ii conditional on her information and the assumption that the other player plays the strategy sjs_{j}.

Although this is the main interpretation of the base values, our model and results permit additional and more general interpretations as well. One additional interpretation is to suppose each viv_{i} is the minimax value of player ii in the unmediated game. In this second interpretation, we could imagine player jj “punishing” player ii if the latter does not opt into using the mediator, by playing a strategy that minimizes the latter’s payoff (off the equilibrium path). A third, more general interpretation is that the base values also depend on factors outside of the specific game, such as firm size, customer base, and so on, in addition to the primitives of the game Γ\Gamma.

In our constructions of mediators in Sections 3 and 4, the interpretation of vv will not matter—our mediators will be optimal given any such values, regardless of whether they are derived endogenously as the equilibrium utilities of the game absent a mediator, or whether they arise exogenously from factors outside of the specific game. However, in Section 5, in which we compare data sharing across two different environments—with and without a mediator—we will stick with the main interpretation of base values as equilibrium payoffs absent a mediator. This is because, when we compare two different environments, the payoffs players could attain in equilibrium without a mediator will differ across these environments.

Now, as noted above, data sharing is modeled by a mediator :2Ω×2ΩΔ(M2)\mathcal{M}:2^{\Omega}\times 2^{\Omega}\mapsto\Delta\left(M^{2}\right). Without loss of generality we invoke the revelation principle and assume that M=GM=G, the set of possible goods. We interpret the messages of the mediator as recommended actions to the players, one recommendation for each player.

Formally, if both players 1 and 2 opt into using the mediator \mathcal{M}, then all play the mediated Bayesian game Γ=(Ω,𝒫,G|𝒫|,,(ui)i𝒫,π)\Gamma^{\mathcal{M}}=\left(\Omega,\mathcal{P},G^{\left|{\mathcal{P}}\right|},\mathcal{I}^{\mathcal{M}},(u_{i})_{i\in\mathcal{P}},\pi\right). 𝒫\mathcal{P} is the set of participating players, and is either {1,2}\{1,2\} or {0,1,2}\{0,1,2\}. The function :ΩΔ(T|𝒫|)\mathcal{I}^{\mathcal{M}}:\Omega\mapsto\Delta(T^{\left|{\mathcal{P}}\right|}) denotes the information of players in the game for each state, where T=2Ω×MT=2^{\Omega}\times M is the set of possible pieces of information a player may have—a partition element and a message from the mediator. For player 0, this information consists of the realized type of consumer only, and so ()0=(P0(),)\mathcal{I}^{\mathcal{M}}(\cdot)_{0}=\left(P_{0}(\cdot),\emptyset\right) (where the \emptyset means player 0 gets no message from the mediator). For players 1 and 2, the information consists of both the partition element of the realized type of consumer and the action recommended to her by the mediator, and so ()i=(Pi(),(P1(),P2())i)\mathcal{I}^{\mathcal{M}}(\cdot)_{i}=\left(P_{i}(\cdot),\mathcal{M}(P_{1}(\cdot),P_{2}(\cdot))_{i}\right).

Next, each player’s set of actions in the mediated game is GG, and her utility function ui=ui:Ω×G|𝒫|u_{i}=u_{i}:\Omega\times G^{\left|{\mathcal{P}}\right|}\mapsto{\mathbb{R}}. The latter is equal to 0 if player ii’s action ggωg\neq g_{\omega}, and otherwise it is equal to 1/k1/k, where kk is the total number of players who play action gωg_{\omega}. The nonzero utility corresponds to utility 1 if a consumer chooses the player’s good, which occurs if a player offers the consumer the correct good and the consumer chooses uniformly amongst all players that do so. We often write E[ui()]E[u_{i}(\cdot)] when the expectation is over the choice of ω\omega, in which case we omit the dependence of uiu_{i} on ω\omega for brevity. Finally, (mixed) strategies of players in Γ\Gamma are functions si:TΔ(G)s_{i}:T\mapsto\Delta(G). We also denote by si(ω)=si(i(ω))s_{i}(\omega)=s_{i}(\mathcal{I}_{i}(\omega)).

An important note about the mediator is in order. We assume that when players opt into participating, they truthfully reveal their data Pi(ω)P_{i}(\omega) to him. Players’ strategic behavior is relevant in their choice of opting in or not, and then in whether or not they follow the mediator’s recommendation in their interaction with the consumer. While truthful reporting is clearly restrictive, it is a natural assumption in our context—for instance, when service providers act as mediators, they are typically also the ones who host retailers’ data in the cloud, and so already have the (true) data on their servers. Similarly, federated machine learning algorithms work based on the assumption that participants share their true data. Furthermore, truthful reporting can be justified by the repeated nature of the interaction. Observe that if a firm is not truthful in its reporting, the other firm will discover this at some point, as the mediator’s recommendations will not pan out as well as they should. Thus, if firms have the ability to opt out of using the mediator at some point in the future, then each firm can use the threat of opting out to incentivize the other to report truthfully.

Individual rationality, incentive compatibility, and optimality

A first observation is that player 0, if present, has a dominant strategy:

Lemma 1

In any game Γ\Gamma^{\mathcal{M}}, player 0 has the unique dominant strategy s0(ω)=gωs_{0}(\omega)=g_{\omega}.

The proof is straightforward: playing action gωg_{\omega} leads to positive utility to player 0, regardless of the other firms’ actions. Playing any other action leads to utility zero. This implies that always offering the correct good is the unique dominant strategy for player 0. We will thus take that player’s strategy as fixed throughout.

We will be interested in designing mediators that players actually wish to utilize, and so would like them to satisfy two requirements: first, that players want to follow the mediator’s recommendations, and so that following these recommendations forms an equilibrium; and second, that players prefer their payoffs with the mediator over their payoffs without (the base values). Formally, we say that:

  • \mathcal{M} is incentive compatible (IC) if the strategy profile s¯\overline{s} in which players 1 and 2 always follow \mathcal{M}’s recommendation is a BNE of Γ\Gamma^{\mathcal{M}}.

  • \mathcal{M} is individually rational (IR) if the expected utility of each player i{1,2}i\in\{1,2\} under s¯\overline{s} in Γ\Gamma^{\mathcal{M}} is (weakly) greater than viv_{i}.

Observe that if a mediator is both IC and IR, then in equilibrium players will opt in and then follow the mediator’s recommendations.

Finally, as we are interested in the extent to which data sharing benefits players 1 and 2, we will study optimal mediators, namely, ones in which the sum of these two players’ utilities is maximal subject to the IC and IR constraints. To this end, denote by W()W(\mathcal{M}) the sum of players 1 and 2’s expected utilities under s¯\overline{s} in Γ\Gamma^{\mathcal{M}}. Then:

Definition 1

Mediator \mathcal{M} is optimal if W()W()W(\mathcal{M})\geq W(\mathcal{M}^{\prime}) for any other \mathcal{M}^{\prime} that is IC and IR.

3 Jointly Complete Information

3.1 Coopetition Without an Amazon

We begin our analysis with the case of jointly complete information and without an Amazon. We first consider the question, under what base values vv does there exist an IR, IC mediator? We then turn to our main result for the section, the construction of an optimal mediator.

3.1.1 When does an IR, IC mediator exist?

We begin with a simple observation. Since for any mediator \mathcal{M} and conditional on any realized type ω\omega the total payoff to players is at most 1, it must be the case that W()1W(\mathcal{M})\leq 1. Thus, if v1+v2>1v_{1}+v_{2}>1, then there does not exist any IR mediator.

However, even if v1+v21v_{1}+v_{2}\leq 1, there may not exist a mediator that is both IR and IC. In the example from the introduction, for instance, in the simple case without an Amazon and where α>2β2γ>4δ\alpha>2\beta\geq 2\gamma>4\delta, the row retailer can guarantee herself a payoff of (α+γ)/2(\alpha+\gamma)/2 by ignoring the mediator and playing her dominant actions. This implies that the most the column retailer can obtain is 1(α+γ)/21-(\alpha+\gamma)/2. Any mediator that leads to a higher payoff to the column retailer must therefore fail to satisfy the IC constraints.

Theorem 1 below generalizes the bound in this example. But first, some notation: Consider the strategy sjs_{j}^{\prime} of player jj that, for every Pj(ω)P_{j}(\omega), chooses one of the goods that is most likely to be correct: sj(ω)argmaxgGPr[g=gω|Pj(ω)]s_{j}^{\prime}(\omega)\in\arg\max_{g\in G}\Pr[g=g_{\omega}|P_{j}(\omega)]. Note that sjs^{\prime}_{j} is a best-response to a player iji\neq j who always chooses the correct good gωg_{\omega}. Furthermore, let αj\alpha_{j} be the overall probability that sjs_{j}^{\prime} chooses the correct good: αj=Pr[sj(ω)=gω]\alpha_{j}=\Pr\left[s^{\prime}_{j}(\omega)=g_{\omega}\right].

Theorem 1

Suppose vivjv_{i}\geq v_{j}. Then an IR, IC mediator exists only if v1+v21v_{1}+v_{2}\leq 1 and viE[ui(gω,sj(ω))]v_{i}\leq E\left[u_{i}(g_{\omega},s_{j}^{\prime}(\omega))\right].

In words, the theorem states that an IR, IC mediator exists only if each player’s base value is bounded above by her utility in the hypothetical situation in which she always offers the correct good, and the other player best-responds conditional only on Pj(ω)P_{j}(\omega) (in addition to the obvious constraint v1+v21v_{1}+v_{2}\leq 1). In Theorem 2 below we will see that these conditions are also sufficient.

We note that if the base values vv are the equilibrium expected utilities of the game without a mediator, then the conditions in Theorem 1 are always satisfied. To see this, observe that the mediator that recommends to each player her equilibrium action (and nothing more) is IC, by the definition of a BNE. It is also IR, since players’ utilities are the same whether or not they opt into using this mediator.

  • Proof:  Proof. The condition v1+v21v_{1}+v_{2}\leq 1 must hold for any IR mediator \mathcal{M}, since W()1W(\mathcal{M})\leq 1. Now, suppose towards a contradiction that there exists an IC mediator for which vi>E[ui(gω,sj(ω))]v_{i}>E\left[u_{i}(g_{\omega},s_{j}^{\prime}(\omega))\right]. This implies that, under \mathcal{M}, the expected utility of player j=3ij=3-i is strictly less than 1E[ui(gω,sj(ω))]1-E\left[u_{i}(g_{\omega},s_{j}^{\prime}(\omega))\right]. However, player jj then has a profitable deviation to sjs_{j}^{\prime}. When playing sjs_{j}^{\prime}, on each partition element Pj(ω)P_{j}(\omega) player jj will get utility at least

    Pr[sj(ω)=gω]/2=E[uj(sj(ω),gω)]1E[ui(gω,sj(ω))],\Pr\left[s^{\prime}_{j}(\omega)=g_{\omega}\right]/2=E\left[u_{j}(s_{j}^{\prime}(\omega),g_{\omega})\right]\geq 1-E\left[u_{i}(g_{\omega},s_{j}^{\prime}(\omega))\right],

    and exactly this utility whenever ii always chooses the correct good gωg_{\omega}. Since this is a profitable deviation, the mediator \mathcal{M} is not IC, a contradiction.    

For the rest of Section 3.1, denote a pair vv that satisfies the conditions in Theorem 1 as a feasible vv.

3.1.2 An optimal mediator

We now describe an optimal mediator, at the same time showing that the conditions on vv in Theorem 1 are also sufficient. We begin with a simple observation:

Observation 1

If \mathcal{M} is IR, IC, and W()=1W(\mathcal{M})=1, then \mathcal{M} is optimal.

This observation implies that the mediator in the example of the introduction, for the case without an Amazon, is optimal (when the base values are retailers’ utilities under their dominant strategies). One drawback of that mediator, however, is that it depends on the retailers’ strategies in the unmediated game—in particular, it utilizes the fact that, in equilibrium, neither retailer offers the correct good in the bottom-right cell of the matrix. Our aim here is to design a mediator that does not depend on such strategies, but only on players’ respective information and base values.

The idea underlying the mediator we construct is the following. Suppose vivjv_{i}\geq v_{j}. Then if vi1/2v_{i}\leq 1/2, then the mediator will facilitate full data-sharing, and so each player will offer the correct good in every state. If vi>1/2v_{i}>1/2 then player ii will obtain all the information and player jj will obtain only partial information: she will get recommendation gωg_{\omega} with probability less than 1, and otherwise will get recommendation sj(ω)s_{j}^{\prime}(\omega). We now formally describe the mediator, and then show that it is indeed IR, IC, and optimal.

Mediator 1 Mediator for 𝒫={1,2}\mathcal{P}=\{1,2\} with jointly complete information
1:procedure noAv\mathcal{M}^{v}_{noA}(V,WV,W) \triangleright V=P1(ω)V=P_{1}(\omega) and W=P2(ω)W=P_{2}(\omega) for realized ωΩ\omega\in\Omega
2:  ωVW\omega\leftarrow V\cap W
3:  iargmaxk{1,2}vki\leftarrow\arg\max_{k\in\{1,2\}}v_{k}
4:  j3ij\leftarrow 3-i
5:  if vi1/2v_{i}\leq 1/2 then return (gω,gω)\left(g_{\omega},g_{\omega}\right) \triangleright Full data-sharing
6:  else
7:   gjsj(ω)g_{j}\leftarrow s_{j}^{\prime}(\omega)
8:   Choose γ[0,1]\gamma\in\left[0,1\right] uniformly at random.
9:   if γ<22viαj1αj\gamma<\frac{2-2v_{i}-\alpha_{j}}{1-\alpha_{j}} then
10:     return (gω,gω)(g_{\omega},g_{\omega})
11:   else if i=1i=1 then return (gω,gj)(g_{\omega},g_{j})
12:   else return (gj,gω)(g_{j},g_{\omega})      
Theorem 2

For any feasible vv, the mediator noAv\mathcal{M}^{v}_{noA} above is IR, IC, and optimal.

  • Proof:   First, consider the case in which vi1/2v_{i}\leq 1/2. In this case, strategy s¯\overline{s} in ΓnoAv\Gamma^{\mathcal{M}^{v}_{noA}} leads to expected utility 1/21/2 to each player, and so the mediator is IR. Furthermore, the strategy s¯\overline{s} is a BNE, since each player always chooses gωg_{\omega}, which is dominant, and so the mediator is also IC. Finally, since in this case W(ΓnoAv)=1W\left(\Gamma^{\mathcal{M}^{v}_{noA}}\right)=1, Observation 1 implies that the mediator is optimal.

    Next, suppose vi>1/2v_{i}>1/2. Observe that, conditional on line 10 being activated, the expected utility of player jj is 1/21/2, whereas conditional on line 11 or 12, the expected utility of player jj is αj/2\alpha_{j}/2 (since with probability αj\alpha_{j} she chooses the correct good, and player ii always chooses the correct good). Thus, overall the expected utility of player jj is

    22viαj1αj12+(122viαj1αj)αj2=1vi.\frac{2-2v_{i}-\alpha_{j}}{1-\alpha_{j}}\cdot\frac{1}{2}+\left(1-\frac{2-2v_{i}-\alpha_{j}}{1-\alpha_{j}}\right)\cdot\frac{\alpha_{j}}{2}=1-v_{i}.

    Since player ii always chooses the correct good, it must be the case that W(ΓnoAv)=1W\left(\Gamma^{\mathcal{M}^{v}_{noA}}\right)=1, and so the utility of player ii is viv_{i}. Finally, since v1+v21v_{1}+v_{2}\leq 1, this implies that the utility of player jj is at least vjv_{j}. Thus, the mediator is IR.

    Next, observe that s¯\overline{s} in ΓnoAv\Gamma^{\mathcal{M}^{v}_{noA}} is dominant for player ii, since she always chooses gωg_{\omega}. It is also optimal for player jj, since she either also chooses gωg_{\omega}, or chooses the best response conditional on her information Pj(ω)P_{j}(\omega). Thus, the strategy s¯\overline{s} is a BNE, and so the mediator is IC.

    Finally, since W(ΓnoAv)=1W\left(\Gamma^{\mathcal{M}^{v}_{noA}}\right)=1, Observation 1 implies that the mediator is optimal.    

3.2 Coopetition Against an Amazon

In this section we analyze the game with an Amazon. As in Section 3.1, we first consider the question, under what base values vv does there exist an IR, IC mediator? We then turn to our main result for the section, the construction of an optimal mediator.

3.2.1 When does an IR, IC mediator exist?

Once again, we begin with a simple observation. Since for any mediator \mathcal{M} and any type ω\omega the total welfare of players 1 and 2 is at most 2/32/3 (since Amazon always knows the type, offers the correct good, and gets at least 1/31/3 of the surplus), it must be the case that W()2/3W(\mathcal{M})\leq 2/3. Thus, if v1+v2>2/3v_{1}+v_{2}>2/3, then there does not exist any IR mediator.

However, and again as above, even if v1+v22/3v_{1}+v_{2}\leq 2/3 there may still not exist a mediator that is both IR and IC. In the example from the introduction, for instance, in the simple case with an Amazon and where α>2β2γ>4δ\alpha>2\beta\geq 2\gamma>4\delta, the row retailer can guarantee herself a payoff of (α+γ)/3(\alpha+\gamma)/3 by ignoring the mediator and playing her dominant actions. This implies that the most the column retailer can obtain is 2/3(α+γ)/32/3-(\alpha+\gamma)/3. Any mediator that leads to a higher payoff to the column retailer must therefore fail to satisfy the IC constraints.

The result below consists of bounds on the base values under which an IR, IC mediator exists.

Theorem 3

Suppose vivjv_{i}\geq v_{j}. Then an IR, IC mediator exists only if viE[ui(gω,sj(ω))]v_{i}\leq E\left[u_{i}(g_{\omega},s_{j}^{\prime}(\omega))\right] and vj12viv_{j}\leq 1-2v_{i}.

As with Theorem 1, we note that if the base values vv are the equilibrium expected utilities of the game without a mediator, then the conditions in Theorem 3 are satisfied. Again, this follows from the observation that the mediator that recommends to each player her equilibrium action is both IR and IC.

Before proving the theorem we develop some notation and intuition. For any mediator \mathcal{M} and player k{1,2}k\in\{1,2\} let βk=Pr[((ω)k=gω)((ω)3kgω)]\beta_{k}^{\mathcal{M}}=\Pr\left[(\mathcal{M}(\omega)_{k}=g_{\omega})\cap(\mathcal{M}(\omega)_{3-k}\neq g_{\omega})\right] and β=Pr[(ω)k=(ω)3k=gω]\beta^{\mathcal{M}}=\Pr\left[\mathcal{M}(\omega)_{k}=\mathcal{M}(\omega)_{3-k}=g_{\omega}\right]. With this notation, the expected utility of player kk under s¯\overline{s} in Γ\Gamma^{\mathcal{M}} is βk/2+β/3\beta^{\mathcal{M}}_{k}/2+\beta^{\mathcal{M}}/3: For any ω\omega, if player kk offers gωg_{\omega} and the other player, 3k3-k, does not, then the consumer chooses between players kk and 0, and so each gets 1/21/2. If both offer gωg_{\omega}, the consumer chooses between players kk, 3k3-k, and 0, and so each gets 1/31/3.

An IR, IC mediator \mathcal{M} is one for which s¯\overline{s} is a BNE, and in which βk/2+β/3vk\beta_{k}^{\mathcal{M}}/2+\beta^{\mathcal{M}}/3\geq v_{k} for both players. In addition to the incentive constraints, we also have the constraints βi+βj+β1\beta_{i}^{\mathcal{M}}+\beta_{j}^{\mathcal{M}}+\beta^{\mathcal{M}}\leq 1, and all three variables non-negative.

Finally, the total welfare of players 1 and 2 in \mathcal{M} is W()=(β1+β2)/2+2β/3W(\mathcal{M})=(\beta^{\mathcal{M}}_{1}+\beta^{\mathcal{M}}_{2})/2+2\beta^{\mathcal{M}}/3, and so an optimal mediator is one that maximizes this sum subject to the IC and IR constraints above. Denote the linear program above, disregarding the IC constraint, as LP(v1,v2)\mathrm{LP}(v_{1},v_{2}):

maximizeβ1+β22+2β3subject toβi2+β3vi,i=1,2βi0,i=1,2β0β1+β2+β1\begin{array}[]{lr@{}ll}\text{maximize}&\frac{\beta_{1}+\beta_{2}}{2}+\frac{2\beta}{3}&\\ \text{subject to}&\frac{\beta_{i}}{2}+\frac{\beta}{3}&\geq v_{i},&i=1,2\\ &\beta_{i}&\geq 0,&i=1,2\\ &\beta&\geq 0&\\ &\beta_{1}+\beta_{2}+\beta&\leq 1&\end{array}

To prove Theorem 3, we need to identify conditions on vv under which the LP is feasible. To do this, it will be helpful to begin with some lemmas that characterize the LP’s optimal solution:

Lemma 2

Suppose vivjv_{i}\geq v_{j} and that LP(v1,v2)\mathrm{LP}(v_{1},v_{2}) is feasible. Then any optimal solution (β1,β2,β)(\beta_{1},\beta_{2},\beta) to LP(v1,v2)\mathrm{LP}(v_{1},v_{2}) satisfies:

  • (a)

    β1+β2+β=1\beta_{1}+\beta_{2}+\beta=1.

  • (b)

    βj=0\beta_{j}=0.

  • (c)

    If vi>1/3v_{i}>1/3 then ii’s IR constraint binds: βi/2+β/3=vi\beta_{i}/2+\beta/3=v_{i}.

  • Proof:   Suppose (a) does not hold, and that β1+β2+β<1\beta_{1}+\beta_{2}+\beta<1. This means we can increase β\beta to β>β\beta^{\prime}>\beta without violating any of the constraints. The value of the LP\mathrm{LP} is now (β1+β2)/2+2β/3>(β1+β2)/2+2β/3(\beta_{1}+\beta_{2})/2+2\beta^{\prime}/3>(\beta_{1}+\beta_{2})/2+2\beta/3, contradicting the optimality (β1,β2,β)(\beta_{1},\beta_{2},\beta).

    Now suppose (a) holds, but (b) does not. If βi<βj\beta_{i}<\beta_{j} then the solution (β2,β1,β)(\beta_{2},\beta_{1},\beta) is also feasible and optimal, so assume without loss of generality that βiβj\beta_{i}\geq\beta_{j}. Consider now the solution βi=βiβj\beta_{i}^{\prime}=\beta_{i}-\beta_{j}, βj=0\beta_{j}^{\prime}=0, and β=β+2βj\beta^{\prime}=\beta+2\beta_{j}. Since

    βi/2+β/3=(βiβj)/2+(β+2βj)/3>βi/2+β/3vi\beta_{i}^{\prime}/2+\beta^{\prime}/3=(\beta_{i}-\beta_{j})/2+(\beta+2\beta_{j})/3>\beta_{i}/2+\beta/3\geq v_{i}

    and

    βj/2+β/3=(β+2βj)/3>βj/2+β/3vj,\beta_{j}^{\prime}/2+\beta^{\prime}/3=(\beta+2\beta_{j})/3>\beta_{j}/2+\beta/3\geq v_{j},

    this new solution is feasible. Moreover,

    (β1+β2)/2+2β/3>(β1+β2)/2+2β/3,(\beta_{1}^{\prime}+\beta_{2}^{\prime})/2+2\beta^{\prime}/3>(\beta_{1}+\beta_{2})/2+2\beta/3,

    contradicting the optimality of (β1,β2,β)(\beta_{1},\beta_{2},\beta).

    Now suppose (a) and (b) hold, but that (c) does not. Then there is some ε>0\varepsilon>0 such that βi/2+β/3>vi\beta_{i}^{\prime}/2+\beta^{\prime}/3>v_{i}, where βi=βiε\beta_{i}^{\prime}=\beta_{i}-\varepsilon and β=β+ε\beta^{\prime}=\beta+\varepsilon, and so (β1,β2,β)(\beta_{1}^{\prime},\beta_{2}^{\prime},\beta^{\prime}) is also a feasible solution. Furthermore,

    βi/2+2β/3>βi/2+2β/3,\beta_{i}^{\prime}/2+2\beta^{\prime}/3>\beta_{i}/2+2\beta/3,

    contradicting the optimality of (β1,β2,β)(\beta_{1},\beta_{2},\beta).    

One implication of Lemma 2 is that at an optimal solution, β\beta is maximized. This follows from the observation that the objective function is (β1+β2+β)/2+β/6=1/2+β/6(\beta_{1}+\beta_{2}+\beta)/2+\beta/6=1/2+\beta/6 when β1+β2+β=1\beta_{1}+\beta_{2}+\beta=1. In addition, Lemma 2 pins down the value of the optimal solution to LP(v1,v2)\mathrm{LP}(v_{1},v_{2}):

Lemma 3

Suppose vivjv_{i}\geq v_{j} and that LP(v1,v2)\mathrm{LP}(v_{1},v_{2}) is feasible. Then the value opt(v1,v2)\mathrm{opt}(v_{1},v_{2}) of an optimal solution to LP(v1,v2)\mathrm{LP}(v_{1},v_{2}) is one of the following:

  • If vi1/3v_{i}\leq 1/3 then opt(v1,v2)=2/3\mathrm{opt}(v_{1},v_{2})=2/3.

  • If vi>1/3v_{i}>1/3 then opt(v1,v2)=1vi\mathrm{opt}(v_{1},v_{2})=1-v_{i}.

  • Proof:   Lemma 2 implies that at the optimal solution we have βi+β=1\beta_{i}+\beta=1, and so the value at the optimum is βi/2+2β/3\beta_{i}/2+2\beta/3. In the first case, the optimal solution occurs at the point β1=β2=0\beta_{1}=\beta_{2}=0, β=1\beta=1: it maximizes the objective, while both IR constraints remain slack since vi1/3=β/3v_{i}\leq 1/3=\beta/3.

    In the second case, Lemma 2 implies that βi/2+β/3=vi\beta_{i}/2+\beta/3=v_{i} and that βi=1β\beta_{i}=1-\beta. These imply that vi=(1β)/2+β/3v_{i}=(1-\beta)/2+\beta/3, and so that β=36vi\beta=3-6v_{i} and βi=13+6vi\beta_{i}=1-3+6v_{i}. Plugging these into the objective yields

    opt(v1,v2)=βi2+2β3=13+6vi2+2(36vi)3=1vi.\mathrm{opt}(v_{1},v_{2})=\frac{\beta_{i}}{2}+\frac{2\beta}{3}=\frac{1-3+6v_{i}}{2}+\frac{2(3-6v_{i})}{3}=1-v_{i}.

     

Lemmas 2 and 3 characterize the maximal values attainable by an IR mediator, and are the analogues of the condition that v1+v21v_{1}+v_{2}\leq 1 when there is no Amazon. However, before proving Theorem 3, we need two additional lemmas. Fix a mediator \mathcal{M}, and consider the hypothetical situation in which player ii obtains additional information, and in each state ω\omega she fully learns that the state is ω\omega and so plays the dominant strategy s^i(ω)=gω\hat{s}_{i}(\omega)=g_{\omega}. In this new game, denote the best response of player jj as sj()s^{\prime}_{j}(\mathcal{M}), the strategy that, for every Pj(ω)P_{j}(\omega) and (ω)i\mathcal{M}(\omega)_{i}, chooses some gg that is most-likely correct conditional on Pj(ω)P_{j}(\omega) and (ω)i\mathcal{M}(\omega)_{i}. Also observe that here, player jj is never the only player to choose gωg_{\omega}, since player ii always plays gωg_{\omega}.

Player jj is worse off in this hypothetical game than when all follow \mathcal{M}’s recommendations:

Lemma 4

E[uj(sj(),s^i)]E[uj(s¯j,s¯i)]E[u_{j}(s^{\prime}_{j}(\mathcal{M}),\hat{s}_{i})]\leq E[u_{j}(\overline{s}_{j},\overline{s}_{i})] for any mediator \mathcal{M}.

  • Proof:   Fix a particular state ω\omega. For every action gg of player jj, who conditions on Pj(ω)P_{j}(\omega) and (ω)j\mathcal{M}(\omega)_{j}, the utility E[uj(g,s^i)|Pj(ω),(ω)j]E[uj(g,s¯i)|Pj(ω),(ω)j]E[u_{j}(g,\hat{s}_{i})|P_{j}(\omega),\mathcal{M}(\omega)_{j}]\leq E[u_{j}(g,\overline{s}_{i})|P_{j}(\omega),\mathcal{M}(\omega)_{j}], since ii makes correct choices more often in the former. This implies that E[uj(sj(),s^i)|Pj(ω),(ω)j]E[uj(s¯j,s¯i)|Pj(ω),(ω)j]E[u_{j}(s^{\prime}_{j}(\mathcal{M}),\hat{s}_{i})|P_{j}(\omega),\mathcal{M}(\omega)_{j}]\leq E[u_{j}(\overline{s}_{j},\overline{s}_{i})|P_{j}(\omega),\mathcal{M}(\omega)_{j}], since in both sides of the inequality jj chooses an optimal action conditional on her information, but in the RHS the utility from every choice gg is higher than its corresponding utility in the LHS. This implies that E[uj(sj(),s^i)]E[uj(s¯j,s¯i)]E[u_{j}(s^{\prime}_{j}(\mathcal{M}),\hat{s}_{i})]\leq E[u_{j}(\overline{s}_{j},\overline{s}_{i})].    

Although in the hypothetical game player jj is worse off, the sum of players’ utilities is higher:

Lemma 5

The sum of players’ utilities is higher under (sj(),s^i)(s^{\prime}_{j}(\mathcal{M}),\hat{s}_{i}) than under (s¯j,s¯i)(\overline{s}_{j},\overline{s}_{i}) in \mathcal{M}.

  • Proof:   Again, we show that this holds conditional on every Pj(ω)P_{j}(\omega) and (ω)j\mathcal{M}(\omega)_{j}. Fix ω\omega, and observe that

    Pr[s^i(ω)=s¯j(ω)=gω|Pj(ω),(ω)j]Pr[s¯i(ω)=s¯j(ω)=gω|Pj(ω),(ω)j],\Pr\left[\hat{s}_{i}(\omega)=\overline{s}_{j}(\omega)=g_{\omega}|P_{j}(\omega),\mathcal{M}(\omega)_{j}\right]\geq\Pr\left[\overline{s}_{i}(\omega)=\overline{s}_{j}(\omega)=g_{\omega}|P_{j}(\omega),\mathcal{M}(\omega)_{j}\right],

    since s^i\hat{s}_{i} always correctly chooses gωg_{\omega}. Furthermore,

    Pr[s^i(ω)=sj()=gω|Pj(ω),(ω)j]Pr[s^i(ω)=s¯j(ω)=gω|Pj(ω),(ω)j],\Pr\left[\hat{s}_{i}(\omega)=s_{j}^{\prime}(\mathcal{M})=g_{\omega}|P_{j}(\omega),\mathcal{M}(\omega)_{j}\right]\geq\Pr\left[\hat{s}_{i}(\omega)=\overline{s}_{j}(\omega)=g_{\omega}|P_{j}(\omega),\mathcal{M}(\omega)_{j}\right],

    since sjs_{j}^{\prime} chooses the good that maximizes the probability of choosing gωg_{\omega}. Thus,

    Pr[s^i(ω)=sj()=gω|Pj(ω),(ω)j]Pr[s¯i(ω)=s¯j(ω)=gω|Pj(ω),(ω)j].\Pr\left[\hat{s}_{i}(\omega)=s_{j}^{\prime}(\mathcal{M})=g_{\omega}|P_{j}(\omega),\mathcal{M}(\omega)_{j}\right]\geq\Pr\left[\overline{s}_{i}(\omega)=\overline{s}_{j}(\omega)=g_{\omega}|P_{j}(\omega),\mathcal{M}(\omega)_{j}\right].

    Now recall that LP(v1,v2)\mathrm{LP}(v_{1},v_{2}) is maximized when β\beta, the probability that both players correctly choose gωg_{\omega} is maximized. Thus, the value of LP(v1,v2)\mathrm{LP}(v_{1},v_{2}), the sum of players’ utilities, is higher under (sj,s^i)(s^{\prime}_{j},\hat{s}_{i}) than under (s¯1,s¯2)(\overline{s}_{1},\overline{s}_{2}).    

We can now prove Theorem 3.

  • Proof:   For the first condition, the upper bound on viv_{i}, fix some IR, IC mediator \mathcal{M}. We will argue that the utility to player ii under \mathcal{M} is at most E[ui(gω,sj())]E\left[u_{i}(g_{\omega},s_{j}^{\prime}(\mathcal{M}))\right]. Lemma 4 implies that player jj’s utility decreases if player ii additionally obtains all the information (even when jj responds optimally), and Lemma 5 implies that in this same situation the sum of utilities increases. Together, these lemmas imply that ii’s utility is higher when she obtains all the information and jj only learns Pj(ω)P_{j}(\omega) and (ω)j\mathcal{M}(\omega)_{j}, than when ii also only learns Pi(ω)P_{i}(\omega) and (ω)i\mathcal{M}(\omega)_{i}. Furthermore, when ii has all the information, she is best off when jj has the least information—that is, when (ω)j=Pj(ω)\mathcal{M}(\omega)_{j}=P_{j}(\omega). This is because any additional information that causes jj to improve upon her payoff will do so at the expense of player ii’s payoff (by transferring weight from βi\beta_{i}^{\mathcal{M}} to β\beta^{\mathcal{M}}). Thus, in any IR, IC mediator, the highest possible utility to player ii is E[ui(gω,sj(ω))]E\left[u_{i}(g_{\omega},s_{j}^{\prime}(\omega))\right], since (ω)j=Pj(ω)\mathcal{M}(\omega)_{j}=P_{j}(\omega) and so sj(ω)=sj()s_{j}^{\prime}(\omega)=s_{j}^{\prime}(\mathcal{M}).

    The second condition, vj12viv_{j}\leq 1-2v_{i}, follows from Lemma 3. If vi1/3v_{i}\leq 1/3, then the maximal welfare of any IR mediator is 2/32/3, and so vj2/3vi12viv_{j}\leq 2/3-v_{i}\leq 1-2v_{i}. If vi>1/3v_{i}>1/3, then the lemma implies that the maximal welfare of any IR mediator is at most 1vi1-v_{i}. Since player ii gets at least viv_{i}, player jj can get at most 12vi1-2v_{i}.    

For the rest of Section 3.2, denote a pair vv that satisfies the conditions in Theorem 3 as a feasible vv.

3.2.2 An optimal mediator

We now describe an optimal mediator, at the same time showing that the conditions in Theorem 3 are not only necessary but also sufficient.

We begin with a definition.

Definition 2

A mediator \mathcal{M} is fully revealing to player ii if the mediator’s recommendation to player ii always coincides with the optimal good: s¯i(ω)=gω\overline{s}_{i}(\omega)=g_{\omega} for every ω\omega. A mediator facilitates full data-sharing if it is fully revealing to both players 𝒫{0}\mathcal{P}\setminus\{0\}.

Fix the set of players to be 𝒫={0,1,2}\mathcal{P}=\{0,1,2\}. The idea underlying the mediator is the same as that of noAv\mathcal{M}_{noA}^{v}. Suppose vivjv_{i}\geq v_{j}. Then if vi1/3v_{i}\leq 1/3, then the mediator will facilitate full data-sharing, and so each player will offer the correct good in every state. If vi>1/3v_{i}>1/3 then player ii will obtain all the information and player jj will obtain only partial information: she will get recommendation gωg_{\omega} with probability less than 1, and otherwise will get recommendation sj(ω)s_{j}^{\prime}(\omega). The challenge is to maximize the probability of recommendation gωg_{\omega} subject to the IR constraints, and to do this in such a way that following the recommendations is a BNE.

We now formally describe the mediator, and then show that it is indeed IR, IC, and optimal. The main differences from noAv\mathcal{M}_{noA}^{v} in Mediator 1 are lines 5 and 9.

Mediator 2 Mediator for 𝒫={0,1,2}\mathcal{P}=\{0,1,2\} with jointly complete information
1:procedure Av\mathcal{M}^{v}_{A}(V,WV,W) \triangleright V=P1(ω)V=P_{1}(\omega) and W=P2(ω)W=P_{2}(\omega) for realized ωΩ\omega\in\Omega
2:  ωVW\omega\leftarrow V\cap W
3:  iargmaxk{1,2}vki\leftarrow\arg\max_{k\in\{1,2\}}v_{k}
4:  j3ij\leftarrow 3-i
5:  if vi1/3v_{i}\leq 1/3 then return (gω,gω)\left(g_{\omega},g_{\omega}\right) \triangleright Full data-sharing
6:  else
7:   gjsj(ω)g_{j}\leftarrow s_{j}^{\prime}(\omega)
8:   Choose γ[0,1]\gamma\in\left[0,1\right] uniformly at random.
9:   if γ<36viαj1αj\gamma<\frac{3-6v_{i}-\alpha_{j}}{1-\alpha_{j}} then
10:     return (gω,gω)(g_{\omega},g_{\omega})
11:   else if i=1i=1 then return (gω,gj)(g_{\omega},g_{j})
12:   else return (gj,gω)(g_{j},g_{\omega})      
Theorem 4

For any feasible vv, the mediator Av\mathcal{M}^{v}_{A} above is IR, IC, and optimal.

An interesting feature of Av\mathcal{M}^{v}_{A} is that it either facilitates full data-sharing, or is fully revealing to one of the players. These features are inherent to any optimal IR mediator:

Theorem 5

For any feasible vv and any optimal IR mediator \mathcal{M}, one of the following holds:

  • \mathcal{M} is fully revealing to both players 1 and 2.

  • \mathcal{M} is fully revealing to one of the players, and the IR constraint binds for that player (i.e., she is indifferent between \mathcal{M} and her base value).

Note that Mediator 1 for the case of no Amazon also satisfies the features described in Theorem 5. In that case, however, these features do not characterize all mediators, as there exist mediators that are optimal for the case of no Amazon that do not have them. One such mediator is the one described in the example from the introduction. With an Amazon, in contrast, these features characterize all optimal IR mediators.

We begin by showing that the mediator above is optimal. Our approach is to show that W(Av)W(\mathcal{M}^{v}_{A}) is equal to the value of the optimal solution to LP(v1,v2)\mathrm{LP}(v_{1},v_{2}), and that the strategy s¯\overline{s} in Av\mathcal{M}^{v}_{A} is a BNE. Observe that LP(v1,v2)\mathrm{LP}(v_{1},v_{2}) does not contain the IC constraints, implying that they are not binding, and so only the IR constraints are relevant for optimality. This will no longer be the case without the assumption of jointly complete information, analyzed in Section 4 below.

We already laid most of the groundwork for the proofs of Theorems 4 and 5 when we proved Theorem 3, but do need one last lemma:

Lemma 6

Suppose vv is feasible and vi>1/3v_{i}>1/3. Then αj=Pr[sj(ω)=gω]36vi\alpha_{j}=\Pr\left[s^{\prime}_{j}(\omega)=g_{\omega}\right]\leq 3-6v_{i}.

  • Proof:   Since vv is feasible, it holds that E[ui(gω,sj(ω))]viE\left[u_{i}(g_{\omega},s_{j}^{\prime}(\omega))\right]\geq v_{i}. Next, note that E[ui(gω,sj(ω))]=αj/3+(1αj)/2E[u_{i}(g_{\omega},s_{j}^{\prime}(\omega))]=\alpha_{j}/3+(1-\alpha_{j})/2. Together, these facts imply that αj36vi\alpha_{j}\leq 3-6v_{i}, as claimed.    

We are now ready to prove Theorem 4.

  • Proof:   First, consider the case in which vi1/3v_{i}\leq 1/3. In this case, strategy s¯\overline{s} in ΓAv\Gamma^{\mathcal{M}^{v}_{A}} leads to expected utility 1/31/3 to each player, which satisfies the IR constraint. Furthermore, strategy s¯\overline{s} is a BNE, since each player always chooses gωg_{\omega}, which is dominant. Finally, W(Av)W\left(\mathcal{M}^{v}_{A}\right) is maximal by Lemma 3.

    Next, suppose vi>1/3v_{i}>1/3. Observe that s¯\overline{s} in ΓAv\Gamma^{\mathcal{M}^{v}_{A}} is dominant for player ii, since she always chooses gωg_{\omega}. It is also optimal for player jj, since she either also chooses gωg_{\omega}, or chooses the best response conditional on her information Pj(ω)P_{j}(\omega). Thus, the mediator is IC.

    We now show that Av\mathcal{M}^{v}_{A} is optimal. For simplicity, henceforth denote by =Av\mathcal{M}=\mathcal{M}^{v}_{A}. First note that βj=0\beta^{\mathcal{M}}_{j}=0. Next, by Lemma 6, it holds that αj36vi\alpha_{j}\leq 3-6v_{i}. Since vi(1/3,1/2]v_{i}\in(1/3,1/2], this implies that αj[0,1)\alpha_{j}\in[0,1). These, in turn imply that

    36viαj1αj[0,1],\frac{3-6v_{i}-\alpha_{j}}{1-\alpha_{j}}\in[0,1],

    so line 9 of Mediator 2 is valid. From that line we have that

    β=αj+(1αj)36viαj1αj=36vi,\beta^{\mathcal{M}}=\alpha_{j}+(1-\alpha_{j})\cdot\frac{3-6v_{i}-\alpha_{j}}{1-\alpha_{j}}=3-6v_{i},

    since both players choose gωg_{\omega} when sj(ω)=gωs^{\prime}_{j}(\omega)=g_{\omega} and when sj(ω)gωs^{\prime}_{j}(\omega)\neq g_{\omega} but γ<(36viαj)/(1αj)\gamma<(3-6v_{i}-\alpha_{j})/(1-\alpha_{j}). This means that

    W()=βi2+2β3=13+6vi2+2(36vi)3=1vi,W(\mathcal{M})=\frac{\beta_{i}^{\mathcal{M}}}{2}+\frac{2\beta^{\mathcal{M}}}{3}=\frac{1-3+6v_{i}}{2}+\frac{2(3-6v_{i})}{3}=1-v_{i},

    which, by Lemma 3, is the value of the optimal solution to LP(v1,v2)\mathrm{LP}(v_{1},v_{2}).

    Finally, since ii’s utility is viv_{i} and so jj’s utility is 12vi1-2v_{i}, the mediator is IR.    

Finally, we can also prove Theorem 5.

  • Proof:   Suppose first that vv is such that vjvi1/3v_{j}\leq v_{i}\leq 1/3. In this case, full data-sharing is IC, and so any optimal mediator \mathcal{M} must have W()2/3W(\mathcal{M})\geq 2/3. However, the maximum value of LP(v1,v2)\mathrm{LP}(v_{1},v_{2}) is 2/32/3, and the only way to obtain this is to have β=1\beta^{\mathcal{M}}=1. This is equivalent to full data-sharing, and so the only mediator that is optimal is the full data-sharing one.

    Now suppose that vi>1/3v_{i}>1/3. By the proof of Theorem 4, an optimal solution to LP(v1,v2)\mathrm{LP}(v_{1},v_{2}) is attainable by some mediator, in particular by Av\mathcal{M}_{A}^{v}. Thus, any mediator \mathcal{M} that is optimal must also yield W()W(\mathcal{M}) that is equal to the optimal value of LP(v1,v2)\mathrm{LP}(v_{1},v_{2}). By Lemma 2, any optimal solution to LP(v1,v2)\mathrm{LP}(v_{1},v_{2}) must have βj=0\beta_{j}=0 and βi/2+β/3=vi\beta_{i}/2+\beta/3=v_{i}. Thus, the mediator \mathcal{M} must satisfy these as well: the first equality implying that \mathcal{M} is fully revealing to player ii, and the second that player ii’s IR constraint binds.    

4 No Jointly Complete Information

We now drop the assumption that players 1 and 2 have jointly complete information. Formally, for a given type ω\omega let S(ω)=P1(ω)P2(ω)S(\omega)=P_{1}(\omega)\cap P_{2}(\omega), and call each such S(ω)S(\omega) a segment. The assumption of jointly complete information states that all segments are singletons, and in this section we consider the more general setting in which segments may contain more than one type of consumer.

This setting presents new challenges, as there is now a new conflict between optimal strategies and welfare. To see this, consider some segment S={ω1,ω2}S=\{\omega^{1},\omega^{2}\} for which Pr[ω1]=4Pr[ω2]\Pr[\omega^{1}]=4\cdot\Pr[\omega^{2}]. Then, conditional on segment SS, both players have a dominant strategy, namely, to offer g=gω1g=g_{\omega^{1}}. To see this, observe that a player’s utility from offering gω1g_{\omega^{1}} is at least Pr[ω1]/2\Pr[\omega^{1}]/2 regardless of the other’s action, the utility from offering gω2g_{\omega^{2}} is at most Pr[ω2]\Pr[\omega^{2}], and by assumption Pr[ω1]/2=2Pr[ω2]>Pr[ω2]\Pr[\omega^{1}]/2=2\Pr[\omega^{2}]>\Pr[\omega^{2}]. This leads to total welfare Pr[ω1]\Pr[\omega^{1}]. However, if players were to separate rather than pool—one offering gω1g_{\omega^{1}} and the other gω2g_{\omega^{2}}—then the total welfare would be Pr[ω1]+Pr[ω2]\Pr[\omega^{1}]+\Pr[\omega^{2}], which is higher. Note that this conflict between optimal strategies and welfare does not occur when players have jointly complete information, since then the pooling action, which is dominant, is also welfare-maximizing. Finally, although we illustrated this conflict for the setting without an Amazon, it is of course also present with an Amazon.

This conflict between optimal strategies and welfare maximization makes the design of mediators significantly more challenging. In principle, one could formulate the problem as a complex linear program, whose solution could be used to find the optimal mediator. Here, however, we take a different approach. We will show that simple mediators—in particular, variants of the ones from Section 3 for the setting with jointly complete information—are approximately optimal. And while this approach entails some loss in terms of optimality, we will show that these simple mediators are not only approximately optimal relative to the optimal IR, IC mediator, but rather that they are approximately optimal relative to a higher benchmark—namely, the welfare that can be achieved by any mediator, even one that is not IR or IC. Then, to complement our results, we will show that the approximation factors our mediators achieve are tight or nearly-tight, and that no other IR and IC mediator can in general achieve a better approximation factor relative to that same benchmark.

We begin with some notation. For any given segment S(ω)S(\omega), denote by gω1g_{\omega}^{1} the good gg that has the highest probability Pr[g=gω|S(ω)]\Pr\left[g=g_{\omega}|S(\omega)\right] in S(ω)S(\omega), and by gω2g_{\omega}^{2} the good with the second highest probability, under the prior π\pi. If there are several such goods, fix two such goods arbitrarily. Next, let ϕω1=Pr[gω1=gω|S(ω)]\phi_{\omega}^{1}=\Pr\left[g_{\omega}^{1}=g_{\omega}|S(\omega)\right] be the probability of gω1g_{\omega}^{1} conditional on S(ω)S(\omega), and ϕω2=Pr[gω2=gω|S(ω)]\phi_{\omega}^{2}=\Pr\left[g_{\omega}^{2}=g_{\omega}|S(\omega)\right] be the respective probability of gω2g_{\omega}^{2}. We will also use the notation that, for a given segment S{S(ω):ωΩ}S\in\{S(\omega):\omega\in\Omega\}, the goods gS1g_{S}^{1} and gS2g_{S}^{2} are the goods with the highest and second-highest probabilities in SS, with ϕS1\phi_{S}^{1} and ϕS2\phi_{S}^{2} their respective conditional probabilities. Finally, let ϕ1\phi^{1} be the total (unconditional) weight of goods gS1g_{S}^{1} over all segments SS, and ϕ2\phi^{2} be the total weight of goods gS2g_{S}^{2}: Formally,

ϕk=S{S(ω):ωΩ}ϕSkPr[S] for each k{1,2}.\phi^{k}=\sum_{S\in\{S(\omega):\omega\in\Omega\}}\phi_{S}^{k}\cdot\Pr\left[S\right]~~~~~~\mbox{ for each }k\in\{1,2\}.

Now, as our main interest lies in coopetition against an Amazon, we will focus here on the setting with an Amazon. We note that, while we drop the assumption that players 1 and 2 have jointly complete information, we still assume that the Amazon player 0 has complete information. That is, even though in any given segment SS players 1 and 2 do not know which type ωS\omega\in S is realized, player 0 does know, and offers each such type the correct good gωg_{\omega}.

We proceed as follows. We begin in Section 4.1 with the design of an optimal mediator for a particular setting of parameters, namely, for the case in which ϕS132ϕS2\phi_{S}^{1}\leq\frac{3}{2}\cdot\phi_{S}^{2} for every segment SS. We then turn to our main analysis in Section 4.2, for the case in which ϕS1>32ϕS2\phi_{S}^{1}>\frac{3}{2}\cdot\phi_{S}^{2} for every segment SS. There we design the approximately optimal simple mediator mentioned above, and analyze the tightness of the approximation factor. Finally, in Section 4.3 we show how to combine the two mediators in order to obtain an approximately optimal mediator for the general setting, without restrictions on the segments.

4.1 Optimal Mediator for ϕS132ϕS2\phi_{S}^{1}\leq\frac{3}{2}\cdot\phi_{S}^{2}

We begin with the simpler case in which ϕS132ϕS2\phi_{S}^{1}\leq\frac{3}{2}\cdot\phi_{S}^{2} for every segment SS. This case is simpler because here, conditional on any given segment, there is no conflict between optimal strategies and welfare maximization. In particular, both involve separating, with one player offering gS1g_{S}^{1} and the other offering gS2g_{S}^{2}:

Claim 1

Conditional on any segment SS, if ϕS132ϕS2\phi_{S}^{1}\leq\frac{3}{2}\cdot\phi_{S}^{2} then the separating strategy profile is both welfare maximizing and an equilibrium.

  • Proof:   We will compare the only two reasonable player choices for welfare maximization conditional on segment SS: (i) pooling on gS1g_{S}^{1}, or (ii) separating, with one player offering gS1g_{S}^{1} and the other gS2g_{S}^{2}.

    • (i)

      Pooling: Here each of players 1 and 2 obtains utility ϕS1/3\phi_{S}^{1}/3—when type ωS\omega\in S with gω=gS1g_{\omega}=g_{S}^{1} is realized (which happens with probability ϕS1\phi_{S}^{1}), the players split the surplus of 1 with player 0, and so each gets 1/31/3. Welfare of players 1 and 2 is thus 23ϕS1\frac{2}{3}\cdot\phi_{S}^{1}.

    • (ii)

      Separating: Here the player who offers gS1g_{S}^{1} obtains utility ϕS1/2\phi_{S}^{1}/2—when type ωS\omega\in S with gω=gS1g_{\omega}=g_{S}^{1} is realized, that player splits the surplus with player 0. Similarly, the player who offers gS2g_{S}^{2} obtains utility ϕS2/2\phi_{S}^{2}/2. Welfare of players 1 and 2 is thus (ϕS1+ϕS2)/2(\phi_{S}^{1}+\phi_{S}^{2})/2.

    Observe that, when ϕS132ϕS2\phi_{S}^{1}\leq\frac{3}{2}\cdot\phi_{S}^{2}, we have that (ϕS1+ϕS2)/2>23ϕS1(\phi_{S}^{1}+\phi_{S}^{2})/2>\frac{2}{3}\cdot\phi_{S}^{1}, and so the welfare maximizing choices are the separating ones.

    In addition to being welfare-maximizing, the separating actions also form an equilibrium (conditional on SS). Fix a segment SS, and suppose player 1 offers gS1g_{S}^{1} and player 2 offers gS2g_{S}^{2}. Then player 1 clearly does not wish to deviate, as she currently obtains utility ϕS1/2\phi_{S}^{1}/2, a deviation to gS2g_{S}^{2} will lead to utility ϕS2/3\phi_{S}^{2}/3, and a deviation to any other good will lead to utility at most ϕS1/2\phi_{S}^{1}/2 (since gS1g_{S}^{1} has maximal probability in SS). Player 2 also does not wish to deviate, as she currently obtains utility ϕS2/2\phi^{2}_{S}/2, a deviation to gS1g^{1}_{S} will lead to utility ϕS1/3\phi_{S}^{1}/3, and by assumption ϕS2/2ϕS1/3\phi^{2}_{S}/2\geq\phi_{S}^{1}/3.    

Given the lack of conflict between welfare maximization and equilibrium shown in Claim 1, we can now construct optimal mediators for this parameter setting. The mediator is straightforward: it facilitates full data-sharing between the players, and recommends the actions that lead to a separating equilibrium.

Mediator 3 Mediator for 𝒫={0,1,2}\mathcal{P}=\{0,1,2\} without jointly complete information
1:procedure 1v\mathcal{M}^{v}_{1}(SS) \triangleright S=P1(ω)P2(ω)S=P_{1}(\omega)\cap P_{2}(\omega) for realized ωΩ\omega\in\Omega
2:  if ϕ1=ϕ2\phi^{1}=\phi^{2} then return (gS1,gS2)(g_{S}^{1},g_{S}^{2})
3:  else
4:   iargmaxk{1,2}vki\leftarrow\arg\max_{k\in\{1,2\}}v_{k}
5:   j3ij\leftarrow 3-i
6:   Choose γ[0,1]\gamma\in\left[0,1\right] uniformly at random.
7:   if γ<2viϕ2ϕ1ϕ2\gamma<\frac{2v_{i}-\phi^{2}}{\phi^{1}-\phi^{2}} then
8:     if i=1i=1 then return (gS1,gS2)(g_{S}^{1},g_{S}^{2})
9:     else return (gS2,gS1)(g_{S}^{2},g_{S}^{1})      
10:   else if i=1i=1 then return (gS2,gS1)(g_{S}^{2},g_{S}^{1})
11:   else return (gS1,gS2)(g_{S}^{1},g_{S}^{2})      
Theorem 6

Suppose v1+v2(ϕ1+ϕ2)/2v_{1}+v_{2}\leq(\phi^{1}+\phi^{2})/2 and max{v1,v2}ϕ1/2\max\{v_{1},v_{2}\}\leq\phi^{1}/2. Then the mediator 1v\mathcal{M}^{v}_{1} above is IR, IC, and optimal.

Observe that the conditions on vv in Theorem 6 are also necessary for the existence of any IR mediator, since the maximal welfare attainable by any mediator in this setting is (ϕ1+ϕ2)/2(\phi^{1}+\phi^{2})/2, and the maximal utility any single player can attain is ϕ1/2\phi^{1}/2.

  • Proof:   Suppose first that ϕ1=ϕ2\phi^{1}=\phi^{2}, and so the conditional in line 2 of Mediator 3 is true. In this case, each player obtains utility ϕ1/2\phi^{1}/2 under 1v\mathcal{M}^{v}_{1}, and the mediator is thus IR. In addition, since by Claim 1 separation is an equilibrium on every segment SS, the mediator is also IC. Finally, the total welfare to the players is (ϕ1+ϕ2)/2(\phi^{1}+\phi^{2})/2, which is maximal.

    Next, suppose ϕ1>ϕ2\phi^{1}>\phi^{2}. Observe that, in this case, the mediator mixes between utility ϕ1/2\phi^{1}/2 to player ii and ϕ2/2\phi^{2}/2 to player jj, and utility ϕ2/2\phi^{2}/2 to player ii and ϕ1/2\phi^{1}/2 to player jj, depending on the value of γ\gamma. Regardless of the realizations of γ\gamma, the total welfare to the players is always (ϕ1+ϕ2)/2(\phi^{1}+\phi^{2})/2. This is the optimal welfare attainable.

    Next, the expected utility of player ii in 1v\mathcal{M}^{v}_{1} is

    2viϕ2ϕ1ϕ2ϕ12+(12viϕ2ϕ1ϕ2)ϕ22=vi.\frac{2v_{i}-\phi^{2}}{\phi^{1}-\phi^{2}}\cdot\frac{\phi^{1}}{2}+\left(1-\frac{2v_{i}-\phi^{2}}{\phi^{1}-\phi^{2}}\right)\frac{\phi^{2}}{2}=v_{i}.

    Furthermore, since v1+v2(ϕ1+ϕ2)/2v_{1}+v_{2}\leq(\phi^{1}+\phi^{2})/2 and total welfare is (ϕ1+ϕ2)/2(\phi^{1}+\phi^{2})/2, the utility to player jj is at least vjv_{j}. Thus, the mediator is IR.

    Finally, since by Claim 1 separation is an equilibrium on every segment SS, the mediator is IC.    

4.2 Approximately Optimal Mediator for ϕS1>32ϕS2\phi_{S}^{1}>\frac{3}{2}\cdot\phi_{S}^{2}

Suppose now that ϕS1>32ϕS2\phi_{S}^{1}>\frac{3}{2}\cdot\phi_{S}^{2} for every segment SS. This case is more complicated since, conditional on any segment, there may be a conflict between optimal strategies and welfare maximization:

Claim 2

Conditional on any segment SS, if ϕS1>32ϕS2\phi_{S}^{1}>\frac{3}{2}\cdot\phi_{S}^{2} then pooling on gS1g_{S}^{1} is the dominant strategy. If ϕS13ϕS2\phi_{S}^{1}\leq 3\cdot\phi_{S}^{2} then separating is welfare-maximizing, whereas if ϕS1>3ϕS2\phi_{S}^{1}>3\cdot\phi_{S}^{2} then pooling is welfare-maximizing.

  • Proof:   Again, we will compare the only two reasonable player choices conditional on segment SS: (i) pooling on gS1g_{S}^{1}, or (ii) separating, with one player offering gS1g_{S}^{1} and the other gS2g_{S}^{2}. As in the proof of Claim 1, pooling leads to individual utilities ϕS1/3\phi_{S}^{1}/3 and so welfare 23ϕS1\frac{2}{3}\cdot\phi_{S}^{1}, whereas separating leads to individual utilities ϕS1/2\phi_{S}^{1}/2 and ϕS2/2\phi_{S}^{2}/2 and so welfare (ϕS1+ϕS2)/2(\phi_{S}^{1}+\phi_{S}^{2})/2. Then 23ϕS1>(ϕS1+ϕS2)/2\frac{2}{3}\cdot\phi_{S}^{1}>(\phi_{S}^{1}+\phi_{S}^{2})/2 if and only if ϕS1>3ϕS2\phi_{S}^{1}>3\cdot\phi_{S}^{2}, as claimed.

    In terms of optimality, however, observe that choosing gS1g_{S}^{1} leads to utility at least ϕS1/3\phi^{1}_{S}/3, whereas choosing any other good leads to utility at most ϕS2/2\phi^{2}_{S}/2. The former is greater than the latter whenever ϕS1>32ϕS2\phi_{S}^{1}>\frac{3}{2}\cdot\phi_{S}^{2}, and so constitutes a dominant action (conditional on SS).    

Our goal is to design a mediator that is approximately optimal relative to a particular benchmark. We denote the benchmark  OPT, which is the maximal welfare attainable by any mediator, even disregarding the IR and IC constraints. We will show that the mediator below, which is a variant of the mediator Av\mathcal{M}^{v}_{A} for the case of jointly complete information, is IR, IC, and obtains welfare at least 34 OPT\frac{3}{4}\cdot\text{\rm\,OPT}. We will then show that this approximation factor is tight.

Before describing the mediator, recall the strategy sjs_{j}^{\prime} from Section 3 that, for every Pj(ω)P_{j}(\omega), chooses one of the goods that is most likely to be correct. Here we will use a different formulation of sjs_{j}^{\prime}. In particular, let sj(ω)argmaxgGE[uj(g,gω1)|Pj(ω)]s_{j}^{\prime}(\omega)\in\arg\max_{g\in G}E\left[u_{j}(g,g^{1}_{\omega})|P_{j}(\omega)\right]. That is, sjs_{j}^{\prime} is an optimal action for player jj conditional on knowing only Pj(ω)P_{j}(\omega), in the hypothetical situation where ii always plays the most-likely correct good gω1g^{1}_{\omega} in every segment. Also, let αj\alpha_{j} and αi\alpha_{i} be the expected utilities of players jj and ii in this hypothetical situation: αj=E[uj(sj(ω),gω1)]\alpha_{j}=E\left[u_{j}(s_{j}^{\prime}(\omega),g_{\omega}^{1})\right] and αi=E[ui(gω1,sj(ω))]\alpha_{i}=E\left[u_{i}(g_{\omega}^{1},s_{j}^{\prime}(\omega))\right]. Note that, under jointly complete information, the formulation of sjs_{j}^{\prime} is identical to the one given in Section 3. Without jointly complete information, however, they may differ.

Mediator 4 Mediator for 𝒫={0,1,2}\mathcal{P}=\{0,1,2\} without jointly complete information
1:procedure 2v\mathcal{M}^{v}_{2}(V,WV,W) \triangleright V=P1(ω)V=P_{1}(\omega) and W=P2(ω)W=P_{2}(\omega) for realized ωΩ\omega\in\Omega
2:  SVWS\leftarrow V\cap W
3:  iargmaxk{1,2}vki\leftarrow\arg\max_{k\in\{1,2\}}v_{k}
4:  j3ij\leftarrow 3-i
5:  if viϕ1/3v_{i}\leq\phi^{1}/3 then return (gS1,gS1)\left(g^{1}_{S},g^{1}_{S}\right) \triangleright Full data-sharing
6:  else
7:   gjsj(ω)g_{j}\leftarrow s_{j}^{\prime}(\omega)
8:   Choose γ[0,1]\gamma\in\left[0,1\right] uniformly at random.
9:   if γ<3αi3vi3αiϕ1\gamma<\frac{3\alpha_{i}-3v_{i}}{3\alpha_{i}-\phi^{1}} then
10:     return (gS1,gS1)(g^{1}_{S},g^{1}_{S})
11:   else if i=1i=1 then return (gS1,gj)(g^{1}_{S},g_{j})
12:   else return (gj,gS1)(g_{j},g^{1}_{S})      
Theorem 7

Let vivjv_{i}\geq v_{j}, and suppose viE[ui(gω1,sj(ω))]v_{i}\leq E\left[u_{i}(g^{1}_{\omega},s_{j}^{\prime}(\omega))\right] and vjϕ1/2viv_{j}\leq\phi^{1}/2-v_{i}. Then the mediator 2v\mathcal{M}^{v}_{2} above is IR, IC, and has welfare W(2v)34 OPTW(\mathcal{M}^{v}_{2})\geq\frac{3}{4}\cdot\text{\rm\,OPT}.

We note that, unlike the conditions on vv in Theorems 24, and 6, the conditions here are not necessary but only sufficient.

  • Proof:   Consider first the case in which viϕ1/3v_{i}\leq\phi^{1}/3. Here, the utility to each player is ϕ1/3vivj\phi^{1}/3\geq v_{i}\geq v_{j}, and so the mediator is IR. Furthermore, players always pool on gω1g_{\omega}^{1} which is an equilibrium by Claim 2. Thus, in this case the mediator is also IC. Finally, total welfare here is 23ϕ1\frac{2}{3}\cdot\phi^{1}. Below we will show that this is within a 3/43/4 factor of  OPT.

    Before turning to the approximation factor, consider the case in which vi>ϕ1/3v_{i}>\phi^{1}/3. Observe that the total welfare in this case is at least ϕ1/2\phi^{1}/2. If on a given segment S(ω)S(\omega) the players pool on gω1g_{\omega}^{1}, then welfare here is 23ϕω1>12ϕω1\frac{2}{3}\cdot\phi^{1}_{\omega}>\frac{1}{2}\cdot\phi^{1}_{\omega}. If the players separate, then the welfare here is at least ϕω1/2\phi^{1}_{\omega}/2, since player ii always chooses gω1g_{\omega}^{1}. Thus, total welfare is at least ϕ1/2\phi^{1}/2.

    Next, the expected utility of player ii in this case is

    3αi3vi3αiϕ1ϕ13+(13αi3vi3αiϕ1)αi=vi.\frac{3\alpha_{i}-3v_{i}}{3\alpha_{i}-\phi^{1}}\cdot\frac{\phi^{1}}{3}+\left(1-\frac{3\alpha_{i}-3v_{i}}{3\alpha_{i}-\phi^{1}}\right)\alpha_{i}=v_{i}.

    Furthermore, since the total welfare is at least ϕ1/2\phi^{1}/2, player jj obtains welfare at least ϕ1/2vivj\phi^{1}/2-v_{i}\geq v_{j}, and so the mediator is IR here as well. In addition, since by Claim 2 the action gω1g_{\omega}^{1} is dominant in every segment S(ω)S(\omega), and in the mediator player ii always chooses gω1g_{\omega}^{1} and player jj either pools or best responds given her information, the mediator is IC.

    Finally, we turn to the approximation factor. By Claim 2, the maximal welfare possible on a segment SS in which ϕS13ϕS2\phi_{S}^{1}\leq 3\cdot\phi_{S}^{2} is (ϕS1+ϕS2)/2(\phi_{S}^{1}+\phi_{S}^{2})/2, and our mediator guarantees welfare at least ϕS12\frac{\phi_{S}^{1}}{2}. Hence, here we have approximation ratio at least

    ϕS1/2(ϕS1+ϕS2)/2ϕS1/2(ϕS1+ϕS1/3)/2=34.\frac{\phi_{S}^{1}/2}{(\phi_{S}^{1}+\phi_{S}^{2})/2}\geq\frac{\phi_{S}^{1}/2}{(\phi_{S}^{1}+\phi_{S}^{1}/3)/2}=\frac{3}{4}.

    In addition, by Claim 2, the maximal welfare possible on a segment SS in which ϕS1>3ϕS2\phi_{S}^{1}>3\cdot\phi_{S}^{2} is 2ϕS1/32\phi_{S}^{1}/3, and our mediator guarantees welfare at least ϕS12\frac{\phi_{S}^{1}}{2}. Hence, here we have approximation ratio at least

    ϕS1/22ϕS1/3=34.\frac{\phi_{S}^{1}/2}{2\phi_{S}^{1}/3}=\frac{3}{4}.

    Since a ratio of 3/43/4 is achieved for each segment, it is also achieved overall, and W(2v)34 OPTW(\mathcal{M}^{v}_{2})\geq\frac{3}{4}\cdot\text{\rm\,OPT}.    

We now show that the approximation factor of 3/43/4 is tight, by giving an example wherein there does not exist an IR mediator, not even one that is not IC, that achieves a factor higher than 3/43/4.

Example 1

Fix some ε>0\varepsilon>0, and suppose there are 1/ε+1\lceil 1/\varepsilon\rceil+1 segments. Within each segment SS there are two correct goods, gS1g_{S}^{1} and gS2g_{S}^{2}, that satisfy ϕS1=ϕS2/ε\phi_{S}^{1}=\phi_{S}^{2}/\varepsilon. Furthermore, all the gS1g_{S}^{1} goods are distinct, whereas for every SS the good gS2=g2g_{S}^{2}=g^{2} is the same. Player 1 has all the information, and so P1(ω)=S(ω)P_{1}(\omega)=S(\omega) for every ω\omega, whereas player 2 has none, and so P2(ω)=ΩP_{2}(\omega)=\Omega for every ω\omega. Finally, v1=ϕ1/2v_{1}=\phi^{1}/2 and v2=0v_{2}=0.

In this example, maximal welfare is achieved when all players pool on gS1g_{S}^{1} for every SS, and so  OPT=23ϕ1\text{\rm\,OPT}=\frac{2}{3}\cdot\phi^{1}. However, any IR mediator must give player 1 utility ϕ1/2\phi^{1}/2, and the only way to achieve this is to give player 2 no additional information, so that player 1 always chooses gS1g_{S}^{1} and player 2 always chooses g2g^{2} (this is, in fact, what the mediator 2v\mathcal{M}^{v}_{2} does). Total welfare here is then (ϕ1+ϕ2)/2=(1+ε)ϕ1/2(\phi^{1}+\phi^{2})/2=(1+\varepsilon)\phi^{1}/2, and so the approximation ratio is

(1+ε)ϕ1/22ϕ1/3=(1+ε)34ε034.\frac{(1+\varepsilon)\phi^{1}/2}{2\phi^{1}/3}=(1+\varepsilon)\cdot\frac{3}{4}\xrightarrow[\varepsilon\to 0]{}\frac{3}{4}.

In the previous example, the main bottleneck to achieving an approximation factor higher than 3/43/4 was the IR constraint. In this next example we show that even without the IR constraints, there does not exist an IC mediator that achieves an approximation factor higher than 4/54/5.

Example 2

Fix some ε>0\varepsilon>0, and suppose that in every segment SS there are two correct goods, gS1g_{S}^{1} and gS2g_{S}^{2}, that satisfy ϕS1=(32+ε)ϕS2\phi_{S}^{1}=\left(\frac{3}{2}+\varepsilon\right)\phi_{S}^{2}. Furthermore, the goods in every segment are distinct from goods in other segments. By Claim 2, maximal welfare here is achieved when players separate on each segment, and so  OPT=(ϕ1+ϕ2)/2\text{\rm\,OPT}=(\phi^{1}+\phi^{2})/2.

However, again by Claim 2, the dominant action for players is to pool on gS1g_{S}^{1}. Note that in any IC mediator, no player will ever choose a good g=gS2g=g_{S}^{2}, regardless of her information, since she can always improve her payoff by deviating to the strictly better gS1g_{S}^{1} (here we use the fact that all goods are distinct). Thus, the most any IC mediator can attain is when all players pool on the top good, leading to welfare 23ϕ1\frac{2}{3}\cdot\phi^{1} and so approximation factor

23ϕ1ϕ1+ϕ22=23ϕ1ϕ1+ϕ1/(3/2+ε)2=2+4ε352+εε045.\frac{\frac{2}{3}\cdot\phi^{1}}{\frac{\phi^{1}+\phi^{2}}{2}}=\frac{\frac{2}{3}\cdot\phi^{1}}{\phi^{1}+\frac{\phi^{1}/(3/2+\varepsilon)}{2}}=\frac{2+\frac{4\varepsilon}{3}}{\frac{5}{2}+\varepsilon}\xrightarrow[\varepsilon\to 0]{}\frac{4}{5}.

4.3 Approximately Optimal Mediator for General Setting

Each of the mediators in the previous two sections is IR, IC, and approximately-optimal under a particular setting of parameters, but not in the other. Mediator 1v\mathcal{M}^{v}_{1} of Mediator 3 is not IC when ϕS1>32ϕS2\phi_{S}^{1}>\frac{3}{2}\cdot\phi_{S}^{2} for some segment SS, since players wish to pool but the mediator facilitates separating. 2v\mathcal{M}^{v}_{2} of Mediator 4 is not IC when ϕS1<32ϕS2\phi_{S}^{1}<\frac{3}{2}\cdot\phi_{S}^{2} for some segment SS, since players wish to separate but the mediator facilitates pooling. In this section we show that the two mediators can be combined in order to yield an IR, IC, and approximately optimal mediator for any parameter setting. The main idea is straightforward: to run 1v\mathcal{M}^{v}_{1} of Mediator 3 on segments SS for which ϕS132ϕS2\phi_{S}^{1}\leq\frac{3}{2}\cdot\phi_{S}^{2}, and to run 2v\mathcal{M}^{v}_{2} of Mediator 4 on segments SS for which ϕS1>32ϕS2\phi_{S}^{1}>\frac{3}{2}\cdot\phi_{S}^{2}.

Before formalizing this mediator, we need some notation. Let S1={S:ϕS132ϕS2}S_{1}=\{S:\phi_{S}^{1}\leq\frac{3}{2}\cdot\phi_{S}^{2}\} and S2={S:ϕS1>32ϕS2}S_{2}=\{S:\phi_{S}^{1}>\frac{3}{2}\cdot\phi_{S}^{2}\}. Also, for each {1,2}\ell\in\{1,2\} let ϕ1\phi^{1}_{\ell} be the total weight of the most-likely correct goods conditional on SS_{\ell}, and ϕ2\phi^{2}_{\ell} be the total weight of the second most-likely correct goods conditional SS_{\ell}. Formally, for each k{1,2}k\in\{1,2\},

ϕk=SSϕSkPr[S|S].\phi_{\ell}^{k}=\sum_{S\in S_{\ell}}\phi_{S}^{k}\cdot\Pr\left[S|S_{\ell}\right].

Finally, modify the definitions of sjs_{j}^{\prime}, αj\alpha_{j}, and αi\alpha_{i} from Section 4.2 as follows. First, for any ω\omega with S(ω)S2S(\omega)\in S_{2} let sj(ω)argmaxgGE[uj(g,gω1)|Pj(ω),S2]s_{j}^{\prime}(\omega)\in\arg\max_{g\in G}E\left[u_{j}(g,g^{1}_{\omega})|P_{j}(\omega),S_{2}\right], so it is an optimal action for player jj conditional on the information that ωPj(ω)\omega\in P_{j}(\omega) and S(ω)S2S(\omega)\in S_{2}, in the hypothetical situation where ii always plays the most-likely correct good gω1g^{1}_{\omega} in every segment S(ω)S(\omega). Also, let αj\alpha_{j} and αi\alpha_{i} be the expected utilities of players jj and ii in this hypothetical situation: αj=E[uj(sj(ω),gω1)|S2]\alpha_{j}=E\left[u_{j}(s_{j}^{\prime}(\omega),g_{\omega}^{1})|S_{2}\right] and αi=E[ui(gω1,sj(ω))|S2]\alpha_{i}=E\left[u_{i}(g_{\omega}^{1},s_{j}^{\prime}(\omega))|S_{2}\right].

Mediator 5 Mediator for 𝒫={0,1,2}\mathcal{P}=\{0,1,2\} without jointly complete information
1:procedure 3v\mathcal{M}^{v}_{3}(V,WV,W) \triangleright V=P1(ω)V=P_{1}(\omega) and W=P2(ω)W=P_{2}(\omega) for realized ωΩ\omega\in\Omega
2:  SVWS\leftarrow V\cap W
3:  if ϕS132ϕS2\phi_{S}^{1}\leq\frac{3}{2}\cdot\phi_{S}^{2} then run 1v(S)\mathcal{M}^{v}_{1}(S) with (ϕ11,ϕ12)(\phi^{1}_{1},\phi^{2}_{1}) replacing (ϕ1,ϕ2)(\phi^{1},\phi^{2})
4:  else run 2v(V,W)\mathcal{M}^{v}_{2}(V,W) with (ϕ21,ϕ22)(\phi^{1}_{2},\phi^{2}_{2}) replacing (ϕ1,ϕ2)(\phi^{1},\phi^{2})   
Theorem 8

Let vivjv_{i}\geq v_{j}, and suppose that v1+v2(ϕ11+ϕ12)/2v_{1}+v_{2}\leq(\phi^{1}_{1}+\phi^{2}_{1})/2, that vimax{E[ui(gω1,sj(ω))|S2],ϕ112}v_{i}\leq\max\left\{E\left[u_{i}(g^{1}_{\omega},s_{j}^{\prime}(\omega))|S_{2}\right],\frac{\phi^{1}_{1}}{2}\right\}, and that vjϕ21/2viv_{j}\leq\phi^{1}_{2}/2-v_{i}. Then the mediator 3v\mathcal{M}^{v}_{3} above is IR, IC, and has welfare W(3v)34 OPTW(\mathcal{M}^{v}_{3})\geq\frac{3}{4}\cdot\text{\rm\,OPT}.

  • Proof:   Conditional on S1S_{1}, the conditions on vv of Theorem 6 are satisfied, and so 1v\mathcal{M}^{v}_{1} is IR and IC. Conditional on S2S_{2}, the conditions on vv of Theorem 7 are satisfied, and so 2v\mathcal{M}^{v}_{2} is IR and IC. That is, conditional on either S1S_{1} or S2S_{2}, the expected utilities of either player kk is at least vkv_{k}. Thus, kk’s expected utility in 3v\mathcal{M}^{v}_{3}, which is a mixture of her utilities conditional on S1S_{1} and on S2S_{2}, must also be greater than vkv_{k}. This implies that 3v\mathcal{M}^{v}_{3} is IR.

    Furthermore, since 1v\mathcal{M}^{v}_{1} and 2v\mathcal{M}^{v}_{2} are IC, for either player kk the strategy s¯k\overline{s}_{k} is a best-response to s¯3k\overline{s}_{3-k} conditional on either S1S_{1} or S2S_{2}. Thus, 3v\mathcal{M}^{v}_{3} is also IC.

    Finally, we consider the welfare guaranteed by 2v\mathcal{M}^{v}_{2}. Conditional on S1S_{1}, the welfare attained is (ϕ11+ϕ12)/2(\phi_{1}^{1}+\phi_{1}^{2})/2, by Theorem 6. This is the optimal attainable by any mediator when ϕS132ϕS2\phi_{S}^{1}\leq\frac{3}{2}\cdot\phi_{S}^{2} for every segment SS, by Claim 1. Hence, conditional on S1S_{1}, the welfare achieved by 3v\mathcal{M}^{v}_{3} is equal to  OPT (conditional on S1S_{1}).

    Conditional on S2S_{2}, the welfare attained on any segment SS is at least 34\frac{3}{4} times the welfare attainable on this segment by any mediator (by the proof of Theorem 7). Hence, the welfare achieved by 3v\mathcal{M}^{v}_{3} is at least 34 OPT\frac{3}{4}\cdot\text{\rm\,OPT} (conditional on S2S_{2}).

    Since conditional on either S1S_{1} or S2S_{2} yields welfare at least 34\frac{3}{4} times the optimal, the total welfare of 3v\mathcal{M}^{v}_{3} is W(3v)34 OPTW(\mathcal{M}^{v}_{3})\geq\frac{3}{4}\cdot\text{\rm\,OPT}.    

5 Do Players Share More Data With or Without an Amazon?

In this section we consider the intriguing question of whether players optimally share more data in the presence of an Amazon or in its absence. While it seems intuitive that players would share more data when facing stronger outside competition, we show here that this is not necessarily the case. We also show that the reason the intuition fails is related to the possible conflict between equilibrium and welfare maximization identified in Claim 2.

Because we are interested in comparing data sharing across two different environments—without and with an Amazon—we focus on our main interpretation of the base values vv, namely, that they correspond to the equilibrium expected utilities absent a mediator. Of course, because the two environments differ, so will these equilibrium expected utilities. We will thus be interested in comparing the benefit of data sharing relative to firms’ no-sharing equilibrium utilities vnoAv^{noA} absent an Amazon with the benefit of data sharing relative to firms’ no-sharing equilibrium utilities vAv^{A} in the presence of an Amazon.

We begin our analysis in Section 5.1, where we show that, when there is jointly complete information, the intuition that firms share more data in the presence of an Amazon than in its absence does hold. In particular, we show that if, when there is no Amazon, data sharing strictly increases welfare relative to the welfare-maximizing equilibrium sans data sharing, then optimal data sharing strictly increases welfare in the environment with an Amazon relative to any equilibrium sans data sharing.

In Section 5.2 we then drop the assumption of jointly complete information, and show that this result no longer holds. In particular, we describe two settings:

  1. 1.

    A setting where full data-sharing is optimal in the presence of an Amazon, but no data-sharing is optimal in the absence of an Amazon.

  2. 2.

    A setting where full data-sharing is optimal in the absence of an Amazon, but no data-sharing is optimal in the presence of an Amazon. In this setting, data sharing is strictly beneficial to firms in the absence of an Amazon but not in its presence.

In Section 5.2 we also identify and discuss the driving force behind this difference, namely, the possible conflict between equilibrium and welfare maximization.

5.1 Conditions for Sharing with Jointly Complete Information

In this section we show that, under jointly complete information, when players can strictly benefit from data sharing in the absence of an Amazon, they can also strictly benefit from data sharing in the presence of an Amazon.

Theorem 9

Fix an unmediated game Γ=(Ω,𝒫,G2,(Pi())i𝒫,(ui)i𝒫,π)\Gamma=\left(\Omega,\mathcal{P},G^{2},(P_{i}(\cdot))_{i\in\mathcal{P}},(u_{i})_{i\in\mathcal{P}},\pi\right) in which there is jointly complete information and no Amazon, and let (v1,v2)(v_{1},v_{2}) be the expected utilities in a BNE of Γ\Gamma for which v1+v2v_{1}+v_{2} is maximal. Also, let Γ=(Ω,𝒫,G3,(Pi())i𝒫,(ui)i𝒫,π)\Gamma^{\prime}=\left(\Omega,\mathcal{P},G^{3},(P_{i}(\cdot))_{i\in\mathcal{P}},(u_{i})_{i\in\mathcal{P}},\pi\right) be the same unmediated game, except with an Amazon, and let (v1,v2)(v_{1}^{\prime},v_{2}^{\prime}) be the expected utilities in some BNE of Γ\Gamma^{\prime}. Then if there exists an IR, IC mediator \mathcal{M} in the game without an Amazon such that W()>v1+v2W(\mathcal{M})>v_{1}+v_{2}, there also exists an IR, IC mediator \mathcal{M}^{\prime} in the game with an Amazon such that W()>v1+v2W(\mathcal{M}^{\prime})>v_{1}^{\prime}+v_{2}^{\prime}.

The converse of Theorem 9 is not true—there are examples where players strictly benefit from data sharing in the presence of an Amazon but not in its absence. A simple example is the game described in Figure 1 in the Introduction, with parameters α>β=γ>δ=0\alpha>\beta=\gamma>\delta=0. In the absence of an Amazon, the unique equilibrium there yields maximal welfare of 1, since at least one consumer always offers the correct good. Thus, data sharing cannot strictly improve welfare. In the presence of an Amazon, however, data sharing can be strictly beneficial, as described in the Introduction.

  • Proof:   Suppose towards a contradiction that there exists an IR, IC mediator \mathcal{M} in the game without an Amazon such that W()>v1+v2W(\mathcal{M})>v_{1}+v_{2}, but that for some (v1,v2)(v_{1}^{\prime},v_{2}^{\prime}) there does not exist an IR, IC mediator \mathcal{M}^{\prime} in the game with an Amazon such that W()>v1+v2W(\mathcal{M}^{\prime})>v_{1}^{\prime}+v_{2}^{\prime}. Let EE be the equilibrium in the game with an Amazon that yields payoffs (v1,v2)(v_{1}^{\prime},v_{2}^{\prime}), and let 0\mathcal{M}_{0} be the mediator that does nothing other than recommend players their actions from the equilibrium EE. This mediator is IR, IC, and yields welfare W(0)=v1+v2W(\mathcal{M}_{0})=v_{1}^{\prime}+v_{2}^{\prime}. Since, by assumption, no IR, IC mediator yields higher welfare, 0\mathcal{M}_{0} is optimal relative to the feasible pair (v1,v2)(v_{1}^{\prime},v_{2}^{\prime}). Thus, by Theorem 5, 0\mathcal{M}_{0} is fully revealing to at least one of the two players. In particular, this implies that the BNE EE in the game with an Amazon is one in which at least one of the players always offers the correct good gωg_{\omega} to every consumer ω\omega.

    We now argue that this implies that the same EE is an equilibrium also in the game without an Amazon. The player ii that always offers the correct good is clearly playing a dominant strategy, both in the game with an Amazon and in the game without, as any deviation can only lower her utility. The other player jj is playing a best response in the game with an Amazon. Because both player ii and the Amazon always offer the correct good, jj’s strategy must maximize the probability of offering the correct good conditional on knowing (only) Pj(ω)P_{j}(\omega). However, this strategy is also an equilibrium in the game without an Amazon, since ii always offers the correct good.

    The equilibrium EE is such that at least one of the players always offers the correct good, and so in the game without an Amazon it yields a total welfare of 1. However, this contradicts the assumption that there exists a mediator \mathcal{M} that strictly improves welfare, since no mediator can yield welfare W()>1W(\mathcal{M})>1.    

5.2 Data-Sharing with No Jointly Complete Information

The analysis of Section 5.1 shows that when players strictly benefit from data sharing in the absence of an Amazon, they necessarily also benefit from data sharing in the presence of an Amazon. This is consistent with the intuition that players can gain more by sharing data when they also face outside competition. However, this result relies on the assumption of jointly complete information. In this section we drop the assumption, and show that in that case this intuition is not complete. In particular, while there are settings in which players benefit more from sharing data in the presence of an Amazon, there are also settings in which players benefit from data sharing in the absence of an Amazon but not in the presence of an Amazon.

More specifically, in Section 5.2.1 below we describe a setting where:

  • In the absence of an Amazon, the optimal welfare is achieved when players share no data and not when they fully share data;

  • In the presence of an Amazon, the optimal welfare is achieved when players fully share data and not when they share no data.

In contrast, in Section 5.2.1 we describe a setting where:

  • In the absence of an Amazon, the optimal welfare is achieved when players fully share data and not when they share no data;

  • In the presence of an Amazon, the optimal welfare is achieved when players share no data and not when they fully share data.

The main driving force behind the distinction between the two settings is whether or not there is a conflict between equilibrium and welfare maximization, as formalized in Claim 2. The situation described in Section 5.2.1 is one where ϕS13ϕS2\phi_{S}^{1}\geq 3\cdot\phi_{S}^{2} for every segment SS. In this case, under full data sharing, it is a dominant strategy for players to pool on gS1g^{1}_{S} (both with and without an Amazon). In the presence of an Amazon, such pooling is also welfare maximizing, by Claim 2. In the absence of an Amazon, however, separating is always welfare maximizing. Thus, full data sharing leads to the optimal welfare in the former case, but not in the latter.

In contrast, the situation described in Section 5.2.2 is one where ϕS1(32ϕS2,2ϕS2)\phi_{S}^{1}\in(\frac{3}{2}\phi_{S}^{2},2\phi_{S}^{2}). In this case, under full data sharing, in the absence of an Amazon it is an equilibrium for players to separate, whereas in the presence of an Amazon it is dominant for players to pool (again, by Claim 2). Furthermore, in both settings, separating is the welfare maximizing outcome. Thus, full data sharing leads to the optimal welfare in the absence of an Amazon, but not in its presence.

5.2.1 More Sharing With an Amazon

When there is an Amazon, if for each segment SS it holds that ϕS13ϕS2\phi_{S}^{1}\geq 3\cdot\phi_{S}^{2}, then maximal welfare is achieved when players pool on ϕS1\phi_{S}^{1} (by Claim 2). This maximal welfare is can be achieved by full data-sharing (assuming the IR constraint is satisfied): On each segment SS, the mediator recommends gS1g_{S}^{1} to both players, and this is optimal for them (again by Claim 2). In contrast, when there is no Amazon, full data-sharing may not be optimal (regardless of the IR constraint), as here maximal welfare is attained by separation.

In this section we construct an example in which full data-sharing is strictly suboptimal when there is no Amazon. While the example from the introduction illustrated this, it is not quite what we desire, since in that example the mediator with an Amazon is also an optimal mediator without. What we would like is an example where full data-sharing is optimal only with an Amazon. In the analysis that follows, we first ignore the IR constraint. At the end of the section we then show that, with the proper modification, the argument holds also when taking the IR constraint into account.

In the following construction, for simplicity we refer to different types of consumers ω\omega by the good gωg_{\omega} they desire. There are two segments, S1={g1}S^{1}=\{g_{1}\} and S2={g1,g2}S^{2}=\{g_{1},g_{2}\}, and players’ partitions are Π1={S1,S2}\Pi_{1}=\{S^{1},S^{2}\} and Π2=Ω\Pi_{2}=\Omega. That is, player 1 learns whether the realized ω\omega lies in the segment S1S^{1} (and so the correct good is g1g_{1}) or S2S^{2} (and so the correct good is g1g_{1} or g2g_{2}), whereas player 2 does not learn anything. The prior is such that both segments are equally likely, and that Pr[g2|S2]>3Pr[g1|S2]>0\Pr\left[g_{2}|S^{2}\right]>3\Pr\left[g_{1}|S^{2}\right]>0. That is, in segment S2S^{2}, the good g2g_{2} is more than three times more likely to be correct than good g1g_{1}.

In the presence of an Amazon, the welfare maximizing choices lead to pooling on g1g_{1} in segment S1S^{1} and pooling on g2g_{2} in segment S2S^{2}, by Claim 2. This can be achieved by full data-sharing, leading to an equilibrium in which players take these pooling actions (again, by Claim 2). In contrast, with no data sharing, player 2 chooses the action g1g_{1} (in both segments), leading to lower welfare due to separation on S2S^{2}.

Now suppose there is no Amazon. Here the welfare maximizing choices have players separating on segment S2S^{2}. Without any additional information, player 2 chooses action g1g_{1}, leading to such optimal separation. With data sharing, however, player 2 takes action g2g_{2} on S2S^{2}, pooling with player 1, leading to lower welfare. Thus, in this example, full data-sharing is optimal in the presence of an Amazon but not in its absence. Furthermore, without an Amazon, it is optimal to not share any data.

Finally, recall that we thus far neglected the IR constraint. We now modify the example so that the conclusion holds when also taking the IR constraints into account. In particular, duplicate the construction with a different set of segments and consumers, but where players 1 and 2 have the opposite information. Formally, in addition to the two segments, S1={g1}S^{1}=\{g_{1}\} and S2={g1,g2}S^{2}=\{g_{1},g_{2}\}, suppose there are also segments S¯1={g¯1}\overline{S}^{1}=\{\overline{g}_{1}\} and S¯2={g¯1,g¯2}\overline{S}^{2}=\{\overline{g}_{1},\overline{g}_{2}\}, where the goods g1g^{1} and g2g^{2} are distinct from the goods g¯1\overline{g}^{1} and g¯2\overline{g}^{2}. Furthermore, the partitions are Π1={S1,S2,S¯1S¯2}\Pi_{1}=\{S^{1},S^{2},\overline{S}^{1}\vee\overline{S}^{2}\} and Π2={S1S2,S¯1,S¯2}\Pi_{2}=\{S^{1}\vee S^{2},\overline{S}^{1},\overline{S}^{2}\}. That is, if the realized segment is S1S^{1} or S2S^{2}, then again player 1 learns the segment and player 2 learns only that the realized segment is either S1S^{1} or S2S^{2}. Symmetrically, if the realized segment is S¯1\overline{S}^{1} or S¯2\overline{S}^{2}, then player 2 learns the segment and player 1 learns only that the realized segment is either S¯1\overline{S}^{1} or S¯2\overline{S}^{2}. Finally, suppose that the realized segment is S1S^{1} or S2S^{2} with probability 1/21/2, and S¯1\overline{S}^{1} or S¯2\overline{S}^{2} with probability 1/21/2.

Because the example is symmetric, the utilities of the two players, absent data sharing, are equal. Similarly, the utilities of the two players, under full data-sharing, are also equal. Thus, whenever full data-sharing leads to strictly higher welfare than no data-sharing (as in the presence of an Amazon above), then it also satisfies the IR constraint. And if no data-sharing is optimal in the original example (as in the absence of an Amazon above), then it is also optimal in the modified setting. Thus, the conclusion that full data-sharing may be optimal in the presence of an Amazon but not in its absence holds also when we take the IR constraint into account.

5.2.2 More Sharing Without an Amazon

When there is no Amazon, the optimal welfare is achieved when, for each segment SS, players separate—one choosing gS1g_{S}^{1} and the other gS2g_{S}^{2}—as this leads to maximal utility gS1+gS2g_{S}^{1}+g_{S}^{2} in each segment. Furthermore, if ϕS1<2ϕS2\phi_{S}^{1}<2\phi_{S}^{2} for every segment SS, then this maximal welfare can be achieved by full data-sharing: On each segment SS, the mediator recommends gS1g_{S}^{1} to one of the players, and gS2g_{S}^{2} to the other. In contrast, in the presence of an Amazon, if ϕS1<2ϕS2\phi_{S}^{1}<2\phi_{S}^{2} for every segment SS then full data-sharing may not be optimal. In this section we construct an example in which full data-sharing is strictly suboptimal with an Amazon. As in Section 5.2.1 above, in the following construction, for simplicity we refer to different types of consumers ω\omega by the good gωg_{\omega} they desire. We also ignore the IR constraint. However, again as in Section 5.2.1 above, the conclusion holds also when we take the IR constraint into account if we consider a modified, symmetric example in which the segments and partitions are duplicated.

Suppose there are three segments, S1={g1}S^{1}=\{g_{1}\}, S2={g3}S^{2}=\{g_{3}\}, and S3={g1,g2,g3}S^{3}=\{g_{1},g_{2},g_{3}\}, and players’ partitions are Π1={S1,S2S3}\Pi_{1}=\{S^{1},S^{2}\vee S^{3}\} and Π2={S2,S1S3}\Pi_{2}=\{S^{2},S^{1}\vee S^{3}\}. That is, player 1 learns whether the realized ω\omega lies in the segment S1S^{1}, and so gω=g1g_{\omega}=g_{1}, or whether ω\omega lies in one of S2S^{2} or S3S^{3}, and so the correct good for the realized type is one of g1g_{1}, g2g_{2}, or g3g_{3} (with relative probabilities to be specified shortly). Player 2 is similar, but learns whether the realized segment is S2S^{2} or one of S1S^{1} or S3S^{3}. The prior is such that the segments are equally likely. Denote by ϕ3k=Pr[gk|S3]\phi^{k}_{3}=\Pr\left[g_{k}|S_{3}\right] the probability that good gkg_{k} is correct, conditional on segment S3S^{3}, and suppose that ϕ31(3ϕ322,2ϕ32)\phi^{1}_{3}\in\left(\frac{3\cdot\phi^{2}_{3}}{2},2\cdot\phi^{2}_{3}\right) and that ϕ32>ϕ33>ϕ31/3\phi^{2}_{3}>\phi^{3}_{3}>\phi^{1}_{3}/3.

Suppose there is no Amazon. The optimal welfare here, (2+ϕ31+ϕ32)/3(2+\phi^{1}_{3}+\phi^{2}_{3})/3, is attained when at least one player chooses g1g_{1} conditional on S1S^{1}, at least one player chooses g3g_{3} conditional on S2S^{2}, and when one player chooses g1g_{1} and another chooses g2g_{2} conditional on S3S^{3}. Full data-sharing leads players to learn the correct segment, and so to maximal welfare: on segments S1S^{1} and S2S^{2} players pool on the correct good, and on segment S3S^{3} it is an equilibrium for players to separate.

In contrast, without data sharing, while player 2 will end up choosing g1g_{1} on S3S^{3}, player 1 will end up choosing g3g_{3} on S3S^{3}. This is because player 1 cannot differentiate between S2S^{2} and S3S^{3}, and conditional on S2S3S^{2}\vee S^{3} the strategy of choosing good g3g_{3} is dominant. This, however, leads to total welfare (2+ϕ31+ϕ33)/3<(2+ϕ31+ϕ32)/3(2+\phi^{1}_{3}+\phi^{3}_{3})/3<(2+\phi^{1}_{3}+\phi^{2}_{3})/3, and so full data-sharing is better than no sharing.

Now suppose there is an Amazon. When there is no data-sharing, then conditional on segments S1S^{1} or S2S^{2}, welfare will be 2/32/3, since for both players it is a dominant strategy to offer the correct good. On segment S3S^{3} the strategies g3g_{3} and g1g_{1} of players 1 and 2, respectively, will again be dominant (as in the case of no Amazon), leading to conditional welfare (ϕ31+ϕ33)/2(\phi^{1}_{3}+\phi^{3}_{3})/2 here.

What happens will full data-sharing? Here, on segment S3S^{3}, players will pool on action g1g_{1}, by Claim 2 and the assumption that ϕ31>32ϕ32\phi_{3}^{1}>\frac{3}{2}\phi_{3}^{2}. Thus, welfare conditional on S3S^{3} will be

23ϕ31<ϕ31+ϕ332,\frac{2}{3}\cdot\phi^{1}_{3}<\frac{\phi^{1}_{3}+\phi^{3}_{3}}{2},

where the inequality follows from the assumption that ϕ33>ϕ31/3\phi^{3}_{3}>\phi^{1}_{3}/3. Since, conditional on S1S^{1} or S2S^{2}, welfare is the same with full sharing and with no sharing, total (unconditional) welfare is higher with no sharing than with full sharing.

Furthermore, observe that under no data-sharing, welfare is maximal conditional on S1S_{1} or S2S_{2}. Under S3S_{3}, higher welfare could (only) be attained if players were to separate on g1g_{1} and g2g_{2}. However, such separation can never occur in equilibrium: Any player that is supposed to offer g2g_{2} can strictly benefit by deviating to offering g1g_{1}, regardless of the other player’s action (by Claim 2 and since ϕ31>32ϕ32\phi_{3}^{1}>\frac{3}{2}\phi_{3}^{2}). Therefore, under no data-sharing players obtain maximal welfare subject to the IC constraint being satisfied.

Thus, in this example, full data-sharing is optimal in the absence of an Amazon, but not in its presence. In contrast, no data-sharing is optimal in the presence of an Amazon, but not in its absence. In particular, this implies that data sharing is strictly beneficial in the absence of an Amazon, but not in its presence.

6 Conclusion

In this paper we proposed a simple model to study coopetitive data sharing between firms. We designed several data-sharing schemes, in the form of mediators, that are optimal in their respective domains, with and without an Amazon.

In our model, we assumed that the mediator only facilitates data sharing between firms. One natural extension is to allow the mediator to also use monetary transfers between them. In Appendix 7.1 we analyze this extension. We show that transfers can loosen the IR constraint, implying that, under jointly complete information, full data-sharing is often optimal. However, when there is no jointly complete information, full data-sharing may violate the IC constraint, and in this case our mediators (even without transfers) are close to optimal.

One limitation of our model is that it considers data sharing between only two firms. What happens when there are n>2n>2 small firms, and a mediator that can facilitate sharing between all of them? We leave a thorough analysis of optimal mediators in this setting for future work, but provide some preliminary results in Appendix 7.2. In particular, we analyze the case of jointly complete information. We show that, in coopetition without an Amazon, a generalization of the mediator from the introductory example to nn players is optimal. In coopetition against an Amazon, however, the situation is more complicated. Nonetheless, we provide necessary and sufficient conditions for full data-sharing to be optimal in this case.

Coopetitive data-sharing is crucial for firms’ survival in current online markets, and the model and results of this paper have only scratched the surface of what can and should be done. In particular, the paper leaves numerous directions open for future research. In addition to a thorough analysis of optimal data-sharing with more than 2 players, a particularly important direction is to analyze data sharing in a variety of markets different from the one studied here.

References

  • Anderson (2006) Anderson, C. (2006). The long tail: Why the future of business is selling less of more. Hachette Books.
  • Aumann (1976) Aumann, R. J. (1976). Agreeing to disagree. The annals of statistics 1236–1239.
  • Ben-Porat and Tennenholtz (2019) Ben-Porat, O. and Tennenholtz, M. (2019). Regression equilibrium. In Proceedings of the 2019 ACM Conference on Economics and Computation. 173–191.
  • Bengtsson and Kock (2000) Bengtsson, M. and Kock, S. (2000). “Coopetition” in business networks – to cooperate and compete simultaneously. Industrial Marketing Management, 29 411–426.
  • Bergemann and Bonatti (2019) Bergemann, D. and Bonatti, A. (2019). Markets for information: An introduction. Annual Review of Economics, 11 85–107.
  • Bergemann and Morris (2013) Bergemann, D. and Morris, S. (2013). Robust predictions in games with incomplete information. Econometrica, 81 1251–1308.
  • Bergemann and Morris (2016) Bergemann, D. and Morris, S. (2016). Bayes correlated equilibrium and the comparison of information structures in games. Theoretical Economics, 11 487–522.
  • Bergemann and Morris (2019) Bergemann, D. and Morris, S. (2019). Information design: A unified perspective. Journal of Economic Literature, 57 44–95.
  • Bimpikis et al. (2019) Bimpikis, K., Crapis, D. and Tahbaz-Salehi, A. (2019). Information sale and competition. Management Science, 65 2646–2664.
  • Brandenburger and Nalebuff (2011) Brandenburger, A. M. and Nalebuff, B. J. (2011). Co-opetition. Crown Business.
  • Clarke (1983) Clarke, R. N. (1983). Collusion and the incentives for information sharing. The Bell Journal of Economics 383–394.
  • de Zegher and Lo (2020) de Zegher, J. F. and Lo, I. (2020). Crowdsourcing market information from competitors. Available at SSRN 3537625.
  • European Commission (2020) European Commission (2020). A European strategy for data. https://eur-lex.europa.eu/legal-content/EN/TXT/?qid=1593073685620&uri=CELEX:52020DC0066. Accessed: 2021-05-13.
  • Feng et al. (2019) Feng, Y., Gradwohl, R., Hartline, J., Johnsen, A. and Nekipelov, D. (2019). Bias-variance games. arXiv preprint arXiv:1909.03618.
  • Forges (1993) Forges, F. (1993). Five legitimate definitions of correlated equilibrium in games with incomplete information. Theory and Decision, 35 277–310.
  • Galperti and Perego (2018) Galperti, S. and Perego, J. (2018). A dual perspective on information design. Available at SSRN 3297406.
  • Gehrig and Stenbacka (2007) Gehrig, T. and Stenbacka, R. (2007). Information sharing and lending market competition with switching costs and poaching. European Economic Review, 51 77–99.
  • Gentzkow and Kamenica (2016) Gentzkow, M. and Kamenica, E. (2016). Competition in persuasion. The Review of Economic Studies, 84 300–322.
  • Gu et al. (2019) Gu, Y., Madio, L. and Reggiani, C. (2019). Data brokers co-opetition. Available at SSRN 3308384.
  • Ha and Tong (2008) Ha, A. Y. and Tong, S. (2008). Contracting and information sharing under supply chain competition. Management Science, 54 701–715.
  • Ichihashi (2020) Ichihashi, S. (2020). Competing data intermediaries. Manuscript, available at https://shota2.github.io/research/data.pdf.
  • Jappelli and Pagano (2002) Jappelli, T. and Pagano, M. (2002). Information sharing, lending and defaults: Cross-country evidence. Journal of Banking & Finance, 26 2017–2045.
  • Jentzsch et al. (2013) Jentzsch, N., Sapi, G. and Suleymanova, I. (2013). Targeted pricing and customer data sharing among rivals. International Journal of Industrial Organization, 31 131–144.
  • Kairouz et al. (2019) Kairouz, P., McMahan, H. B., Avent, B., Bellet, A., Bennis, M., Bhagoji, A. N., Bonawitz, K., Charles, Z., Cormode, G., Cummings, R. et al. (2019). Advances and open problems in federated learning. arXiv preprint arXiv:1912.04977.
  • Kamenica and Gentzkow (2011) Kamenica, E. and Gentzkow, M. (2011). Bayesian persuasion. American Economic Review, 101 2590–2615.
  • Kearns et al. (2014) Kearns, M., Pai, M., Roth, A. and Ullman, J. (2014). Mechanism design in large games: Incentives and privacy. In Proceedings of the 5th Conference on Innovations in Theoretical Computer Science. 403–410.
  • Liu and Serfes (2006) Liu, Q. and Serfes, K. (2006). Customer information sharing among rival firms. European Economic Review, 50 1571–1600.
  • Loshen (2014) Loshen, D. (2014). Leveraging cloud-based analytics for crowd-sourced intelligence. https://gigaom.com/report/leveraging-cloud-based-analytics-for-crowdsourced-intelligence/. [Gigaom Research; accessed 6-June-2020].
  • Lyu et al. (2020) Lyu, L., Yu, J., Nandakumar, K., Li, Y., Ma, X., Jin, J., Yu, H. and Ng, K. S. (2020). Towards fair and privacy-preserving federated deep models. IEEE Transactions on Parallel and Distributed Systems, 31 2524–2541.
  • Mansour et al. (2018) Mansour, Y., Slivkins, A. and Wu, Z. S. (2018). Competing bandits: Learning under competition. In 9th Innovations in Theoretical Computer Science Conference (ITCS 2018).
  • Martens et al. (2020) Martens, B., De Streel, A., Graef, I., Tombal, T. and Duch-Brown, N. (2020). Business-to-business data sharing: An economic and legal analysis. European Commission, JRC Digital Economy Working Paper Series, 2020. https://ec.europa.eu/jrc/sites/jrcsh/files/jrc121336.pdf. Accessed: 2021-05-13.
  • Mathevet et al. (2020) Mathevet, L., Perego, J. and Taneva, I. (2020). On information design in games. Journal of Political Economy, 128 1370–1404.
  • Monderer and Tennenholtz (2004) Monderer, D. and Tennenholtz, M. (2004). k-implementation. Journal of Artificial Intelligence Research, 21 37–62.
  • Monderer and Tennenholtz (2009) Monderer, D. and Tennenholtz, M. (2009). Strong mediated equilibrium. Artificial Intelligence, 173 180–195.
  • Myerson (1991) Myerson, R. B. (1991). Game Theory: Analysis of Conflict. Harvard University Press.
  • Pagano and Jappelli (1993) Pagano, M. and Jappelli, T. (1993). Information sharing in credit markets. The Journal of Finance, 48 1693–1718.
  • Pathak et al. (2010) Pathak, B., Garfinkel, R., Gopal, R. D., Venkatesan, R. and Yin, F. (2010). Empirical analysis of the impact of recommender systems on sales. Journal of Management Information Systems, 27 159–188.
  • Raith (1996) Raith, M. (1996). A general model of information sharing in oligopoly. Journal of economic theory, 71 260–288.
  • Shamir and Shin (2016) Shamir, N. and Shin, H. (2016). Public forecast information sharing in a market with competing supply chains. Management Science, 62 2994–3022.
  • Simon (1956) Simon, H. A. (1956). Rational choice and the structure of the environment. Psychological review, 63 129.
  • Yang et al. (2019) Yang, Q., Liu, Y., Chen, T. and Tong, Y. (2019). Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST), 10 1–19.
  • Zhan et al. (2021) Zhan, Y., Zhang, J., Hong, Z., Wu, L., Li, P. and Guo, S. (2021). A survey of incentive mechanism design for federated learning. IEEE Transactions on Emerging Topics in Computing.

Appendix

7 Extensions

7.1 Transfers

In this section we consider an extension to the model: the possibility of monetary transfers between players. With transfers, the mediator may require one player to pay an amount cc, and another to receive that amount cc, when opting in. Assume total utilities are quasi-linear in these transfers—that is, a player’s total utility is her utility from the game plus the change in her monetary position. How does the possibility of such transfers affect the design of mediators?

A first observation is that the optimality of mediator noAv\mathcal{M}^{v}_{noA}, under jointly complete information and without an Amazon, is not affected, since that mediator leads to a globally optimal sum of player’s utilities. Any transfers will only change the distribution of utilities (and hence whether or not the mediator is IR), not their sum.

When players have jointly complete information and they compete against an Amazon, however, the optimal mediator can be different. In particular, when there are transfers, then a mediator that is fully revealing to both players is optimal whenever v1+v22/3v_{1}+v_{2}\leq 2/3. If both vi1/3v_{i}\leq 1/3 and vj1/3v_{j}\leq 1/3, then the mediator that is fully revealing to both is optimal even without a transfer (this is what Av\mathcal{M}_{A}^{v} does). If vi1/3vjv_{i}\geq 1/3\geq v_{j}, let the mediator set a transfer c=(vi1/3)c=(v_{i}-1/3), have player jj transfer cc to player ii, and then always recommend gωg_{\omega} to both players. After the transfer this leads to utility 1/3+c1/3+c to player ii and 1/3c1/3-c to player jj. Observe that 1/3+c=1/3+(vi1/3)=vi1/3+c=1/3+(v_{i}-1/3)=v_{i} and 1/3c=1/3(vi1/3)=2/3vivj1/3-c=1/3-(v_{i}-1/3)=2/3-v_{i}\geq v_{j}, and so the mediator is IR. The mediator is IC since each player plays the dominant action gωg_{\omega} for every ω\omega. And, finally, the mediator is optimal, since the sum of utilities is 2/32/3, which is globally maximal.

When there is no jointly complete information the situation is slightly more complicated, and here we consider three settings of parameters. First, if ϕS132ϕS2\phi_{S}^{1}\leq\frac{3}{2}\cdot\phi_{S}^{2}, then the mediator 1v\mathcal{M}_{1}^{v} of Mediator 3 is IR, IC, and optimal, even without transfers. Second, if ϕS1>3ϕS2\phi_{S}^{1}>3\cdot\phi_{S}^{2}, then by Claim 2 the pooling strategy is both dominant and welfare maximizing. In this case, a mediator like the mediator with jointly complete information and transfers above will be optimal whenever v1+v223ϕ1v_{1}+v_{2}\leq\frac{2}{3}\cdot\phi^{1}. If both viϕ1/3v_{i}\leq\phi^{1}/3 and vjϕ1/3v_{j}\leq\phi^{1}/3, then the mediator that is fully revealing to both is optimal even without a transfer (as in 2v\mathcal{M}_{2}^{v}). If viϕ1/3vjv_{i}\geq\phi^{1}/3\geq v_{j}, let the mediator set a transfer c=(viϕ1/3)c=(v_{i}-\phi^{1}/3), have player jj transfer cc to player ii, and then always recommend gωg_{\omega} to both players. This mediator is IR, IC, and optimal, by the same reasoning as above.

Third, if ϕS1(32ϕS2,3ϕS2)\phi_{S}^{1}\in(\frac{3}{2}\cdot\phi_{S}^{2},3\cdot\phi_{S}^{2}), then full data-sharing does not always satisfy the IC constraint. This is demonstrated by the example in Section 5.2.1, in which the optimal IC mediator is one in which there is no data sharing (and it is strictly better than full data-sharing). Since the example is symmetric, players’ payoffs are identical, and so transfers do not alleviate the situation. Nonetheless, recall that mediator 3v\mathcal{M}^{v}_{3} (analyzed in Theorem 8) yields welfare at least 34 OPT\frac{3}{4}\cdot\text{\rm\,OPT}, where  OPT is the maximal welfare achieved by any mediator, including one that is not IR or IC. Furthermore, Example 2 shows that no mediator (not even one that does no satisfy IR) can in general achieve more than 45 OPT\frac{4}{5}\cdot\text{\rm\,OPT}. This implies that even the best mediator with transfers can yield at most 45 OPT\frac{4}{5}\cdot\text{\rm\,OPT}. Our mediator 3v\mathcal{M}^{v}_{3} achieves close to this—namely, 34 OPT\frac{3}{4}\cdot\text{\rm\,OPT}without transfers.

7.2 Many Players

Throughout this paper we analyzed the case of 2 players and possibly an Amazon. In this section we discuss an extension of the model and results to n>2n>2 players. We leave a thorough analysis of optimal mediators in this setting for future work, but here we provide some preliminary results. In particular, we consider the case in which players have jointly complete information: i{1,,n}Pi(ω)={ω}\bigcap_{i\in\{1,\ldots,n\}}P_{i}(\omega)=\{\omega\} for every ωΩ\omega\in\Omega, where Pi(ω)P_{i}(\omega) is the partition element of player ii when consumer type ω\omega is realized.

The main take-away from this section is that in coopetition without an Amazon, a generalization of the mediator from the introductory example to nn players is optimal. In coopetition against an Amazon, however, the situation is more complicated. We provide necessary and sufficient conditions for full data-sharing to be optimal in this case.

7.2.1 Many Players Without Amazon

Let E=(s1,,sn)E=(s_{1},\ldots,s_{n}) be a BNE of the unmediated game, where si(ω)s_{i}(\omega) is the equilibrium strategy of player ii when she obtains information Pi(ω)P_{i}(\omega). We will design an optimal mediator for the setting in which, for every ii, the base value viv_{i} is at most the utility of player ii in EE. Consider the following mediator, and observe that it uses the equilibrium EE in its construction (in contrast with noAv\mathcal{M}_{noA}^{v}, which only uses the base values vv).

Mediator 6 Mediator for 𝒫={1,,n}\mathcal{P}=\{1,\ldots,n\} given equilibrium E=(s1,,sn)E=(s_{1},\ldots,s_{n})
1:procedure noAE\mathcal{M}^{E}_{noA}(V1,,VnV_{1},\ldots,V_{n}) \triangleright Vi=Pi(ω)V_{i}=P_{i}(\omega) for realized ωΩ\omega\in\Omega and every ii
2:  ωi{1,,n}Vi\omega\leftarrow\bigcap_{i\in\{1,\ldots,n\}}V_{i}
3:  gia draw from distribution si(ω)g_{i}\leftarrow\mbox{a draw from distribution }s_{i}(\omega) for every ii
4:  if (gigωi𝒫)\left(g_{i}\neq g_{\omega}~\forall i\in\mathcal{P}\right) then return (gω,,gω)\left(g_{\omega},\ldots,g_{\omega}\right)
5:  else return (g1,,gn)\left(g_{1},\ldots,g_{n}\right)   
Theorem 10

Fix a BNE E=(s1,,sn)E=(s_{1},\ldots,s_{n}) of the unmediated game with no Amazon, and for every ii let viv_{i} be at most the expected utility of player ii in EE. Then noAE\mathcal{M}^{E}_{noA} is IR, IC, and optimal.

In mediator noAE\mathcal{M}^{E}_{noA}, every player obtains the same utility as under EE, plus additional utility whenever no player correctly chooses gωg_{\omega}. Thus, the mediator is IR. Furthermore, since under this mediator, for every ω\omega at least one player chooses gωg_{\omega}, and so the total welfare is 1, which is maximal. The main challenge in the proof is to show that the mediator is IC: in the mediator each player either plays her equilibrium strategy si(ω)s_{i}(\omega) or the dominant action gωg_{\omega}, and the proof involves showing that, conditional on not obtaining recommendation gωg_{\omega}, the strategy si(ω)s_{i}(\omega) is still optimal.

  • Proof:   Recall that s¯\overline{s} is the strategy profile in which players always follow noAE\mathcal{M}^{E}_{noA}’s recommendation. We first show that all players receive (weakly) higher expected utilities under s¯\overline{s} than under EE. Consider some ωΩ\omega\in\Omega, and some realizations (g1,,gn)(g_{1},\ldots,g_{n}) of (s1(ω),,sn(ω))(s_{1}(\omega),\ldots,s_{n}(\omega)). Consider two events: If gigωg_{i}\neq g_{\omega} for all ii then without the mediator, all players obtain utility zero. The mediator here recommends gωg_{\omega} to all, leading to positive utility to all players. Conditional on this event, then, the mediator is beneficial to all players. The other event is the complement of this first event, in which case the mediator recommends players’ BNE actions. Since the mediator’s recommendation does not change players’ actions relative to EE, it also does not affect their utilities. Overall, then, all players prefer s¯\overline{s} in noAE\mathcal{M}^{E}_{noA} to EE, and so the mediator is IR.

    Next, we show that s¯\overline{s} is a BNE of ΓnoAE\Gamma^{\mathcal{M}^{E}_{noA}}. Fix ω\omega and a player ii, and suppose all other players jj play s¯j\overline{s}_{j}. Denote this profile of other players as s¯i(ω)\overline{s}_{-i}(\omega) To simplify notation, denote by =noAE\mathcal{M}=\mathcal{M}^{E}_{noA} and by (ω)i\mathcal{M}(\omega)_{i} the distribution over recommendations to player ii in state ω\omega. Also fix some gsupp(si(ω))g\in\mathrm{supp}(s_{i}(\omega)), and let α=Pr[(si(ω)g)|Pi(ω)((ω)i=g)]\alpha=\Pr\left[(s_{i}(\omega)\neq g)|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\right]. Then when the mediator recommends action gg, player ii’s expected utility from following that recommendation is

    E\displaystyle E [ui(g,s¯i(ω))|Pi(ω)((ω)i=g)]\displaystyle\left[u_{i}(g,\overline{s}_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\right]
    =αE[ui(g,s¯i(ω))|Pi(ω)((ω)i=g)(si(ω)g)]\displaystyle=\alpha E\left[u_{i}(g,\overline{s}_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\cap(s_{i}(\omega)\neq g)\right]
    +(1α)E[ui(g,s¯i(ω))|Pi(ω)((ω)i=g)(si(ω)=g)]\displaystyle~~~+(1-\alpha)E\left[u_{i}(g,\overline{s}_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\cap(s_{i}(\omega)=g)\right] (1)
    =α/n+(1α)E[ui(g,s¯i(ω))|Pi(ω)((ω)i=g)(si(ω)=g)]\displaystyle=\alpha/n+(1-\alpha)E\left[u_{i}(g,\overline{s}_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\cap(s_{i}(\omega)=g)\right] (2)
    =α/n+(1α)E[ui(g,si(ω))|Pi(ω)((ω)i=g)(si(ω)=g)]\displaystyle=\alpha/n+(1-\alpha)E\left[u_{i}(g,s_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\cap(s_{i}(\omega)=g)\right] (3)
    =α/n+1αPr[(ω)i=g|Pi(ω)(si(ω)=g)](E[ui(g,si(ω))|Pi(ω)(si(ω)=g)]\displaystyle=\alpha/n+\frac{1-\alpha}{\Pr\left[\mathcal{M}(\omega)_{i}=g|P_{i}(\omega)\cap(s_{i}(\omega)=g)\right]}\cdot\biggl{(}E\left[u_{i}(g,s_{-i}(\omega))|P_{i}(\omega)\cap(s_{i}(\omega)=g)\right]
    E[ui(g,si(ω))|Pi(ω)((ω)ig)(si(ω)=g)]Pr[(ω)ig|Pi(ω)(si(ω)=g)])\displaystyle~~~~-E\left[u_{i}(g,s_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}\neq g\right)\cap(s_{i}(\omega)=g)\right]\cdot\Pr\left[\mathcal{M}(\omega)_{i}\neq g|P_{i}(\omega)\cap(s_{i}(\omega)=g)\right]\biggr{)} (4)
    =α/n+(1α)(E[ui(g,si(ω))|Pi(ω)(si(ω)=g)]Pr[(ω)i=g|Pi(ω)(si(ω)=g)],\displaystyle=\alpha/n+\frac{(1-\alpha)\cdot(E\left[u_{i}(g,s_{-i}(\omega))|P_{i}(\omega)\cap(s_{i}(\omega)=g)\right]}{\Pr\left[\mathcal{M}(\omega)_{i}=g|P_{i}(\omega)\cap(s_{i}(\omega)=g)\right]}, (5)

    where (1) follows from the law of total expectation, (2) follows since si(ω)(ω)is_{i}(\omega)\neq\mathcal{M}(\omega)_{i} implies that the mediator recommended gωg_{\omega} to all players, (3) follows since si(ω)=(ω)is_{i}(\omega)=\mathcal{M}(\omega)_{i} implies that the mediator recommended the equilibrium action to all players, (4) follows from another application of the law of total expectation, and (5) follows since (ω)ig=si(ω)\mathcal{M}(\omega)_{i}\neq g=s_{i}(\omega) implies that ggωg\neq g_{\omega}.

Similarly, ii’s expected utility from not following the mediator’s recommendation gg and playing ggg^{\prime}\neq g instead is

E\displaystyle E [ui(g,s¯i(ω))|Pi(ω)((ω)i=g)]\displaystyle\left[u_{i}(g^{\prime},\overline{s}_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\right]
=αE[ui(g,s¯i(ω))|Pi(ω)((ω)i=g)(si(ω)g)]\displaystyle=\alpha E\left[u_{i}(g^{\prime},\overline{s}_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\cap(s_{i}(\omega)\neq g)\right]
+(1α)E[ui(g,s¯i(ω))|Pi(ω)((ω)i=g)(si(ω)=g)]\displaystyle~~~+(1-\alpha)E\left[u_{i}(g^{\prime},\overline{s}_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\cap(s_{i}(\omega)=g)\right]
=(1α)E[ui(g,s¯i(ω))|Pi(ω)((ω)i=g)(si(ω)=g)]\displaystyle=(1-\alpha)E\left[u_{i}(g^{\prime},\overline{s}_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\cap(s_{i}(\omega)=g)\right]
=(1α)E[ui(g,si(ω))|Pi(ω)((ω)i=g)(si(ω)=g)]\displaystyle=(1-\alpha)E\left[u_{i}(g^{\prime},s_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\cap(s_{i}(\omega)=g)\right]
=1αPr[(ω)i=g|Pi(ω)(si(ω)=g)](E[ui(g,si(ω))|Pi(ω)(si(ω)=g)]\displaystyle=\frac{1-\alpha}{\Pr\left[\mathcal{M}(\omega)_{i}=g|P_{i}(\omega)\cap(s_{i}(\omega)=g)\right]}\cdot\biggl{(}E\left[u_{i}(g^{\prime},s_{-i}(\omega))|P_{i}(\omega)\cap(s_{i}(\omega)=g)\right]
E[ui(g,si(ω))|Pi(ω)((ω)ig)(si(ω)=g)]Pr[(ω)ig|Pi(ω)(si(ω)=g)])\displaystyle~~~~-E\left[u_{i}(g^{\prime},s_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}\neq g\right)\cap(s_{i}(\omega)=g)\right]\cdot\Pr\left[\mathcal{M}(\omega)_{i}\neq g|P_{i}(\omega)\cap(s_{i}(\omega)=g)\right]\biggr{)}
(1α)(E[ui(g,si(ω))|Pi(ω)(si(ω)=g)]Pr[(ω)i=g|Pi(ω)(si(ω)=g)].\displaystyle\leq\frac{(1-\alpha)\cdot(E\left[u_{i}(g^{\prime},s_{-i}(\omega))|P_{i}(\omega)\cap(s_{i}(\omega)=g)\right]}{\Pr\left[\mathcal{M}(\omega)_{i}=g|P_{i}(\omega)\cap(s_{i}(\omega)=g)\right]}.

Since EE is an equilibrium, it holds that

E[ui(g,si(ω))|Pi(ω)si(ω)=g)]E[ui(g,si(ω))|Pi(ω)si(ω)=g)],E\left[u_{i}(g,s_{-i}(\omega))|P_{i}(\omega)\cap s_{i}(\omega)=g)\right]\geq E\left[u_{i}(g^{\prime},s_{-i}(\omega))|P_{i}(\omega)\cap s_{i}(\omega)=g)\right],

and so

E[ui(g,si(ω))|Pi(ω)((ω)i=g)]E[ui(g,si(ω))|Pi(ω)((ω)i=g)].E\left[u_{i}(g,s_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\right]\geq E\left[u_{i}(g^{\prime},s_{-i}(\omega))|P_{i}(\omega)\cap\left(\mathcal{M}(\omega)_{i}=g\right)\right].

Additionally, note that if (ω)i=g\mathcal{M}(\omega)_{i}=g for some gsupp(si(ω))g\not\in\mathrm{supp}(s_{i}(\omega)) then player ii is certain that g=gωg=g_{\omega}, and so following the mediator’s recommendation is optimal. Thus, in both cases, following the mediator’s recommendation is optimal, and so s¯\overline{s} is a BNE.

Finally, we show that noAE\mathcal{M}^{E}_{noA} is optimal with respect to EE. In any state ω\omega, the sum of players’ utilities conditional on that state is at most 1. This maximal utility is achieved whenever at least one player plays gωg_{\omega}. Under noAE\mathcal{M}^{E}_{noA}, at least one player plays gωg_{\omega} in every ω\omega. Hence, the sum of players’ (unconditional) expected utilities is W(noAE)=1W(\mathcal{M}^{E}_{noA})=1, the maximum possible in the game.    

7.2.2 Many Players Against Amazon

Suppose now that there are nn players and an Amazon, that there is jointly complete information, and that vv is an arbitrary profile of base values.

Claim 3

The mediator \mathcal{M} that is fully revealing to all players i{1,,n}i\in\{1,\ldots,n\} and always recommends gωg_{\omega} is IR, IC, and optimal if and only if vi1/(n+1)v_{i}\leq 1/(n+1) for all ii.

  • Proof:   The strategy s¯\overline{s} in a fully revealing \mathcal{M} is a BNE, since each player plays gωg_{\omega}, which is dominant. It is IR, since each player ii’s expected utility in \mathcal{M} is 1/(n+1)vi1/(n+1)\geq v_{i}. Finally, it is optimal: Whenever kk players correctly choose gωg_{\omega}, the sum of players’ utilities is k/(k+1)k/(k+1). Under \mathcal{M}, the sum of players’ utilities is n/(n+1)n/(n+1), which is the most they can jointly obtain.

    The reverse direction holds by the observation that if a player has base value vi>1/(n+1)v_{i}>1/(n+1) then her IR constraint is violated under a fully revealing mediator.