Coopetition Against an Amazon
Abstract
This paper studies cooperative data-sharing between competitors vying to predict a consumer’s tastes. We design optimal data-sharing schemes both for when they compete only with each other, and for when they additionally compete with an Amazon—a company with more, better data. We show that simple schemes—threshold rules that probabilistically induce either full data-sharing between competitors, or the full transfer of data from one competitor to another—are either optimal or approximately optimal, depending on properties of the information structure. We also provide conditions under which firms share more data when they face stronger outside competition, and describe situations in which this conclusion is reversed.
1 Introduction
A key challenge to firms competing in today’s electronic marketplace is competition against Big Tech companies that have considerably more data and so better predictive models. One way in which smaller firms may overcome this hurdle and survive or even thrive in such a market is to engage in coopetitive strategies—namely, to cooperate with other small firms that are its competitors. Such coopetitive strategies have increasingly become a field of study by both academics and practitioners (see, for example, Brandenburger and Nalebuff (2011) and Bengtsson and Kock (2000)), but they largely focus on industrial applications such as healthcare, IT, and service industries. In this paper, we study coopetition between e-commerce companies, and focus on the possibility of data sharing as a way to deal with their data imbalance vis-à-vis Big Tech.
Such coopetitive data-sharing can be undertaken by the firms themselves or by external service-providers. To facilitate the former, the newly burgeoning area of federated machine learning has as its goal the design of mechanisms that generate predictive models for firms based on such shared data (Yang et al., 2019). The assumption underlying federated learning is that these firms actually want predictive models based on all shared data, but in competitive scenarios this need not be the case. Although each firm certainly desires its own model to be as predictive as possible, it most likely does not wish the same for its competitors. So when is full data-sharing, leading to maximally predictive models for all firms, optimal? When is partial or no sharing better? And do these answers differ when firms compete against an Amazon?
In addition to using federated machine learning, coopetitive data-sharing can also be facilitated by an external service-provider. While tools for data sharing are prevalent—and include platforms such as Google Merchant and Azure Data Share—they are currently undergoing even further development. For example, in its European Strategy for Data, the European Commission plans the following: “In the period 2021-2027, the Commission will invest in a High Impact Project on European data spaces and federated cloud infrastructures. The project will fund infrastructures, data-sharing tools, architectures and governance mechanisms for thriving data-sharing and Artificial Intelligence ecosystems” (European Commission, 2020). But just as with federated learning, the potential benefits from data sharing may arise under partial, rather than full, sharing. For instance, one industry white paper urges service providers to
“…analyze the combined, crowdsourced data and generate benchmark analyses and comparative performance reports. Each participating client gains insights that it could not otherwise access, and each benefits from the service-host provider’s ability to slice and dice the aggregated data and share the results that are relevant to each client” (Loshen, 2014).
What is the optimal way for such service providers to combine, slice and dice, and share e-commerce firms’ data in order to facilitate successful coopetition against an Amazon?
Regardless of how coopetitive data-sharing is achieved—via federated learning or through a service provider—the answer to whether or not firms desire maximally predictive models for all depends on how these predictions are eventually used. If the firms involved act in completely unrelated markets, then of course full data-sharing is optimal—it yields a maximally predictive model for a firm, and that firm is not harmed by others’ better predictions. This is no longer true if the firms are competitors, since then a firm may be harmed by an improvement in its competitors’ predictions. In this case, whether or not the tradeoff is worthwhile depends on the details of the market in which firms compete.
In this paper we focus on one such market, which is motivated by recommender systems for online advertising and Long Tail retail. In our model, there are many types of consumers, and firms use their data to infer and take a tailored action for each consumer type. In the online advertising market, for example, firms use their data to personalize advertisements in order to maximize the probability that a consumer clicks on their ad. Data sharing is common in this market, and platforms such as Google Merchant and Azure Data Share specifically facilitate such sharing across clients in order to improve personalization and increase clickthrough rates.
For another concrete application, consider retailers in a Long Tail market—a market with an enormous number of low-demand goods that collectively make up substantial market share—interested in finding consumers for their products. Success in such markets crucially relies on retailers’ abilities to predict consumers’ tastes in order to match them to relevant products (Anderson, 2006; Pathak et al., 2010). On their own, smaller firms can be destroyed by giants with vastly greater amounts of data and thus a clear predictive advantage. In cooperation with other small firms, however, this data imbalance may be mitigated, possibly granting smaller firms a fighting chance. (We do not discuss here antitrust issues surrounding data sharing; the reader is referred to Martens et al. (2020) for a discussion thereof in the context of the EU’s European Strategy for Data.)
In this paper we study such cooperation between competing firms, and design data-sharing schemes that are optimal—ones that maximize total firm profits subject to each firm having the proper incentives to participate. More specifically, we study two settings, one without and the other with an Amazon. In the former, in which smaller firms compete only with one another, we show that coopetition is beneficial to the extent that firms can share data to simultaneously improve their respective predictions. In the optimal data-sharing scheme firms benefit from sharing because it allows them to better predict the tastes of consumers for whom no firm predicts well on its own.
In the latter setting, in which smaller firms compete with one another but also with an Amazon, sharing data about consumers for whom no firm predicts well can be beneficial, but is not optimal. We show that, in the optimal scheme, firms share data with others and weaken their own market positions on some market segments, in exchange for receiving data from others and strengthening their market positions on other segments.
The following simple example illustrates our model and previews some of our results. The example is framed as retailers offering goods to consumers who may make a purchase, but can also be interpreted as advertisers displaying advertisements to consumers who may click on the ads. In either case, suppose the market consists of two goods, and , that consumers may desire. Each consumer is described by a feature vector in that determines that consumer’s taste: Consumers of types 00 and 11 are only interested in , whereas consumers of types 01 and 10 are only interested in . A priori, suppose the distribution of consumers in the market is such that an fraction are of type 00, a fraction of type 01, a fraction of type 10, and a fraction of type 11.
When a consumer shows up to the market, a retailer may offer him one of the goods (as a convention, we use masculine pronouns for consumers and feminine pronouns for retailers/players). If the retailer has data about the consumer’s type, she will offer the good desired by that consumer. Assume that when a consumer is offered the correct good he makes the purchase, and that this leads to a profit of 1 to the retailer. If there are two retailers offering this good to the consumer he chooses one at random from whom to make the purchase. A retailer whose good is not chosen gets profit 0.
Suppose now that there are two retailers pursuing the same consumer, but that they do not fully know the consumer’s type. Instead, the first retailer only knows the first bit of the consumer’s type, and the second retailer only knows the second bit. This situation is summarized in Figure 1, where the first retailer knows the row bit and the second knows the column bit. Which goods should the competing retailers offer the consumer? For this example, suppose for simplicity that . Then each of the retailers has a dominant strategy: If the bit they know is 0, offer , and if the bit they know is 1, offer .
To see that this is a dominant strategy, consider for example the row retailer, and suppose she learns that a consumer’s row bit is 0. She thus knows that the correct good is with probability and with probability . Her utility, however, depends also on the good offered by the column retailer. The row retailer’s worst-case utility from offering is , which occurs when the column player also offers . Her best case utility from offering is . Since we assumed , offering is best for the row player regardless of the other retailer’s offer, and is thus dominant. The analyses for the case in which the row retailer’s bit is 1, as well as for the column retailer’s strategy, are similar.
0 | 1 | |
---|---|---|
0 | ||
1 |
0 | 1 | |
---|---|---|
0 | ||
1 |
The result from deployment of these dominant strategies is that consumers of type 00 will be offered the correct good by both retailers, and will thus choose one at random. Consumers of types 01 and 10 will be offered the correct good by only one of the retailers, whereas consumers of type 11 will not make a purchase as they will not be offered the correct good. The expected profit of the first retailer will thus be , and of the second retailer .
Does data sharing improve retailers’ profits? Suppose retailers could share their respective information with one another, so that both always knew the consumer’s type. This would lead to both always offering the correct good, and hence to expected profits of for each. However, this would be detrimental to the second retailer whenever , since her profits with data sharing would be lower than without. Such cooperation is thus not individually rational – the second retailer will not want to cooperate with the first.
Instead, suppose there is a mediator—a trusted third-party that may represent either a service provider or a cryptographic protocol (such as a federated learning algorithm) run by the two retailers. The European Commission calls mediators common data spaces European Commission (2020), and describes them as “third-parties [who] may act as intermediaries and apply new technologies and ways of organising markets in order to…enable transactions that were previously not feasible. They can be private, public or community organisations that are neutral with respect to data uses…” (Martens et al., 2020). Given such a mediator, consider the following data-sharing scheme: both retailers share their data with the mediator, who then passes along the data to all retailers only if the consumer’s type is 11. If the consumer is not of type 11 then each retailer has her original data, and additionally infers that the consumer’s type is not 11 (for otherwise she would have learned that the type is 11).
How does this scheme affect retailers’ strategies? Clearly, when retailers learn that the consumer’s type is 11, both offer . What happens if a retailer learns that the consumer’s type is not 11? Consider the row retailer. If her bit is 0, then learning that the consumer’s type is not 11 does not provide any new information, and so she still has the same dominant strategy of offering . If her bit is 1, however, then she learns that the consumer’s type must be 10. But note that in this case, her best strategy is to offer , which is the same as her dominant strategy absent a mediator. Thus, the mediator changes the row retailer’s behavior only when the consumer is of type 11. A similar analysis and conclusion hold for the column retailer.
Since retailers offer the same goods with and without data sharing in all cases except when the consumer is of type 11, this scheme changes retailer’s profits only in this latter case. In particular, it leads to an additional (ex ante) profit of for each retailer beyond her original profit—with probability the consumer is of type 11, in which case the retailers split the additional surplus of 1—and is thus beneficial to both. Observation 1 shows that such a scheme is, in fact, not only individually rational but also optimal.
Suppose now that the two retailers are also competing against an Amazon for the consumer. Since the Amazon is a giant it has more data, and, in particular, we assume that it has complete information about each consumer’s type. In this three-way competition, the original dominant strategies of our two smaller retailers do not perform as well: They lead to profits of and to the first and second retailer, respectively, since when they offer the correct good the consumer now chooses amongst up to 3 retailers. The mediator described above, which reveals information when the type is 11, leads to higher profits, since now there is an additional to each of the smaller retailers. However, that mediator is no longer optimal, and the retailers can actually do better.
To see this, observe that, conditional on consumer type 01, the total profit to the small retailers is , since only the second makes the correct offer and then competes with the Amazon. In contrast, if both retailers were to know the type and offer the correct good, then their total profit would be , namely each. This is harmful to the second retailer as it involves a loss of profit, but a gain for the first and the sum. But the second retailer can be compensated by getting data from the first elsewhere, for instance on consumer type 10.
For simplicity of this example, suppose that , and consider a mediator that facilitates full data-sharing, in which both retailers learn each other’s data and thus have complete information about consumers’ types. Here, the profit of each is , so both gain from this data sharing. Furthermore, the total profit of the retailers is , which is the maximal utility they can obtain when competing against an Amazon. Thus, this data-sharing scheme is individually rational and optimal. Observe that it leads to higher retailer welfare than the optimal scheme absent a mediator.
Full data-sharing is not always individually rational, however. If is much greater than , then the second retailer is not sufficiently compensated by the first for sharing data about consumer 01. The second retailer will consequently be harmed by such data sharing, and so the scheme will not be individually rational. However, in Theorem 4 we show that there is a different data-sharing scheme that is individually rational and optimal, a scheme in which the first retailer shares all her data and the second shares some of her data. Overall, we show that such data-sharing coopetition against an Amazon is beneficial to the small retailers.
1.1 Contribution
Our model is more general than the simple example above, and involves many goods and many types of consumers. For the example it was useful to think of each consumer as having a feature vector describing his type, but in the model we take a more general approach that allows for a wider class of information structures. Furthermore, in general the retailers will not have dominant strategies, either with or without a mediator. Instead, we suppose that, absent a mediator, players play an arbitrary Bayesian Nash equilibrium, and construct mediators that lead to higher expected utilities in equilibrium.
In Section 3, we design optimal mediators for coopetitive data-sharing. In addition to maximizing total retailer profits, all our mediators are individually rational—retailers attain higher utilities when they use the mediators—and incentive compatible—in equilibrium, each retailer’s best strategy is to follow the mediators’ recommendations. One of our main results is that the optimal schemes are simple. In particular, instead of relying on the details of retailers’ data or on their strategies in equilibrium sans data-sharing (as in the introductory example above), our mediators consist of threshold rules that probabilistically induce either full data-sharing between retailers, or the full transfer of data from one retailer to another.
The analysis in Section 3 retains one of the assumptions present in the example above—that the joint data of the small retailers is sufficient to uniquely identify each consumer’s type. In Section 4 we drop this assumption, leading to a new set of challenges. This is because, when the assumption holds, there is no conflict between welfare maximization and equilibrium, and so the main hurdle in designing optimal mediators is the individual rationality constraint. Without the assumption, however, participants’ incentive compatibility constraints impose a limit on the total welfare that can be achieved. Nonetheless, our main results in this section are that variants of the simple mediators from Section 3 are approximately optimal here as well. We also provide examples showing that our approximation factor is tight.
Finally, in Section 5 we delve into the intriguing question of whether players share more data in the presence of an Amazon or in its absence. First, under the assumption that the joint data of the small retailers is sufficient to uniquely identify each consumer’s type, we show that if data sharing is strictly beneficial in the absence of an Amazon, then it is also strictly beneficial in its presence. This confirms the intuition that players share more data when facing stronger outside competition. However, we also show that this conclusion may be reversed when the assumption does not hold. In this latter case, we show that sometimes data sharing can be strictly beneficial in the absence of an Amazon, but not in its presence. We show that the reason this may happen depends on whether or not players’ equilibrium considerations conflict with welfare maximization.
1.2 Related Literature
Most broadly, our paper contributes to a burgeoning literature on competition in prediction and machine learning. Within this literature, papers such as Mansour et al. (2018), Ben-Porat and Tennenholtz (2019), and Feng et al. (2019) study different models of learning embedded in settings where participants compete with others in their predictions. They take participants’ data as given, and point to the effect competition has on optimal learning algorithms. Our focus, in contrast, is on the effect of data sharing on competitive prediction.
Because of its focus on data sharing, our paper is also related to the more established literature on strategic information-sharing, a literature that focuses on a number of distinct applications: oligopolistic competition (Clarke, 1983; Raith, 1996), financial intermediation (Pagano and Jappelli, 1993; Jappelli and Pagano, 2002; Gehrig and Stenbacka, 2007), supply chain management (Ha and Tong, 2008; Shamir and Shin, 2016), price discrimination (Liu and Serfes, 2006; Jentzsch et al., 2013), and competition between data brokers (Gu et al., 2019; Ichihashi, 2020). Much of this literature revolves around the question of whether it is beneficial for participating firms to pool all their data, or whether they would prefer to pool only some or none at all. As Bergemann and Morris (2013) demonstrate, however, more finely tuned data sharing can be beneficial even when full or partial pooling is not, and so the existing schemes do not exhaust the potential benefits of data sharing. In this paper we analyze precisely such finely tuned sharing.
A different paper that does consider more finely tuned sharing is that of de Zegher and Lo (2020), who study an infinitely repeated setting in which firms compete for market share. Their main result is the construction of a data sharing scheme that Pareto improves all participants’ welfare. Like our results, their scheme also only reveals partial information to participants. Unlike our approach, however, de Zegher and Lo (2020)’s scheme relies on the repeated nature of the interaction, and uses folk-theorem-type arguments to show that cooperation can be sustained. In addition, their focus is on data sharing between many small firms, and they do not consider the possible presence of an Amazon.
In terms of modeling and techniques, our paper falls into the literature on information design. Information design, recently surveyed by Bergemann and Morris (2019), is the study of how the allocation of information affects incentives and hence behavior, with a focus on the extent and limits of purely informational manipulation. The information design problem encompasses work on communication in games and on Bayesian persuasion. When the mediator is assumed to have only the information held by the players, or only the information they are willing to share with him, the problem maps to the one studied in the literature on communication in games. The goal of these studies is to characterize the equilibrium outcomes achievable when players are allowed to communicate prior to playing a fixed game, and where communication is captured by players’ interaction with a mediator. Myerson (1991) and Forges (1993) provide useful overviews. Our paper builds on this model by endowing the mediator with the specific aim of maximizing (some of) the players’ utilities.
A different setting, called Bayesian persuasion (Kamenica and Gentzkow, 2011), is one in which only the mediator (here called the sender) has payoff-relevant information, and in which he can commit to a particular information structure prior to observing that information. Initial work on Bayesian persuasion focused on the case of a single sender and a single player (Kamenica and Gentzkow, 2011), but more recent research also consists of settings with multiple senders (such as Gentzkow and Kamenica (2016)) and multiple players. Our paper is most closely related to the latter, and specifically to Bergemann and Morris (2016) and Galperti and Perego (2018), who develop a linear programming approach to study the effect of different information structures on the Bayes correlated equilibria of the subsequent game, as well as Mathevet et al. (2020), who extend the belief-based approach of Kamenica and Gentzkow (2011) to study optimal information structures under different solution concepts.
A closely related strand of the literature, recently surveyed by Bergemann and Bonatti (2019), focuses on the sale of information by a data broker. Most of this work differs from our paper in that it focuses on a market with one-sided information flow, with information going only from one party to another, whereas we study a market where information flow is bidirectional across firms. Nonetheless, the insights arising from that literature are related to ones we develop in this paper. For example, Bimpikis et al. (2019) consider a monopolistic data broker who sells data to firms who compete in a downstream market. A major insight in that paper is that the amount of information optimally sold by the data broker depends on the nature of downstream competition, and, in particular, that more (resp., less) data is sold if that competition features strategic complements (resp., substitutes). While our paper studies data sharing as opposed to data sale, our results also shed light on the effect the nature of competition has on the amount of data shared. In our model firms’ actions are strategic substitutes—when more firms offer the correct good to a particular consumer (e.g., due to the presence of an Amazon), the value of offering the correct good decreases. Our results indicate that sometimes there is more data sharing in the absence of an Amazon and hence when strategic substitutes are weaker, which is consistent with the insight of Bimpikis et al. (2019). However, we also show that sometimes the opposite holds.
There is also a relationship between information sale and information sharing, which is pointed out by Bergemann and Bonatti (2019). These authors relate their general model of a market for information to incentives for information sharing, and point to the study of finely tuned sharing schemes as an open problem.
Our paper is also related to a set of papers that focuses on designing mediators to achieve various goals, such as to improve the incentives of players, make equilibria robust to collusion, or implement correlated equilibria while guaranteeing privacy Monderer and Tennenholtz (2004, 2009); Kearns et al. (2014). The first two differ from our work in that they make stronger assumptions about the mediator’s capabilities, such as changing payoffs or limiting player actions, and the third focuses on a setting with many players that is quite different from our own.
Finally, our work is conceptually related to research on federated learning, and in particular on cross-silo federated learning Kairouz et al. (2019). This framework consists of a set of agents with individual data whose goal is to jointly compute a predictive model. A recent emphasis within federated learning is on incentivizing the agents to participate, sometimes through monetary transfers Zhan et al. (2021) and sometimes by providing different models to different agents Lyu et al. (2020). The focus of these papers is on fairness—agents that provide more data should be compensated more generously. Our paper differs, in that agents are assumed to utilize the resulting models in some competition, and this subsequent competition drives agents’ incentives to participate. Our work is thus orthogonal to that surveyed in Zhan et al. (2021): we do not focus on the algorithmic aspects of computing a joint model, but rather on the incentives and joint benefits of participating even when other agents are competitors.
2 Model and Preliminaries
There is a set of goods and a population of consumers interested in obtaining one of them. Each consumer has one of a finite set of types, , that describe the good in which he is interested. There are three players who compete for consumers: two regular players indexed 1 and 2, and an Amazon, a player indexed 0. We will separate the analysis to two settings: first, when the Amazon player 0 is not present and players 1 and 2 compete only with one another; and second, when they additionally compete with Amazon. The model and definitions here apply to both settings.
Player 0 (if present) has complete information of the consumer’s type. In Section 3 we assume that players 1 and 2 have jointly complete information—that, when combined, their respective data uniquely identifies each consumer’s type—but that each player on her own may only be partially informed. In Section 4 we remove this assumption.
To model this informational setting, we represent players’ data using the information partition model of Aumann (1976): Each player is endowed with a partition of , where is a set of disjoint, nonempty sets whose union is . For each we denote by the unique element of that contains , with the interpretation that if the realized type of consumer is , each player only learns that the type belongs to the set .
Framing the example from the introduction within this model would associate with and the partitions , , , and .
In this model, player 0’s complete information means that for all , and players 1 and 2’s jointly complete information means that for all . We further assume that, before obtaining any information, all players have a common prior over .
To model data sharing between players 1 and 2 we suppose there is a mediator that gathers each player’s information and shares it with the other in some way. Formally, a mediator is a function , where is an arbitrary message space. The range is a distribution over pairs of messages, where the first (respectively, second) is the message sent to player 1 (respectively, player 2).
We begin with an informal description of the game: Players offer consumers a good, and consumers choose a player from whom to acquire the good. Consumers are single-minded: For each there is a unique such that the consumer will only choose a player who offers good . If there is more than one such player, the consumer chooses uniformly at random between them. One interpretation of this consumer behavior is that he chooses by “satsificing”—making a random choice among options that are “good enough” Simon (1956), where represents these good-enough options. See Ben-Porat and Tennenholtz (2019) for a similar approach.
We assume that prices and costs are fixed, and normalize a player’s utility to 1 if she is chosen and to 0 otherwise. So, for example, in the online advertising application of our model, consumers are assumed to click on a random ad amongst those ads that are most relevant; once they click, the expected profit to the advertiser (conversion rate times profit from a sale) is 1. The normalization to 1 is actually without loss of generality. If we had a different profit for each consumer type , then we could change every to 1, modify the prior probability of type to , and then renormalize the prior to be a proper distribution. This would leave all of our analysis intact.
The following is the order of events, given a fixed mediator . It applies both to the case in which there are only two players and to the case in which there is an additional Amazon player, indexed 0.
-
1.
Each of players 1 and 2 chooses whether or not to opt into using the mediator.
-
2.
If one or both players did not opt into using the mediator, then each player obtains her base value, the utility (described below).
-
3.
If both players opted into using the mediator, then:
-
(a)
A consumer of type is chosen from with prior distribution .
-
(b)
If present, player 0 learns .
-
(c)
Messages are chosen from the distribution , and each learns and .
-
(d)
Each player simultaneously chooses a good to offer the consumer, and then the consumer chooses a player from whom to obtain the good.
-
(a)
There are several ways to interpret the base values and . Our main interpretation is that these are the expected utilities of the players in the game without data sharing. To formalize this, consider the unmediated game , where is the set of participating players, and is either or , is the set of actions, is the information of player (which consists of the partition element of the realized type), is ’s utility function (described below), and is the common prior over . Given this Bayesian game, let be the expected utilities in some Bayesian Nash equilibrium (BNE) of —a profile in which is the best strategy for each player conditional on her information and the assumption that the other player plays the strategy .
Although this is the main interpretation of the base values, our model and results permit additional and more general interpretations as well. One additional interpretation is to suppose each is the minimax value of player in the unmediated game. In this second interpretation, we could imagine player “punishing” player if the latter does not opt into using the mediator, by playing a strategy that minimizes the latter’s payoff (off the equilibrium path). A third, more general interpretation is that the base values also depend on factors outside of the specific game, such as firm size, customer base, and so on, in addition to the primitives of the game .
In our constructions of mediators in Sections 3 and 4, the interpretation of will not matter—our mediators will be optimal given any such values, regardless of whether they are derived endogenously as the equilibrium utilities of the game absent a mediator, or whether they arise exogenously from factors outside of the specific game. However, in Section 5, in which we compare data sharing across two different environments—with and without a mediator—we will stick with the main interpretation of base values as equilibrium payoffs absent a mediator. This is because, when we compare two different environments, the payoffs players could attain in equilibrium without a mediator will differ across these environments.
Now, as noted above, data sharing is modeled by a mediator . Without loss of generality we invoke the revelation principle and assume that , the set of possible goods. We interpret the messages of the mediator as recommended actions to the players, one recommendation for each player.
Formally, if both players 1 and 2 opt into using the mediator , then all play the mediated Bayesian game . is the set of participating players, and is either or . The function denotes the information of players in the game for each state, where is the set of possible pieces of information a player may have—a partition element and a message from the mediator. For player 0, this information consists of the realized type of consumer only, and so (where the means player 0 gets no message from the mediator). For players 1 and 2, the information consists of both the partition element of the realized type of consumer and the action recommended to her by the mediator, and so .
Next, each player’s set of actions in the mediated game is , and her utility function . The latter is equal to 0 if player ’s action , and otherwise it is equal to , where is the total number of players who play action . The nonzero utility corresponds to utility 1 if a consumer chooses the player’s good, which occurs if a player offers the consumer the correct good and the consumer chooses uniformly amongst all players that do so. We often write when the expectation is over the choice of , in which case we omit the dependence of on for brevity. Finally, (mixed) strategies of players in are functions . We also denote by .
An important note about the mediator is in order. We assume that when players opt into participating, they truthfully reveal their data to him. Players’ strategic behavior is relevant in their choice of opting in or not, and then in whether or not they follow the mediator’s recommendation in their interaction with the consumer. While truthful reporting is clearly restrictive, it is a natural assumption in our context—for instance, when service providers act as mediators, they are typically also the ones who host retailers’ data in the cloud, and so already have the (true) data on their servers. Similarly, federated machine learning algorithms work based on the assumption that participants share their true data. Furthermore, truthful reporting can be justified by the repeated nature of the interaction. Observe that if a firm is not truthful in its reporting, the other firm will discover this at some point, as the mediator’s recommendations will not pan out as well as they should. Thus, if firms have the ability to opt out of using the mediator at some point in the future, then each firm can use the threat of opting out to incentivize the other to report truthfully.
Individual rationality, incentive compatibility, and optimality
A first observation is that player 0, if present, has a dominant strategy:
Lemma 1
In any game , player 0 has the unique dominant strategy .
The proof is straightforward: playing action leads to positive utility to player 0, regardless of the other firms’ actions. Playing any other action leads to utility zero. This implies that always offering the correct good is the unique dominant strategy for player 0. We will thus take that player’s strategy as fixed throughout.
We will be interested in designing mediators that players actually wish to utilize, and so would like them to satisfy two requirements: first, that players want to follow the mediator’s recommendations, and so that following these recommendations forms an equilibrium; and second, that players prefer their payoffs with the mediator over their payoffs without (the base values). Formally, we say that:
-
•
is incentive compatible (IC) if the strategy profile in which players 1 and 2 always follow ’s recommendation is a BNE of .
-
•
is individually rational (IR) if the expected utility of each player under in is (weakly) greater than .
Observe that if a mediator is both IC and IR, then in equilibrium players will opt in and then follow the mediator’s recommendations.
Finally, as we are interested in the extent to which data sharing benefits players 1 and 2, we will study optimal mediators, namely, ones in which the sum of these two players’ utilities is maximal subject to the IC and IR constraints. To this end, denote by the sum of players 1 and 2’s expected utilities under in . Then:
Definition 1
Mediator is optimal if for any other that is IC and IR.
3 Jointly Complete Information
3.1 Coopetition Without an Amazon
We begin our analysis with the case of jointly complete information and without an Amazon. We first consider the question, under what base values does there exist an IR, IC mediator? We then turn to our main result for the section, the construction of an optimal mediator.
3.1.1 When does an IR, IC mediator exist?
We begin with a simple observation. Since for any mediator and conditional on any realized type the total payoff to players is at most 1, it must be the case that . Thus, if , then there does not exist any IR mediator.
However, even if , there may not exist a mediator that is both IR and IC. In the example from the introduction, for instance, in the simple case without an Amazon and where , the row retailer can guarantee herself a payoff of by ignoring the mediator and playing her dominant actions. This implies that the most the column retailer can obtain is . Any mediator that leads to a higher payoff to the column retailer must therefore fail to satisfy the IC constraints.
Theorem 1 below generalizes the bound in this example. But first, some notation: Consider the strategy of player that, for every , chooses one of the goods that is most likely to be correct: . Note that is a best-response to a player who always chooses the correct good . Furthermore, let be the overall probability that chooses the correct good: .
Theorem 1
Suppose . Then an IR, IC mediator exists only if and .
In words, the theorem states that an IR, IC mediator exists only if each player’s base value is bounded above by her utility in the hypothetical situation in which she always offers the correct good, and the other player best-responds conditional only on (in addition to the obvious constraint ). In Theorem 2 below we will see that these conditions are also sufficient.
We note that if the base values are the equilibrium expected utilities of the game without a mediator, then the conditions in Theorem 1 are always satisfied. To see this, observe that the mediator that recommends to each player her equilibrium action (and nothing more) is IC, by the definition of a BNE. It is also IR, since players’ utilities are the same whether or not they opt into using this mediator.
-
Proof: Proof. The condition must hold for any IR mediator , since . Now, suppose towards a contradiction that there exists an IC mediator for which . This implies that, under , the expected utility of player is strictly less than . However, player then has a profitable deviation to . When playing , on each partition element player will get utility at least
and exactly this utility whenever always chooses the correct good . Since this is a profitable deviation, the mediator is not IC, a contradiction.
3.1.2 An optimal mediator
We now describe an optimal mediator, at the same time showing that the conditions on in Theorem 1 are also sufficient. We begin with a simple observation:
Observation 1
If is IR, IC, and , then is optimal.
This observation implies that the mediator in the example of the introduction, for the case without an Amazon, is optimal (when the base values are retailers’ utilities under their dominant strategies). One drawback of that mediator, however, is that it depends on the retailers’ strategies in the unmediated game—in particular, it utilizes the fact that, in equilibrium, neither retailer offers the correct good in the bottom-right cell of the matrix. Our aim here is to design a mediator that does not depend on such strategies, but only on players’ respective information and base values.
The idea underlying the mediator we construct is the following. Suppose . Then if , then the mediator will facilitate full data-sharing, and so each player will offer the correct good in every state. If then player will obtain all the information and player will obtain only partial information: she will get recommendation with probability less than 1, and otherwise will get recommendation . We now formally describe the mediator, and then show that it is indeed IR, IC, and optimal.
Theorem 2
For any feasible , the mediator above is IR, IC, and optimal.
-
Proof: First, consider the case in which . In this case, strategy in leads to expected utility to each player, and so the mediator is IR. Furthermore, the strategy is a BNE, since each player always chooses , which is dominant, and so the mediator is also IC. Finally, since in this case , Observation 1 implies that the mediator is optimal.
Next, suppose . Observe that, conditional on line 10 being activated, the expected utility of player is , whereas conditional on line 11 or 12, the expected utility of player is (since with probability she chooses the correct good, and player always chooses the correct good). Thus, overall the expected utility of player is
Since player always chooses the correct good, it must be the case that , and so the utility of player is . Finally, since , this implies that the utility of player is at least . Thus, the mediator is IR.
Next, observe that in is dominant for player , since she always chooses . It is also optimal for player , since she either also chooses , or chooses the best response conditional on her information . Thus, the strategy is a BNE, and so the mediator is IC.
Finally, since , Observation 1 implies that the mediator is optimal.
3.2 Coopetition Against an Amazon
In this section we analyze the game with an Amazon. As in Section 3.1, we first consider the question, under what base values does there exist an IR, IC mediator? We then turn to our main result for the section, the construction of an optimal mediator.
3.2.1 When does an IR, IC mediator exist?
Once again, we begin with a simple observation. Since for any mediator and any type the total welfare of players 1 and 2 is at most (since Amazon always knows the type, offers the correct good, and gets at least of the surplus), it must be the case that . Thus, if , then there does not exist any IR mediator.
However, and again as above, even if there may still not exist a mediator that is both IR and IC. In the example from the introduction, for instance, in the simple case with an Amazon and where , the row retailer can guarantee herself a payoff of by ignoring the mediator and playing her dominant actions. This implies that the most the column retailer can obtain is . Any mediator that leads to a higher payoff to the column retailer must therefore fail to satisfy the IC constraints.
The result below consists of bounds on the base values under which an IR, IC mediator exists.
Theorem 3
Suppose . Then an IR, IC mediator exists only if and .
As with Theorem 1, we note that if the base values are the equilibrium expected utilities of the game without a mediator, then the conditions in Theorem 3 are satisfied. Again, this follows from the observation that the mediator that recommends to each player her equilibrium action is both IR and IC.
Before proving the theorem we develop some notation and intuition. For any mediator and player let and . With this notation, the expected utility of player under in is : For any , if player offers and the other player, , does not, then the consumer chooses between players and 0, and so each gets . If both offer , the consumer chooses between players , , and 0, and so each gets .
An IR, IC mediator is one for which is a BNE, and in which for both players. In addition to the incentive constraints, we also have the constraints , and all three variables non-negative.
Finally, the total welfare of players 1 and 2 in is , and so an optimal mediator is one that maximizes this sum subject to the IC and IR constraints above. Denote the linear program above, disregarding the IC constraint, as :
To prove Theorem 3, we need to identify conditions on under which the LP is feasible. To do this, it will be helpful to begin with some lemmas that characterize the LP’s optimal solution:
Lemma 2
Suppose and that is feasible. Then any optimal solution to satisfies:
-
(a)
.
-
(b)
.
-
(c)
If then ’s IR constraint binds: .
-
Proof: Suppose (a) does not hold, and that . This means we can increase to without violating any of the constraints. The value of the is now , contradicting the optimality .
Now suppose (a) holds, but (b) does not. If then the solution is also feasible and optimal, so assume without loss of generality that . Consider now the solution , , and . Since
and
this new solution is feasible. Moreover,
contradicting the optimality of .
Now suppose (a) and (b) hold, but that (c) does not. Then there is some such that , where and , and so is also a feasible solution. Furthermore,
contradicting the optimality of .
One implication of Lemma 2 is that at an optimal solution, is maximized. This follows from the observation that the objective function is when . In addition, Lemma 2 pins down the value of the optimal solution to :
Lemma 3
Suppose and that is feasible. Then the value of an optimal solution to is one of the following:
-
•
If then .
-
•
If then .
-
Proof: Lemma 2 implies that at the optimal solution we have , and so the value at the optimum is . In the first case, the optimal solution occurs at the point , : it maximizes the objective, while both IR constraints remain slack since .
In the second case, Lemma 2 implies that and that . These imply that , and so that and . Plugging these into the objective yields
Lemmas 2 and 3 characterize the maximal values attainable by an IR mediator, and are the analogues of the condition that when there is no Amazon. However, before proving Theorem 3, we need two additional lemmas. Fix a mediator , and consider the hypothetical situation in which player obtains additional information, and in each state she fully learns that the state is and so plays the dominant strategy . In this new game, denote the best response of player as , the strategy that, for every and , chooses some that is most-likely correct conditional on and . Also observe that here, player is never the only player to choose , since player always plays .
Player is worse off in this hypothetical game than when all follow ’s recommendations:
Lemma 4
for any mediator .
-
Proof: Fix a particular state . For every action of player , who conditions on and , the utility , since makes correct choices more often in the former. This implies that , since in both sides of the inequality chooses an optimal action conditional on her information, but in the RHS the utility from every choice is higher than its corresponding utility in the LHS. This implies that .
Although in the hypothetical game player is worse off, the sum of players’ utilities is higher:
Lemma 5
The sum of players’ utilities is higher under than under in .
-
Proof: Again, we show that this holds conditional on every and . Fix , and observe that
since always correctly chooses . Furthermore,
since chooses the good that maximizes the probability of choosing . Thus,
Now recall that is maximized when , the probability that both players correctly choose is maximized. Thus, the value of , the sum of players’ utilities, is higher under than under .
We can now prove Theorem 3.
-
Proof: For the first condition, the upper bound on , fix some IR, IC mediator . We will argue that the utility to player under is at most . Lemma 4 implies that player ’s utility decreases if player additionally obtains all the information (even when responds optimally), and Lemma 5 implies that in this same situation the sum of utilities increases. Together, these lemmas imply that ’s utility is higher when she obtains all the information and only learns and , than when also only learns and . Furthermore, when has all the information, she is best off when has the least information—that is, when . This is because any additional information that causes to improve upon her payoff will do so at the expense of player ’s payoff (by transferring weight from to ). Thus, in any IR, IC mediator, the highest possible utility to player is , since and so .
The second condition, , follows from Lemma 3. If , then the maximal welfare of any IR mediator is , and so . If , then the lemma implies that the maximal welfare of any IR mediator is at most . Since player gets at least , player can get at most .
3.2.2 An optimal mediator
We now describe an optimal mediator, at the same time showing that the conditions in Theorem 3 are not only necessary but also sufficient.
We begin with a definition.
Definition 2
A mediator is fully revealing to player if the mediator’s recommendation to player always coincides with the optimal good: for every . A mediator facilitates full data-sharing if it is fully revealing to both players .
Fix the set of players to be . The idea underlying the mediator is the same as that of . Suppose . Then if , then the mediator will facilitate full data-sharing, and so each player will offer the correct good in every state. If then player will obtain all the information and player will obtain only partial information: she will get recommendation with probability less than 1, and otherwise will get recommendation . The challenge is to maximize the probability of recommendation subject to the IR constraints, and to do this in such a way that following the recommendations is a BNE.
We now formally describe the mediator, and then show that it is indeed IR, IC, and optimal. The main differences from in Mediator 1 are lines 5 and 9.
Theorem 4
For any feasible , the mediator above is IR, IC, and optimal.
An interesting feature of is that it either facilitates full data-sharing, or is fully revealing to one of the players. These features are inherent to any optimal IR mediator:
Theorem 5
For any feasible and any optimal IR mediator , one of the following holds:
-
•
is fully revealing to both players 1 and 2.
-
•
is fully revealing to one of the players, and the IR constraint binds for that player (i.e., she is indifferent between and her base value).
Note that Mediator 1 for the case of no Amazon also satisfies the features described in Theorem 5. In that case, however, these features do not characterize all mediators, as there exist mediators that are optimal for the case of no Amazon that do not have them. One such mediator is the one described in the example from the introduction. With an Amazon, in contrast, these features characterize all optimal IR mediators.
We begin by showing that the mediator above is optimal. Our approach is to show that is equal to the value of the optimal solution to , and that the strategy in is a BNE. Observe that does not contain the IC constraints, implying that they are not binding, and so only the IR constraints are relevant for optimality. This will no longer be the case without the assumption of jointly complete information, analyzed in Section 4 below.
We already laid most of the groundwork for the proofs of Theorems 4 and 5 when we proved Theorem 3, but do need one last lemma:
Lemma 6
Suppose is feasible and . Then .
-
Proof: Since is feasible, it holds that . Next, note that . Together, these facts imply that , as claimed.
We are now ready to prove Theorem 4.
-
Proof: First, consider the case in which . In this case, strategy in leads to expected utility to each player, which satisfies the IR constraint. Furthermore, strategy is a BNE, since each player always chooses , which is dominant. Finally, is maximal by Lemma 3.
Next, suppose . Observe that in is dominant for player , since she always chooses . It is also optimal for player , since she either also chooses , or chooses the best response conditional on her information . Thus, the mediator is IC.
We now show that is optimal. For simplicity, henceforth denote by . First note that . Next, by Lemma 6, it holds that . Since , this implies that . These, in turn imply that
so line 9 of Mediator 2 is valid. From that line we have that
since both players choose when and when but . This means that
which, by Lemma 3, is the value of the optimal solution to .
Finally, since ’s utility is and so ’s utility is , the mediator is IR.
Finally, we can also prove Theorem 5.
-
Proof: Suppose first that is such that . In this case, full data-sharing is IC, and so any optimal mediator must have . However, the maximum value of is , and the only way to obtain this is to have . This is equivalent to full data-sharing, and so the only mediator that is optimal is the full data-sharing one.
Now suppose that . By the proof of Theorem 4, an optimal solution to is attainable by some mediator, in particular by . Thus, any mediator that is optimal must also yield that is equal to the optimal value of . By Lemma 2, any optimal solution to must have and . Thus, the mediator must satisfy these as well: the first equality implying that is fully revealing to player , and the second that player ’s IR constraint binds.
4 No Jointly Complete Information
We now drop the assumption that players 1 and 2 have jointly complete information. Formally, for a given type let , and call each such a segment. The assumption of jointly complete information states that all segments are singletons, and in this section we consider the more general setting in which segments may contain more than one type of consumer.
This setting presents new challenges, as there is now a new conflict between optimal strategies and welfare. To see this, consider some segment for which . Then, conditional on segment , both players have a dominant strategy, namely, to offer . To see this, observe that a player’s utility from offering is at least regardless of the other’s action, the utility from offering is at most , and by assumption . This leads to total welfare . However, if players were to separate rather than pool—one offering and the other —then the total welfare would be , which is higher. Note that this conflict between optimal strategies and welfare does not occur when players have jointly complete information, since then the pooling action, which is dominant, is also welfare-maximizing. Finally, although we illustrated this conflict for the setting without an Amazon, it is of course also present with an Amazon.
This conflict between optimal strategies and welfare maximization makes the design of mediators significantly more challenging. In principle, one could formulate the problem as a complex linear program, whose solution could be used to find the optimal mediator. Here, however, we take a different approach. We will show that simple mediators—in particular, variants of the ones from Section 3 for the setting with jointly complete information—are approximately optimal. And while this approach entails some loss in terms of optimality, we will show that these simple mediators are not only approximately optimal relative to the optimal IR, IC mediator, but rather that they are approximately optimal relative to a higher benchmark—namely, the welfare that can be achieved by any mediator, even one that is not IR or IC. Then, to complement our results, we will show that the approximation factors our mediators achieve are tight or nearly-tight, and that no other IR and IC mediator can in general achieve a better approximation factor relative to that same benchmark.
We begin with some notation. For any given segment , denote by the good that has the highest probability in , and by the good with the second highest probability, under the prior . If there are several such goods, fix two such goods arbitrarily. Next, let be the probability of conditional on , and be the respective probability of . We will also use the notation that, for a given segment , the goods and are the goods with the highest and second-highest probabilities in , with and their respective conditional probabilities. Finally, let be the total (unconditional) weight of goods over all segments , and be the total weight of goods : Formally,
Now, as our main interest lies in coopetition against an Amazon, we will focus here on the setting with an Amazon. We note that, while we drop the assumption that players 1 and 2 have jointly complete information, we still assume that the Amazon player 0 has complete information. That is, even though in any given segment players 1 and 2 do not know which type is realized, player 0 does know, and offers each such type the correct good .
We proceed as follows. We begin in Section 4.1 with the design of an optimal mediator for a particular setting of parameters, namely, for the case in which for every segment . We then turn to our main analysis in Section 4.2, for the case in which for every segment . There we design the approximately optimal simple mediator mentioned above, and analyze the tightness of the approximation factor. Finally, in Section 4.3 we show how to combine the two mediators in order to obtain an approximately optimal mediator for the general setting, without restrictions on the segments.
4.1 Optimal Mediator for
We begin with the simpler case in which for every segment . This case is simpler because here, conditional on any given segment, there is no conflict between optimal strategies and welfare maximization. In particular, both involve separating, with one player offering and the other offering :
Claim 1
Conditional on any segment , if then the separating strategy profile is both welfare maximizing and an equilibrium.
-
Proof: We will compare the only two reasonable player choices for welfare maximization conditional on segment : (i) pooling on , or (ii) separating, with one player offering and the other .
-
(i)
Pooling: Here each of players 1 and 2 obtains utility —when type with is realized (which happens with probability ), the players split the surplus of 1 with player 0, and so each gets . Welfare of players 1 and 2 is thus .
-
(ii)
Separating: Here the player who offers obtains utility —when type with is realized, that player splits the surplus with player 0. Similarly, the player who offers obtains utility . Welfare of players 1 and 2 is thus .
Observe that, when , we have that , and so the welfare maximizing choices are the separating ones.
In addition to being welfare-maximizing, the separating actions also form an equilibrium (conditional on ). Fix a segment , and suppose player 1 offers and player 2 offers . Then player 1 clearly does not wish to deviate, as she currently obtains utility , a deviation to will lead to utility , and a deviation to any other good will lead to utility at most (since has maximal probability in ). Player 2 also does not wish to deviate, as she currently obtains utility , a deviation to will lead to utility , and by assumption .
-
(i)
Given the lack of conflict between welfare maximization and equilibrium shown in Claim 1, we can now construct optimal mediators for this parameter setting. The mediator is straightforward: it facilitates full data-sharing between the players, and recommends the actions that lead to a separating equilibrium.
Theorem 6
Suppose and . Then the mediator above is IR, IC, and optimal.
Observe that the conditions on in Theorem 6 are also necessary for the existence of any IR mediator, since the maximal welfare attainable by any mediator in this setting is , and the maximal utility any single player can attain is .
-
Proof: Suppose first that , and so the conditional in line 2 of Mediator 3 is true. In this case, each player obtains utility under , and the mediator is thus IR. In addition, since by Claim 1 separation is an equilibrium on every segment , the mediator is also IC. Finally, the total welfare to the players is , which is maximal.
Next, suppose . Observe that, in this case, the mediator mixes between utility to player and to player , and utility to player and to player , depending on the value of . Regardless of the realizations of , the total welfare to the players is always . This is the optimal welfare attainable.
Next, the expected utility of player in is
Furthermore, since and total welfare is , the utility to player is at least . Thus, the mediator is IR.
Finally, since by Claim 1 separation is an equilibrium on every segment , the mediator is IC.
4.2 Approximately Optimal Mediator for
Suppose now that for every segment . This case is more complicated since, conditional on any segment, there may be a conflict between optimal strategies and welfare maximization:
Claim 2
Conditional on any segment , if then pooling on is the dominant strategy. If then separating is welfare-maximizing, whereas if then pooling is welfare-maximizing.
-
Proof: Again, we will compare the only two reasonable player choices conditional on segment : (i) pooling on , or (ii) separating, with one player offering and the other . As in the proof of Claim 1, pooling leads to individual utilities and so welfare , whereas separating leads to individual utilities and and so welfare . Then if and only if , as claimed.
In terms of optimality, however, observe that choosing leads to utility at least , whereas choosing any other good leads to utility at most . The former is greater than the latter whenever , and so constitutes a dominant action (conditional on ).
Our goal is to design a mediator that is approximately optimal relative to a particular benchmark. We denote the benchmark OPT, which is the maximal welfare attainable by any mediator, even disregarding the IR and IC constraints. We will show that the mediator below, which is a variant of the mediator for the case of jointly complete information, is IR, IC, and obtains welfare at least . We will then show that this approximation factor is tight.
Before describing the mediator, recall the strategy from Section 3 that, for every , chooses one of the goods that is most likely to be correct. Here we will use a different formulation of . In particular, let . That is, is an optimal action for player conditional on knowing only , in the hypothetical situation where always plays the most-likely correct good in every segment. Also, let and be the expected utilities of players and in this hypothetical situation: and . Note that, under jointly complete information, the formulation of is identical to the one given in Section 3. Without jointly complete information, however, they may differ.
Theorem 7
Let , and suppose and . Then the mediator above is IR, IC, and has welfare .
We note that, unlike the conditions on in Theorems 2, 4, and 6, the conditions here are not necessary but only sufficient.
-
Proof: Consider first the case in which . Here, the utility to each player is , and so the mediator is IR. Furthermore, players always pool on which is an equilibrium by Claim 2. Thus, in this case the mediator is also IC. Finally, total welfare here is . Below we will show that this is within a factor of OPT.
Before turning to the approximation factor, consider the case in which . Observe that the total welfare in this case is at least . If on a given segment the players pool on , then welfare here is . If the players separate, then the welfare here is at least , since player always chooses . Thus, total welfare is at least .
Next, the expected utility of player in this case is
Furthermore, since the total welfare is at least , player obtains welfare at least , and so the mediator is IR here as well. In addition, since by Claim 2 the action is dominant in every segment , and in the mediator player always chooses and player either pools or best responds given her information, the mediator is IC.
Finally, we turn to the approximation factor. By Claim 2, the maximal welfare possible on a segment in which is , and our mediator guarantees welfare at least . Hence, here we have approximation ratio at least
In addition, by Claim 2, the maximal welfare possible on a segment in which is , and our mediator guarantees welfare at least . Hence, here we have approximation ratio at least
Since a ratio of is achieved for each segment, it is also achieved overall, and .
We now show that the approximation factor of is tight, by giving an example wherein there does not exist an IR mediator, not even one that is not IC, that achieves a factor higher than .
Example 1
Fix some , and suppose there are segments. Within each segment there are two correct goods, and , that satisfy . Furthermore, all the goods are distinct, whereas for every the good is the same. Player 1 has all the information, and so for every , whereas player 2 has none, and so for every . Finally, and .
In this example, maximal welfare is achieved when all players pool on for every , and so . However, any IR mediator must give player 1 utility , and the only way to achieve this is to give player 2 no additional information, so that player 1 always chooses and player 2 always chooses (this is, in fact, what the mediator does). Total welfare here is then , and so the approximation ratio is
In the previous example, the main bottleneck to achieving an approximation factor higher than was the IR constraint. In this next example we show that even without the IR constraints, there does not exist an IC mediator that achieves an approximation factor higher than .
Example 2
Fix some , and suppose that in every segment there are two correct goods, and , that satisfy . Furthermore, the goods in every segment are distinct from goods in other segments. By Claim 2, maximal welfare here is achieved when players separate on each segment, and so .
However, again by Claim 2, the dominant action for players is to pool on . Note that in any IC mediator, no player will ever choose a good , regardless of her information, since she can always improve her payoff by deviating to the strictly better (here we use the fact that all goods are distinct). Thus, the most any IC mediator can attain is when all players pool on the top good, leading to welfare and so approximation factor
4.3 Approximately Optimal Mediator for General Setting
Each of the mediators in the previous two sections is IR, IC, and approximately-optimal under a particular setting of parameters, but not in the other. Mediator of Mediator 3 is not IC when for some segment , since players wish to pool but the mediator facilitates separating. of Mediator 4 is not IC when for some segment , since players wish to separate but the mediator facilitates pooling. In this section we show that the two mediators can be combined in order to yield an IR, IC, and approximately optimal mediator for any parameter setting. The main idea is straightforward: to run of Mediator 3 on segments for which , and to run of Mediator 4 on segments for which .
Before formalizing this mediator, we need some notation. Let and . Also, for each let be the total weight of the most-likely correct goods conditional on , and be the total weight of the second most-likely correct goods conditional . Formally, for each ,
Finally, modify the definitions of , , and from Section 4.2 as follows. First, for any with let , so it is an optimal action for player conditional on the information that and , in the hypothetical situation where always plays the most-likely correct good in every segment . Also, let and be the expected utilities of players and in this hypothetical situation: and .
Theorem 8
Let , and suppose that , that , and that . Then the mediator above is IR, IC, and has welfare .
-
Proof: Conditional on , the conditions on of Theorem 6 are satisfied, and so is IR and IC. Conditional on , the conditions on of Theorem 7 are satisfied, and so is IR and IC. That is, conditional on either or , the expected utilities of either player is at least . Thus, ’s expected utility in , which is a mixture of her utilities conditional on and on , must also be greater than . This implies that is IR.
Furthermore, since and are IC, for either player the strategy is a best-response to conditional on either or . Thus, is also IC.
Finally, we consider the welfare guaranteed by . Conditional on , the welfare attained is , by Theorem 6. This is the optimal attainable by any mediator when for every segment , by Claim 1. Hence, conditional on , the welfare achieved by is equal to OPT (conditional on ).
Conditional on , the welfare attained on any segment is at least times the welfare attainable on this segment by any mediator (by the proof of Theorem 7). Hence, the welfare achieved by is at least (conditional on ).
Since conditional on either or yields welfare at least times the optimal, the total welfare of is .
5 Do Players Share More Data With or Without an Amazon?
In this section we consider the intriguing question of whether players optimally share more data in the presence of an Amazon or in its absence. While it seems intuitive that players would share more data when facing stronger outside competition, we show here that this is not necessarily the case. We also show that the reason the intuition fails is related to the possible conflict between equilibrium and welfare maximization identified in Claim 2.
Because we are interested in comparing data sharing across two different environments—without and with an Amazon—we focus on our main interpretation of the base values , namely, that they correspond to the equilibrium expected utilities absent a mediator. Of course, because the two environments differ, so will these equilibrium expected utilities. We will thus be interested in comparing the benefit of data sharing relative to firms’ no-sharing equilibrium utilities absent an Amazon with the benefit of data sharing relative to firms’ no-sharing equilibrium utilities in the presence of an Amazon.
We begin our analysis in Section 5.1, where we show that, when there is jointly complete information, the intuition that firms share more data in the presence of an Amazon than in its absence does hold. In particular, we show that if, when there is no Amazon, data sharing strictly increases welfare relative to the welfare-maximizing equilibrium sans data sharing, then optimal data sharing strictly increases welfare in the environment with an Amazon relative to any equilibrium sans data sharing.
In Section 5.2 we then drop the assumption of jointly complete information, and show that this result no longer holds. In particular, we describe two settings:
-
1.
A setting where full data-sharing is optimal in the presence of an Amazon, but no data-sharing is optimal in the absence of an Amazon.
-
2.
A setting where full data-sharing is optimal in the absence of an Amazon, but no data-sharing is optimal in the presence of an Amazon. In this setting, data sharing is strictly beneficial to firms in the absence of an Amazon but not in its presence.
In Section 5.2 we also identify and discuss the driving force behind this difference, namely, the possible conflict between equilibrium and welfare maximization.
5.1 Conditions for Sharing with Jointly Complete Information
In this section we show that, under jointly complete information, when players can strictly benefit from data sharing in the absence of an Amazon, they can also strictly benefit from data sharing in the presence of an Amazon.
Theorem 9
Fix an unmediated game in which there is jointly complete information and no Amazon, and let be the expected utilities in a BNE of for which is maximal. Also, let be the same unmediated game, except with an Amazon, and let be the expected utilities in some BNE of . Then if there exists an IR, IC mediator in the game without an Amazon such that , there also exists an IR, IC mediator in the game with an Amazon such that .
The converse of Theorem 9 is not true—there are examples where players strictly benefit from data sharing in the presence of an Amazon but not in its absence. A simple example is the game described in Figure 1 in the Introduction, with parameters . In the absence of an Amazon, the unique equilibrium there yields maximal welfare of 1, since at least one consumer always offers the correct good. Thus, data sharing cannot strictly improve welfare. In the presence of an Amazon, however, data sharing can be strictly beneficial, as described in the Introduction.
-
Proof: Suppose towards a contradiction that there exists an IR, IC mediator in the game without an Amazon such that , but that for some there does not exist an IR, IC mediator in the game with an Amazon such that . Let be the equilibrium in the game with an Amazon that yields payoffs , and let be the mediator that does nothing other than recommend players their actions from the equilibrium . This mediator is IR, IC, and yields welfare . Since, by assumption, no IR, IC mediator yields higher welfare, is optimal relative to the feasible pair . Thus, by Theorem 5, is fully revealing to at least one of the two players. In particular, this implies that the BNE in the game with an Amazon is one in which at least one of the players always offers the correct good to every consumer .
We now argue that this implies that the same is an equilibrium also in the game without an Amazon. The player that always offers the correct good is clearly playing a dominant strategy, both in the game with an Amazon and in the game without, as any deviation can only lower her utility. The other player is playing a best response in the game with an Amazon. Because both player and the Amazon always offer the correct good, ’s strategy must maximize the probability of offering the correct good conditional on knowing (only) . However, this strategy is also an equilibrium in the game without an Amazon, since always offers the correct good.
The equilibrium is such that at least one of the players always offers the correct good, and so in the game without an Amazon it yields a total welfare of 1. However, this contradicts the assumption that there exists a mediator that strictly improves welfare, since no mediator can yield welfare .
5.2 Data-Sharing with No Jointly Complete Information
The analysis of Section 5.1 shows that when players strictly benefit from data sharing in the absence of an Amazon, they necessarily also benefit from data sharing in the presence of an Amazon. This is consistent with the intuition that players can gain more by sharing data when they also face outside competition. However, this result relies on the assumption of jointly complete information. In this section we drop the assumption, and show that in that case this intuition is not complete. In particular, while there are settings in which players benefit more from sharing data in the presence of an Amazon, there are also settings in which players benefit from data sharing in the absence of an Amazon but not in the presence of an Amazon.
More specifically, in Section 5.2.1 below we describe a setting where:
-
•
In the absence of an Amazon, the optimal welfare is achieved when players share no data and not when they fully share data;
-
•
In the presence of an Amazon, the optimal welfare is achieved when players fully share data and not when they share no data.
In contrast, in Section 5.2.1 we describe a setting where:
-
•
In the absence of an Amazon, the optimal welfare is achieved when players fully share data and not when they share no data;
-
•
In the presence of an Amazon, the optimal welfare is achieved when players share no data and not when they fully share data.
The main driving force behind the distinction between the two settings is whether or not there is a conflict between equilibrium and welfare maximization, as formalized in Claim 2. The situation described in Section 5.2.1 is one where for every segment . In this case, under full data sharing, it is a dominant strategy for players to pool on (both with and without an Amazon). In the presence of an Amazon, such pooling is also welfare maximizing, by Claim 2. In the absence of an Amazon, however, separating is always welfare maximizing. Thus, full data sharing leads to the optimal welfare in the former case, but not in the latter.
In contrast, the situation described in Section 5.2.2 is one where . In this case, under full data sharing, in the absence of an Amazon it is an equilibrium for players to separate, whereas in the presence of an Amazon it is dominant for players to pool (again, by Claim 2). Furthermore, in both settings, separating is the welfare maximizing outcome. Thus, full data sharing leads to the optimal welfare in the absence of an Amazon, but not in its presence.
5.2.1 More Sharing With an Amazon
When there is an Amazon, if for each segment it holds that , then maximal welfare is achieved when players pool on (by Claim 2). This maximal welfare is can be achieved by full data-sharing (assuming the IR constraint is satisfied): On each segment , the mediator recommends to both players, and this is optimal for them (again by Claim 2). In contrast, when there is no Amazon, full data-sharing may not be optimal (regardless of the IR constraint), as here maximal welfare is attained by separation.
In this section we construct an example in which full data-sharing is strictly suboptimal when there is no Amazon. While the example from the introduction illustrated this, it is not quite what we desire, since in that example the mediator with an Amazon is also an optimal mediator without. What we would like is an example where full data-sharing is optimal only with an Amazon. In the analysis that follows, we first ignore the IR constraint. At the end of the section we then show that, with the proper modification, the argument holds also when taking the IR constraint into account.
In the following construction, for simplicity we refer to different types of consumers by the good they desire. There are two segments, and , and players’ partitions are and . That is, player 1 learns whether the realized lies in the segment (and so the correct good is ) or (and so the correct good is or ), whereas player 2 does not learn anything. The prior is such that both segments are equally likely, and that . That is, in segment , the good is more than three times more likely to be correct than good .
In the presence of an Amazon, the welfare maximizing choices lead to pooling on in segment and pooling on in segment , by Claim 2. This can be achieved by full data-sharing, leading to an equilibrium in which players take these pooling actions (again, by Claim 2). In contrast, with no data sharing, player 2 chooses the action (in both segments), leading to lower welfare due to separation on .
Now suppose there is no Amazon. Here the welfare maximizing choices have players separating on segment . Without any additional information, player 2 chooses action , leading to such optimal separation. With data sharing, however, player 2 takes action on , pooling with player 1, leading to lower welfare. Thus, in this example, full data-sharing is optimal in the presence of an Amazon but not in its absence. Furthermore, without an Amazon, it is optimal to not share any data.
Finally, recall that we thus far neglected the IR constraint. We now modify the example so that the conclusion holds when also taking the IR constraints into account. In particular, duplicate the construction with a different set of segments and consumers, but where players 1 and 2 have the opposite information. Formally, in addition to the two segments, and , suppose there are also segments and , where the goods and are distinct from the goods and . Furthermore, the partitions are and . That is, if the realized segment is or , then again player 1 learns the segment and player 2 learns only that the realized segment is either or . Symmetrically, if the realized segment is or , then player 2 learns the segment and player 1 learns only that the realized segment is either or . Finally, suppose that the realized segment is or with probability , and or with probability .
Because the example is symmetric, the utilities of the two players, absent data sharing, are equal. Similarly, the utilities of the two players, under full data-sharing, are also equal. Thus, whenever full data-sharing leads to strictly higher welfare than no data-sharing (as in the presence of an Amazon above), then it also satisfies the IR constraint. And if no data-sharing is optimal in the original example (as in the absence of an Amazon above), then it is also optimal in the modified setting. Thus, the conclusion that full data-sharing may be optimal in the presence of an Amazon but not in its absence holds also when we take the IR constraint into account.
5.2.2 More Sharing Without an Amazon
When there is no Amazon, the optimal welfare is achieved when, for each segment , players separate—one choosing and the other —as this leads to maximal utility in each segment. Furthermore, if for every segment , then this maximal welfare can be achieved by full data-sharing: On each segment , the mediator recommends to one of the players, and to the other. In contrast, in the presence of an Amazon, if for every segment then full data-sharing may not be optimal. In this section we construct an example in which full data-sharing is strictly suboptimal with an Amazon. As in Section 5.2.1 above, in the following construction, for simplicity we refer to different types of consumers by the good they desire. We also ignore the IR constraint. However, again as in Section 5.2.1 above, the conclusion holds also when we take the IR constraint into account if we consider a modified, symmetric example in which the segments and partitions are duplicated.
Suppose there are three segments, , , and , and players’ partitions are and . That is, player 1 learns whether the realized lies in the segment , and so , or whether lies in one of or , and so the correct good for the realized type is one of , , or (with relative probabilities to be specified shortly). Player 2 is similar, but learns whether the realized segment is or one of or . The prior is such that the segments are equally likely. Denote by the probability that good is correct, conditional on segment , and suppose that and that .
Suppose there is no Amazon. The optimal welfare here, , is attained when at least one player chooses conditional on , at least one player chooses conditional on , and when one player chooses and another chooses conditional on . Full data-sharing leads players to learn the correct segment, and so to maximal welfare: on segments and players pool on the correct good, and on segment it is an equilibrium for players to separate.
In contrast, without data sharing, while player 2 will end up choosing on , player 1 will end up choosing on . This is because player 1 cannot differentiate between and , and conditional on the strategy of choosing good is dominant. This, however, leads to total welfare , and so full data-sharing is better than no sharing.
Now suppose there is an Amazon. When there is no data-sharing, then conditional on segments or , welfare will be , since for both players it is a dominant strategy to offer the correct good. On segment the strategies and of players 1 and 2, respectively, will again be dominant (as in the case of no Amazon), leading to conditional welfare here.
What happens will full data-sharing? Here, on segment , players will pool on action , by Claim 2 and the assumption that . Thus, welfare conditional on will be
where the inequality follows from the assumption that . Since, conditional on or , welfare is the same with full sharing and with no sharing, total (unconditional) welfare is higher with no sharing than with full sharing.
Furthermore, observe that under no data-sharing, welfare is maximal conditional on or . Under , higher welfare could (only) be attained if players were to separate on and . However, such separation can never occur in equilibrium: Any player that is supposed to offer can strictly benefit by deviating to offering , regardless of the other player’s action (by Claim 2 and since ). Therefore, under no data-sharing players obtain maximal welfare subject to the IC constraint being satisfied.
Thus, in this example, full data-sharing is optimal in the absence of an Amazon, but not in its presence. In contrast, no data-sharing is optimal in the presence of an Amazon, but not in its absence. In particular, this implies that data sharing is strictly beneficial in the absence of an Amazon, but not in its presence.
6 Conclusion
In this paper we proposed a simple model to study coopetitive data sharing between firms. We designed several data-sharing schemes, in the form of mediators, that are optimal in their respective domains, with and without an Amazon.
In our model, we assumed that the mediator only facilitates data sharing between firms. One natural extension is to allow the mediator to also use monetary transfers between them. In Appendix 7.1 we analyze this extension. We show that transfers can loosen the IR constraint, implying that, under jointly complete information, full data-sharing is often optimal. However, when there is no jointly complete information, full data-sharing may violate the IC constraint, and in this case our mediators (even without transfers) are close to optimal.
One limitation of our model is that it considers data sharing between only two firms. What happens when there are small firms, and a mediator that can facilitate sharing between all of them? We leave a thorough analysis of optimal mediators in this setting for future work, but provide some preliminary results in Appendix 7.2. In particular, we analyze the case of jointly complete information. We show that, in coopetition without an Amazon, a generalization of the mediator from the introductory example to players is optimal. In coopetition against an Amazon, however, the situation is more complicated. Nonetheless, we provide necessary and sufficient conditions for full data-sharing to be optimal in this case.
Coopetitive data-sharing is crucial for firms’ survival in current online markets, and the model and results of this paper have only scratched the surface of what can and should be done. In particular, the paper leaves numerous directions open for future research. In addition to a thorough analysis of optimal data-sharing with more than 2 players, a particularly important direction is to analyze data sharing in a variety of markets different from the one studied here.
References
- Anderson (2006) Anderson, C. (2006). The long tail: Why the future of business is selling less of more. Hachette Books.
- Aumann (1976) Aumann, R. J. (1976). Agreeing to disagree. The annals of statistics 1236–1239.
- Ben-Porat and Tennenholtz (2019) Ben-Porat, O. and Tennenholtz, M. (2019). Regression equilibrium. In Proceedings of the 2019 ACM Conference on Economics and Computation. 173–191.
- Bengtsson and Kock (2000) Bengtsson, M. and Kock, S. (2000). “Coopetition” in business networks – to cooperate and compete simultaneously. Industrial Marketing Management, 29 411–426.
- Bergemann and Bonatti (2019) Bergemann, D. and Bonatti, A. (2019). Markets for information: An introduction. Annual Review of Economics, 11 85–107.
- Bergemann and Morris (2013) Bergemann, D. and Morris, S. (2013). Robust predictions in games with incomplete information. Econometrica, 81 1251–1308.
- Bergemann and Morris (2016) Bergemann, D. and Morris, S. (2016). Bayes correlated equilibrium and the comparison of information structures in games. Theoretical Economics, 11 487–522.
- Bergemann and Morris (2019) Bergemann, D. and Morris, S. (2019). Information design: A unified perspective. Journal of Economic Literature, 57 44–95.
- Bimpikis et al. (2019) Bimpikis, K., Crapis, D. and Tahbaz-Salehi, A. (2019). Information sale and competition. Management Science, 65 2646–2664.
- Brandenburger and Nalebuff (2011) Brandenburger, A. M. and Nalebuff, B. J. (2011). Co-opetition. Crown Business.
- Clarke (1983) Clarke, R. N. (1983). Collusion and the incentives for information sharing. The Bell Journal of Economics 383–394.
- de Zegher and Lo (2020) de Zegher, J. F. and Lo, I. (2020). Crowdsourcing market information from competitors. Available at SSRN 3537625.
- European Commission (2020) European Commission (2020). A European strategy for data. https://eur-lex.europa.eu/legal-content/EN/TXT/?qid=1593073685620&uri=CELEX:52020DC0066. Accessed: 2021-05-13.
- Feng et al. (2019) Feng, Y., Gradwohl, R., Hartline, J., Johnsen, A. and Nekipelov, D. (2019). Bias-variance games. arXiv preprint arXiv:1909.03618.
- Forges (1993) Forges, F. (1993). Five legitimate definitions of correlated equilibrium in games with incomplete information. Theory and Decision, 35 277–310.
- Galperti and Perego (2018) Galperti, S. and Perego, J. (2018). A dual perspective on information design. Available at SSRN 3297406.
- Gehrig and Stenbacka (2007) Gehrig, T. and Stenbacka, R. (2007). Information sharing and lending market competition with switching costs and poaching. European Economic Review, 51 77–99.
- Gentzkow and Kamenica (2016) Gentzkow, M. and Kamenica, E. (2016). Competition in persuasion. The Review of Economic Studies, 84 300–322.
- Gu et al. (2019) Gu, Y., Madio, L. and Reggiani, C. (2019). Data brokers co-opetition. Available at SSRN 3308384.
- Ha and Tong (2008) Ha, A. Y. and Tong, S. (2008). Contracting and information sharing under supply chain competition. Management Science, 54 701–715.
- Ichihashi (2020) Ichihashi, S. (2020). Competing data intermediaries. Manuscript, available at https://shota2.github.io/research/data.pdf.
- Jappelli and Pagano (2002) Jappelli, T. and Pagano, M. (2002). Information sharing, lending and defaults: Cross-country evidence. Journal of Banking & Finance, 26 2017–2045.
- Jentzsch et al. (2013) Jentzsch, N., Sapi, G. and Suleymanova, I. (2013). Targeted pricing and customer data sharing among rivals. International Journal of Industrial Organization, 31 131–144.
- Kairouz et al. (2019) Kairouz, P., McMahan, H. B., Avent, B., Bellet, A., Bennis, M., Bhagoji, A. N., Bonawitz, K., Charles, Z., Cormode, G., Cummings, R. et al. (2019). Advances and open problems in federated learning. arXiv preprint arXiv:1912.04977.
- Kamenica and Gentzkow (2011) Kamenica, E. and Gentzkow, M. (2011). Bayesian persuasion. American Economic Review, 101 2590–2615.
- Kearns et al. (2014) Kearns, M., Pai, M., Roth, A. and Ullman, J. (2014). Mechanism design in large games: Incentives and privacy. In Proceedings of the 5th Conference on Innovations in Theoretical Computer Science. 403–410.
- Liu and Serfes (2006) Liu, Q. and Serfes, K. (2006). Customer information sharing among rival firms. European Economic Review, 50 1571–1600.
- Loshen (2014) Loshen, D. (2014). Leveraging cloud-based analytics for crowd-sourced intelligence. https://gigaom.com/report/leveraging-cloud-based-analytics-for-crowdsourced-intelligence/. [Gigaom Research; accessed 6-June-2020].
- Lyu et al. (2020) Lyu, L., Yu, J., Nandakumar, K., Li, Y., Ma, X., Jin, J., Yu, H. and Ng, K. S. (2020). Towards fair and privacy-preserving federated deep models. IEEE Transactions on Parallel and Distributed Systems, 31 2524–2541.
- Mansour et al. (2018) Mansour, Y., Slivkins, A. and Wu, Z. S. (2018). Competing bandits: Learning under competition. In 9th Innovations in Theoretical Computer Science Conference (ITCS 2018).
- Martens et al. (2020) Martens, B., De Streel, A., Graef, I., Tombal, T. and Duch-Brown, N. (2020). Business-to-business data sharing: An economic and legal analysis. European Commission, JRC Digital Economy Working Paper Series, 2020. https://ec.europa.eu/jrc/sites/jrcsh/files/jrc121336.pdf. Accessed: 2021-05-13.
- Mathevet et al. (2020) Mathevet, L., Perego, J. and Taneva, I. (2020). On information design in games. Journal of Political Economy, 128 1370–1404.
- Monderer and Tennenholtz (2004) Monderer, D. and Tennenholtz, M. (2004). k-implementation. Journal of Artificial Intelligence Research, 21 37–62.
- Monderer and Tennenholtz (2009) Monderer, D. and Tennenholtz, M. (2009). Strong mediated equilibrium. Artificial Intelligence, 173 180–195.
- Myerson (1991) Myerson, R. B. (1991). Game Theory: Analysis of Conflict. Harvard University Press.
- Pagano and Jappelli (1993) Pagano, M. and Jappelli, T. (1993). Information sharing in credit markets. The Journal of Finance, 48 1693–1718.
- Pathak et al. (2010) Pathak, B., Garfinkel, R., Gopal, R. D., Venkatesan, R. and Yin, F. (2010). Empirical analysis of the impact of recommender systems on sales. Journal of Management Information Systems, 27 159–188.
- Raith (1996) Raith, M. (1996). A general model of information sharing in oligopoly. Journal of economic theory, 71 260–288.
- Shamir and Shin (2016) Shamir, N. and Shin, H. (2016). Public forecast information sharing in a market with competing supply chains. Management Science, 62 2994–3022.
- Simon (1956) Simon, H. A. (1956). Rational choice and the structure of the environment. Psychological review, 63 129.
- Yang et al. (2019) Yang, Q., Liu, Y., Chen, T. and Tong, Y. (2019). Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST), 10 1–19.
- Zhan et al. (2021) Zhan, Y., Zhang, J., Hong, Z., Wu, L., Li, P. and Guo, S. (2021). A survey of incentive mechanism design for federated learning. IEEE Transactions on Emerging Topics in Computing.
Appendix
7 Extensions
7.1 Transfers
In this section we consider an extension to the model: the possibility of monetary transfers between players. With transfers, the mediator may require one player to pay an amount , and another to receive that amount , when opting in. Assume total utilities are quasi-linear in these transfers—that is, a player’s total utility is her utility from the game plus the change in her monetary position. How does the possibility of such transfers affect the design of mediators?
A first observation is that the optimality of mediator , under jointly complete information and without an Amazon, is not affected, since that mediator leads to a globally optimal sum of player’s utilities. Any transfers will only change the distribution of utilities (and hence whether or not the mediator is IR), not their sum.
When players have jointly complete information and they compete against an Amazon, however, the optimal mediator can be different. In particular, when there are transfers, then a mediator that is fully revealing to both players is optimal whenever . If both and , then the mediator that is fully revealing to both is optimal even without a transfer (this is what does). If , let the mediator set a transfer , have player transfer to player , and then always recommend to both players. After the transfer this leads to utility to player and to player . Observe that and , and so the mediator is IR. The mediator is IC since each player plays the dominant action for every . And, finally, the mediator is optimal, since the sum of utilities is , which is globally maximal.
When there is no jointly complete information the situation is slightly more complicated, and here we consider three settings of parameters. First, if , then the mediator of Mediator 3 is IR, IC, and optimal, even without transfers. Second, if , then by Claim 2 the pooling strategy is both dominant and welfare maximizing. In this case, a mediator like the mediator with jointly complete information and transfers above will be optimal whenever . If both and , then the mediator that is fully revealing to both is optimal even without a transfer (as in ). If , let the mediator set a transfer , have player transfer to player , and then always recommend to both players. This mediator is IR, IC, and optimal, by the same reasoning as above.
Third, if , then full data-sharing does not always satisfy the IC constraint. This is demonstrated by the example in Section 5.2.1, in which the optimal IC mediator is one in which there is no data sharing (and it is strictly better than full data-sharing). Since the example is symmetric, players’ payoffs are identical, and so transfers do not alleviate the situation. Nonetheless, recall that mediator (analyzed in Theorem 8) yields welfare at least , where OPT is the maximal welfare achieved by any mediator, including one that is not IR or IC. Furthermore, Example 2 shows that no mediator (not even one that does no satisfy IR) can in general achieve more than . This implies that even the best mediator with transfers can yield at most . Our mediator achieves close to this—namely, —without transfers.
7.2 Many Players
Throughout this paper we analyzed the case of 2 players and possibly an Amazon. In this section we discuss an extension of the model and results to players. We leave a thorough analysis of optimal mediators in this setting for future work, but here we provide some preliminary results. In particular, we consider the case in which players have jointly complete information: for every , where is the partition element of player when consumer type is realized.
The main take-away from this section is that in coopetition without an Amazon, a generalization of the mediator from the introductory example to players is optimal. In coopetition against an Amazon, however, the situation is more complicated. We provide necessary and sufficient conditions for full data-sharing to be optimal in this case.
7.2.1 Many Players Without Amazon
Let be a BNE of the unmediated game, where is the equilibrium strategy of player when she obtains information . We will design an optimal mediator for the setting in which, for every , the base value is at most the utility of player in . Consider the following mediator, and observe that it uses the equilibrium in its construction (in contrast with , which only uses the base values ).
Theorem 10
Fix a BNE of the unmediated game with no Amazon, and for every let be at most the expected utility of player in . Then is IR, IC, and optimal.
In mediator , every player obtains the same utility as under , plus additional utility whenever no player correctly chooses . Thus, the mediator is IR. Furthermore, since under this mediator, for every at least one player chooses , and so the total welfare is 1, which is maximal. The main challenge in the proof is to show that the mediator is IC: in the mediator each player either plays her equilibrium strategy or the dominant action , and the proof involves showing that, conditional on not obtaining recommendation , the strategy is still optimal.
-
Proof: Recall that is the strategy profile in which players always follow ’s recommendation. We first show that all players receive (weakly) higher expected utilities under than under . Consider some , and some realizations of . Consider two events: If for all then without the mediator, all players obtain utility zero. The mediator here recommends to all, leading to positive utility to all players. Conditional on this event, then, the mediator is beneficial to all players. The other event is the complement of this first event, in which case the mediator recommends players’ BNE actions. Since the mediator’s recommendation does not change players’ actions relative to , it also does not affect their utilities. Overall, then, all players prefer in to , and so the mediator is IR.
Next, we show that is a BNE of . Fix and a player , and suppose all other players play . Denote this profile of other players as To simplify notation, denote by and by the distribution over recommendations to player in state . Also fix some , and let . Then when the mediator recommends action , player ’s expected utility from following that recommendation is
(1) (2) (3) (4) (5) where (1) follows from the law of total expectation, (2) follows since implies that the mediator recommended to all players, (3) follows since implies that the mediator recommended the equilibrium action to all players, (4) follows from another application of the law of total expectation, and (5) follows since implies that .
Similarly, ’s expected utility from not following the mediator’s recommendation and playing instead is
Since is an equilibrium, it holds that
and so
Additionally, note that if for some then player is certain that , and so following the
mediator’s recommendation is optimal. Thus, in both cases, following the mediator’s recommendation is optimal, and so is a BNE.
Finally, we show that is optimal with respect to . In any state , the sum of players’ utilities conditional on that state is at most 1. This maximal utility is achieved whenever at least one player plays . Under , at least one player plays in every . Hence, the sum of players’ (unconditional) expected utilities is , the maximum possible in the game.
7.2.2 Many Players Against Amazon
Suppose now that there are players and an Amazon, that there is jointly complete information, and that is an arbitrary profile of base values.
Claim 3
The mediator that is fully revealing to all players and always recommends is IR, IC, and optimal if and only if for all .
-
Proof: The strategy in a fully revealing is a BNE, since each player plays , which is dominant. It is IR, since each player ’s expected utility in is . Finally, it is optimal: Whenever players correctly choose , the sum of players’ utilities is . Under , the sum of players’ utilities is , which is the most they can jointly obtain.
The reverse direction holds by the observation that if a player has base value then her IR constraint is violated under a fully revealing mediator.