Probably approximately correct stability of allocations in uncertain coalitional games with private sampling
Abstract
We study coalitional games with exogenous uncertainty in the coalition value, in which each agent is allowed to have private samples of the uncertainty. As a consequence, the agents may have a different perception of stability of the grand coalition. In this context, we propose a novel methodology to study the out-of-sample coalitional rationality of allocations in the set of stable allocations (i.e., the core). Our analysis builds on the framework of probably approximately correct learning. Initially, we state a priori and a posteriori guarantees for the entire core. Furthermore, we provide a distributed algorithm to compute a compression set that determines the generalization properties of the a posteriori statements. We then refine our probabilistic robustness bounds by specialising the analysis to a single payoff allocation, taking, also in this case, both a priori and a posteriori approaches. Finally, we consider a relaxed -core to include nearby allocations and also address the case of empty core. For this case, probabilistic statements are given on the eventual stability of allocations in the -core.
keywords:
Uncertain coalitional games; Statistical learning; Data privacy1 Introduction
Multi-agent systems are pervasive across various fields, including engineering (Raja and Grammatico, 2021; Karaca and Kamgarpour, 2020; Han et al., 2019; Fele et al., 2017, 2018), economics, and social sciences (McCain, 2008). Even though agents often behave as self-interested entities, their limited ability to improve their utility can, in certain scenarios, motivate them to form coalitions to achieve a higher individual payoff. This situation can be modelled through coalitional games (Chalkiadakis et al., 2011). As each agent’s participation in a coalition is subject to their own payoff being maximised, a question emerges about how to allocate the total value of the coalition such that no agent deviates from it. This problem is known as stability of agents’ allocations.
In real-world scenarios, the value of a coalition is typically affected by uncertainty. The consideration of uncertainty in the coalitional values of a game finds its roots in seminal works such as Charnes and Granot (1976, 1977, 1973), which provided solution concepts to yield stable allocations against uncertainty. Suijs et al. (1999) discussed non-emptiness of the core for a particular class of stochastic games. Chalkiadakis and Boutilier (2004) and Ieong and Shoham (2008) addressed uncertainty through Bayesian learning methods, while Li and Conitzer (2015) investigated different solution concepts that maximize the probability of obtaining stable allocations. Chalkiadakis and Boutilier (2008) and Guéneron and Bonnet (2021) explored the dynamics of repeated stochastic coalitional games.
The connection between probably approximately correct (PAC) learning and uncertain coalitional games has been explored in Balcan et al. (2015). The spirit of Procaccia and Rosenschein (2006) is similar, using Vapnik-Chervonenkis (VC) theory to learn the winning coalitions for the class of the so-called simple games. Balcan et al. (2015) focused on a complementary problem where only a randomized subset of coalitions is considered. Both these works evaluate the sample complexity from the VC theory perspective, and hence their results suffer from the associated conservativeness. Pantazis et al. (2022a) leverage the scenario approach (Campi and Garatti, 2021, 2018a, 2018b; Garatti and Campi, 2022) to provide distribution-free guarantees on the stability of allocations in a PAC manner, based on samples from the exogenous uncertainty affecting the value functions. On a parallel line of research, Pantazis et al. (2023) propose a data-driven Wasserstein-based distributionally robust approach for allocations’ stability.
Unlike the aforementioned works, we consider a more general setting where the uncertainty data is privately drawn by each agent. As such, information is heterogeneous across agents, and the samples are regarded as a private resource. When data samples are commonly shared among agents, the work of Pantazis et al. (2022a) provide guarantees on stability of allocations. In particular, the developments in Pantazis et al. (2022a) rely on the notion of scenario core – a data-driven approximation of the robust core based on the hypothesis that all uncertainty samples are shared among the agents (Raja and Grammatico, 2021). Given this common set of samples, the perception of a stable allocation based on the available data is identical across agents. This is no longer the case for private sampling, as different samples of the exogenous uncertainty can result in a different perception of stability by each agent. We adopt a PAC learning approach and leverage the concept of compression (Margellos et al., 2015), i.e., the set of samples essential for the reconstruction of the scenario core and whose cardinality affects the generalization properties of the provided guarantees. We give a priori and a posteriori certificates for the entire scenario core obtained by private sampling, and propose a distributed algorithm to calculate a compression set.
We then focus on the specific allocation returned by some algorithm (akin, e.g., to the distributed payoff algorithm by Raja and Grammatico (2021)). We then provide a priori coalition stability guarantees for this allocation (Theorem 3). We show how the dimension of the problem – in our case the number of agents – plays a key role. Less conservative probabilistic bounds can often be obtained by taking an a posteriori approach (Campi et al., 2018), i.e., based on observation of the realized uncertainty. In this case, the probabilistic bounds can be significantly improved and even a tighter a priori bound can be obtained (see Theorem 4). Finally, we consider a relaxed -core to include nearby allocations and also address the case of empty core. We leverage recents results by Campi and Garatti (2021) and study the probabilistic stability of allocations in the -core.
2 Scenario approach for stochastic coalitional games
We consider a coalitional game with agents identified by the set . We denote the number of possible subcoalitions, excluding the grand coalition, by , i.e., . The worth of a coalition is given by the so called value function; here we consider transferable utility problems, i.e., the value of coalitions is expressed by a real value which can be split among participating agents. We posit that the value of each coalition is subject to uncertainty. Then, the value of a coalition is a function that given an uncertainty realization returns the total payoff for the agents in . The value of the grand coalition is the deterministic quantity .
The uncertain coalitional game is then defined as the tuple , where is some unknown probability measure over . A vector , where is the payoff received by agent , is called an allocation. For a given uncertainty realization , an allocation is strictly rational for the members of if ; the latter implies that agents have an incentive to form this coalition for this particular uncertainty realization. An allocation is efficient if . Efficient allocations such that there are no incentives for agents to deviate the grand coalition are called stable. The set of all such allocations is the core of the game.
The notion of robust core is proposed to account for the presence of uncertainty; see, e.g., Pantazis et al. (2022a); Raja and Grammatico (2021); Nedić and Bauso (2013). We formally define it as . For any possible realization of the uncertainty , any allocation gives the agents no incentive to defect from the grand coalition and form sub-coalitions. Unfortunately, computing explicitly the robust core is hard, as we assume no knowledge on the uncertainty support (nor on the underlying probability distribution ). To circumvent this challenge, we adopt a data-driven methodology and approximate the robust core by drawing a finite number of independent and identically distributed (i.i.d.) samples , where denotes the -fold cartesian product of ; we refer to vectors as multi-samples. This constitutes the scenario game , whose core is the set , referred to as the scenario core.
We now take the notion of allocation stability a step further, considering the more general setting where every agent has only access to a private set of samples from . Let be the multi-sample privately drawn by agent . We say that an allocation of is stable with respect to if and , s.t. .
This immediately leads to the following extension of the scenario core:
Definition 2.1.
Let be some multi-sample drawn by agent . The scenario core with private sampling is given by
(1) |
Unless differently specified, we assume the following conditions hold throughout the paper:
Assumption 1
-
(i)
Each agent draws independent samples from the probability distribution . The samples drawn by any agent are independent from those drawn by other agents.
-
(ii)
is non-empty for any multi-sample .
In the following, we let , and . Also, for brevity, we will use to denote all which allow agent as a member.
On the basis of privately available data, we wish to provide guarantees on the probability that allocations will remain stable (i.e., within ) for any future, yet unseen, uncertainty realization. Capitalizing on Pantazis et al. (2022b); Fabiani et al. (2022); Pantazis et al. (2022a), we define two probabilistic notions of instability. In particular, the first refers to a particular allocation in the core, whereas the second involves the entire set .
Definition 2.2.
-
(i)
Let . For any , is the probability of allocation instability.
-
(ii)
Let . We call probability of core instability.
thus denotes the probability with respect to the realizations of that, for some with value function , at least one of the allocations in the scenario core will become unstable, i.e., will be dominated by the option of defecting from the grand coalition to form .
To bound the probability of core instability in a PAC fashion, we introduce two key concepts from statistical learning theory, namely the algorithm and the compression set (Margellos et al., 2015). The latter refers to the fact that only a subset of data from may be sufficient to produce the same scenario core. As we will see later this underpins the quality of the provided probabilistic stability guarantees.
Definition 2.3.
A mapping that takes as input a multi-sample and returns the scenario core of game is called an algorithm. With -probability one w.r.t. the choice of , a subset is a compression set for if .
Any compression set of least cardinality is called minimal compression set. Another important notion used in our derivations is the support rank (Schildbach et al., 2012).
Definition 2.4.
Consider the maximal unconstrained subspace of a constraint in the form , i.e., for all , where , with . Then, the support rank of this constraint is given by .
In words, the support rank of an uncertain constraint is equal to the dimension of the problem at hand minus the dimension of the maximal unconstrained space of the constraint.
3 PAC stability guarantees for the scenario core with private sampling
3.1 A posteriori collective stability guarantees
In the following theorem we bound with high confidence the probability that some allocation will become unstable, i.e., that the scenario core – computed on the basis of samples – will be reduced after a new uncertainty realization.
Theorem 1
Suppose that each agent independently draws a multi-sample and let . Fix a confidence parameter and choose such that . Consider an algorithm that takes as input and returns the scenario core . It holds:
(2) |
where is the cardinality of the subset of a compression relative to agent , quantified a posteriori, and satisfies
(3) |
Proof: First, note that due to Assumption 1-(i). Then, the following inequalities hold.
Given some new uncertainty realization , let designate the scenario core built from the multisample . The first equality stems from Definition 2.2, and expresses the fact that for to be a strict subset of , there must exist some allocation in which, due to , violates the rationality condition for some subcoalition , i.e., , causing the departure of some agent . The third inequality is obtained by applying the subadditivity property, while the second to last from applying Bonferroni’s inequality (Margellos et al., 2018; Falsone et al., 2020). Note now that can be found as the solution of the feasibility problem
(4) |
Let be the collection of (sub)sets of allocations satisfying the coalitional constraints obtained from data corresponding to agent . Considering the unique set , note that , i.e., satisfies the consistency assumption required by Campi et al. (2018, Th. 1). From this,
(5) |
from which (2) follows.
A compression (sub)set originated from agent’s uncertainty samples can be obtained by means of Algorithm 1; its cardinality can then be obtained as .
Note that differently from the compression algorithm in Pantazis et al. (2022a), Algorithm 1 is distributed among agents. At each iteration of Algorithm 1, a feasibility program is solved where coalitional rationality is enforced with equality for each coalition in which agent is allowed to participate. This allows to identify whether the sample that maximizes is critical for the construction of a stable allocation set for agent . The latter is verified if the problem is feasible, and the sample collected as part of the compression set . Note that unless coordination is imposed among agents, the compression set obtained through Algorithm 1 is non-minimal.
3.2 A priori collective stability guarantees
As a byproduct of our a posteriori analysis, an a priori bound can be obtained. Specifically one can obtain an a priori bound by considering the worst case value of over all possible combinations for which (where ). The following theorem provides then an a priori bound for the entire core with private samples.
Theorem 2
Fix and choose such that . It holds that
(6) |
where , with defined as in (3), is a quantity independent from the given multi-sample.
Proof: We apply the a posteriori result in Pantazis et al. (2022b, Th. 1) – which admits the number of facets of a randomized feasibility domain as an upper bound to the cardinality of the minimal compression set – to , where each facet is associated to some subcoalition in . Now, let denote the cardinality of the minimal (sub)compression set relative to agent , quantified a posteriori: by Pantazis et al. (2022b, Th.1) it holds that . Then, , for any such that . By definition of it follows
where the last inequality holds because of Theorem 1.
The results above can be applied to the entire set of allocations that are stable w.r.t. the observed data; because of their generality, these theoretical guarantees tend to be conservative. In what follows we specialise our analysis to a single allocation.
4 PAC stability of a single allocation with private sampling
Let be the unique allocation in which maximises some convex utility function .111Uniqueness of the maximiser is without loss of generality. If multiple maximisers are possible, a convex tie-break rule can be applied to single out a unique solution. Recalling the notion of support rank as per Definition 2.4, the following result holds for .
Theorem 3
Suppose that each agent draws a multi-sample and let . Fix , and choose such that . Then,
(7) |
where , with being the support rank of the coalitional rationality constraints corresponding to agent , i.e., relative to all such that .
Proof: Similarly to the proofline of Theorem 1, we have that
To obtain the last inequality consider the following optimization program
(8) |
We then group the constraints depending on which agent they correspond to. Let . With defined as above, from Schildbach et al. (2012) we have
which concludes the proof.
Note that the confidence parameter in Theorem 3 depends on the number of samples of agent and the violation level set individually by the agent to determine the probability of satisfaction of the rationality constraints. Most importantly, depends on the so called support rank of the coalitional constraints corresponding to , which is in any instance upper bounded by the dimension of the decision variable in (8). Theorem 3 is an a priori result and in certain cases can be conservative. The following result provides an improved probabilistic bound, based on the observation of the uncertainty realization. Let
(9) |
Note that (9) is compatible with (3), and allows to explicitly compute as a function of .
Theorem 4
Fix and choose such that . Set as in (9). Then, for the unique allocation it holds
(10) |
Recall that, as in Theorem 4, is the cardinality of the (sub)compression set associated with the rationality constraints relative to all , quantified a posteriori, e.g., by means of the procedure illustrated in Campi et al. (2018, §II).
Proof: For each agent , taking the conditional probability over all other agents’ multi-samples, Theorem 1 in Campi et al. (2018) allows to state
which, by integrating w.r.t. the probability of realization of all other agents’ scenarios, becomes
(11) |
The proof is concluded by applying relation (11) in the proofline of Theorem 3.
Finally, an a priori bound can be derived from Theorem 4 by considering the worst case among all a posteriori observable compressions ; this bound is nontrivial by noticing that , which follows from Calafiore and Campi (2006, Th. 3).
Corollary 1.
Let , with defined as in (9). Then
(12) |
4.1 The case of empty core
Lifting Assumption 1-(ii) on non-emptiness of the scenario core, we define a relaxed version of the latter, the so called -core for the case of private uncertainty sampling as follows:
Definition 4.1.
For any , fixed for all , the scenario -core is given by
(13) |
It is worth pointing out that the -core allows to extend the analysis to allocations “closest” to the core, and the interest in it may not be necessarily restricted to cases where the standard core is empty (this is recovered by setting for all ). Also, Definition 4.1 contemplates a different relaxation parameter for every agent, allowing to specialise the definition according to individual preferences, or possibly different information on, e.g., each agent’s sampling.
On these grounds, here we address the problem of i) providing an allocation with (relaxed) coalitional stability certificates and ii) measuring how agents’ multi-samples contribute to lack of coalitional stability. These questions can be answered at once by solving
(14a) | |||
(14b) | |||
(14c) |
In what follows, we consider without loss of generality that for any multi-sample , (14) returns a unique pair (this can be ensured by using a convex tie-break rule).
Assumption 2
For every allocation , for any .
Assumption 2 is related to non-degeneracy and is often satisfied when does not accumulate as is the case for continuous probability distributions. Now, for , consider the polynomial equation (Campi and Garatti, 2021, Th. 1)
(15) |
For , (15) has two solutions in . We denote the smallest as , and further let for . We can now propose the following a posteriori statement on :
Theorem 5
Proof: Let , where the -th component of is 1 if (and 0 otherwise), and . Then (14c) can be rewritten as
We note that in our setting Campi and Garatti (2021, Assum. 1) is satisfied; under Assumption 2, it is then possible to apply Campi and Garatti (2021, Th. 1), which yields
(17) |
where is derived from (15) as described above. By applying arguments similar to those used in proving Theorems 3 and 4, we obtain (16).
5 Conclusion
In this work we considered uncertain coalitional games and proposed a data-driven methodology to study the stability of allocations for the general setting where uncertainty is privately sampled by the agents. Future work will investigate stability of allocations through a distributionally robust framework, as well as analyse the case where the value of the grand coalition can also be affected by uncertainty.
F. Fele gratefully acknowledges support from grant RYC2021-033960-I funded by MCIN/AEI/ 10.13039/501100011033 and European Union NextGenerationEU/PRTR, as well as from grant PID2022-142946NA-I00 funded by MCIN/AEI/ 10.13039/501100011033 and by ERDF A way of making Europe.
References
- Balcan et al. (2015) Maria-Florina Balcan, Ariel D. Procaccia, and Yair Zick. Learning cooperative games. In IJCAI15: Proceedings of the 24th International Conference on Artificial Intelligence, 2015.
- Calafiore and Campi (2006) Giuseppe C. Calafiore and Marco C. Campi. The scenario approach to robust control design. IEEE Transactions on Automatic Control, 51(5):742–753, 2006. 10.1109/TAC.2006.875041.
- Campi and Garatti (2018a) Marco C. Campi and Simone Garatti. Introduction to the scenario approach. Society for Industrial and Applied Mathematics, 2018a.
- Campi and Garatti (2018b) Marco C. Campi and Simone Garatti. Wait-and-judge scenario optimization. Mathematical Programming, 167:155–189, 2018b.
- Campi and Garatti (2021) Marco C. Campi and Simone Garatti. A theory of the risk for optimization with relaxation and its application to support vector machines. Journal of Machine Learning Research, 22(288):1–38, 2021.
- Campi et al. (2018) Marco C. Campi, Simone Garatti, and Federico Alessandro Ramponi. A general scenario theory for nonconvex optimization and decision making. IEEE Transactions on Automatic Control, 63(12):4067–4078, 2018. 10.1109/TAC.2018.2808446.
- Chalkiadakis and Boutilier (2004) Georgios Chalkiadakis and Craig Boutilier. Bayesian reinforcement learning for coalition formation under uncertainty. In AAMAS 2004: Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems., pages 1090–1097, 2004.
- Chalkiadakis and Boutilier (2008) Georgios Chalkiadakis and Craig Boutilier. Sequential decision making in repeated coalition formation under uncertainty. In AAMAS08: Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems, 2008.
- Chalkiadakis et al. (2011) Georgios Chalkiadakis, Edith Elkind, and Michael Wooldridge. Computational aspects of cooperative game theory. Synthesis Lectures on Artificial Intelligence and Machine Learning, 5(6):1–168, 2011.
- Charnes and Granot (1973) Abraham Charnes and Daniel Granot. Prior solutions: Extensions of convex nucleus solutions to chance-constrained games. In Proceedings of the Computer Science and Statistics Seventh Symposium at Iowa State University, pages 323–332, 1973.
- Charnes and Granot (1976) Abraham Charnes and Daniel Granot. Coalitional and chance-constrained solutions to n-person games I: The prior satisficing nucleolus. SIAM Journal on Applied Mathematics, 31(2):358–367, 1976.
- Charnes and Granot (1977) Abraham Charnes and Daniel Granot. Coalitional and chance-constrained solutions to n-person games II: Two-stage solutions. Operations Research, 25(6):1013–1019, 1977.
- Fabiani et al. (2022) Filippo Fabiani, Kostas Margellos, and Paul J. Goulart. Probabilistic feasibility guarantees for solution sets to uncertain variational inequalities. Automatica, 137:110120, 2022. ISSN 0005-1098.
- Falsone et al. (2020) Alessandro Falsone, Kostas Margellos, Maria Prandini, and Simone Garatti. A scenario-based approach to multi-agent optimization with distributed information. IFAC-PapersOnLine, 53(2):20–25, 2020. ISSN 2405-8963. 10.1016/j.ifacol.2020.12.034. 21st IFAC World Congress.
- Fele et al. (2017) Filiberto Fele, José M. Maestre, and Eduardo F. Camacho. Coalitional control: Cooperative game theory and control. IEEE Control Systems Magazine, 37(1):53–69, 2017.
- Fele et al. (2018) Filiberto Fele, Ezequiel Debada, José M. Maestre, and Eduardo F. Camacho. Coalitional control for self-organizing agents. IEEE Transactions on Automatic Control, 63(9):2883–2897, 2018. 10.1109/TAC.2018.2792301.
- Garatti and Campi (2022) Simone Garatti and Marco C. Campi. Risk and complexity in scenario optimization. Mathematical Programming, 191:243–279, 2022.
- Guéneron and Bonnet (2021) Josselin Guéneron and Grégory Bonnet. Are exploration-based strategies of interest for repeated stochastic coalitional games? Advances in Practical Applications of Agents, Multi-Agent Systems, and Social Good. The PAAMS Collection, pages 89–100, 2021.
- Han et al. (2019) Liyang Han, Thomas Morstyn, and Malcolm McCulloch. Incentivizing prosumer coalitions with energy management using cooperative game theory. IEEE Transactions on Power Systems, 34(1):303–313, 2019.
- Ieong and Shoham (2008) Samuel Ieong and Yoav Shoham. Bayesian coalitional games. In Proceedings of the 23rd AAAI Conference on Artificial Intelligence, pages 95–100, 2008.
- Karaca and Kamgarpour (2020) Orcun Karaca and Maryam Kamgarpour. Core-selecting mechanisms in electricity markets. IEEE Transactions on Smart Grid, 11(3):2604–2614, 2020. 10.1109/TSG.2019.2958710.
- Li and Conitzer (2015) Yuqian Li and Vincent Conitzer. Cooperative game solution concepts that maximize stability under noise. In Proceedings of the 29th AAAI Conference on Artificial Intelligence, page 979–985, 2015.
- Margellos et al. (2015) Kostas Margellos, Maria Prandini, and John Lygeros. On the connection between compression learning and scenario-based single-stage and cascading optimization problems. IEEE Transactions on Automatic Control, 60(10):2716–2721, 2015.
- Margellos et al. (2018) Kostas Margellos, Alessandro Falsone, Simone Garatti, and Maria Prandini. Distributed constrained optimization and consensus in uncertain networks via proximal minimization. IEEE Transactions on Automatic Control, 63(5):1372–1387, 2018. 10.1109/TAC.2017.2747505.
- McCain (2008) Roger A. McCain. Cooperative games and cooperative organizations. The Journal of Socio-Economics, 37(6):2155–2167, 2008. ISSN 1053-5357.
- Nedić and Bauso (2013) Angelia Nedić and Dario Bauso. Dynamic coalitional tu games: Distributed bargaining among players’ neighbors. IEEE Transactions on Automatic Control, 58(6):1363–1376, 2013. 10.1109/TAC.2012.2236716.
- Pantazis et al. (2022a) George Pantazis, Filippo Fabiani, Filiberto Fele, and Kostas Margellos. Probabilistically robust stabilizing allocations in uncertain coalitional games. IEEE Control Systems Letters, 6:3128–3133, 2022a. 10.1109/LCSYS.2022.3182152.
- Pantazis et al. (2022b) George Pantazis, Filiberto Fele, and Kostas Margellos. On the probabilistic feasibility of solutions in multi-agent optimization problems under uncertainty. European Journal of Control, 63:186–195, 2022b. ISSN 0947-3580.
- Pantazis et al. (2023) George Pantazis, Barbara Franci, Sergio Grammatico, and Kostas Margellos. Distributionally robust stability of payoff allocations in stochastic coalitional games. Accepted for publication at the IEEE Conference on Decision and Control 2023, 2023.
- Procaccia and Rosenschein (2006) Ariel D. Procaccia and Jeffrey S. Rosenschein. Learning to identify winning coalitions in the PAC model. In AAMAS06 - 5th International Joint Conference on Autonomous Agents and Multi-agent Systems, page 673–675. Association for Computing Machinery, NY, US, 2006. 10.1145/1160633.1160751.
- Raja and Grammatico (2021) Aitazaz Ali Raja and Sergio Grammatico. Payoff distribution in robust coalitional games on time-varying networks. IEEE Transactions on Control of Network Systems, 2021.
- Schildbach et al. (2012) Georg Schildbach, Lorenzo Fagiano, and Manfred Morari. Randomized solutions to convex programs with multiple chance constraints. SIAM J. Optim., 23:2479–2501, 2012.
- Suijs et al. (1999) Jeroen Suijs, Peter Borm, Anja De Waegenaere, and Stef Tijs. Cooperative games with stochastic payoffs. European Journal of Operational Research, 113(1):193–205, 1999. ISSN 0377-2217. 10.1016/S0377-2217(97)00421-9.