Social Welfare Maximization in cross-silo Federated Learning
Abstract
As one of the typical settings of Federated Learning (FL), cross-silo FL allows organizations to jointly train an optimal Machine Learning (ML) model. In this case, some organizations may try to obtain the global model without contributing their local training, lowering the social welfare. In this paper, we model the interactions among organizations in cross-silo FL as a public goods game for the first time and theoretically prove that there exists a social dilemma where the maximum social welfare is not achieved in Nash equilibrium. To overcome this social dilemma, we employ the Multi-player Multi-action Zero-Determinant (MMZD) strategy to maximize the social welfare. With the help of the MMZD, an individual organization can unilaterally control the social welfare without extra cost. Experimental results validate that the MMZD strategy is effective in maximizing the social welfare.
Index Terms:
Federated learning, Public goods game, Zero-determinant strategy, Social welfare, Game theoryI Introduction
In Federated Learning (FL), clients cooperatively train a Machine Learning (ML) model with their decentralized datasets under the coordination of a central server[1]. One of the typical settings of FL is cross-silo FL[2] where a neutral third-party agent acts as the central server and clients are a group of organizations, aiming to jointly train an optimal ML model for their respective use. In this case, these organizations are also the owners of the global model and can utilize the well-trained global model to further process tasks for their own interests. 111This work is partly supported by the US NSF under grant CNS-2105004. 22footnotetext: Corresponding author. An optimal global model with high performance requires the organizations in cross-silo FL to collaborate efficiently so as to bring considerable benefits to all participants, which can be regarded as the maximization of the social welfare. In fact, there are many studies on optimizing the social welfare in cross-silo FL to directly improving the model performance[3, 4, 5, 6], increasing the convergence speed[7], reducing the communication cost[8], protecting privacy[9, 10, 11] and security[12], etc.
However, since every organization participating in cross-silo FL can obtain the final global model regardless of its contribution, the well-trained model becomes a public good, which is non-excludable and non-rivalrous for all organizations[13]. This leads to the emergence of selfish behaviors that some organizations may only consider their own interests via inactively participating in local training while obtaining the final global model for free or at a lower cost. The spread of this behavior can result in huge loss of the social welfare, and then none of the organizations can get the optimal model, which compromises the long-term stability and sustainability of cross-silo FL.
The existing studies motivate organizations to fully contribute to cross-silo FL by designing an incentive mechanism[13]. However, this requires extra negotiation costs since organizations need to reach a consensus on the mechanism in advance, and demands additional running costs where a distributed algorithm runs over all organizations, clearly adding more burden to organizations.
In this paper, we take a brand-new approach using the Multi-player Multi-action Zero-Determinant (MMZD) strategy[14] to maximize the social welfare in cross-silo FL without causing additional costs for organizations. Another outstanding advantage of our method is that it can control the social welfare and be applied to any cross-silo FL scenario no matter what strategies or actions organizations perform. In summary, our contributions include:
1) We model the interactions among organizations in cross-silo FL as a public goods game for the first time, focusing on the organization’s strategy rather than designing an extra mechanism to solve the social welfare maximization problem.
2) We reveal the existence of the social dilemma in cross-silo FL by mathematical proof for the first time, which demonstrates the adverse effect of selfish behaviors in cross-silo FL from the perspective of game theory. This can be used as a theoretical basis for exploring organizations’ behaviors in cross-silo FL.
3) We overcome the social dilemma by employing the MMZD strategy from view of an individual organization. Specifically, any organization can unilaterally maximize the social welfare, which ensures the social welfare in cross-silo FL at a certain level and maintains the stability of the system. This approach can also extend the application domains of the MMZD.
4) Experiments prove the effectiveness of the MMZD strategy in maximizing the social welfare.
II System Model
We consider a cross-silo FL scenario with a set of organizations, denoted as . All organizations rely on a central server to collaboratively conduct global model training for a specific task, where each of them has their own data for local training. The goal of organizations is to obtain an optimal global model, minimizing the loss with all datasets. The central server collects the results of local model updates from all organizations, aggregates to obtain the global model, and then distributes it to everyone for the next round of training.
In each round of local training, every organization performs iterations of model training. We denote the number of global aggregations as . For the current task, the strategy of organization , denoted as , represents the number of global aggregation rounds it participates in the task. Then, denotes the strategy vector of all organizations. Here we assume that all organizations in this cross-silo FL may participate in fewer global aggregations due to laziness or selfishness, but they do not carry out malicious attacks, such as model poisoning attack.
According to the cross-silo FL model, all organizations get the same model in return. Inspired by [13], we define the revenue of organization as
(1) |
where denotes the unit revenue of organization by using the returned final model, denotes the precision of the untrained model, and denotes the precision of the trained global model with corresponding strategy vector . can be modeled as , with positive coefficients and [15] being derived based on the loss function, neural network, and local datasets. In particular, we have . The revenue of each organization is proportional to the difference between the expected loss after rounds aggregation (i.e.,) and the minimum expected loss (i.e.,)[13]. As the number of total participation rounds increases, the marginal decrease of the difference reduces.
We define the cost of organization as . The cost is composed of the organization’s computation cost and its communication cost . On the one hand, the computation cost , where is a positive parameter, denoting the computation cost of each iteration in organization ’s local training111As [16] shows, , where is the effective capacitance coefficient of organization ’s computing chipset, denotes the calculation processing capacity, denotes the number of data units, and denotes the number of CPU cycles required by organization to process one data unit.. On the other hand, the communication cost is defined as a parameter since we assume that every organization uploads its updates in each global aggregation round. If it chooses not to participate in global aggregation, it will submit a zero vector as updates.
Then the utility of organization is defined as the difference between its revenue and cost:
(2) |
According to previous statements, we model the interactions among organizations as a cross-silo FL game.
Definition 1.
(Cross-silo FL game). In the cross-silo FL game, organizations participating in cross-silo FL act as players, where their actions and utilities are and , respectively.
The cross-silo FL game can be iterative since these organizations in cross-silo FL usually cooperate for a long time to finish multiple FL tasks. Each game round in the cross-silo FL game corresponds to a certain FL task. Moreover, the social welfare in cross-silo FL game can be denoted as the total utility of all organizations , namely . In the cross-silo FL game, we find that the social dilemma occurs if , which can be summarized as below.
Theorem 1.
(Social dilemma). If , there exists a social dilemma in the cross-silo FL game.
Proof.
The social dilemma forms when the Nash equilibrium is not the point of maximum social welfare. First, we study the Nash equilibrium of the cross-silo FL game. Referring to (2), we can derive the derivative of as . Given , we have , which leads to . Thus, the derivative of is negative, and the utility function decreases monotonically with . The Nash equilibrium strategy of each organization is , so the Nash equilibrium point is . Noted that, there is a natural and necessary premise of FL that the utility of the well-trained model must be positive. Thus, we prove that the point with the social welfare higher than that in the Nash equilibrium point . So the social dilemma exists if . ∎
The condition in the above theorem indicates that if any organization only trains the local model using its local dataset, the utility is negative. In fact, this condition strengthens organizations’ motivation to participate in global training in cross-silo FL game.
III Social welfare maximization
According to the analysis above, we can see that the underlying cause of the social dilemma is selfishness, leading to the loss of all organizations, namely the low social welfare. Aiming to solve this problem, we resort to the Multi-player Multi-action Zero-Determinant (MMZD) strategy for the social welfare maximization in this section. In each game round, any organization can choose the action , so there are possible outcomes for each game round. We assume that the organizations have one-round-memory since a long-memory player has no priority against others with short memory[14]. For arbitrary organization , its mixed strategy is defined as:
(3) |
where represents the probability of organization choosing action in the current game round and other organizations’ choosing the same actions as -th outcome of the previous game round. In addition, the corresponding utility vector is denoted as:
(4) |
where each utility of organization choosing action in the -th outcome can be calculated by , with denoting the action vector corresponding to the -th outcome but .
In the cross-silo FL model, an organization’s current move depends only on its last action and the strategy vector in the last game round. We can construct a Markov matrix , with each element denoting the one-step transition probability from state to .
As demonstrated in[14], a certain column of is only controlled by organization ’s strategy. We assume the stationary vector of is , then organization ’s expected utility in the stationary state is:
which makes a linear combination of all organizations’ expected utilities yielding the following equation:
(5) |
In the above equation, and are constants. Thus, when organization chooses a strategy satisfying , where is a non-zero constant and is under the control of organization , the corresponding column of and the last column of will be proportional. (5) becomes:
(6) |
We further study the social welfare maximization problem with the help of the MMZD in this circumstance. Take organization performing the MMZD strategy as an example. According to (6), by setting , the social welfare can be calculated as . Thus, the issue of maximizing the social welfare is equivalent to the following optimization problem:
We denote as the th element in , then we can solve the above optimization problem by considering the following two cases:
1) . To meet the constraint , we can get the lower bound of as follows:
To meet the constraint , we can get the upper bound of as follows:
Only if , can have a feasible solution, which is equivalent to . If there exists satisfying the above constraint, we can obtain the minimum value of as follow:
(7) |
2) . Similarly, when , we have ; considering , we have . In addition, is feasible only when , i.e., . Finally, we can get the following result:
(8) |
In summary, by (7) and (8), organization can unilaterally set the expected social welfare with the MMZD strategy meeting , with each element of calculated by:
where denotes the -th element in .
IV Experiments
In this section, we evaluate the performance of the MMZD strategy to maximize the social welfare based on simulation experiments. All experiments are implemented using Matlab R2021a on a laptop with 2.3 GHz Intel Core i5-8300H processor. We set since the number of organizations in cross-silo FL is usually small. We consider that , . and are derived from the simulation dataset. In addition, we set . For every control group with different strategy settings, we repeat the experiment for 100 times.




In order to verify the effectiveness of the MMZD strategy on maximizing the social welfare, we compare it with other five classical strategies, by simulating the entire cross-silo FL process for 20 rounds of game. The numerical results of maximum social welfare denote the sum of all organizations’ utilities. In Fig. 1, organization 1 adopts MMZD, all-defection (ALLD)[17], all-cooperation (ALLC)[17], and random (Rand) strategies[17]. Other organizations use ALLD, ALLC, Rand, Tit-For-Tat (TFT)[18], and mixed (Mixed) strategies[17]. Specifically, ALLD strategy is defined as: the organization does not perform local training at all, only submits a zero vector in global aggregation. While ALLC strategy means that the organization participates in all global aggregation with their local updates in every game round. Organizations which adopt Rand strategy randomly participate in global aggregation from to rounds with the probability of . TFT strategy is defined as the organizations randomly choose the number of participating global aggregation from to when the sum of global aggregation rounds in last game round is less than , otherwise they randomly choose the number of participating global aggregation from to . We define the mixed strategy as adopting a specific strategy chosen from ALLC, ALLD, Rand, and TFT. By comparing Fig. 1(a) with the other three figures, we can find that the MMZD strategy can effectively control the social welfare. This can prove that free-riding behavior is avoided in some ways.

Fig. 2 plots the expected social welfare changes in each game round, as the organization 1 adopts the MMZD strategy and other organizations employ different strategies. Fig. 1(a) displays the final result in Fig. 2, which indicates that no matter what kind of strategies other organizations adopt, the social welfare finally converges to a fixed value, verifying the power of the proposed social welfare maximization game.
V conclusion
In this paper, we define the cross-silo FL game among organizations as a public goods game, revealing the social dilemma in cross-silo FL theoretically. In order to overcome the social dilemma, we propose a brand-new method using the MMZD to solve the social welfare maximization problem. By the means of the MMZD, an individual organization can unilaterally control social welfare at a certain level, regardless of other organizations’ strategies. Moreover, our approach can maintain the stability and sustainability of the system without extra cost. Simulation results prove that the MMZD strategy can efficiently and effectively control social welfare, which reduces the loss from selfish behaviors.
References
- [1] Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Aguera y Arcas, “Communication-efficient learning of deep networks from decentralized data,” in Artificial intelligence and statistics. PMLR, 2017, pp. 1273–1282.
- [2] Peter Kairouz, H Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, et al., “Advances and open problems in federated learning,” arXiv preprint arXiv:1912.04977, 2019.
- [3] Yutao Huang, Lingyang Chu, Zirui Zhou, Lanjun Wang, Jiangchuan Liu, Jian Pei, and Yong Zhang, “Personalized cross-silo federated learning on non-iid data,” in Proceedings of the AAAI Conference on Artificial Intelligence, 2021, vol. 35, pp. 7865–7873.
- [4] Yansheng Wang, Yongxin Tong, Dingyuan Shi, and Ke Xu, “An efficient approach for cross-silo federated learning to rank,” in 2021 IEEE 37th International Conference on Data Engineering (ICDE). IEEE, 2021, pp. 1128–1139.
- [5] Kishore Nandury, Anand Mohan, and Frederick Weber, “Cross-silo federated training in the cloud with diversity scaling and semi-supervised learning,” in ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2021, pp. 3085–3089.
- [6] Umer Majeed, Latif U Khan, and Choong Seon Hong, “Cross-silo horizontal federated learning for flow-based time-related-features oriented traffic classification,” in 2020 21st Asia-Pacific Network Operations and Management Symposium (APNOMS). IEEE, 2020, pp. 389–392.
- [7] Othmane Marfoq, Chuan Xu, Giovanni Neglia, and Richard Vidal, “Throughput-optimal topology design for cross-silo federated learning,” arXiv preprint arXiv:2010.12229, 2020.
- [8] Chengliang Zhang, Suyi Li, Junzhe Xia, Wei Wang, Feng Yan, and Yang Liu, “Batchcrypt: Efficient homomorphic encryption for cross-silo federated learning,” in 2020 USENIX Annual Technical Conference (USENIXATC 20), 2020, pp. 493–506.
- [9] Mikko A Heikkilä, Antti Koskela, Kana Shimizu, Samuel Kaski, and Antti Honkela, “Differentially private cross-silo federated learning,” arXiv preprint arXiv:2007.05553, 2020.
- [10] Qinbin Li, Bingsheng He, and Dawn Song, “Practical one-shot federated learning for cross-silo setting,” in Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21), 2021.
- [11] Guodong Long, Tao Shen, Yue Tan, Leah Gerrard, Allison Clarke, and Jing Jiang, “Federated learning for privacy-preserving open innovation future on digital health,” arXiv preprint arXiv:2108.10761, 2021.
- [12] Zhifeng Jiang, Wei Wang, and Yang Liu, “Flashe: Additively symmetric homomorphic encryption for cross-silo federated learning,” arXiv preprint arXiv:2109.00675, 2021.
- [13] Ming Tang and Vincent WS Wong, “An incentive mechanism for cross-silo federated learning: A public goods perspective,” in IEEE INFOCOM 2021-IEEE Conference on Computer Communications. IEEE, 2021, pp. 1–10.
- [14] Xiaofan He, Huaiyu Dai, Peng Ning, and Rudra Dutta, “Zero-determinant strategies for multi-player multi-action iterated games,” IEEE Signal Processing Letters, vol. 23, no. 3, pp. 311–315, 2016.
- [15] Xiang Li, Kaixuan Huang, Wenhao Yang, Shusen Wang, and Zhihua Zhang, “On the convergence of fedavg on non-iid data,” arXiv preprint arXiv:1907.02189, 2019.
- [16] Nguyen H Tran, Wei Bao, Albert Zomaya, Minh NH Nguyen, and Choong Seon Hong, “Federated learning over wireless networks: Optimization model design and analysis,” in IEEE INFOCOM 2019-IEEE Conference on Computer Communications. IEEE, 2019, pp. 1387–1395.
- [17] Qin Hu, Shengling Wang, Peizi Ma, Xiuzhen Cheng, Weifeng Lv, and Rongfang Bie, “Quality control in crowdsourcing using sequential zero-determinant strategies,” IEEE Transactions on Knowledge and Data Engineering, vol. 32, no. 5, pp. 998–1009, 2019.
- [18] Martin Nowak and Karl Sigmund, “A strategy of win-stay, lose-shift that outperforms tit-for-tat in the prisoner’s dilemma game,” Nature, vol. 364, no. 6432, pp. 56–58, 1993.