This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Maximally Satisfying Lower Quotas
in the Hospitals/Residents Problem with Tiesthanks: This work was partially supported by the joint project of Kyoto University and Toyota Motor Corporation, titled “Advanced Mathematical Science for Mobility Society”.

Hiromichi Goko Frontier Research Center, Toyota Motor Corporation, Aichi 471-8572, Japan, E-mail: hiromichi_goko@mail.toyota.co.jp    Kazuhisa Makino Research Institute for Mathematical Sciences, Kyoto University, Kyoto 606-8502, Japan., E-mail: makino@kurims.kyoto-u.ac.jp, Supported by JSPS KAKENHI Grant Numbers JP20H05967, JP19K22841, and JP20H00609.    Shuichi Miyazaki Academic Center for Computing and Media Studies, Kyoto University, Kyoto 606-8501, Japan, E-mail: shuichi@media.kyoto-u.ac.jp, Supported by JSPS KAKENHI Grant Numbers JP20K11677 and JP16H02782.    Yu Yokoi Principles of Informatics Research Division, National Institute of Informatics, Tokyo 101-8430, Japan, E-mail: yokoi@nii.ac.jp, Supported by JSPS KAKENHI Grant Number JP18K18004 and JST PRESTO Grant Number JPMJPR212B.
Abstract

Motivated by the serious problem that hospitals in rural areas suffer from a shortage of residents, we study the Hospitals/Residents model in which hospitals are associated with lower quotas and the objective is to satisfy them as much as possible. When preference lists are strict, the number of residents assigned to each hospital is the same in any stable matching because of the well-known rural hospitals theorem; thus there is no room for algorithmic interventions. However, when ties are introduced to preference lists, this will no longer apply because the number of residents may vary over stable matchings.

In this paper, we formulate an optimization problem to find a stable matching with the maximum total satisfaction ratio for lower quotas. We first investigate how the total satisfaction ratio varies over choices of stable matchings in four natural scenarios and provide the exact values of these maximum gaps. Subsequently, we propose a strategy-proof approximation algorithm for our problem; in one scenario it solves the problem optimally, and in the other three scenarios, which are NP-hard, it yields a better approximation factor than that of a naive tie-breaking method. Finally, we show inapproximability results for the above-mentioned three NP-hard scenarios.

1 Introduction

The Hospitals/Residents model (HR), a many-to-one matching model, has been extensively studied since the seminal work of Gale and Shapley [13]. Its input consists of a set of residents and a set of hospitals. Each resident has a preference over hospitals; similarly, each hospital has a preference over residents. In addition, each hospital is associated with a positive integer called the upper quota, which specifies the maximum number of residents it can accept. In this model, stability is the central solution concept, which requires the nonexistence of a blocking pair, i.e., a resident–hospital pair that has an incentive to deviate jointly from the current matching. In the basic model, each agent (resident or hospital) is assumed to have a strict preference for possible partners. For this model, the resident-oriented Gale–Shapley algorithm (also known as the deferred acceptance mechanism) is known to find a stable matching. This algorithm has advantages from both computational and strategic viewpoints: it runs in linear time and is strategy-proof for residents.

In reality, people typically have indifference among possible partners. Accordingly, a stable matching model that allows ties in preference lists, denoted by HRT in the context of HR, was introduced [21]. For such a model, several definitions of stability are possible. Among them, weak stability provides a natural concept, in which agents have no incentive to move within the ties. It is known that if we break the ties of an instance II arbitrarily, any stable matching of the resultant instance is a weakly stable matching of II. Hence, the Gale–Shapley algorithm can still be used to obtain a weakly stable matching. In applications, typically, ties are broken randomly, or participants are forced to report strict preferences even if their true preferences have ties. Hereafter, “stability” in the presence of ties refers to “weak stability,” unless stated otherwise.

It is commonly known that HR plays an important role not only in theory but also in practice; for example, in assigning students to high schools [1, 2] and residents to hospitals [31]. In such applications, “imbalance” is one of the major problems. For example, hospitals in urban areas are generally more popular than those in rural areas; hence it is likely that the former are well-staffed whereas the latter suffer from a shortage of doctors. One possible solution to this problem is to introduce a lower quota of each hospital, which specifies the minimum number of residents required by a hospital, and obtain a stable matching that satisfies both the upper and lower quotas. However, such a matching may not exist in general [17, 29], and determining if such a stable matching exists in HRT is known to be NP-complete (which is an immediate consequence from page 276 of [30]).

In general, it is too pessimistic to assume that a shortage of residents would force hospitals to go out of operation. In some cases, the hospital simply has to reduce its service level according to how much its lower quota is satisfied. In this scenario, a hospital will wish to satisfy the lower quota as much as possible, if not completely. To formulate this situation, we introduce the following optimization problem, which we call HRT to Maximally Satisfy Lower Quotas (HRT-MSLQ). Specifically, let RR and HH be the sets of residents and hospitals, respectively. All members in RR and HH have complete preference lists that may contain ties. Each hospital hh has an upper quota u(h)u(h), the maximum number of residents it can accept. The stability of a matching is defined with respect to these preference lists and upper quotas, as in conventional HRT. In addition, each hospital hh is associated with a lower quota (h)\ell(h), which specifies the minimum number of residents required to keep its service level. We assume that (h)u(h)|R|\ell(h)\leq u(h)\leq|R| for each hHh\in H. For a stable matching MM, let M(h)M(h) be the set of residents assigned to hh. The satisfaction ratio, sM(h)s_{M}(h), of hospital hHh\in H (with respect to (h)\ell(h)) is defined as sM(h)=min{1,|M(h)|(h)}s_{M}(h)=\min\left\{1,\frac{|M(h)|}{\ell(h)}\right\}. Here, we let sM(h)=1s_{M}(h)=1 if (h)=0\ell(h)=0, because the lower quota is automatically satisfied in this case. The satisfaction ratio reflects a situation in which hospital hh’s service level increases linearly with respect to the number of residents up to (h)\ell(h) but does not increase after that, even though hh is still willing to accept u(h)(h)u(h)-\ell(h) more residents. These u(h)(h)u(h)-\ell(h) positions may be considered as “marginal seats,” which do not affect the service level but provide hospitals with advantages, such as generous work shifts. Our HRT-MSLQ problem asks us to maximize the total satisfaction ratio over the family \mathcal{M} of all stable matchings in the problem instance, i.e.,

maxMhHsM(h).\max_{M\in\mathcal{M}}\sum_{h\in H}s_{M}(h).

The following are some remarks on our problem: (1) To our best knowledge, almost all previous works on lower quotas have investigated cases with no ties and have assumed lower quotas to be hard constraints. Refer to the discussion at the end of this section. (2) Our assumption that all preference lists are complete is theoretically a fundamental scenario used to study the satisfaction ratio for lower quotas. Moreover, there exist several cases in which this assumption is valid [5, 15]. For example, according to Goto et al. [15], a complete list assumption is common in student–laboratory assignment in engineering departments of Japanese universities because it is mandatory that every student be assigned. (3) If preference lists contain no ties, the satisfaction ratio sM(h)s_{M}(h) is identical for any stable matching MM because of the rural hospitals theorem [14, 31, 32]. Hence, there is no chance for algorithms to come into play if the stability is not relaxed. In our setting (i.e., with ties), the rural hospitals theorem implies that our task is essentially to find an optimal tie-breaking. However, it is unclear how to find such a tie-breaking. (4) Alternative objective functions may be considered to reflect our objective of satisfying the lower quotas. In Appendix E, we introduce three such natural objective functions and briefly discuss their behaviors.

Our Contributions.  First, we study the goodness of any stable matching in terms of the total satisfaction ratios. For a problem instance II, let OPT(I){\rm OPT}(I) and WST(I){\rm WST}(I), respectively, denote the maximum and minimum total satisfaction ratios of the stable matchings of II. For a family of problem instances {\cal I}, let Λ()=maxIOPT(I)WST(I)\Lambda({\cal I})=\max_{I\in{\cal I}}\frac{{\rm OPT}(I)}{{\rm WST}(I)} denote the maximum gap of the total satisfaction ratios. In this paper, we consider the following four fundamental scenarios of {\cal I}: (i) general model, which consists of all problem instances, (ii) uniform model, in which all hospitals have the same upper and lower quotas, (iii) marriage model, in which each hospital has an upper quota of 11 and a lower quota of either 0 or 11, and (iv) R-side ML model, in which all residents have identical preference lists. The exact values of Λ()\Lambda({\cal I}) for all such fundamental scenarios are listed in the first row of Table 1, where n=|R|n=|R|. In the uniform model, we write θ=u(h)(h)\theta=\frac{u(h)}{\ell(h)} for the ratio of the upper and lower quotas, which is common to all hospitals. Further detailed analyses can be found in Table 1 of Appendix C.

Subsequently, we consider our problem algorithmically. Note that the aforementioned maximum gap corresponds to the worst-case approximation factor of the arbitrarily tie-breaking Gale–Shapley algorithm, which is frequently used in practice; this algorithm first breaks ties in the preference lists of agents arbitrarily and then applies the Gale–Shapley algorithm on the resulting preference lists. This correspondence easily follows from the rural hospitals theorem, as explained in Proposition 20 in Appendix C.

In this paper, we show that there are two types of difficulties inherent in our problem HRT-MSLQ for all scenarios except (iv). Even for scenarios (i)–(iii), we show that (1) the problem is NP-hard and that (2) there is no algorithm that is strategy-proof for residents and always returns an optimal solution; see Section 6 and Appendix A.1.

We then consider strategy-proof approximation algorithms. We propose a strategy-proof algorithm Double Proposal, which is applicable in all above possible scenarios, whose approximation factor is substantially better than that of the arbitrary tie-breaking method. The approximation factors are listed in the second row of Table 1, where ϕ\phi is a function defined by ϕ(1)=1\phi(1)=1, ϕ(2)=32\phi(2)=\frac{3}{2}, and ϕ(n)=n(1+n2)/(n+n2)\phi(n)=n(1+\lfloor\frac{n}{2}\rfloor)/(n+\lfloor\frac{n}{2}\rfloor) for any n3n\geq 3. Note that θ2+θ12θ1<θ\frac{\theta^{2}+\theta-1}{2\theta-1}<\theta holds whenever θ>1\theta>1.

[htbp] General  Uniform   Marriage  RR-side ML Maximum gap Λ()\Lambda({\cal I}) (i.e., Approx. factor of arbitrary tie-breaking GS) n+1n+1 θ\theta 22 n+1n+1 Approx. factor of Double Proposal  ϕ(n)(n+23)\phi(n)~{}(\sim\frac{n+2}{3}) θ2+θ12θ1\frac{\theta^{2}+\theta-1}{2\theta-1} 1.51.5 11 Inapproximability n14ϵn^{\frac{1}{4}-\epsilon}* 3θ+42θ+4ϵ\frac{3\theta+4}{2\theta+4}-\epsilon\,\dagger 98ϵ\frac{9}{8}-\epsilon\dagger

  • *

    Under P \not= NP

  • \dagger

    Under the Unique Games Conjecture

Maximum gap Λ()\Lambda({\cal I}), approximation factor of Double Proposal, and inapproximability of HRT-MSLQ for four fundamental scenarios {\cal I}.

Techniques.  Our algorithm Double Proposal is based on the resident-oriented Gale–Shapley algorithm and is inspired by previous research on approximation algorithms [26, 18] for another NP-hard problem called MAX-SMTI. Unlike in the conventional Gale–Shapley algorithm, our algorithm allows each resident rr to make proposals twice to each hospital. Among the hospitals in the top tie of the current preference list, rr prefers hospitals to which rr has not yet proposed to those which rr has already proposed to once. When a hospital hh receives a new proposal from rr, hospital hh may accept or reject it, and in the former case, hh may reject a currently assigned resident to accommodate rr. In contrast to the conventional Gale–Shapley algorithm, a rejection may occur even if hh is not full. If at least (h)\ell(h) residents are currently assigned to hh and at least one of them has not been rejected by hh so far, then hh rejects such a resident, regardless of its preference. This process can be considered as the algorithm dynamically finding a tie-breaking in rr’s preference list.

The main difficulty in our problem originates from the complicated form of our objective function s(M)=hHmin{1,|M(h)|(h)}s(M)=\sum_{h\in H}\min\{1,\frac{|M(h)|}{\ell(h)}\}. In particular, non-linearity of s(M)s(M) makes the analysis of the approximation factor of Double Proposal considerably hard. We therefore introduce some new ideas and techniques to analyze the maximum gap Λ\Lambda and approximation factor of our algorithm, which is one of the main novelties of this paper.

To estimate the approximation factor of the algorithm, we need to compare objective values of a stable matching MM output by the algorithm and an (unknown) optimal stable matching NN. A typical technique used to compare two matchings is to consider a graph of their union. In the marriage model, the connected components of the union are paths and cycles, both of which are easy to analyze; however, this is not the case in a general many-to-one matching model. For some problems, this approach still works via “cloning,” which transforms an instance of HR into that of the marriage model by replacing each hospital hh with an upper quota of u(h)u(h) by u(h)u(h) hospitals with an upper quota of 11. Unfortunately, however, in HRT-MSLQ there seems to be no simple way to transform the general model into the marriage model because of the non-linearity of the objective function.

In our analysis of the uniform model, the union graph of MM and NN may have a complex structure. We categorize hospitals using a procedure like breadth-first search starting from the set of hospitals hh with the satisfaction ratio sN(h)s_{N}(h) larger than sM(h)s_{M}(h), which allows us to provide a tight bound on the approximation factor. For the general model, instead of using the union graph, we define two vectors that distribute the values s(M)s(M) and s(N)s(N) to the residents. By making use of the local optimality of MM proven in Section 3, we compare such two vectors and give a tight bound on the approximation factor.

We finally remark that the improvement of Double Proposal over the maximum gap shows that our problem exhibits a different phenomenon from that of MAX-SMTI because the approximation factor of MAX-SMTI cannot be improved from a naive tie-breaking method if strategy-proofness is imposed [18].

Related Work.  Recently, the Hospitals/Residents problems with lower quotas are quite popular in the literature; however, most of these studies are on settings without ties. The problems related to HRT-MSLQ can be classified into three models. The model by Hamada et al. [17], denoted by HR-LQ-2 in [29], is the closest to ours. The input of this model is the same as ours, but the hard and soft constraints are different from ours; their solution must satisfy both upper and lower quotas, the objective being to maximize the stability (e.g., to minimize the number of blocking pairs). Another model, introduced by Biró et al. [6] and denoted by HR-LQ-1 in [29], allows some hospitals to be closed; a closed hospital is not assigned any resident. They showed that it is NP-complete to determine the existence of a stable matching. This model was further studied by Boehmer and Heeger [7] from a parameterized complexity perspective. Huang [20] introduced the classified stable matching model, in which each hospital defines a family of subsets RR of residents and each subset of RR has an upper and lower quota. This model was extended by Fleiner and Kamiyama [11] to a many-to-many matching model where both sides have upper and lower quotas. Apart from these, several matching problems with lower quotas have been studied in the literature, whose solution concepts are different from stability [4, 12, 27, 28, 35].

Paper Organization.  The rest of the paper is organized as follows. Section 2 formulates our problem HRT-MSLQ, and Section 3 describes our algorithm Double Proposal for HRT-MSLQ. Section 4 shows the strategy-proofness of Double Proposal. Section 5 is devoted to proving the maximum gaps Λ\Lambda and approximation factors of algorithm Double Proposal for the several scenarios mentioned above. Finally, Section 6 provides hardness results such as NP-hardness and inapproximability for several scenarios. Some proofs are deferred to appendices.

2 Problem Definition

Let R={r1,r2,,rn}R=\{r_{1},r_{2},\ldots,r_{n}\} be a set of residents and H={h1,h2,,hm}H=\{h_{1},h_{2},\ldots,h_{m}\} be a set of hospitals. Each hospital hh has a lower quota (h)\ell(h) and an upper quota u(h)u(h) such that (h)u(h)n\ell(h)\leq u(h)\leq n. We sometimes denote a hospital hh’s quota pair as [(h),u(h)][\ell(h),u(h)] for simplicity. Each resident has a preference list over hospitals, which is complete and may contain ties. If a resident rr prefers a hospital hih_{i} to hjh_{j}, we write hirhjh_{i}\succ_{r}h_{j}. If rr is indifferent between hih_{i} and hjh_{j} (including the case that hi=hjh_{i}=h_{j}), we write hi=rhjh_{i}=_{r}h_{j}. We use the notation hirhjh_{i}\succeq_{r}h_{j} to signify that hirhjh_{i}\succ_{r}h_{j} or hi=rhjh_{i}=_{r}h_{j} holds. Similarly, each hospital has a preference list over residents and the same notations as above are used. In this paper, a preference list is denoted by one row, from left to right according to the preference order. When two or more agents are of equal preference, they are enclosed in parentheses. For example, “r1r_{1}:  h3h_{3}   (  h2h_{2}  h4h_{4}  )  h1h_{1}” is a preference list of resident r1r_{1} such that h3h_{3} is the top choice, h2h_{2} and h4h_{4} are the second choice with equal preference, and h1h_{1} is the last choice.

An assignment is a subset of R×HR\times H. For an assignment MM and a resident rr, let M(r)M(r) be the set of hospitals hh such that (r,h)M(r,h)\in M. Similarly, for a hospital hh, let M(h)M(h) be the set of residents rr such that (r,h)M(r,h)\in M. An assignment MM is called a matching if |M(r)|1|M(r)|\leq 1 for each resident rr and |M(h)|u(h)|M(h)|\leq u(h) for each hospital hh. For a matching MM, a resident rr is called matched if |M(r)|=1|M(r)|=1 and unmatched otherwise. If (r,h)M(r,h)\in M, we say that rr is assigned to hh and hh is assigned rr. We sometimes abuse notation M(r)M(r) to denote the unique hospital where rr is assigned. A hospital hh is called deficient or sufficient if |M(h)|<(h)|M(h)|<\ell(h) or (h)|M(h)|u(h)\ell(h)\leq|M(h)|\leq u(h), respectively. Additionally, a hospital hh is called full if |M(h)|=u(h)|M(h)|=u(h) and undersubscribed otherwise.

A resident–hospital pair (r,h)(r,h) is called a blocking pair for a matching MM (or we say that (r,h)(r,h) blocks MM) if (i) rr is either unmatched in MM or prefers hh to M(r)M(r) and (ii) hh is either undersubscribed in MM or prefers rr to at least one resident in M(h)M(h). A matching is called stable if it admits no blocking pair. Recall that the satisfaction ratio of a hospital hh (which is also called the score of hh) in a matching MM is defined by sM(h)=min{1,|M(h)|(h)}s_{M}(h)=\min\{1,\frac{|M(h)|}{\ell(h)}\}, where we define sM(h)=1s_{M}(h)=1 if (h)=0\ell(h)=0. The total satisfaction ratio (also called the score) of a matching MM, is the sum of the scores of all hospitals, that is, s(M)=hHsM(h)s(M)=\sum_{h\in H}s_{M}(h). The Hospitals/Residents problem with Ties to Maximally Satisfy Lower Quotas, denoted by HRT-MSLQ, is to find a stable matching MM that maximizes the score s(M)s(M). The optimal score of an instance II is denoted by OPT(I){\rm OPT}(I).

Note that if |R|hHu(h)|R|\geq\sum_{h\in H}u(h), then all hospitals are full in any stable matching (recall that preference lists are complete). Hence, all stable matchings have the same score |H||H|, and the problem is trivial. Therefore, throughout this paper, we assume |R|<hHu(h)|R|<\sum_{h\in H}u(h). In this setting, all residents are matched in any stable matching as an unmatched resident forms a blocking pair with an undersubscribed hospital.

3 Algorithm

In this section, we present our algorithm Double Proposal for HRT-MSLQ along with a few of its basic properties. Its strategy-proofness and approximation factors for several models are presented in the following sections.

Our proposed algorithm Double Proposal is based on the resident-oriented Gale–Shapley algorithm but allows each resident rr to make proposals twice to each hospital. Here, we explain the ideas underlying this modification.

Let us apply the ordinary resident-oriented Gale–Shapley algorithm to HRT-MSLQ, which starts with an empty matching MM\coloneqq\emptyset and repeatedly updates MM by a proposal-acceptance/rejection process. In each iteration, the algorithm takes a currently unassigned resident rr and lets her propose to the hospital at the top of her current list. If the preference list of resident rr contains ties, the proposal order of rr depends on how to break the ties in her list. Hence, we need to define a priority rule for hospitals that are in a tie. Recall that our objective function is given by s(M)=hHmin{1,|M(h)|(h)}s(M)=\sum_{h\in H}\min\{1,\frac{|M(h)|}{\ell(h)}\}. This value immediately increases by 1(h)\frac{1}{\ell(h)} if rr proposes to a deficient hospital hh, whereas it does not increase if rr proposes to a sufficient hospital hh^{\prime}, although the latter may cause a rejection of some resident if hh^{\prime} is full. Therefore, a naive greedy approach is to let rr first prioritize deficient hospitals over sufficient hospitals and then prioritize those with small lower quotas among deficient hospitals. This approach is useful for attaining a larger objective value for some instances; however, it is not enough to improve the approximation factor in the sense of worst case analysis, as a deficient hospital hh in some iteration might become sufficient later and it might be better if rr had made a proposal to a hospital other than hh in the tie. Furthermore, this naive approach sacrifices strategy-proofness as demonstrated in Appendix A.2. This failure of strategy-proofness follows from the adaptivity of this tie-breaking rule, in the sense that the proposal order of each resident is affected by the other residents’ behaviors.

In our algorithm Double Proposal, each resident can propose twice to each hospital. If the head of rr’s preference list is a tie when rr makes a proposal, then the hospitals to which rr has not yet proposed are prioritized. This idea was inspired by an algorithm of [18]. Recall that each hospital hh has an upper quota u(h)u(h) and a lower quota (h)\ell(h). In our algorithm, we use (h)\ell(h) as a dummy upper quota. Whenever |M(h)|<(h)|M(h)|<\ell(h), a hospital hh accepts any proposal. If hh receives a new proposal from rr when |M(h)|(h)|M(h)|\geq\ell(h), then hh checks whether there is a resident in M(h){r}M(h)\cup\{r\} who has not been rejected by hh so far. If such a resident exists, hh rejects that resident regardless of the preference of hh. Otherwise, we apply the usual acceptance/rejection operation, i.e., hh accepts rr if |M(h)|<u(h)|M(h)|<u(h) and otherwise replaces rr with the worst resident rr^{\prime} in M(h)M(h). Roughly speaking, the first proposals are used to implement priority on deficient hospitals, and the second proposals are used to guarantee stability.

Formally, our algorithm Double Proposal is described in Algorithm 1. For convenience, in the preference list, a hospital hh that is not included in any tie is regarded as a tie consisting of hh only. We say that a resident is rejected by a hospital hh if she is chosen as rr^{\prime} in Lines 12 or 17. To argue strategy-proofness, we need to make the algorithm deterministic. To this end, we remove arbitrariness using indices of agents as follows. If there are multiple hospitals (resp., residents) satisfying the condition to be chosen at Lines 5 or 7 (resp., at Lines 12 or 17), take the one with the smallest index (resp., with the largest index). Furthermore, when there are multiple unmatched residents at Line 3, take the one with the smallest index. In this paper, Double Proposal always refers to this deterministic version.

Algorithm 1  Double Proposal
0:  An instance II where each hHh\in H has quotas [(h),u(h)][\ell(h),u(h)].
0:  A stable matching MM.
1:  MM\coloneqq\emptyset
2:  while there is an unmatched resident do
3:     Let rr be any unmatched resident and TT be the top tie of rr’s list.
4:     if TT contains a hospital to which rr has not proposed yet then
5:        Let hh be such a hospital with minimum (h)\ell(h).
6:     else
7:        Let hh be a hospital with minimum (h)\ell(h) in TT.
8:     end if
9:     if |M(h)|<(h)|M(h)|<\ell(h) then
10:        Let MM{(r,h)}M\coloneqq M\cup\{(r,h)\}.
11:     else if there is a resident in M(h){r}M(h)\cup\{r\} who has not been rejected by hh then
12:        Let rr^{\prime} be such a resident (possibly r=rr^{\prime}=r).
13:        Let M(M{(r,h)}){(r,h)}M\coloneqq(M\cup\{(r,h)\})\setminus\{(r^{\prime},h)\}.
14:     else if |M(h)|<u(h)|M(h)|<u(h) then
15:        MM{(r,h)}M\coloneqq M\cup\{(r,h)\}.
16:     else {i.e., when |M(h)|=u(h)|M(h)|=u(h) and all residents in M(h){r}M(h)\cup\{r\} have been rejected by hh once}
17:        Let rr^{\prime} be any resident that is worst in M(h){r}M(h)\cup\{r\} for hh (possibly r=rr^{\prime}=r).
18:        Let M(M{(r,h)}){(r,h)}M\coloneqq(M\cup\{(r,h)\})\setminus\{(r^{\prime},h)\}.
19:        Delete hh from rr^{\prime}’s list.
20:     end if
21:  end while
22:  Output MM and halt.
Lemma 1.

Algorithm Double Proposal runs in linear time and outputs a stable matching.

Proof.

Clearly, the size of the input is O(|R||H|)O(|R||H|). As each resident proposes to each hospital at most twice, the while loop is iterated at most 2|R||H|2|R||H| times. At Lines 5 and 7, a resident prefers hospitals with smaller (h)\ell(h), and hence we need to sort hospitals in each tie in an increasing order of the values of \ell. Since 0(h)n0\leq\ell(h)\leq n for each hHh\in H, \ell has only |R|+1|R|+1 possible values. Therefore, the required sorting can be done in O(|R||H|)O(|R||H|) time as a preprocessing step using a method like bucket sort. Thus, our algorithm runs in linear time.

Observe that a hospital hh is deleted from rr’s list only if hh is full. Additionally, once hh becomes full, it remains so afterward. Since each resident has a complete preference list and |R|<hHu(h)|R|<\sum_{h\in H}u(h), the preference list of each resident never becomes empty. Therefore, all residents are matched in the output MM.

Suppose, to the contrary, that MM is not stable, i.e., there is a pair (r,h)(r,h) such that (i) rr prefers hh to M(r)M(r) and (ii) hh is either undersubscribed or prefers rr to at least one resident in M(h)M(h). By the algorithm, (i) implies that rr is rejected by hh twice. Just after the second rejection, hh is full, and all residents in M(h)M(h) have once been rejected by hh and are no worse than rr for hh. Since M(h)M(h) is monotonically improving for hh, at the end of the algorithm hh is still full and no resident in M(h)M(h) is worse than rr, which contradicts (ii). ∎

In addition to stability, the output of Double Proposal satisfies the following property, which plays a key role in the analysis of the approximation factors in Section 5.

Lemma 2.

Let MM be the output of Double Proposal, rr be a resident, and hh and hh^{\prime} be hospitals such that h=rhh=_{r}h^{\prime} and M(r)=hM(r)=h. Then, we have the following conditions:

  1. (i)

    If (h)>(h)\ell(h)>\ell(h^{\prime}), then |M(h)|(h)|M(h^{\prime})|\geq\ell(h^{\prime}).

  2. (ii)

    If |M(h)|>(h)|M(h)|>\ell(h), then |M(h)|(h)|M(h^{\prime})|\geq\ell(h^{\prime}).

Proof.

(i) Since h=rhh=_{r}h^{\prime}, (h)>(h)\ell(h)>\ell(h^{\prime}), and rr is assigned to hh in MM, the definition of the algorithm (Lines 4, 5, and 7) implies that rr proposed to hh^{\prime} and was rejected by hh^{\prime} before she proposes to hh. Just after this rejection occurred, |M(h)|(h)|M(h^{\prime})|\geq\ell(h^{\prime}) holds. Since |M(h)||M(h^{\prime})| is monotonically increasing, we also have |M(h)|(h)|M(h^{\prime})|\geq\ell(h^{\prime}) at the end.

(ii) Since |M(h)|>(h)|M(h)|>\ell(h), the value of |M(h)||M(h)| changes from (h)\ell(h) to (h)+1\ell(h)+1 at some moment of the algorithm. By Line 11 of the algorithm, at any point after this, M(h)M(h) consists only of residents who have once been rejected by hh. Since M(r)=hM(r)=h for the output MM, at some moment rr must have made the second proposal to hh. By Line 4 of the algorithm, h=rhh=_{r}h^{\prime} implies that rr has been rejected by hh^{\prime} at least once, which implies that |M(h)|(h)|M(h^{\prime})|\geq\ell(h^{\prime}) at this moment and also at the end. ∎

Lemma 2 states some local optimality of Double Proposal. Suppose that we reassign rr from hh to hh^{\prime}. Then, hh may lose and hh^{\prime} may gain score, but Lemma 2 says that the objective value does not increase. To see this, note that if the objective value were to increase, hh^{\prime} must gain score and hh would either not lose score or lose less score than hh^{\prime} would gain. The former and the latter are the “if” parts of (ii) and (i), respectively, and in either case the conclusion |M(h)|(h)|M(h^{\prime})|\geq\ell(h^{\prime}) implies that hh^{\prime} cannot gain score by accepting one more resident.

4 Strategy-proofness

An algorithm is called strategy-proof for residents if it gives residents no incentive to misrepresent their preferences. The precise definition follows. An algorithm that always outputs a matching deterministically can be regarded as a mapping from instances of HRT-MSLQ into matchings. Let AA be an algorithm. We denote by A(I)A(I) the matching returned by AA for an instance II. For any instance II, let rRr\in R be any resident, who has a preference r\succeq_{r}. Additionally, let II^{\prime} be an instance of HRT-MSLQ which is obtained from II by replacing r\succeq_{r} with some other r\succeq^{\prime}_{r}. Furthermore, let MA(I)M\coloneqq A(I) and MA(I)M^{\prime}\coloneqq A(I^{\prime}). Then, AA is strategy-proof if M(r)rM(r)M(r)\succeq_{r}M^{\prime}(r) holds regardless of the choices of II, rr, and r\succeq^{\prime}_{r}.

In the setting without ties, it is known that the resident-oriented Gale–Shapley algorithm is strategy-proof for residents (even if preference lists are incomplete) [9, 16, 33]. Furthermore, it has been proved that no algorithm can be strategy-proof for both residents and hospitals [33]. As in many existing papers on two-sided matching, we use the term “strategy-proofness” to refer to strategy-proofness for residents.

Before proving the strategy-proofness of Double Proposal, we remark that the exact optimization and strategy-proofness are incompatible even if a computational issue is set aside. The following fact is demonstrated in Appendix A.1.

Proposition 3.

There is no algorithm that is strategy-proof for residents and returns an optimal solution for any instance of HRT-MSLQ. The statement holds even for the uniform and marriage models.

This proposition implies that, if we require strategy-proofness for an algorithm, then we should consider approximation even in the absence of computational constraints. Now, we show the strategy-proofness of our approximation algorithm.

Theorem 4.

Algorithm Double Proposal is strategy-proof for residents.

Proof.

To establish the strategy-proofness, we show that an execution of Double Proposal for an instance II can be described as an application of the resident-oriented Gale–Shapley algorithm to an auxiliary instance II^{*}. The construction of II^{*} is based on the proof of Lemma 8 in [18]; however, we need nontrivial extensions.

Let RR and HH be the sets of residents and hospitals in II, respectively. An auxiliary instance II^{*} is an instance of the Hospitals/Residents problem that has neither lower quotas nor ties and allows incomplete lists. The set of residents in II^{*} is RDR^{\prime}\cup D, where R={r1,r2,,rn}R^{\prime}=\{r^{\prime}_{1},r^{\prime}_{2},\dots,r^{\prime}_{n}\} is a copy of RR and D={dj,p|j=1,2,,m,p=1,2,,u(hj)}D=\set{d_{j,p}}{j=1,2,\dots,m,~{}p=1,2,\dots,u(h_{j})} is a set of j=1mu(hj)\sum_{j=1}^{m}u(h_{j}) dummy residents. The set of hospitals in II^{*} is HHH^{\scalebox{0.8}{$\circ$}}\cup H^{\scalebox{0.7}{$\bullet$}}, where each of H={h1,h2,,hm}H^{\scalebox{0.8}{$\circ$}}=\{h^{\scalebox{0.8}{$\circ$}}_{1},h^{\scalebox{0.8}{$\circ$}}_{2},\dots,h^{\scalebox{0.8}{$\circ$}}_{m}\} and H={h1,h2,,hm}H^{\scalebox{0.7}{$\bullet$}}=\{h^{\scalebox{0.7}{$\bullet$}}_{1},h^{\scalebox{0.7}{$\bullet$}}_{2},\dots,h^{\scalebox{0.7}{$\bullet$}}_{m}\} is a copy of HH. Each hospital hjHh^{\scalebox{0.8}{$\circ$}}_{j}\in H^{\scalebox{0.8}{$\circ$}} has an upper quota u(hj)u(h_{j}) while each hjHh^{\scalebox{0.7}{$\bullet$}}_{j}\in H^{\scalebox{0.7}{$\bullet$}} has an upper quota (hj)\ell(h_{j}).

For each resident riRr^{\prime}_{i}\in R^{\prime}, her preference list is defined as follows. Consider any tie (hj1hj2hjk)(h_{j_{1}}h_{j_{2}}\cdots h_{j_{k}}) in rir_{i}’s preference list. Let j1j2jkj^{\prime}_{1}\,j^{\prime}_{2}\cdots j^{\prime}_{k} be a permutation of j1j2jkj_{1}\,j_{2}\cdots j_{k} such that (hj1)(hj2)(hjk)\ell(h_{j^{\prime}_{1}})\leq\ell(h_{j^{\prime}_{2}})\leq\cdots\leq\ell(h_{j^{\prime}_{k}}), and for each jp,jqj^{\prime}_{p},j^{\prime}_{q} with (hjp)=(hjq)\ell(h_{j^{\prime}_{p}})=\ell(h_{j^{\prime}_{q}}), p<qp<q implies jp<jqj^{\prime}_{p}<j^{\prime}_{q}. We replace the tie (hj1hj2hjk)(h_{j_{1}}h_{j_{2}}\cdots h_{j_{k}}) with a strict order of 2k2k hospitals hj1hj2hjkhj1hj2hjkh^{\scalebox{0.7}{$\bullet$}}_{j^{\prime}_{1}}h^{\scalebox{0.7}{$\bullet$}}_{j^{\prime}_{2}}\cdots h^{\scalebox{0.7}{$\bullet$}}_{j^{\prime}_{k}}h^{\scalebox{0.8}{$\circ$}}_{j^{\prime}_{1}}h^{\scalebox{0.8}{$\circ$}}_{j^{\prime}_{2}}\cdots h^{\scalebox{0.8}{$\circ$}}_{j^{\prime}_{k}}. The preference list of rir^{\prime}_{i} is obtained by applying this operation to all ties in rir_{i}’s list, where a hospital not included in any tie is regarded as a tie of length one. The following is an example of the correspondence between the preference lists of rir_{i} and rir^{\prime}_{i}:

ri:(h2h4h5)h3(h1h6) where (h4)=(h5)<(h2) and (h6)<(h1)\displaystyle r_{i}:\quad(\ h_{2}\ h_{4}\ h_{5}\ )\ h_{3}\ (\ h_{1}\ h_{6}\ )\text{~{}~{}~{} where~{}~{} $\ell(h_{4})=\ell(h_{5})<\ell(h_{2})$ and $\ell(h_{6})<\ell(h_{1})$}
ri:h4h5h2h4h5h2h3h3h6h1h6h1\displaystyle r^{\prime}_{i}:~{}\quad h^{\scalebox{0.7}{$\bullet$}}_{4}\ h^{\scalebox{0.7}{$\bullet$}}_{5}\ h^{\scalebox{0.7}{$\bullet$}}_{2}\ h^{\scalebox{0.8}{$\circ$}}_{4}\ h^{\scalebox{0.8}{$\circ$}}_{5}\ h^{\scalebox{0.8}{$\circ$}}_{2}\ h^{\scalebox{0.7}{$\bullet$}}_{3}\ h^{\scalebox{0.8}{$\circ$}}_{3}\ h^{\scalebox{0.7}{$\bullet$}}_{6}\ h^{\scalebox{0.7}{$\bullet$}}_{1}\ h^{\scalebox{0.8}{$\circ$}}_{6}\ h^{\scalebox{0.8}{$\circ$}}_{1}

For each j=1,2,,mj=1,2,\dots,m, the dummy residents dj,p(p=1,2,,u(hj))d_{j,p}~{}(p=1,2,\dots,u(h_{j})) have the same list:

dj,p:hjhj\displaystyle d_{j,p}:~{}~{}h^{\scalebox{0.8}{$\circ$}}_{j}~{}~{}h^{\scalebox{0.7}{$\bullet$}}_{j}

For j=1,2,,mj=1,2,\dots,m, let P(hj)P(h_{j}) be the preference list of hjh_{j} in II and let Q(hj)Q(h_{j}) be the strict order on RR^{\prime} obtained by replacing residents rir_{i} with rir^{\prime}_{i} and breaking ties so that residents in the same tie of P(hj)P(h_{j}) are ordered in ascending order of indices. The preference lists of hospitals hjh^{\scalebox{0.8}{$\circ$}}_{j} and hjh^{\scalebox{0.7}{$\bullet$}}_{j} are then defined as follows:

hj:Q(hi)dj,1dj,2dj,u(hj)\displaystyle h^{\scalebox{0.8}{$\circ$}}_{j}:\quad\quad\qquad Q(h_{i})\quad\quad\quad~{}~{}d_{j,1}~{}d_{j,2}~{}\cdots~{}d_{j,u(h_{j})}
hj:dj,1dj,2dj,u(hj)r1r2rn\displaystyle h^{\scalebox{0.7}{$\bullet$}}_{j}:\quad d_{j,1}~{}d_{j,2}~{}\cdots~{}d_{j,u(h_{j})}~{}\quad r^{\prime}_{1}~{}~{}r^{\prime}_{2}~{}~{}\cdots~{}r^{\prime}_{n}

Let MM be the output of Double Proposal applied to II. For each resident rir_{i}, there are two cases: she has never been rejected by M(ri)M(r_{i}), and she had been rejected once by M(ri)M(r_{i}) and accepted upon her second proposal. Let M1M_{1} be the set of pairs (ri,M(ri))(r_{i},M(r_{i})) of the former case and M2M_{2} be that of the latter. Note that |M1(hj)|(hj)|M_{1}(h_{j})|\leq\ell(h_{j}) for any hjh_{j}. Define a matching MM^{*} of II^{*} by

M=\displaystyle M^{*}= {(ri,hj)|(ri,hj)M2}{(ri,hj)|(ri,hj)M1}\displaystyle\set{(r^{\prime}_{i},h^{\scalebox{0.8}{$\circ$}}_{j})}{(r_{i},h_{j})\in M_{2}}\cup\set{(r^{\prime}_{i},h^{\scalebox{0.7}{$\bullet$}}_{j})}{(r_{i},h_{j})\in M_{1}}
{(dj,p,hj)|1pu(hj)|M2(hj)|}\displaystyle\cup\set{(d_{j,p},h^{\scalebox{0.8}{$\circ$}}_{j})}{~{}1\leq p\leq u(h_{j})-|M_{2}(h_{j})|~{}}
{(dj,p,hj)|u(hj)|M2(hj)|<pmin{u(hj)|M(hj)|+(hj),u(hj)}}.\displaystyle\cup\set{(d_{j,p},h^{\scalebox{0.7}{$\bullet$}}_{j})}{~{}u(h_{j})-|M_{2}(h_{j})|<p\leq\min\{u(h_{j})-|M(h_{j})|+\ell(h_{j}),~{}u(h_{j})\}~{}}.

Then, the following holds.

Lemma 5.

MM^{*} coincides with the output of the resident-oriented Gale–Shapley algorithm applied to the auxiliary instance II^{*}.

This lemma is proved in Appendix B. We now complete the proof of the theorem.

Given an instance II, suppose that some resident rir_{i} changes her preference list from ri\succeq_{r_{i}} to some other ri\succeq^{\prime}_{r_{i}}. Let JJ be the resultant instance. Define an auxiliary instance JJ^{*} from JJ in the manner described above. Let NN be the output of Double Proposal for JJ and NN^{*} be a matching defined from NN as we defined MM^{*} from MM. By Lemma 5, the resident-oriented Gale–Shapley algorithm returns MM^{*} and NN^{*} for II^{*} and JJ^{*}, respectively. Note that all residents except rir^{\prime}_{i} have the same preference lists in II^{*} and JJ^{*} and so do all hospitals. Therefore, by the strategy-proofness of the Gale–Shapley algorithm, we have M(ri)riN(ri)M^{*}(r^{\prime}_{i})\succeq_{r^{\prime}_{i}}N^{*}(r^{\prime}_{i}). By the definitions of II^{*}, JJ^{*}, MM^{*}, and NN^{*}, we have M(ri)riN(ri)M(r_{i})\succeq_{r_{i}}N(r_{i}), which means that rir_{i} is no better off in NN than in MM with respect to her true preference ri\succeq_{r_{i}}. Thus, Double Proposal is strategy-proof for residents. ∎

5 Maximum Gaps and Approximation Factors of Double Proposal

In this section, we analyze the approximation factors of our algorithm, together with the maximum gaps Λ\Lambda for the four models mentioned in Section 1. All results in this section are summarized in the first and second rows of Table 1 in Section 1. Most of the proofs are deferred to Appendix C, which gives the full version of this section.

For an instance II of HRT-MSLQ, let OPT(I){\rm OPT}(I) and WST(I){\rm WST}(I) respectively denote the maximum and minimum scores over all stable matchings of II, and let ALG(I){\rm ALG}(I) be the score of the output of our algorithm Double Proposal. Proposition 20 in Appendix C shows that WST(I){\rm WST}(I) can be the score of the output of the algorithm that first breaks ties arbitrarily and then applies the Gale–Shapley algorithm for the resultant instance . Therefore, the maximum gap is equivalent to the approximation factor of such arbitrary tie-breaking GS algorithm.

For a model \cal I (i.e., subfamily of problem instances of HRT-MSLQ), let

Λ()=maxIOPT(I)WST(I) and APPROX()=maxIOPT(I)ALG(I).\Lambda({\cal I})=\max_{I\in{\cal I}}\frac{{\rm OPT}(I)}{{\rm WST}(I)}\ \ \mbox{ and }\ \ {\rm APPROX}({\cal I})=\max_{I\in{\cal I}}\frac{{\rm OPT}(I)}{{\rm ALG}(I)}.

In subsequent subsections, we provide exact values of Λ()\Lambda({\cal I}) and APPROX(){\rm APPROX}({\cal I}) for the four fundamental models. Recall our assumptions that preference lists are complete, |R|<hHu(h)|R|<\sum_{h\in H}u(h), and (h)u(h)n\ell(h)\leq u(h)\leq n for each hHh\in H.

5.1 General Model

Let Gen{\cal I}_{\rm Gen} denote the family of all instances of HRT-MSLQ, which we call the general model.

Proposition 6.

The maximum gap for the general model satisfies Λ(Gen)=n+1\Lambda({\cal I}_{\rm Gen})=n+1. Moreover, this equality holds even if residents have a master list, and preference lists of hospitals contain no ties.

We next obtain the value of APPROX(Gen){\rm APPROX}({\cal I}_{\rm Gen}). Recall that ϕ\phi is a function of n=|R|n=|R| defined by ϕ(1)=1\phi(1)=1, ϕ(2)=32\phi(2)=\frac{3}{2}, and ϕ(n)=n(1+n2)/(n+n2)\phi(n)=n(1+\lfloor\frac{n}{2}\rfloor)/(n+\lfloor\frac{n}{2}\rfloor) for n3n\geq 3.

Theorem 7.

The approximation factor of Double Proposal for the general model satisfies APPROX(Gen)=ϕ(n){\rm APPROX}({\cal I}_{\rm Gen})=\phi(n).

We provide a full proof in Appendix C, where Proposition 27 provides an instance IGenI\in{\cal I}_{\rm Gen} such that OPT(I)ALG(I)=ϕ(n)\frac{{\rm OPT}(I)}{{\rm ALG}(I)}=\phi(n). Here, we present the ideas to show the inequality OPT(I)ALG(I)ϕ(n)\frac{{\rm OPT}(I)}{{\rm ALG}(I)}\leq\phi(n) for any IGenI\in{\cal I}_{\rm Gen}.

Proof sketch of Theorem 7.

Let MM be the output of the algorithm and NN be an optimal stable matching. We define vectors pMp_{M} and pNp_{N} on RR, which distribute the scores to residents. For each hHh\in H, among residents in M(h)M(h), we set pM(r)=1(h)p_{M}(r)=\frac{1}{\ell(h)} for min{(h),|M(h)|}\min\{\ell(h),|M(h)|\} residents and pM(r)=0p_{M}(r)=0 for the remaining |M(h)|min{(h),|M(h)|}|M(h)|-\min\{\ell(h),|M(h)|\} residents. Similarly, we define pNp_{N} from NN. We write pM(A)rApM(r)p_{M}(A)\coloneqq\sum_{r\in A}p_{M}(r) for any ARA\subseteq R. By definition, pM(M(h))=sM(h)p_{M}(M(h))=s_{M}(h) and pN(N(h))=sN(h)p_{N}(N(h))=s_{N}(h) for each hHh\in H, and hence s(M)=hHsM(h)=pM(R)s(M)=\sum_{h\in H}s_{M}(h)=p_{M}(R) and s(N)=hHsN(h)=pN(R)s(N)=\sum_{h\in H}s_{N}(h)=p_{N}(R). Thus, pN(R)pM(R)=s(N)s(M)\frac{p_{N}(R)}{p_{M}(R)}=\frac{s(N)}{s(M)}, which needs to be bounded.

Let R={r1,r2,,rn}R^{\prime}=\{r^{\prime}_{1},r^{\prime}_{2},\dots,r^{\prime}_{n}\} be a copy of RR and identify pNp_{N} as a vector on RR^{\prime}. Consider a bipartite graph G=(R,R;E)G=(R,R^{\prime};E) whose edge set is E{(ri,rj)R×R|pM(ri)pN(rj)}E\coloneqq\set{(r_{i},r^{\prime}_{j})\in R\times R^{\prime}}{p_{M}(r_{i})\geq p_{N}(r^{\prime}_{j})}. For any matching XEX\subseteq E in GG, denote by (X)RR\partial(X)\subseteq R\cup R^{\prime} the set of vertices covered by XX. Then, pM(R(X))pN(R(X))p_{M}(R\cap\partial(X))\geq p_{N}(R^{\prime}\cap\partial(X)) holds since each edge (ri,rj)XE(r_{i},r^{\prime}_{j})\in X\subseteq E satisfies pM(ri)pN(rj)p_{M}(r_{i})\geq p_{N}(r^{\prime}_{j}). In addition, the value of pN(R(X))pM(R(X))p_{N}(R^{\prime}\setminus\partial(X))-p_{M}(R\setminus\partial(X)) is bounded from above by |R(X)|=|R||X|=n|X||R\setminus\partial(X)|=|R|-|X|=n-|X| because pN(r)1p_{N}(r^{\prime})\leq 1 for any rRr^{\prime}\in R^{\prime} and pM(r)0p_{M}(r)\geq 0 for any rRr\in R. Therefore, the existence of a matching XEX\subseteq E with large |X||X| helps us bound pN(R)pM(R)\frac{p_{N}(R)}{p_{M}(R)}. Indeed, the following claim plays a key role in our proof: (\star) The graph GG admits a matching XEX\subseteq E with |X|n2|X|\geq\lceil\frac{n}{2}\rceil.

In the proof in Appendix C, the required bound of pN(R)pM(R)\frac{p_{N}(R)}{p_{M}(R)} is obtained using a stronger version of (\star). Here we concentrate on showing (\star). To this end, we divide RR into

R+{rR|M(r)rN(r)},\displaystyle R_{+}\coloneqq\set{r\in R}{M(r)\succ_{r}N(r)},
R{rR|N(r)rM(r) or [M(r)=rN(r),pN(r)>pM(r)]}, and\displaystyle R_{-}\coloneqq\set{r\in R}{N(r)\succ_{r}M(r)\text{ or }[M(r)=_{r}N(r),~{}p_{N}(r)>p_{M}(r)]},\text{ and}
R0{rR|M(r)=rN(r),pM(r)pN(r)}.\displaystyle R_{0}~{}\!\coloneqq\set{r\in R}{M(r)=_{r}N(r),~{}p_{M}(r)\geq p_{N}(r)}.

Let R+,R,R0R^{\prime}_{+},R^{\prime}_{-},R^{\prime}_{0} be the corresponding subsets of RR^{\prime}. We show the following two properties.

  • There is an injection ξ+:R+R\xi_{+}\colon R_{+}\to R^{\prime} such that pM(r)=pN(ξ+(r))p_{M}(r)=p_{N}(\xi_{+}(r)) for every rR+r\in R_{+}.

  • There is an injection ξ:RR\xi_{-}\colon R^{\prime}_{-}\to R such that pN(r)=pM(ξ(r))p_{N}(r^{\prime})=p_{M}(\xi_{-}(r^{\prime})) for every rRr^{\prime}\in R^{\prime}_{-}.

We first define ξ+\xi_{+}. For each hospital hh with M(h)R+M(h)\cap R_{+}\neq\emptyset, there is rM(h)R+r\in M(h)\cap R_{+} with h=M(r)rN(r)h=M(r)\succ_{r}N(r). By the stability of NN, hospital hh is full in NN. Then, we can define an injection ξ+h:M(h)R+N(h)\xi_{+}^{h}\colon M(h)\cap R_{+}\to N(h) so that pM(r)=pN(ξ+h(r))p_{M}(r)=p_{N}(\xi_{+}^{h}(r)) for all rM(h)R+r\in M(h)\cap R_{+}. By regarding N(h)N(h) as a subset of RR^{\prime} and taking the direct sum of ξ+h\xi_{+}^{h} for all hospitals hh with M(h)R+M(h)\cap R_{+}\neq\emptyset, we obtain a required injection ξ+:R+R\xi_{+}\colon R_{+}\to R^{\prime}.

We next define ξ\xi_{-}. For each hospital hh^{\prime} with N(h)RN(h^{\prime})\cap R^{\prime}_{-}\neq\emptyset, any rN(h)Rr\in N(h^{\prime})\cap R^{\prime}_{-} satisfies either h=N(r)rM(r)h^{\prime}=N(r)\succ_{r}M(r) or [h=N(r)=rM(r),pN(r)>pM(r)][h^{\prime}=N(r)=_{r}M(r),~{}p_{N}(r)>p_{M}(r)]. If some rN(h)Rr\in N(h^{\prime})\cap R^{\prime}_{-} satisfies the former, the stability of MM implies that hh^{\prime} is full in MM. If all rN(h)Rr\in N(h^{\prime})\cap R^{\prime}_{-} satisfy the latter, they all satisfy 0pN(r)=1(h)0\neq p_{N}(r)=\frac{1}{\ell(h^{\prime})}, and hence |N(h)R|(h)|N(h^{\prime})\cap R^{\prime}_{-}|\leq\ell(h^{\prime}). Additionally, pN(r)>pM(r)p_{N}(r)>p_{M}(r) implies either pM(r)=0p_{M}(r)=0 or (h)<(h)\ell(h^{\prime})<\ell(h), where hM(r)h\coloneqq M(r). Observe that pM(r)=0p_{M}(r)=0 implies |M(h)|>(h)|M(h)|>\ell(h). By Lemma 2, each of (h)<(h)\ell(h^{\prime})<\ell(h) and |M(h)|>(h)|M(h)|>\ell(h) implies |M(h)|(h)|N(h)R||M(h^{\prime})|\geq\ell(h^{\prime})\geq|N(h^{\prime})\cap R^{\prime}_{-}|. Then, in any case, we can define an injection ξh:N(h)RM(h)\xi_{-}^{h^{\prime}}\colon N(h^{\prime})\cap R^{\prime}_{-}\to M(h^{\prime}) such that pN(r)=pM(ξh(r))p_{N}(r^{\prime})=p_{M}(\xi_{-}^{h^{\prime}}(r^{\prime})) for all rN(h)Rr^{\prime}\in N(h^{\prime})\cap R^{\prime}_{-}. By taking the direct sum of ξh\xi_{-}^{h^{\prime}} for all hospitals hh^{\prime} with M(h)RM(h^{\prime})\cap R_{-}\neq\emptyset, we obtain ξ:RR\xi_{-}\colon R^{\prime}_{-}\to R.

Let G=(R,R;E)G^{*}=(R,R^{\prime};E^{*}) be a bipartite graph (possibly with multiple edges), where EE^{*} is the disjoint union of E+E_{+}, EE_{-}, and E0E_{0}, defined by

E+\displaystyle E_{+} {(r,ξ+(r))|rR+},E{(ξ(r),r)|rR}, and\displaystyle\coloneqq\set{(r,\xi_{+}(r))}{r\in R_{+}},~{}~{}E_{-}\coloneqq\set{(\xi_{-}(r^{\prime}),r^{\prime})}{r\in R^{\prime}_{-}},\text{ and}
E0\displaystyle E_{0}~{}\! {(r,r)|rR0 and r is the copy of r}.\displaystyle\coloneqq\set{(r,r^{\prime})}{r\in R_{0}\text{ and $r^{\prime}$ is the copy of $r$}}.~{}~{}~{}~{}~{}
Refer to caption
Figure 1: A graph G=(R,R;E)G^{*}=(R,R^{\prime};E^{*})

See Fig. 1 for an example. By the definitions of ξ+\xi_{+}, ξ\xi_{-}, and R0R_{0}, any edge (r,r)(r,r^{\prime}) in EE^{*} belongs to EE, and hence any matching in GG^{*} is also a matching in GG. Since ξ+:R+R\xi_{+}\colon R_{+}\to R^{\prime} and ξ:RR\xi_{-}\colon R^{\prime}_{-}\to R are injections, we observe that every vertex in GG^{*} is incident to at most two edges in EE^{*}. Then, EE^{*} is decomposed into paths and cycles, and hence EE^{*} contains a matching of size at least |E|2\lceil\frac{|E^{*}|}{2}\rceil. Since |E|=|R+|+|R|+|R0|=n|E^{*}|=|R_{+}|+|R_{-}|+|R_{0}|=n, this means that there exists a matching XEX\subseteq E with |X|n2|X|\geq\lceil\frac{n}{2}\rceil, as required. ∎

5.2 Uniform Model

Let Uniform{\cal I}_{\rm Uniform} denote the family of uniform problem instances of HRT-MSLQ, where an instance is called uniform if upper and lower quotas are uniform. In the rest of this subsection, we assume that \ell and uu are nonnegative integers to represent the common lower and upper quotas, respectively, and let θu(1)\theta\coloneqq\frac{u}{\ell}~{}(\geq 1). We call Uniform{\cal I}_{\rm Uniform} the uniform model.

Proposition 8.

The maximum gap for the uniform model satisfies Λ(Uniform)=θ\Lambda({\cal I}_{\rm Uniform})=\theta. Moreover, this equality holds even if preference lists of hospitals contain no ties.

Theorem 9.

The approximation factor of Double Proposal for the uniform model satisfies APPROX(uniform)=θ2+θ12θ1{\rm APPROX}({\cal I}_{\rm uniform})=\frac{\theta^{2}+\theta-1}{2\theta-1}.

Note that θ2+θ12θ1<θ\frac{\theta^{2}+\theta-1}{2\theta-1}<\theta whenever <u\ell<u because θθ2+θ12θ1=(θ1)22θ1>0\theta-\frac{\theta^{2}+\theta-1}{2\theta-1}=\frac{(\theta-1)^{2}}{2\theta-1}>0. Here is the ideas to show that OPT(I)ALG(I)θ2+θ12θ1\frac{{\rm OPT}(I)}{{\rm ALG}(I)}\leq\frac{\theta^{2}+\theta-1}{2\theta-1} holds for any IUniformI\in{\cal I}_{\rm Uniform}.

Proof sketch of Theorem 9.

Let MM be the output of the algorithm and NN be an optimal stable matching, and assume s(M)<s(N)s(M)<s(N). Consider a bipartite graph (R,H;MN)(R,H;M\cup N), which may have multiple edges. Take an arbitrary connected component, and let RR^{*} and HH^{*} be the sets of residents and hospitals, respectively, contained in it. It is sufficient to bound sN(H)sM(H)\frac{s_{N}(H^{*})}{s_{M}(H^{*})}.

Let H0H_{0} be the set of all hospitals in HH^{*} having strictly larger scores in NN than in MM, i.e.,

H0{hH|sN(h)>sM(h)}.\displaystyle H_{0}\coloneqq\set{h\in H^{*}}{s_{N}(h)>s_{M}(h)}.

Using this, we sequentially define

R0{rR|N(r)H0},H1{hHH0|rR0:M(r)=h},\displaystyle R_{0}\coloneqq\set{r\in R^{*}}{N(r)\in H_{0}},~{}~{}H_{1}\coloneqq\set{h\in H^{*}\setminus H_{0}}{\exists r\in R_{0}:M(r)=h},
R1{rR|N(r)H1},H2H(H0H1), and R2R(R0R1).\displaystyle R_{1}\coloneqq\set{r\in R^{*}}{N(r)\in H_{1}},~{}~{}H_{2}\coloneqq H^{*}\setminus(H_{0}\cup H_{1}),\text{~{}~{}and~{}~{}}R_{2}\coloneqq R^{*}\setminus(R_{0}\cup R_{1}).
Refer to caption
Figure 2: Example with [,u]=[2,3][\ell,u]=[2,3].

See Fig. 2 for an example. We use scaled score functions vMsMv_{M}\coloneqq\ell\cdot s_{M} and vNsNv_{N}\coloneqq\ell\cdot s_{N} and write vM(A)=hAvM(h)v_{M}(A)=\sum_{h\in A}v_{M}(h) for any AHA\subseteq H. We bound vN(H)vM(H)\frac{v_{N}(H^{*})}{v_{M}(H^{*})}, which equals sN(H)sM(H)\frac{s_{N}(H^{*})}{s_{M}(H^{*})}. Note that the set of residents assigned to HH^{*} is RR^{*} in both MM and NN. The scores differ depending on how efficiently those residents are assigned. In this sense, we may think that a hospital hh is assigned residents “efficiently” in MM if |M(h)||M(h)|\leq\ell and is assigned “most redundantly” if |M(h)|=u|M(h)|=u. Since vM(h)=min{,|M(h)|}v_{M}(h)=\min\{\ell,|M(h)|\}, we have vM(h)=|M(h)|v_{M}(h)=|M(h)| in the former case and vM(h)=1θ|M(h)|v_{M}(h)=\frac{1}{\theta}\cdot|M(h)| in the latter. We show that hospitals in H1H_{1} provide us with advantage of MM; any hospital in H1H_{1} is assigned residents either efficiently in MM or most redundantly in NN.

For any hH0h\in H_{0}, sM(h)<sN(h)s_{M}(h)<s_{N}(h) implies |M(h)|<|M(h)|<\ell. Then, the stability of MM implies M(r)rN(r)M(r)\succeq_{r}N(r) for any rR0r\in R_{0}. Hence, the following {H1,H1=}\{H_{1}^{\succ},H_{1}^{=}\} defines a bipartition of H1H_{1}:

H1{hH1|rM(h)R0:hrN(r)},\displaystyle H_{1}^{\succ}\coloneqq\set{h\in H_{1}}{\exists r\in M(h)\cap R_{0}:h\succ_{r}N(r)},
H1={hH1|rM(h)R0:h=rN(r)}.\displaystyle H_{1}^{=}\coloneqq\set{h\in H_{1}}{\forall r\in M(h)\cap R_{0}:h=_{r}N(r)}.

For each hH1h\in H_{1}^{\succ}, as some rr satisfies hrN(r)h\succ_{r}N(r), the stability of NN implies that hh is full, i.e., hh is assigned residents most redundantly, in NN. Note that any hH1h\in H_{1}^{\succ} satisfies vM(h)vN(h)v_{M}(h)\geq v_{N}(h) because hH0h\not\in H_{0}, and hence vM(h)=vN(h)=v_{M}(h)=v_{N}(h)=\ell. Then, |N(h)|=u=θvN(h)=(θ1)vM(h)+vN(h)|N(h)|=u=\theta\cdot v_{N}(h)=(\theta-1)\cdot v_{M}(h)+v_{N}(h) for each hH1h\in H^{\succ}_{1}. Additionally, for any hHh\in H^{*}, we have |N(h)|min{,|N(h)|}=vN(h)|N(h)|\geq\min\{\ell,|N(h)|\}=v_{N}(h). Since |R|=hH|N(h)||R^{*}|=\sum_{h\in H^{*}}|N(h)|, we have

|R|(θ1)vM(H1)+vN(H1)+vN(HH1)=(θ1)vM(H1)+vN(H).\displaystyle|R^{*}|\geq(\theta-1)\cdot v_{M}(H^{\succ}_{1})+v_{N}(H^{\succ}_{1})+v_{N}(H^{*}\setminus H^{\succ}_{1})=(\theta-1)\cdot v_{M}(H_{1}^{\succ})+v_{N}(H^{*}).

For each hH1=h\in H_{1}^{=}, there is rR0r\in R_{0} with M(r)=h=rN(r)M(r)=h=_{r}N(r). As rR0r\in R_{0}, the hospital hN(r)h^{\prime}\coloneqq N(r) belongs to H0H_{0}, and hence |M(h)|<|M(h^{\prime})|<\ell. Then, Lemma 2(ii) implies |M(h)||M(h)|\leq\ell, i.e., hh is assigned residents efficiently in MM. Note that any hH0h\in H_{0} satisfies vM(h)<vN(h)v_{M}(h)<v_{N}(h)\leq\ell. Then, the number of residents assigned to H0H1=H_{0}\cup H_{1}^{=} is vM(H0H1=)v_{M}(H_{0}\cup H_{1}^{=}). Additionally, the number of residents assigned to H1H2H_{1}^{\succ}\cup H_{2} is at most θvM(H1H2)\theta\cdot v_{M}(H_{1}^{\succ}\cup H_{2}). Thus, we have

|R|vM(H0H1=)+θvM(H1H2)=vM(H)+(θ1)vM(H1H2).\displaystyle|R^{*}|\leq v_{M}(H_{0}\cup H_{1}^{=})+\theta\cdot v_{M}(H_{1}^{\succ}\cup H_{2})=v_{M}(H^{*})+(\theta-1)\cdot v_{M}(H_{1}^{\succ}\cup H_{2}).

From these two estimations of |R||R^{*}|, we obtain vN(H)(θ1)vM(H2)+vM(H)v_{N}(H^{*})\leq(\theta-1)\cdot v_{M}(H_{2})+v_{M}(H^{*}), which gives us a relationship between vM(H)v_{M}(H^{*}) and vN(H)v_{N}(H^{*}). Combining this with other inequalities, we can obtain the required upper bound of vN(H)vM(H)\frac{v_{N}(H^{*})}{v_{M}(H^{*})}. ∎

5.3 Marriage Model

Let Marriage{\cal I}_{\rm Marriage} denote the family of instances of HRT-MSLQ, in which each hospital has an upper quota of 11. We call Marriage{\cal I}_{\rm Marriage} the marriage model. By definition, [(h),u(h)][\ell(h),u(h)] in this model is either [0,1][0,1] or [1,1][1,1] for each hHh\in H. Since this is a one-to-one matching model, the union of two stable matchings can be partitioned into paths and cycles. By applying standard arguments used in other stable matching problems, we can obtain Λ(Marriage)=2\Lambda({\cal I}_{\rm Marriage})=2 and APPROX(Marriage)=1.5{\rm APPROX}({\cal I}_{\rm Marriage})=1.5.

As shown in Example 15 in Appendix A.1, there is no strategy-proof algorithm that can achieve an approximation factor better than 1.51.5 even in the marriage model. Therefore, we cannot improve this ratio without sacrificing strategy-proofness.

5.4 Resident-side Master List Model

Let R-ML{\cal I}_{\scriptsize\mbox{R-ML}} denote the family of instances of HRT-MSLQ in which all residents have the same preference list. This is well studied in literature on stable matching [8, 22, 23, 24]. We call R-ML{\cal I}_{\scriptsize\mbox{R-ML}} the R-side ML model. We have already shown in Proposition 6 that Λ(R-ML)=n+1\Lambda({\cal I}_{\scriptsize\mbox{R-ML}})=n+1. Our algorithm, however, solves this model exactly.

Note that this is not the case for the hospital-side master list model, which is NP-hard as shown in Theorem 14 below. This difference highlights the asymmetry of two sides in HRT-MSLQ.

6 Hardness Results

We obtain various hardness and inapproximability results for HRT-MSLQ. First, we show that HRT-MSLQ in the general model is inapproximable and that we cannot hope for a constant factor approximation.

Theorem 10.

HRT-MSLQ is inapproximable within a ratio n14ϵn^{\frac{1}{4}-\epsilon} for any ϵ>0\epsilon>0 unless P=NP.

Proof.

We show the theorem by way of a couple of reductions, one from the maximum independent set problem (MAX-IS ) to the maximum 2-independent set problem (MAX-2-IS ), and the other from MAX-2-IS to HRT-MSLQ.

For an undirected graph G=(V,E)G=(V,E), a subset SVS\subseteq V is an independent set of GG if no two vertices in SS are adjacent. SS is a 2-independent set of GG if the distance between any two vertices in SS is at least 3. MAX-IS (resp. MAX-2-IS) asks to find an independent set (resp. 2-independent set) of maximum size. Let us denote by IS(G){\rm IS}(G) and IS2(G){\rm IS}_{2}(G), respectively, the sizes of optimal solutions of MAX-IS and MAX-2-IS for GG. We assume without loss of generality that input graphs are connected. It is known that, unless P=NP, there is no polynomial-time algorithm, given a graph G1=(V1,E1)G_{1}=(V_{1},E_{1}), to distinguish between the two cases IS(G1)|V1|ϵ1{\rm IS}(G_{1})\leq|V_{1}|^{\epsilon_{1}} and IS(G1)|V1|1ϵ1{\rm IS}(G_{1})\geq|V_{1}|^{1-\epsilon_{1}}, for any constant ϵ1>0\epsilon_{1}>0 [36].

Now, we give the first reduction, which is based on the NP-hardness proof of the minimum maximal matching problem [19]. Let G1=(V1,E1)G_{1}=(V_{1},E_{1}) be an instance of MAX-IS. We construct an instance G2=(V2,E2)G_{2}=(V_{2},E_{2}) of MAX-2-IS as V2=V1E1{s}V_{2}=V_{1}\cup E_{1}\cup\{s\} and E2={(v,e)|vV1,eE1,e is incident to v in G1}{(s,e)|eE1}E_{2}=\set{(v,e)}{v\in V_{1},~{}e\in E_{1},e\mbox{ is incident to }v\mbox{ in }G_{1}}\cup\set{(s,e)}{e\in E_{1}}, where ss is a new vertex not in V1E1V_{1}\cup E_{1}. For any two vertices uu and vv in V1V_{1}, if their distance in G1G_{1} is at least 2 then that in G2G_{2} is at least 4. Hence, any independent set in G1G_{1} is also a 22-independent set in G2G_{2}. Conversely, for any 22-independent set SS in G2G_{2}, SV1S\cap V_{1} is independent in G1G_{1} and |S(V2V1)|1|S\cap(V_{2}\setminus V_{1})|\leq 1. These facts imply that IS2(G2){\rm IS}_{2}(G_{2}) is either IS(G1){\rm IS}(G_{1}) or IS(G1)+1{\rm IS}(G_{1})+1. Since |E2|=3|E1|3|V1|2|E_{2}|=3|E_{1}|\leq 3|V_{1}|^{2}, distinguishing between IS2(G2)|E2|ϵ2{\rm IS}_{2}(G_{2})\leq|E_{2}|^{\epsilon_{2}} and IS2(G2)|E2|1/2ϵ2{\rm IS}_{2}(G_{2})\geq|E_{2}|^{1/2-\epsilon_{2}} for some constant ϵ2>0\epsilon_{2}>0 would imply distinguishing between IS(G1)|V1|ϵ1{\rm IS}(G_{1})\leq|V_{1}|^{\epsilon_{1}} and IS(G1)|V1|1ϵ1{\rm IS}(G_{1})\geq|V_{1}|^{1-\epsilon_{1}} for some constant ϵ1>0\epsilon_{1}>0, which in turn implies P=NP.

We then proceed to the second reduction. Let G2=(V2,E2)G_{2}=(V_{2},E_{2}) be an instance of MAX-2-IS. Let n2=|V2|n_{2}=|V_{2}|, m2=|E2|m_{2}=|E_{2}|, V2={v1,v2,,vn2}V_{2}=\{v_{1},v_{2},\dots,v_{n_{2}}\}, and E2={e1,e2,,em2}E_{2}=\{e_{1},e_{2},\dots,e_{m_{2}}\}. We construct an instance II of HRT-MSLQ as follows. For an integer pp which will be determined later, define the set of residents of II as R={ri,j|1in2,1jp}R=\set{r_{i,j}}{1\leq i\leq n_{2},~{}1\leq j\leq p}, where ri,jr_{i,j} corresponds to the jjth copy of vertex viV2v_{i}\in V_{2}. Next, define the set of hospitals of II as HYH\cup Y, where H={hk|1km2}H=\set{h_{k}}{1\leq k\leq m_{2}} and Y={yi,j|1in2,1jp}Y=\set{y_{i,j}}{1\leq i\leq n_{2},~{}1\leq j\leq p}. The hospital hkh_{k} corresponds to the edge ekE2e_{k}\in E_{2} and the hospital yi,jy_{i,j} corresponds to the resident ri,jr_{i,j}.

We complete the reduction by giving preference lists and quotas in Fig. 3, where 1in21\leq i\leq n_{2}, 1jp1\leq j\leq p, and 1km21\leq k\leq m_{2}. Here, N(vi)={hk|ek is incident to vi in G2}N(v_{i})=\set{h_{k}}{e_{k}\mbox{ is incident to }v_{i}\mbox{ in }G_{2}} and “(  N(vi)N(v_{i})  )” denotes the tie consisting of all hospitals in N(vi)N(v_{i}). Similarly, N(ek)={ri,j|ek is incident to vi in G2,1jp}N(e_{k})=\set{r_{i,j}}{e_{k}\mbox{ is incident to }v_{i}\mbox{ in }G_{2},~{}1\leq j\leq p} and “(  N(ek)N(e_{k})  )” is the tie consisting of all residents in N(ek)N(e_{k}). The notation “\cdots” denotes an arbitrary strict order of all agents missing in the list.

ri,jr_{i,j}: (  N(vi)N(v_{i})  ) yi,jy_{i,j} \cdots hkh_{k} [0,p][0,p]: (  N(ek)N(e_{k})  ) \cdots
yi,jy_{i,j} [1,1][1,1]: ri,jr_{i,j} \cdots
Figure 3: Preference lists of residents and hospitals.

We will show that OPT(I)=m2+pIS2(G2){\rm OPT}(I)=m_{2}+p\cdot{\rm IS}_{2}(G_{2}). To do so, we first see a useful property. Let G3=(V3,E3)G_{3}=(V_{3},E_{3}) be the subdivision graph of G2G_{2}, i.e., V3=V2E2V_{3}=V_{2}\cup E_{2} and E3={(v,e)|vV2,eE2,e is incident to v in G2}E_{3}=\set{(v,e)}{v\in V_{2},e\in E_{2},e\mbox{ is incident to }v\mbox{ in }G_{2}}. Then, the family 2(G2){\cal I}_{2}(G_{2}) of 2-independent sets in G2G_{2} is characterized as follows [19]:

2(G2)={V2eM{endpoints of e}|M is a maximal matching of G3}.{\cal I}_{2}(G_{2})=\left\{\,V_{2}\setminus\bigcup_{e\in M}\{\mbox{endpoints of }e\}\ \middle|\,M\mbox{ is a maximal matching of }G_{3}\right\}.

In other words, for a maximal matching MM of G3G_{3}, if we remove all vertices matched in MM from V2V_{2}, then the remaining vertices form a 2-independent set of G2G_{2}, and conversely, any 2-independent set of G2G_{2} can be obtained in this manner for some maximal matching MM of G3G_{3}.

Let SS be an optimal solution of G2G_{2} in MAX-2-IS, i.e., a 2-independent set of size IS2(G2){\rm IS}_{2}(G_{2}). Let M~\tilde{M} be a maximal matching of G3G_{3} corresponding to SS. We construct a matching MM of II as M=M1M2M=M_{1}\cup M_{2}, where M1={(ri,j,hk)|(vi,ek)M~,1jp}M_{1}=\set{(r_{i,j},h_{k})}{(v_{i},e_{k})\in\tilde{M},~{}1\leq j\leq p} and M2={(ri,j,yi,j)|viS,1jp}M_{2}=\set{(r_{i,j},y_{i,j})}{v_{i}\in S,~{}1\leq j\leq p}. It is not hard to see that each resident is matched by exactly one of M1M_{1} and M2M_{2} and that no hospital exceeds its upper quota.

We then show the stability of MM. Each resident matched by M1M_{1} is assigned to a first-choice hospital, so if there were a blocking pair, then it would be of the form (ri,j,hk)(r_{i,j},h_{k}) where M(ri,j)=yi,jM(r_{i,j})=y_{i,j} and hkN(vi)h_{k}\in N(v_{i}). Then, viv_{i} is unmatched in M~\tilde{M}. Additionally, all residents assigned to hkh_{k} (if any) are its first choice; hence, hkh_{k} must be undersubscribed in MM. Then, eke_{k} is unmatched in M~\tilde{M}. hkN(vi)h_{k}\in N(v_{i}) implies that there is an edge (vi,ek)E3(v_{i},e_{k})\in E_{3}, so M~{(vi,ek)}\tilde{M}\cup\{(v_{i},e_{k})\} is a matching of G3G_{3}, contradicting the maximality of M~\tilde{M}. Hence, MM is stable in II.

A hospital in HH has a lower quota of 0, so it obtains a score of 11. The number of hospitals in YY that are assigned a resident is |M2|=p|S|=pIS2(G2)|M_{2}|=p|S|=p\cdot{\rm IS}_{2}(G_{2}). Hence, s(M)=m2+pIS2(G2)s(M)=m_{2}+p\cdot{\rm IS}_{2}(G_{2}). Therefore, we have OPT(I)s(M)=m2+pIS2(G2){\rm OPT}(I)\geq s(M)=m_{2}+p\cdot{\rm IS}_{2}(G_{2}).

Conversely, let MM be an optimal solution for II, i.e., a stable matching of score OPT(I){\rm OPT}(I). Note that each ri,jr_{i,j} is assigned to a hospital in N(vi){yi,j}N(v_{i})\cup\{y_{i,j}\} as otherwise (ri,j,yi,j)(r_{i,j},y_{i,j}) blocks MM. We construct a bipartite multi-graph GM=(V2,E2;F)G_{M}=(V_{2},E_{2};F) where V2={v1,v2,,vn2}V_{2}=\{v_{1},v_{2},\ldots,v_{n_{2}}\} and E2={e1,e2,,em2}E_{2}=\{e_{1},e_{2},\ldots,e_{m_{2}}\} are identified as vertices and edges of G2G_{2}, respectively, and an edge (vi,ek)jF(v_{i},e_{k})_{j}\in F if and only if (ri,j,hk)M(r_{i,j},h_{k})\in M. Here, a subscript jj of edge (vi,ek)j(v_{i},e_{k})_{j} is introduced to distinguish the multiplicity of edge (vi,ek)(v_{i},e_{k}). The degree of each vertex of GMG_{M} is at most pp, so by Kőnig’s edge coloring theorem [25], GMG_{M} is pp-edge colorable and each color class cc induces a matching McM_{c} (1cp1\leq c\leq p) of GMG_{M}. Each McM_{c} is a matching of G3G_{3}, and by the stability of MM, we can show that it is in fact a maximal matching of G3G_{3}. Let MM_{*} be a minimum cardinality one among them.

Define a subset SS of V2V_{2} by removing vertices that are matched in MM_{*} from V2V_{2}. By the above observation, SS is a 2-independent set of G2G_{2}. We will bound its size. Note that s(M)=OPT(I)s(M)={\rm OPT}(I) and each hospital in HH obtains the score of 1, so MM assigns residents to OPT(I)m2{\rm OPT}(I)-m_{2} hospitals in YY and each such hospital receives one resident. There are pn2pn_{2} residents in total, among which OPT(I)m2{\rm OPT}(I)-m_{2} ones are assigned to hospitals in YY, so the remaining pn2(OPT(I)m2)pn_{2}-({\rm OPT}(I)-m_{2}) ones are assigned to hospitals in HH. Thus FF contains this number of edges and so |M|pn2(OPT(I)m2)p=n2OPT(I)m2p|M_{*}|\leq\frac{pn_{2}-({\rm OPT}(I)-m_{2})}{p}=n_{2}-\frac{{\rm OPT}(I)-m_{2}}{p}. Since |V2|=n2|V_{2}|=n_{2} and exactly one endpoint of each edge in MM_{*} belongs to V2V_{2}, we have that |S|=|V2||M|OPT(I)m2p|S|=|V_{2}|-|M_{*}|\geq\frac{{\rm OPT}(I)-m_{2}}{p}. Therefore IS2(G2)|S|OPT(I)m2p{\rm IS}_{2}(G_{2})\geq|S|\geq\frac{{\rm OPT}(I)-m_{2}}{p}. Hence, we obtain OPT(I)=m2+pIS2(G2){\rm OPT}(I)=m_{2}+p\cdot{\rm IS}_{2}(G_{2}) as desired. Now we let p=m2p=m_{2}, and have OPT(I)=m2(1+IS2(G2)){\rm OPT}(I)=m_{2}(1+{\rm IS}_{2}(G_{2})).

Therefore distinguishing between OPT(I)(m2)1+δ{\rm OPT}(I)\leq(m_{2})^{1+\delta} and OPT(I)(m2)3/2δ{\rm OPT}(I)\geq(m_{2})^{3/2-\delta} for some δ\delta would distinguish between IS2(G2)(m2)ϵ2{\rm IS}_{2}(G_{2})\leq(m_{2})^{\epsilon_{2}} and IS2(G2)(m2)1/2ϵ2{\rm IS}_{2}(G_{2})\geq(m_{2})^{1/2-\epsilon_{2}} for some constant ϵ2>0\epsilon_{2}>0. Since n=|R|=n2m2(m2)2n=|R|=n_{2}m_{2}\leq(m_{2})^{2}, a polynomial-time n1/4ϵn^{1/4-\epsilon}-approximation algorithm for HRT-MSLQ can distinguish between the above two cases for a constant δ<ϵ/2\delta<\epsilon/2. Hence, the existence of such an algorithm implies P=NP. This completes the proof. ∎

We then show inapproximability results for the uniform model and the marriage model under the Unique Games Conjecture (UGC).

Theorem 11.

Under UGC, HRT-MSLQ in the uniform model is not approximable within a ratio 3θ+32θ+4ϵ\frac{3\theta+3}{2\theta+4}-\epsilon for any positive ϵ\epsilon.

Theorem 12.

Under UGC, HRT-MSLQ in the marriage model is not approximable within a ratio 98ϵ\frac{9}{8}-\epsilon for any positive ϵ\epsilon.

Furthermore, we give two examples showing that HRT-MSLQ is NP-hard even in very restrictive settings. The first is a marriage model for which ties appear in one side only.

Theorem 13.

HRT-MSLQ in the marriage model is NP-hard even if there is a master preference list of hospitals and ties appear only in preference lists of residents or only in preference lists of hospitals.

The other is a setting like the capacitated house allocation problem, where all hospitals are indifferent among residents.

Theorem 14.

HRT-MSLQ in the uniform model is NP-hard even if all the hospitals quotas are [1,2][1,2], preferences lists of all residents are strict, and all hospitals are indifferent among all residents (i.e., there is a master list of hospitals consisting of a single tie).

7 Concluding Remarks

We proposed the Hospitals/Residents problem with Ties to Maximally Satisfy Lower Quotas. We showed the difficulty of this problem from computational and strategic aspects; we provided NP-hardness and inapproximability results and showed that the exact optimization is incompatible with strategy-proofness. We presented a single algorithm Double Proposal and tightly showed its approximation factor for four fundamental scenarios, which is better than that of a naive method using arbitrary tie-breaking.

There remain several open questions and future research directions for HRT-MSLQ. Clearly, it is a major open problem to close a gap between the upper and lower bounds of the approximation factor for each scenario. This problem has two variants depending on whether we restrict ourselves to strategy-proof algorithms or not.

In this paper, we assumed the completeness of the preference lists of agents. The proofs for the stability and the strategy-proofness of our algorithm extend to the setting with incomplete lists, but we used this assumption in the analysis of the maximum gap and the approximation factors. Considering the setting with incomplete lists may be an interesting future direction.

References

  • [1] Atila Abdulkadiroğlu, Parag A. Pathak, and Alvin E. Roth: The New York city high school match, American Economic Review, 95 (2005), pp. 364–367.
  • [2] Atila Abdulkadiroğlu, Parag A. Pathak, Alvin E. Roth, and Tayfun Sönmez: The Boston public school match, American Economic Review, 95 (2005), pp. 368–371.
  • [3] David J. Abraham, Katarína Cechlárová, David F. Manlove, and Kurt Mehlhorn: Pareto optimality in house allocation problems, Xiaotie Deng and Ding-Zhu Du, editors, Algorithms and Computation, 16th International Symposium, ISAAC 2005, Sanya, Hainan, China, December 19-21, 2005, Proceedings 3827 of Lecture Notes in Computer Science, Springer, 2005, pp. 1163–1175.
  • [4] Ashwin Arulselvan, Ágnes Cseh, Martin Groß, David F. Manlove, and Jannik Matuschke: Matchings with lower quotas: Algorithms and complexity, Algorithmica, 80 (2018), pp. 185–208.
  • [5] Itai Ashlagi, Amin Saberi, and Ali Shameli: Assignment mechanisms under distributional constraints, Oper. Res., 68 (2020), pp. 467–479.
  • [6] Péter Biró, Tamás Fleiner, Robert W. Irving, and David F. Manlove: The College Admissions problem with lower and common quotas, Theor. Comput. Sci., 411 (2010), pp. 3136–3153.
  • [7] Niclas Boehmer and Klaus Heeger: A fine-grained view on stable many-to-one matching problems with lower and upper quotas, Xujin Chen, Nikolai Gravin, Martin Hoefer, and Ruta Mehta, editors, Web and Internet Economics - 16th International Conference, WINE 2020, Beijing, China, December 7-11, 2020, Proceedings 12495 of Lecture Notes in Computer Science, Springer, 2020, pp. 31–44.
  • [8] Robert Bredereck, Klaus Heeger, Dusan Knop, and Rolf Niedermeier: Multidimensional stable roommates with master list, Xujin Chen, Nikolai Gravin, Martin Hoefer, and Ruta Mehta, editors, Web and Internet Economics - 16th International Conference, WINE 2020, Beijing, China, December 7-11, 2020, Proceedings 12495 of Lecture Notes in Computer Science, Springer, 2020, pp. 59–73.
  • [9] Lester E. Dubins and David A. Freedman: Machiavelli and the Gale-Shapley algorithm, The American Mathematical Monthly, 88 (1981), pp. 485–494.
  • [10] Szymon Dudycz, Mateusz Lewandowski, and Jan Marcinkowski: Tight approximation ratio for minimum maximal matching, Andrea Lodi and Viswanath Nagarajan, editors, Integer Programming and Combinatorial Optimization - 20th International Conference, IPCO 2019, Ann Arbor, MI, USA, May 22-24, 2019, Proceedings 11480 of Lecture Notes in Computer Science, Springer, 2019, pp. 181–193.
  • [11] Tamás Fleiner and Naoyuki Kamiyama: A matroid approach to stable matchings with lower quotas, Math. Oper. Res., 41 (2016), pp. 734–744.
  • [12] Daniel Fragiadakis, Atsushi Iwasaki, Peter Troyan, Suguru Ueda, and Makoto Yokoo: Strategyproof matching with minimum quotas, ACM Trans. Economics and Comput., 4 (2015), pp. 6:1–6:40.
  • [13] David Gale and Lloyd S. Shapley: College admissions and the stability of marriage, The American Mathematical Monthly, 69 (1962), pp. 9–15.
  • [14] David Gale and Marilda Sotomayor: Some remarks on the stable matching problem, Discret. Appl. Math., 11 (1985), pp. 223–232.
  • [15] Masahiro Goto, Atsushi Iwasaki, Yujiro Kawasaki, Ryoji Kurata, Yosuke Yasuda, and Makoto Yokoo: Strategyproof matching with regional minimum and maximum quotas, Artificial Intelligence, 235 (2016), pp. 40–57.
  • [16] Dan Gusfield and Robert W. Irving: Foundations of computing series, The Stable marriage problem - structure and algorithms., MIT Press, 1989.
  • [17] Koki Hamada, Kazuo Iwama, and Shuichi Miyazaki: The Hospitals/Residents problem with lower quotas, Algorithmica, 74 (2016), pp. 440–465.
  • [18] Koki Hamada, Shuichi Miyazaki, and Hiroki Yanagisawa: Strategy-proof approximation algorithms for the stable marriage problem with ties and incomplete lists, Pinyan Lu and Guochuan Zhang, editors, 30th International Symposium on Algorithms and Computation, ISAAC 2019, December 8-11, 2019, Shanghai University of Finance and Economics, Shanghai, China 149 of LIPIcs, Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2019, pp. 9:1–9:14.
  • [19] Joseph D. Horton and Kyriakos Kilakos: Minimum edge dominating sets, SIAM J. Discret. Math., 6 (1993), p. 375–387.
  • [20] Chien-Chung Huang: Classified stable matching, Moses Charikar, editor, Proceedings of the Twenty-First Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2010, Austin, Texas, USA, January 17-19, 2010, SIAM, 2010, pp. 1235–1253.
  • [21] Robert W. Irving: Stable marriage and indifference, Discret. Appl. Math., 48 (1994), pp. 261–272.
  • [22] Robert W. Irving, David F. Manlove, and Sandy Scott: The stable marriage problem with master preference lists, Discret. Appl. Math., 156 (2008), pp. 2959–2977.
  • [23] Naoyuki Kamiyama: Stable matchings with ties, master preference lists, and matroid constraints, Martin Hoefer, editor, Algorithmic Game Theory - 8th International Symposium, SAGT 2015, Saarbrücken, Germany, September 28-30, 2015, Proceedings 9347 of Lecture Notes in Computer Science, Springer, 2015, pp. 3–14.
  • [24] Naoyuki Kamiyama: Many-to-many stable matchings with ties, master preference lists, and matroid constraints, Edith Elkind, Manuela Veloso, Noa Agmon, and Matthew E. Taylor, editors, Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS ’19, Montreal, QC, Canada, May 13-17, 2019, International Foundation for Autonomous Agents and Multiagent Systems, 2019, pp. 583–591.
  • [25] Dénes Kőnig: Über graphen und ihre anwendung auf determinantentheorie und mengenlehre, Mathematische, 77 (1916), p. 453–465.
  • [26] Zoltán Király: Linear time local approximation algorithm for maximum stable marriage, Algorithms, 6 (2013), pp. 471–484.
  • [27] Prem Krishnaa, Girija Limaye, Meghana Nasre, and Prajakta Nimbhorkar: Envy-freeness and relaxed stability: Hardness and approximation algorithms, Tobias Harks and Max Klimm, editors, Algorithmic Game Theory - 13th International Symposium, SAGT 2020, Augsburg, Germany, September 16-18, 2020, Proceedings 12283 of Lecture Notes in Computer Science, Springer, 2020, pp. 193–208.
  • [28] Krishnapriya A. M., Meghana Nasre, Prajakta Nimbhorkar, and Amit Rawat: How good are popular matchings?, Gianlorenzo D’Angelo, editor, 17th International Symposium on Experimental Algorithms, SEA 2018, June 27-29, 2018, L’Aquila, Italy 103 of LIPIcs, Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2018, pp. 9:1–9:14.
  • [29] David F. Manlove 2 of Series on Theoretical Computer Science: Algorithmics of Matching Under Preferences, WorldScientific, 2013.
  • [30] David F. Manlove, Robert W. Irving, Kazuo Iwama, Shuichi Miyazaki, and Yasufumi Morita: Hard variants of stable marriage, Theor. Comput. Sci., 276 (2002), pp. 261–279.
  • [31] Alvin Roth: The evolution of the labor market for medical interns and residents: A case study in game theory, Journal of Political Economy, 92 (1984), pp. 991–1016.
  • [32] Alvin Roth: On the allocation of residents to rural hospitals: A general property of two-sided matching markets, Econometrica, 54 (1986), pp. 425–27.
  • [33] Alvin E. Roth: The economics of matching: Stability and incentives, Mathematics of Operations Research, 7 (1982), pp. 617–628.
  • [34] Alexander Schrijver: Combinatorial Optimization: Polyhedra and Efficiency, Springer, 2003.
  • [35] Yu Yokoi: Envy-free matchings with lower quotas, Algorithmica, 82 (2020), pp. 188–211.
  • [36] David Zuckerman: Linear degree extractors and the inapproximability of max clique and chromatic number, Theory Comput., 3 (2007), pp. 103–128.

Appendix A Examples

We give some examples that show the difficulty of implementing strategy-proof algorithms for HRT-MSLQ.

A.1 Incompatibility between Optimization and Strategy-proofness

Here, we provide two examples that show that solving HRT-MSLQ exactly is incompatible with strategy-proofness even if we ignore computational efficiency. This incompatibility holds even for restrictive models. The first example is an instance in the marriage model in which ties appear only in preference lists of hospitals. The second example is an instance in the uniform model in which ties appear only in preference lists of residents.

Example 15.

Consider the following instance II, consisting of two residents and three hospitals.

r1r_{1}: h1h_{1} h2h_{2} h3h_{3} h1h_{1} [1,1][1,1]: (r1r_{1} r2r_{2})
r2r_{2}: h1h_{1} h2h_{2} h3h_{3} h2h_{2} [1,1][1,1]: (r1r_{1} r2r_{2})
h3h_{3} [0,1][0,1]: (r1r_{1} r2r_{2})

Then, II has two stable matchings M1={(r1,h1),(r2,h2)}M_{1}=\{(r_{1},h_{1}),(r_{2},h_{2})\} and M2={(r1,h2),(r2,h1)}M_{2}=\{(r_{1},h_{2}),(r_{2},h_{1})\}, both of which have a score of 33. Let AA be an algorithm that outputs a stable matching with a maximum score for any instance of HRT-MSLQ. Without loss of generality, suppose that AA returns M1M_{1}. Let II^{\prime} be obtained from II by replacing r2r_{2}’s list with “r2:h1h3h2r_{2}:\,h_{1}\ h_{3}\ h_{2}.” Then, the stable matchings for II^{\prime} are M3={(r1,h1),(r2,h3)}M_{3}=\{(r_{1},h_{1}),(r_{2},h_{3})\} and M4={(r1,h2),(r2,h1)}M_{4}=\{(r_{1},h_{2}),(r_{2},h_{1})\}, which have scores 22 and 33, respectively. Since AA should return one with a maximum score, the output is M4M_{4}, in which r2r_{2} is assigned to h1h_{1} while she is assigned to h2h_{2} in M1M_{1}. As h1r3h2h_{1}\succ_{r_{3}}h_{2} in her true preference, this is a successful manipulation for r2r_{2}, and AA is not strategy-proof.

Example 15 shows that there is no strategy-proof algorithm for HRT-MSLQ that attains an approximation factor better than 1.51.5 even if there are no computational constraints.

Example 16.

Consider the following instance II, consisting of six residents and five hospitals, where the notation “\cdots” at the tail of lists denotes an arbitrary strict order of all agents missing in the list.

r1r_{1}: h1h_{1} \cdots h1h_{1} [1,2][1,2]: r1r_{1} r2r_{2} r6r_{6} \cdots
r2r_{2}: h3h_{3} h2h_{2} h1h_{1} \cdots h2h_{2} [1,2][1,2]: r2r_{2} \cdots
r3r_{3}: h3h_{3} \cdots h3h_{3} [1,2][1,2]: r3r_{3} r4r_{4} r2r_{2} \cdots
r4r_{4}: (h3h_{3} h4h_{4}) \cdots h4h_{4} [1,2][1,2]: r5r_{5} r4r_{4} r6r_{6} \cdots
r5r_{5}: h4h_{4} \cdots h5h_{5} [1,2][1,2]: r6r_{6} \cdots
r6r_{6}: h4h_{4} h5h_{5} h1h_{1} \cdots

This instance II has two stable matchings

M1={(r1,h1),(r2,h2),(r3,h3),(r4,h3),(r5,h4),(r6,h4)}, and\displaystyle M_{1}=\{(r_{1},h_{1}),(r_{2},h_{2}),(r_{3},h_{3}),(r_{4},h_{3}),(r_{5},h_{4}),(r_{6},h_{4})\},\text{ and }
M2={(r1,h1),(r2,h3),(r3,h3),(r4,h4),(r5,h4),(r6,h5)},\displaystyle M_{2}=\{(r_{1},h_{1}),(r_{2},h_{3}),(r_{3},h_{3}),(r_{4},h_{4}),(r_{5},h_{4}),(r_{6},h_{5})\},

both of which have a score of 4. Let AA be an algorithm that outputs an optimal solution for any input. Then, AA must output either M1M_{1} or M2M_{2}.

Suppose that AA outputs M1M_{1}. Let II^{\prime} be an instance obtained by replacing r2r_{2}’s preference list from “r2:h3h2h1r_{2}:\,h_{3}\ h_{2}\ h_{1}\cdots” to “r2:h3h1h2r_{2}:\,h_{3}\ h_{1}\ h_{2}\cdots.” Then, the stable matchings II^{\prime} admits are M2M_{2} and M1={(r1,h1),(r2,h1),(r3,h3),(r4,h3),(r5,h4),(r6,h4)}M^{\prime}_{1}=\{(r_{1},h_{1}),(r_{2},h_{1}),(r_{3},h_{3}),(r_{4},h_{3}),(r_{5},h_{4}),(r_{6},h_{4})\}, whose score is 3. Hence, AA must output M2M_{2}. As a result, r2r_{2} is assigned to a better hospital h3h_{3} than h2h_{2}, so this manipulation is successful.

If AA outputs M2M_{2}, then r6r_{6} can successfully manipulate the result by changing her list from “r6:h4h5h1r_{6}:\,h_{4}\ h_{5}\ h_{1}\cdots” to “r6:h4h1h5r_{6}:\,h_{4}\ h_{1}\ h_{5}\cdots.” The instance obtained by this manipulation has two stable matchings M1M_{1} and M2={(r1,h1),(r2,h3),(r3,h3),(r4,h4),(r5,h4),(r6,h1)}M^{\prime}_{2}=\{(r_{1},h_{1}),(r_{2},h_{3}),(r_{3},h_{3}),(r_{4},h_{4}),(r_{5},h_{4}),(r_{6},h_{1})\}, whose score is 3. Hence, AA must output M1M_{1} and r6r_{6} is assigned to h4h_{4}, which is better than h5h_{5}.

A.2 Absence of Strategy-proofness in Adaptive Tie-breaking

We provide an example that demonstrates that introducing a greedy tie-breaking method into the resident-oriented Gale–Shapley algorithm in an adaptive manner destroys the strategy-proofness for residents.

Example 17.

Consider the following instance II (in the uniform model), consisting of five residents and three hospitals.

r1r_{1}: h1h_{1} h2h_{2} h3h_{3} h1h_{1} [1,2][1,2]: r2r_{2} r3r_{3} r5r_{5} r1r_{1} r4r_{4}
r2r_{2}: (h1h_{1} h2h_{2}) h3h_{3} h2h_{2} [1,2][1,2]: r2r_{2} r4r_{4} r1r_{1} r3r_{3} r5r_{5}
r3r_{3}: h1h_{1} h2h_{2} h3h_{3} h3h_{3} [1,2][1,2]: r1r_{1} r2r_{2} r3r_{3} r4r_{4} r5r_{5}
r4r_{4}: h2h_{2} h1h_{1} h3h_{3}
r5r_{5}: h1h_{1} h3h_{3} h2h_{2}

Consider an algorithm that is basically the resident-oriented Gale–Shapley algorithm and let each resident prioritize deficient hospitals over sufficient hospitals among the hospitals in the same tie. Its one possible execution is as follows. First, r1r_{1} proposes to h1h_{1} and is accepted. Next, as h1h_{1} is sufficient while h2h_{2} is deficient, r2r_{2} proposes to h2h_{2} and is accepted. If we apply the ordinary Gale–Shapley procedure afterward, then we obtain a matching {(r1,h3),(r2,h2),(r3,h1),(r4,h2),(r5,h1)}\{(r_{1},h_{3}),(r_{2},h_{2}),(r_{3},h_{1}),(r_{4},h_{2}),(r_{5},h_{1})\}. Thus, r1r_{1} is assigned to her third choice.

Let II^{\prime} be an instance obtained by swapping h1h_{1} and h2h_{2} in r1r_{1}’s preference list. If we run the same algorithm for II^{\prime}, then r1r_{1} first proposes to h2h_{2}. Next, as h2h_{2} is sufficient while h1h_{1} is deficient, r2r_{2} proposes to h1h_{1} and is accepted. By applying the ordinary Gale–Shapley procedure afterward, we obtain {(r1,h2),(r2,h1),(r3,h1),(r4,h2),(r5,h3)}\{(r_{1},h_{2}),(r_{2},h_{1}),(r_{3},h_{1}),(r_{4},h_{2}),(r_{5},h_{3})\}. Thus, r1r_{1} is assigned to a hospital h2h_{2}, which is her second choice in her original list. Therefore, this manipulation is successful for r1r_{1}.

Appendix B Proof of Lemma 5

Let MM^{*} and II^{*} be defined as in the proof of Theorem 4 in Section 4. We show that the matching MM^{*} coincides with the output of the resident-oriented Gale–Shapley algorithm applied to the auxiliary instance II^{*}. Since it is known that the resident-oriented Gale–Shapley algorithm outputs the resident-optimal stable matching (see e.g., [16]), it suffices to show the stability and resident-optimality of MM^{*}.

The analysis goes as follows: Although matchings M1M_{1}, M2M_{2}, and MM^{*} are defined for the final matching MM of II, we also refer to them for a temporal matching MM at any step of the execution of Double Proposal. When some event occurs in Double Proposal, we remove some pairs from the instance II^{*}, where removing (r,h)(r,h) from II^{*} means to remove rr from hh’s list and hh from rr’s list. The removal operations are defined shortly. We then investigate MM, MM^{*}, and II^{*} at this moment and observe that some property holds for MM^{*} and II^{*}. This property is used to show the stability and resident-optimality of the final matching MM^{*}.

Here are the definitions of removal operations.

  • Case (1) rir_{i} is rejected by hjh_{j} for the first time. In this case, we remove (ri,hj)(r^{\prime}_{i},h^{\scalebox{0.7}{$\bullet$}}_{j}) from II^{*}. Just after this happens, by the priority rule on indices at Line 12, we have (i) |M(hj)|(hj)|M(h_{j})|\geq\ell(h_{j}) and (ii) every rkM1(hj)r_{k}\in M_{1}(h_{j}) satisfies k<ik<i. Note that (i) implies min{u(hj)|M(hj)|+(hj),u(hj)}=u(hj)|M(hj)|+(hj)\min\{u(h_{j})-|M(h_{j})|+\ell(h_{j}),~{}u(h_{j})\}=u(h_{j})-|M(h_{j})|+\ell(h_{j}), and hence MM^{*} assigns (hj)|M1(hj)|\ell(h_{j})-|M_{1}(h_{j})| dummy residents to hjh^{\scalebox{0.7}{$\bullet$}}_{j}. Additionally, MM^{*} assigns |M1(hj)||M_{1}(h_{j})| residents rkr^{\prime}_{k} to hjh^{\scalebox{0.7}{$\bullet$}}_{j} and (ii) implies that k<ik<i for every kk. Thus, at this moment, the hospital hjh^{\scalebox{0.7}{$\bullet$}}_{j} is full in MM^{*} with residents better than rir^{\prime}_{i}.

  • Case (2) rir_{i} is rejected by hjh_{j} for the second time. In this case, we remove (ri,hj)(r^{\prime}_{i},h^{\scalebox{0.8}{$\circ$}}_{j}) from II^{*}. Just after this happens, by Lines 16, 17, and the priority rule on indices, M(hj)=M2(hj)M(h_{j})=M_{2}(h_{j}), |M(hj)|=u(hj)|M(h_{j})|=u(h_{j}), and every rkM2(hj)r_{k}\in M_{2}(h_{j}) satisfies either (a) rkhjrir_{k}\succ_{h_{j}}r_{i} or (b) rk=hjrir_{k}=_{h_{j}}r_{i} and k<ik<i, each of which implies rkhjrir^{\prime}_{k}\succ_{h^{\scalebox{0.8}{$\circ$}}_{j}}r^{\prime}_{i}. Thus, at this moment, the hospital hjh^{\scalebox{0.8}{$\circ$}}_{j} is full in MM^{*} with residents better than rir^{\prime}_{i}.

  • Case (3) |M𝟐(hj)||M_{2}(h_{j})| increases by 1, from u(hj)pu(h_{j})-p to u(hj)p+𝟏u(h_{j})-p+1 for some pp (𝟏pu(hj)1\leq p\leq u(h_{j})). In this case, we remove one or two pairs depending on pp.

    We first remove (dj,p,hj)(d_{j,p},h^{\scalebox{0.8}{$\circ$}}_{j}) from II^{*}. Just after this happens, we have u(hj)|M2(hj)|=p1u(h_{j})-|M_{2}(h_{j})|=p-1 and hence M(hj)={ri|riM2(hj)}{dj,1,dj,2,,dj,p1}M^{*}(h^{\scalebox{0.8}{$\circ$}}_{j})=\set{r^{\prime}_{i}}{r_{i}\in M_{2}(h_{j})}\cup\set{d_{j,1},d_{j,2},\dots,d_{j,p-1}}. Thus, at this moment, the hospital hjh^{\scalebox{0.8}{$\circ$}}_{j} is full in MM^{*} with residents better than dj,pd_{j,p}.

    If, furthermore, pp satisfies 1pu(hj)(hj)1\leq p\leq u(h_{j})-\ell(h_{j}), we remove (dj,(hj)+p,hj)(d_{j,\ell(h_{j})+p},h^{\scalebox{0.7}{$\bullet$}}_{j}) from II^{*}. Just after this happens, we have |M2(hj)|=u(hj)p+1>(hj)|M_{2}(h_{j})|=u(h_{j})-p+1>\ell(h_{j}). Note that, by Lines 1113, when |M(hj)||M(h_{j})| exceeds (hj)\ell(h_{j}), any resident in M(hj)M(h_{j}) is once rejected by hjh_{j}, and this invariant is maintained till the end of the algorithm. Hence, |M1(hj)|=0|M_{1}(h_{j})|=0 and |M2(hj)|=|M(hj)||M_{2}(h_{j})|=|M(h_{j})| hold. Then, M(hj)={dj,p,dj,p+1,,dj,p+(hj)1}M^{*}(h^{\scalebox{0.7}{$\bullet$}}_{j})=\set{d_{j,p},d_{j,p+1},\dots,d_{j,p+\ell(h_{j})-1}}. Thus, at this moment, the hospital hjh^{\scalebox{0.7}{$\bullet$}}_{j} is full in MM^{*} with residents better than dj,(hj)+pd_{j,\ell(h_{j})+p}.

Now we will see two properties of MM^{*} at the termination of Double Proposal.

Claim 18.

If (r,h)(r,h) is removed from II^{*} by Double Proposal, hh is full in MM^{*} with residents better than rr.

Proof.

In all Cases (1)–(3) of the removal operation, we have observed that, just after (r,h)(r,h) is removed from II^{*}, hh is full in MM^{*} with residents better than rr. We will show that this property is maintained afterward, which completes the proof.

Note that MM^{*} changes only when MM changes and this occurs at Lines 10, 13, 15, and 18. Let rir_{i} be the resident chosen at Line 3 and hjh_{j} be the hospital chosen at Line 5 or 7. We show that, for each of the above cases, if the condition is satisfied before updating MM, it is also satisfied after the update.

Suppose that MM changes as MM{(ri,hj)}M\coloneqq M\cup\{(r_{i},h_{j})\} at Line 10. Before application of Line 10, |M(hj)|<(hj)|M(h_{j})|<\ell(h_{j}). This implies that M1(hj)=M(hj)M_{1}(h_{j})=M(h_{j}) and M2(hj)=M_{2}(h_{j})=\emptyset, so M(hj)={dj,1,,dj,u(hj)}M^{*}(h_{j}^{\scalebox{0.8}{$\circ$}})=\{d_{j,1},\ldots,d_{j,u(h_{j})}\} and M(hj)M^{*}(h_{j}^{\scalebox{0.7}{$\bullet$}}) consists of less than (hj)\ell(h_{j}) residents in RR^{\prime}. Since we assume that hh is full, hh cannot be hjh_{j}^{\scalebox{0.7}{$\bullet$}}. When Line 10 is applied, M(hj)M^{*}(h_{j}^{\scalebox{0.8}{$\circ$}}) does not change so we are done.

Suppose that MM changes as M(M{(ri,hj)}){(ri,hj)}M\coloneqq(M\cup\{(r_{i},h_{j})\})\setminus\{(r_{i^{\prime}},h_{j})\} at Line 13. If i=ii^{\prime}=i, MM is unchanged, so suppose that iii^{\prime}\neq i. Note that rir_{i^{\prime}} is not rejected by hjh_{j} yet. If rir_{i} is not rejected by hjh_{j} yet, M2M_{2} does not change, M1M_{1} changes as M1(M1{(ri,hj)}){(ri,hj)}M_{1}\coloneqq(M_{1}\cup\{(r_{i},h_{j})\})\setminus\{(r_{i^{\prime}},h_{j})\}, and i>ii^{\prime}>i. Hence, M(hj)M^{*}(h_{j}^{\scalebox{0.8}{$\circ$}}) does not change and M(hj)(M(hj){ri}){ri}M^{*}(h_{j}^{\scalebox{0.7}{$\bullet$}})\coloneqq(M^{*}(h_{j}^{\scalebox{0.7}{$\bullet$}})\cup\{r^{\prime}_{i}\})\setminus\{r^{\prime}_{i^{\prime}}\}. If rir_{i} is once rejected by hjh_{j}, M2M2{(ri,hj)}M_{2}\coloneqq M_{2}\cup\{(r_{i},h_{j})\} and M1M1{(ri,hj)}M_{1}\coloneqq M_{1}\setminus\{(r_{i^{\prime}},h_{j})\}. Then, M(hj)(M(hj){ri}){dj,k}M^{*}(h_{j}^{\scalebox{0.8}{$\circ$}})\coloneqq(M^{*}(h_{j}^{\scalebox{0.8}{$\circ$}})\cup\{r^{\prime}_{i}\})\setminus\{d_{j,k}\} and M(hj)(M(hj){dj,k}){ri}M^{*}(h_{j}^{\scalebox{0.7}{$\bullet$}})\coloneqq(M^{*}(h_{j}^{\scalebox{0.7}{$\bullet$}})\cup\{d_{j,k}\})\setminus\{r^{\prime}_{i^{\prime}}\} for some kk. Hence, the condition is satisfied for both hjh_{j}^{\scalebox{0.8}{$\circ$}} and hjh_{j}^{\scalebox{0.7}{$\bullet$}}.

Suppose that MM changes as MM{(ri,hj)}M\coloneqq M\cup\{(r_{i},h_{j})\} at Line 15. By the condition of this case, M1(hj)=M_{1}(h_{j})=\emptyset and M2(hj)=M(hj)M_{2}(h_{j})=M(h_{j}) before the application of Line 15. Then, by application of Line 15, M1M_{1} does not change and M2M2{(ri,hj)}M_{2}\coloneqq M_{2}\cup\{(r_{i},h_{j})\}. Then, M(hj)(M(hj){ri}){dj,k}M^{*}(h_{j}^{\scalebox{0.8}{$\circ$}})\coloneqq(M^{*}(h_{j}^{\scalebox{0.8}{$\circ$}})\cup\{r^{\prime}_{i}\})\setminus\{d_{j,k}\} and M(hj)(M(hj){dj,k}){dj,k+}M^{*}(h_{j}^{\scalebox{0.7}{$\bullet$}})\coloneqq(M^{*}(h_{j}^{\scalebox{0.7}{$\bullet$}})\cup\{d_{j,k}\})\setminus\{d_{j,k+\ell}\} for some kk. Hence, the condition is satisfied for both hjh_{j}^{\scalebox{0.8}{$\circ$}} and hjh_{j}^{\scalebox{0.7}{$\bullet$}}.

Suppose that MM changes as M(M{(ri,hj)}){(ri,hj)}M\coloneqq(M\cup\{(r_{i},h_{j})\})\setminus\{(r_{i^{\prime}},h_{j})\} at Line 18. If i=ii^{\prime}=i, MM is unchanged, so suppose that iii^{\prime}\neq i. By the condition of this case, M1(hj)=M_{1}(h_{j})=\emptyset and M2(hj)=M(hj)M_{2}(h_{j})=M(h_{j}) before the application of Line 18. Then, by application of Line 18, M1M_{1} does not change, M2(M2{(ri,hj)}){(ri,hj)}M_{2}\coloneqq(M_{2}\cup\{(r_{i},h_{j})\})\setminus\{(r_{i^{\prime}},h_{j})\}, and either (rihjrir_{i}\succ_{h_{j}}r_{i^{\prime}}) or (ri=hjrir_{i}=_{h_{j}}r_{i^{\prime}} and i<ii<i^{\prime}). Then, M(hj)M^{*}(h_{j}^{\scalebox{0.7}{$\bullet$}}) does not change, so the condition is satisfied for hjh_{j}^{\scalebox{0.7}{$\bullet$}}. Additionally, M(hj)(M(hj){ri}){ri}M^{*}(h_{j}^{\scalebox{0.8}{$\circ$}})\coloneqq(M^{*}(h_{j}^{\scalebox{0.8}{$\circ$}})\cup\{r^{\prime}_{i}\})\setminus\{r^{\prime}_{i^{\prime}}\} and rihjrir^{\prime}_{i}\succ_{h_{j}^{\scalebox{0.8}{$\circ$}}}r^{\prime}_{i^{\prime}}, so the condition is satisfied for hjh_{j}^{\scalebox{0.8}{$\circ$}}. ∎

Claim 19.

If a resident rr is matched in MM^{*}, then M(r)M^{*}(r) is at the top of rr’s preference list in the final II^{*}. If a resident rr is unmatched in MM^{*}, then rr’s preference list is empty.

Proof.

First note that, for every ii, since rir_{i} is matched in MM, rir^{\prime}_{i} is matched in MM^{*}. Consider a resident rir^{\prime}_{i} such that (ri,hj)M(r^{\prime}_{i},h^{\scalebox{0.7}{$\bullet$}}_{j})\in M^{*} for some jj. Then, (ri,hj)M1(r_{i},h_{j})\in M_{1}. Since rir_{i} is not rejected by hjh_{j}, the pair (ri,hj)(r^{\prime}_{i},h^{\scalebox{0.7}{$\bullet$}}_{j}) is not removed. Consider a hospital hh such that hrihjh\succ_{r^{\prime}_{i}}h^{\scalebox{0.7}{$\bullet$}}_{j}. If hh is hjh^{\scalebox{0.7}{$\bullet$}}_{j^{\prime}} or hjh^{\scalebox{0.8}{$\circ$}}_{j^{\prime}} for some jj^{\prime} such that hjrihjh_{j^{\prime}}\succ_{r_{i}}h_{j} in II, rir_{i} is rejected by hjh_{j^{\prime}} twice, and both hjh^{\scalebox{0.7}{$\bullet$}}_{j^{\prime}} and hjh^{\scalebox{0.8}{$\circ$}}_{j^{\prime}} are removed from rir^{\prime}_{i}’s list. If h=hjh=h^{\scalebox{0.7}{$\bullet$}}_{j^{\prime}} for some jj^{\prime} such that hj=rihjh_{j^{\prime}}=_{r_{i}}h_{j} in II, then ((hj)<(hj)\ell(h_{j^{\prime}})<\ell(h_{j})) or ((hj)=(hj)\ell(h_{j^{\prime}})=\ell(h_{j}) and j<jj^{\prime}<j), so rir_{i} must have proposed to and been rejected by hjh_{j^{\prime}} before. Therefore hjh^{\scalebox{0.7}{$\bullet$}}_{j^{\prime}} is removed from rir^{\prime}_{i}’s list.

Consider a resident rir^{\prime}_{i} such that (ri,hj)M(r^{\prime}_{i},h^{\scalebox{0.8}{$\circ$}}_{j})\in M^{*} for some jj. Then, (ri,hj)M2(r_{i},h_{j})\in M_{2}. Since rir_{i} is rejected by hjh_{j} only once, (ri,hj)(r^{\prime}_{i},h^{\scalebox{0.8}{$\circ$}}_{j}) is not removed. Consider a hospital hh such that hrihjh\succ_{r^{\prime}_{i}}h^{\scalebox{0.8}{$\circ$}}_{j}. If hh is hjh^{\scalebox{0.7}{$\bullet$}}_{j^{\prime}} or hjh^{\scalebox{0.8}{$\circ$}}_{j^{\prime}} for some jj^{\prime} such that hjrihjh_{j^{\prime}}\succ_{r_{i}}h_{j} in II, then the same argument as above holds. If h=hjh=h^{\scalebox{0.7}{$\bullet$}}_{j^{\prime}} for some jj^{\prime} such that hj=rihjh_{j^{\prime}}=_{r_{i}}h_{j} in II, rir_{i} is rejected by hjh_{j^{\prime}} once, and hence hjh^{\scalebox{0.7}{$\bullet$}}_{j^{\prime}} is removed from rir^{\prime}_{i}’s list. If h=hjh=h^{\scalebox{0.8}{$\circ$}}_{j^{\prime}} for some jj^{\prime} such that hj=rihjh_{j^{\prime}}=_{r_{i}}h_{j} in II, then ((hj)<(hj)\ell(h_{j^{\prime}})<\ell(h_{j})) or ((hj)=(hj)\ell(h_{j^{\prime}})=\ell(h_{j}) and j<jj^{\prime}<j), so rir_{i} is rejected by hjh_{j^{\prime}} twice. Therefore hjh^{\scalebox{0.8}{$\circ$}}_{j^{\prime}} is removed from rir^{\prime}_{i}’s list.

Next we consider dummy residents. Consider a pair (dj,q,hj)M(d_{j,q},h^{\scalebox{0.8}{$\circ$}}_{j})\in M^{*}. By the definition of MM^{*}, we have 1qu(hj)|M2(hj)|1\leq q\leq u(h_{j})-|M_{2}(h_{j})|, and hence |M2(hj)|u(hj)q|M_{2}(h_{j})|\leq u(h_{j})-q. Thus |M2(hj)||M_{2}(h_{j})| never reaches u(hj)q+1u(h_{j})-q+1 so this qq does not satisfy the condition of pp in Case (3) of the removal operation. Therefore Case (3) is not executed for this qq so (dj,q,hj)(d_{j,q},h^{\scalebox{0.8}{$\circ$}}_{j}) is not removed. Since hjh^{\scalebox{0.8}{$\circ$}}_{j} is already at the top of dj,qd_{j,q}’s list, we are done.

Consider a pair (dj,q,hj)M(d_{j,q},h^{\scalebox{0.7}{$\bullet$}}_{j})\in M^{*}. By the definition of MM^{*}, we have u(hj)|M2(hj)|<qmin{u(hj)|M(hj)|+(hj),u(hj)}u(h_{j})-|M_{2}(h_{j})|<q\leq\min\{u(h_{j})-|M(h_{j})|+\ell(h_{j}),~{}u(h_{j})\}. The first inequality implies |M2(hj)|>u(hj)q|M_{2}(h_{j})|>u(h_{j})-q. This means that |M2(hj)||M_{2}(h_{j})| reaches u(hj)q+1u(h_{j})-q+1 at some point, so qq satisfies the condition of pp in Case (3). Therefore Case (3) is executed for this qq and hence (dj,q,hj)(d_{j,q},h^{\scalebox{0.8}{$\circ$}}_{j}) is removed. If (dj,q,hj)(d_{j,q},h^{\scalebox{0.7}{$\bullet$}}_{j}) were removed, by the condition of Case (3), |M2(hj)||M_{2}(h_{j})| would reach u(hj)(q(hj))+1u(h_{j})-(q-\ell(h_{j}))+1, so we would have |M2(hj)|>u(hj)(q(hj))|M_{2}(h_{j})|>u(h_{j})-(q-\ell(h_{j})). Additionally, as described in the explanation of Case (3), we would have |M2(hj)|=|M(hj)||M_{2}(h_{j})|=|M(h_{j})|, and then the second inequality implies qu(hj)|M2(hj)|+(hj)q\leq u(h_{j})-|M_{2}(h_{j})|+\ell(h_{j}), i.e., |M2(hj)|u(hj)(q(hj))|M_{2}(h_{j})|\leq u(h_{j})-(q-\ell(h_{j})), a contradiction. So (dj,q,hj)(d_{j,q},h^{\scalebox{0.7}{$\bullet$}}_{j}) is not removed.

Finally, if dj,qd_{j,q} is unmatched in MM^{*}, then we have q>min{u(hj)|M(hj)|+(hj),u(hj)}q>\min\{u(h_{j})-|M(h_{j})|+\ell(h_{j}),~{}u(h_{j})\}. If u(hj)|M(hj)|+(hj)u(hj)u(h_{j})-|M(h_{j})|+\ell(h_{j})\geq u(h_{j}), we have q>u(hj)q>u(h_{j}) but this is a contradiction. Hence, we have q>u(hj)|M(hj)|+(hj)q>u(h_{j})-|M(h_{j})|+\ell(h_{j}). Then, |M2(hj)|=|M(hj)||M1(hj)|>u(hj)q+(hj)(hj)=u(hj)q|M_{2}(h_{j})|=|M(h_{j})|-|M_{1}(h_{j})|>u(h_{j})-q+\ell(h_{j})-\ell(h_{j})=u(h_{j})-q, as |M1(hj)|(hj)|M_{1}(h_{j})|\leq\ell(h_{j}). This satisfies the condition of Case (3), so (dj,q,hj)(d_{j,q},h^{\scalebox{0.8}{$\circ$}}_{j}) is removed. Recall that qu(hj)q\leq u(h_{j}) holds by definition. Additionally, since |M(hj)|u(hj)|M(h_{j})|\leq u(h_{j}), the condition q>u(hj)|M(hj)|+(hj)q>u(h_{j})-|M(h_{j})|+\ell(h_{j}) implies q>(hj)q>\ell(h_{j}). Hence, we have 1q(hj)u(hj)(hj)1\leq q-\ell(h_{j})\leq u(h_{j})-\ell(h_{j}) and so (dj,q,hj)(d_{j,q},h^{\scalebox{0.7}{$\bullet$}}_{j}) is removed. ∎

We now show the nonexistence of a blocking pair in MM^{*} at the end of the algorithm. Suppose that hrM(r)h\succ_{r}M^{*}(r) for some rRDr\in R^{\prime}\cup D and hHHh\in H^{\scalebox{0.8}{$\circ$}}\cup H^{\scalebox{0.7}{$\bullet$}}. By Claim 19, hrM(r)h\succ_{r}M^{*}(r) implies that (r,h)(r,h) is removed during the course of Double Proposal. Then, by Claim 18, hh is full in MM^{*} with residents better than rr, so (r,h)(r,h) cannot block MM^{*}.

Finally, we show that MM^{*} is resident-optimal. Suppose, to the contrary, that there is a stable matching NN^{*} of II^{*} such that the set R{rRD|N(r)rM(r)}R^{*}\coloneqq\set{r\in R^{\prime}\cup D}{N^{*}(r)\succ_{r}M^{*}(r)} is nonempty. By Claim 19, for each rRr\in R^{*}, the pair (r,N(r))(r,N^{*}(r)) is removed at some point of the algorithm. Let r0Rr^{0}\in R^{*} be a resident such that (r0,h0)(r^{0},h^{0}) (where h0N(r0)h_{0}\coloneqq N^{*}(r^{0})) is removed first during the algorithm. Let M0M^{*}_{0} be the matching just after this removal. Then, by recalling the argument in the definitions of removal operations (1)–(3), we can see that h0h^{0} is full in M0M^{*}_{0} with residents better than r0r^{0}. Note that M0(h0)N(h0)M^{*}_{0}(h^{0})\setminus N^{*}(h^{0})\neq\emptyset because |M0(h0)||N(h0)||M^{*}_{0}(h^{0})|\geq|N^{*}(h^{0})| and r0N(h0)M0(h0)r^{0}\in N^{*}(h^{0})\setminus M^{*}_{0}(h^{0}). Take any resident r1M0(h0)N(h0)r^{1}\in M^{*}_{0}(h^{0})\setminus N^{*}(h^{0}) and let h1N(r1)h^{1}\coloneqq N^{*}(r^{1}). Since h0h^{0} is at the top of r1r^{1}’s current list and (r1,h1)(r^{1},h^{1}) is not yet removed by the choice of r0r^{0}, h0r1h1h^{0}\succ_{r^{1}}h^{1} holds. Then, (r1,h0)(r^{1},h^{0}) blocks NN^{*}, which contradicts the stability of NN^{*}.

Appendix C Full Version of Section 5 (Maximum Gaps and Approximation Factors of Double Proposal)

This is a full version of Section 5. Here we analyze the approximation factors of our algorithm Double Proposal, together with the maximum gaps Λ\Lambda for several models mentioned in Section 2.

For an instance II of HRT-MSLQ, let OPT(I){\rm OPT}(I) and WST(I){\rm WST}(I) respectively denote the maximum and minimum scores over all stable matchings of II, and let ALG(I){\rm ALG}(I) be the score of the output of our algorithm. For a model \cal I (i.e., subfamily of problem instances of HRT-MSLQ), let

Λ()=maxIOPT(I)WST(I) and APPROX()=maxIOPT(I)ALG(I).\Lambda({\cal I})=\max_{I\in{\cal I}}\frac{{\rm OPT}(I)}{{\rm WST}(I)}\ \ \mbox{ and }\ \ {\rm APPROX}({\cal I})=\max_{I\in{\cal I}}\frac{{\rm OPT}(I)}{{\rm ALG}(I)}.

The maximum gap Λ()\Lambda({\cal I}) represents a worst approximation factor of a naive algorithm that first breaks ties arbitrarily and then apply the resident-oriented Gale–Shapley algorithm. Let us first confirm this fact. For this purpose, it suffices to show that the worst objective value is indeed realized by the output of such an algorithm.

Proposition 20.

Let II be an instance of HRT-MSLQ. There exists an instance II^{\prime} such that (i) II^{\prime} is obtained by breaking the ties in II and (ii) the residents-oriented Gale–Shapley algorithm applied to II^{\prime} outputs a matching MM^{\prime} with s(M)=WST(I)s(M^{\prime})={\rm WST}(I).

To see this, we remind the following two known results. They are originally shown for the Hospitals/Residents model, but it is easy to see that they hold for HRT-MSLQ too.

Theorem 21 ([30]).

Let II be an instance of HRT-MSLQ and let MM be a matching in II. Then, MM is (weakly) stable in II if and only if MM is stable in some instance II^{\prime} of HRT-MSLQ without ties obtained by breaking the ties in II.

The following claim is a part of the famous rural hospitals theorem. The original version states stronger conditions for the case with incomplete preference lists.

Theorem 22 ([14, 31, 32]).

For an instance II^{\prime} of HRT-MSLQ that has no ties, the number of residents assigned to each hospital does not change over all stable matchings of II^{\prime}.

Proof of Proposition 20.

Let MM be a stable matching of II that attains WST(I){\rm WST}(I). By Theorem 21, there is an instance II^{\prime} of HRT-MSLQ without ties such that it is obtained by breaking the ties in II and MM is a stable matching of II^{\prime}. Let MM^{\prime} be the output of the resident-oriented Gale–Shapley algorithm applied to II^{\prime}. Since both MM^{\prime} and MM are stable matchings of II^{\prime}, which has no ties, Theorem 22 implies that any hospital is assigned the same number of residents in MM^{\prime} and MM. Thus, s(M)=s(M)=WST(I)s(M^{\prime})=s(M)={\rm WST}(I) holds. ∎

In the rest, we analyze Λ()\Lambda({\cal I}) and APPROX(){\rm APPROX}({\cal I}) for each model. All results in this section are summarized in Table 1, which is a refinement of the first and second rows of Table 1. Here, we split each model into three sub-models according to on which side ties are allowed to appear. The ratio for Λ()\Lambda({\cal I}) when ties appear only in hospitals’ side, which is the same as APPROX(){\rm APPROX}({\cal I}) for all four cases, is derived by observing the proofs of the approximation factor of our algorithm. In Table 1, nn represents the number of residents and a function ϕ\phi is defined by ϕ(1)=1\phi(1)=1, ϕ(2)=32\phi(2)=\frac{3}{2}, and ϕ(n)=n(1+n2)/(n+n2)\phi(n)=n(1+\lfloor\frac{n}{2}\rfloor)/(n+\lfloor\frac{n}{2}\rfloor) for any n3n\geq 3. In the uniform model, we write θ=u(h)(h)\theta=\frac{u(h)}{\ell(h)} for the ratio of the upper and lower quotas, which is common to all hospitals. Note that θ2+θ12θ1<θ\frac{\theta^{2}+\theta-1}{2\theta-1}<\theta holds whenever θ>1\theta>1.

General Uniform Marriage RR-side ML
HH RR Both HH RR Both HH RR Both HH RR Both
Max gap Λ()\Lambda({\cal I}) ϕ(n)\phi(n) n+1n+1 θ2+θ12θ1\frac{\theta^{2}+\theta-1}{2\theta-1} θ\theta 1.51.5 22 11 n+1n+1
(ATB+GS) (Cor.28) (Prop.6) (Cor.30) (Prop.8) (Cor.33) (Prop.31) (Cor.36) (Prop.6)
APPROX(){\rm APPROX}({\cal I}) ϕ(n)\phi(n) θ2+θ12θ1\frac{\theta^{2}+\theta-1}{2\theta-1} 1.51.5 11
(Double Proposal) (Thm.7) (Thm.9) (Thm.32) (Thm.34)
Table 1: Maximum gap Λ()\Lambda({\cal I}) (equivalently, approximation factor of the arbitrarily tie-breaking Gale–Shapley algorithm) and approximation factor of Double Proposal of HRT-MSLQ for four fundamental models {\cal I}. Here HH and RR represent the restrictions in which ties appear only in preference lists of residents and hospitals, respectively.

Recall that the following conditions are commonly assumed in all models: all agents have complete preference lists, (h)u(h)n\ell(h)\leq u(h)\leq n for each hospital hHh\in H, and |R|<hHu(h)|R|<\sum_{h\in H}u(h). From these, it follows that in any stable matching any resident is assigned to some hospital.

C.1 General Model

We first analyze our model without any additional assumption. Before evaluating our algorithm, we provide a worst case analysis of a tie-breaking algorithm.

Proposition 23.

The maximum gap for general model satisfies Λ(Gen)=n+1\Lambda({\cal I}_{\rm Gen})=n+1. Moreover, this equality holds even if residents have a master list, and the preference lists of hospitals contain no ties.

Proof.

We first show OPT(I)WST(I)n+1\frac{{\rm OPT}(I)}{{\rm WST}(I)}\leq n+1 for any instance II of HRT-MSLQ. Let NN and MM be stable matchings with s(N)=OPT(I)s(N)={\rm OPT}(I) and s(M)=WST(I)s(M)={\rm WST}(I), respectively. Recall that (h)n\ell(h)\leq n is assumed for any hospital hh. Let H0HH_{0}\subseteq H be the set of hospitals hh with (h)=0\ell(h)=0. Then

s(N)=|H0|+hHH0min{1,|N(h)|(h)}|H0|+hHH0min{1,|N(h)|1}|H0|+n,\displaystyle\textstyle s(N)=|H_{0}|+\sum_{h\in H\setminus H_{0}}\min\{1,\frac{|N(h)|}{\ell(h)}\}\leq|H_{0}|+\sum_{h\in H\setminus H_{0}}\min\{1,\frac{|N(h)|}{1}\}\leq|H_{0}|+n,
s(M)=|H0|+hHH0min{1,|M(h)|(h)}|H0|+hHH0min{1,|M(h)|n}.\displaystyle\textstyle s(M)=|H_{0}|+\sum_{h\in H\setminus H_{0}}\min\{1,\frac{|M(h)|}{\ell(h)}\}\geq|H_{0}|+\sum_{h\in H\setminus H_{0}}\min\{1,\frac{|M(h)|}{n}\}.

In case |H0|=0|H_{0}|=0, we have hHH0min{1,|M(h)|n}=hHmin{1,|M(h)|n}1\sum_{h\in H\setminus H_{0}}\min\{1,\frac{|M(h)|}{n}\}=\sum_{h\in H}\min\{1,\frac{|M(h)|}{n}\}\geq 1, and hence s(N)s(M)n1=n\frac{s(N)}{s(M)}\leq\frac{n}{1}=n. In case |H0|1|H_{0}|\geq 1, we have s(M)|H0|s(M)\geq|H_{0}|, and s(N)s(M)|H0|+n|H0|=1+n|H0|1+n\frac{s(N)}{s(M)}\leq\frac{|H_{0}|+n}{|H_{0}|}=1+\frac{n}{|H_{0}|}\leq 1+n. Thus, OPT(I)WST(I)n+1\frac{{\rm OPT}(I)}{{\rm WST}(I)}\leq n+1 for any instance II.

We next show that there exists an instance II with OPT(I)WST(I)=n+1\frac{{\rm OPT}(I)}{{\rm WST}(I)}=n+1 that satisfies the conditions required in the statement. Let II be an instance consisting of nn residents {r1,r2,,rn}\{r_{1},r_{2},\dots,r_{n}\} and n+1n+1 hospitals {h1,h2,,hn+1}\{h_{1},h_{2},\dots,h_{n+1}\} such that

  • the preference list of every resident consists of a single tie containing all hospitals,

  • the preference list of every hospital is an arbitrary complete list without ties, and

  • [(hi),u(hi)]=[1,1][\ell(h_{i}),u(h_{i})]=[1,1] for i=1,2,,ni=1,2,\dots,n and [(hn+1),u(hn+1)]=[0,n][\ell(h_{n+1}),u(h_{n+1})]=[0,n].

This instance satisfies the conditions in the statement. Since any resident is indifferent among all hospitals, a matching is stable whenever all residents are assigned. Let N={(ri,hi)|i=1,2,,n}N=\set{(r_{i},h_{i})}{i=1,2,\dots,n} and M={(ri,hn+1)|i=1,2,,n}M=\set{(r_{i},h_{n+1})}{i=1,2,\dots,n}. Then, s(N)=n+1s(N)=n+1 while s(M)=1s(M)=1. Thus we obtain OPT(I)WST(I)=n+1\frac{{\rm OPT}(I)}{{\rm WST}(I)}=n+1. ∎

We next show that the approximation factor of our algorithm is ϕ(n)\phi(n). Recall that ϕ\phi is a function of n=|R|n=|R| defined by

ϕ(n)={1n=1,32n=2,n(1+n2)n+n2n3.\phi(n)=\begin{cases}1&n=1,\\ \frac{3}{2}&n=2,\\ \frac{n(1+\lfloor\frac{n}{2}\rfloor)}{n+\lfloor\frac{n}{2}\rfloor}&n\geq 3.\end{cases}
Theorem 7.

The approximation factor of Double Proposal for the general model satisfies APPROX(Gen)=ϕ(n){\rm APPROX}({\cal I}_{\rm Gen})=\phi(n).

Proof.

Here we only show APPROX(Gen)ϕ(n){\rm APPROX}({\cal I}_{\rm Gen})\leq\phi(n), since this together with Proposition 27 shown later implies the required equality.

Let MM be the output of the algorithm and let NN be an optimal stable matching. We define vectors pMp_{M} and pNp_{N} on RR, which are distributions of scores to residents. For each hospital hHh\in H, its scores in MM and NN are sM(h)=min{1,|M(h)|(h)}s_{M}(h)=\min\{1,\frac{|M(h)|}{\ell(h)}\} and sN(h)=min{1,|M(h)|(h)}s_{N}(h)=\min\{1,\frac{|M(h)|}{\ell(h)}\}, respectively. We set {pM(r)}rM(h)\{p_{M}(r)\}_{r\in M(h)} and {pN(r)}rN(h)\{p_{N}(r)\}_{r\in N(h)} as follows. Among M(h)N(h)M(h)\cap N(h), take min{(h),|M(h)N(h)|}\min\{\ell(h),|M(h)\cap N(h)|\} residents arbitrarily and set pM(r)=pN(r)=1(h)p_{M}(r)=p_{N}(r)=\frac{1}{\ell(h)} for them. If |M(h)N(h)|>(h)|M(h)\cap N(h)|>\ell(h), set pM(r)=pN(r)=0p_{M}(r)=p_{N}(r)=0 for the remaining residents in M(h)N(h)M(h)\cap N(h). If |M(h)N(h)|<(h)|M(h)\cap N(h)|<\ell(h), then among M(h)N(h)M(h)\setminus N(h), take min{(h)|M(h)N(h)|,|M(h)N(h)|}\min\{\ell(h)-|M(h)\cap N(h)|,|M(h)\setminus N(h)|\} residents arbitrarily and set pM(r)=1(h)p_{M}(r)=\frac{1}{\ell(h)} for them. If there still remains a resident rr in M(h)N(h)M(h)\setminus N(h) with undefined pM(r)p_{M}(r), set pM(r)=0p_{M}(r)=0 for all such residents. Similarly, define pN(r)p_{N}(r) for residents in N(h)M(h)N(h)\setminus M(h).

By definition, for each hHh\in H, we have pM(M(h))=sM(h)p_{M}(M(h))=s_{M}(h) and pN(N(h))=sN(h)p_{N}(N(h))=s_{N}(h), where the notation pM(A)p_{M}(A) is defined as pM(A)=rApM(r)p_{M}(A)=\sum_{r\in A}p_{M}(r) for any ARA\subseteq R and pN(A)p_{N}(A) is defined similarly. Since each of {M(h)}hH\{M(h)\}_{h\in H} and {N(h)}hH\{N(h)\}_{h\in H} is a partition of RR, we have

s(M)=pM(R),s(N)=pN(R).s(M)=p_{M}(R),\quad s(N)=p_{N}(R).

Thus, what we have to prove is pN(R)pM(R)ϕ(n)\frac{p_{N}(R)}{p_{M}(R)}\leq\phi(n), where n=|R|n=|R|.

Note that, for any resident rRr\in R, the condition M(r)=N(r)M(r)=N(r) means that rM(h)N(h)r\in M(h)\cap N(h) for some hHh\in H. Then, the above construction of pMp_{M} and pNp_{N} implies the following condition for any rRr\in R, which will be used in the last part of the proof (in the proof of Claim 26).

M(r)=N(r)pM(r)=pN(r).M(r)=N(r)\implies p_{M}(r)=p_{N}(r). (1)

For the convenience of the analysis below, let R={r1,r2,,rn}R^{\prime}=\{r^{\prime}_{1},r^{\prime}_{2},\dots,r^{\prime}_{n}\} be the copy of RR and identify pNp_{N} as a vector on RR^{\prime}. Consider a bipartite graph G=(R,R;E)G=(R,R^{\prime};E), where the edge set EE is defined by E={(ri,rj)R×R|pM(ri)pN(rj)}E=\set{(r_{i},r^{\prime}_{j})\in R\times R^{\prime}}{p_{M}(r_{i})\geq p_{N}(r^{\prime}_{j})}. For a matching XEX\subseteq E (i.e., a subset of EE covering each vertex at most once), we denote by (X)RR\partial(X)\subseteq R\cup R^{\prime} the set of vertices covered by XX. Then, we have |R(X)|=|R(X)|=|X||R\cap\partial(X)|=|R^{\prime}\cap\partial(X)|=|X|.

Lemma 23.

G=(R,R;E)G=(R,R^{\prime};E) admits a matching XX such that |X|n2|X|\geq\lceil\frac{n}{2}\rceil. Furthermore, in case s(M)<2s(M)<2, such a matching XX can be chosen so that pM(R(X))1p_{M}(R\cap\partial(X))\geq 1 holds and any rR(X)r\in R\setminus\partial(X) satisfies pM(r)0p_{M}(r)\neq 0.

We postpone the proof of this lemma and now complete the proof of Theorem 7. There are two cases (i) s(M)2s(M)\geq 2 and (ii) s(M)<2s(M)<2.

We first consider the case (i). Assume s(M)2s(M)\geq 2. By Lemma 23, there is a matching XEX\subseteq E such that |X|n2|X|\geq\lceil\frac{n}{2}\rceil. The definition of EE implies pM(R(X))pN(R(X))p_{M}(R\cap\partial(X))\geq p_{N}(R^{\prime}\cap\partial(X)). We then have pN(R)=pN(R(X))+pN(R(X))pM(R(X))+pN(R(X))={pM(R)pM(R(X))}+pN(R(X))p_{N}(R^{\prime})=p_{N}(R^{\prime}\cap\partial(X))+p_{N}(R^{\prime}\setminus\partial(X))\leq p_{M}(R\cap\partial(X))+p_{N}(R^{\prime}\setminus\partial(X))=\{p_{M}(R)-p_{M}(R\setminus\partial(X))\}+p_{N}(R^{\prime}\setminus\partial(X)), which implies the first inequality of the following consecutive inequalities, where others are explained below.

s(N)s(M)\displaystyle\frac{s(N)}{s(M)} =\displaystyle= pN(R)pM(R)\displaystyle\frac{p_{N}(R^{\prime})}{p_{M}(R)}
\displaystyle\leq pM(R)pM(R(X))+pN(R(X))pM(R)\displaystyle\frac{p_{M}(R)-p_{M}(R\setminus\partial(X))+p_{N}(R^{\prime}\setminus\partial(X))}{p_{M}(R)}
\displaystyle\leq pM(R)+|R(X)|pM(R)\displaystyle\frac{p_{M}(R)+|R^{\prime}\setminus\partial(X)|}{p_{M}(R)}
\displaystyle\leq 2+|R(X)|2\displaystyle\frac{2+|R^{\prime}\setminus\partial(X)|}{2}
\displaystyle\leq 2+n22\displaystyle\frac{2+\lfloor\frac{n}{2}\rfloor}{2}
\displaystyle\leq ϕ(n).\displaystyle\phi(n).

The second inequality uses the facts that pM(r)0p_{M}(r)\geq 0 for any rRr\in R and pN(r)1p_{N}(r^{\prime})\leq 1 for any rRr^{\prime}\in R^{\prime}. The third follows from pM(R)=s(M)2p_{M}(R)=s(M)\geq 2. The fourth follows from |X|n2|X|\geq\lceil\frac{n}{2}\rceil as it implies |R(X)|=|R||X|nn2=n2|R^{\prime}\setminus\partial(X)|=|R^{\prime}|-|X|\leq n-\lceil\frac{n}{2}\rceil=\lfloor\frac{n}{2}\rfloor. The last one 2+n22ϕ(n)\frac{2+\lfloor\frac{n}{2}\rfloor}{2}\leq\phi(n) can be checked for n=1,2n=1,2 easily and for n3n\geq 3 as follows:

ϕ(n)2+n22=n(1+n2)n+n22+n22=n2(n2n2)2(n+n2)=n2(n22)2(n+n2)0.\phi(n)-\frac{2+\lfloor\frac{n}{2}\rfloor}{2}=\frac{n(1+\lfloor\frac{n}{2}\rfloor)}{n+\lfloor\frac{n}{2}\rfloor}-\frac{2+\lfloor\frac{n}{2}\rfloor}{2}=\frac{\lfloor\frac{n}{2}\rfloor(n-2-\lfloor\frac{n}{2}\rfloor)}{2(n+\lfloor\frac{n}{2}\rfloor)}=\frac{\lfloor\frac{n}{2}\rfloor(\lceil\frac{n}{2}\rceil-2)}{2(n+\lfloor\frac{n}{2}\rfloor)}\geq 0.

Thus, we obtain s(N)s(M)ϕ(n)\frac{s(N)}{s(M)}\leq\phi(n) as required.

We next consider the case (ii). Assume s(M)<2s(M)<2. By Lemma 23, then there is a matching XEX\subseteq E such that |X|n2|X|\geq\lceil\frac{n}{2}\rceil, pM(R(X))1p_{M}(R\cap\partial(X))\geq 1, and pM(r)0p_{M}(r)\neq 0 for any rR(X)r\in R\setminus\partial(X). Again, by the definition of EE, we have pM(R(X))pN(R(X))p_{M}(R\cap\partial(X))\geq p_{N}(R^{\prime}\cap\partial(X)), which implies the first inequality of the following consecutive inequalities, where others are explained below.

s(N)s(M)\displaystyle\frac{s(N)}{s(M)} =\displaystyle= pN(R)pM(R)\displaystyle\frac{p_{N}(R^{\prime})}{p_{M}(R)}
\displaystyle\leq pM(R(X))+pN(R(X))pM(R(X))+pM(R(X))\displaystyle\frac{p_{M}(R\cap\partial(X))+p_{N}(R^{\prime}\setminus\partial(X))}{p_{M}(R\cap\partial(X))+p_{M}(R\setminus\partial(X))}
\displaystyle\leq pM(R(X))+|R(X)|pM(R(X))+1n|R(X)|\displaystyle\frac{p_{M}(R\cap\partial(X))+|R^{\prime}\setminus\partial(X)|}{p_{M}(R\cap\partial(X))+\frac{1}{n}|R\setminus\partial(X)|}
\displaystyle\leq 1+|R(X)|1+1n|R(X)|\displaystyle\frac{1+|R^{\prime}\setminus\partial(X)|}{1+\frac{1}{n}|R\setminus\partial(X)|}
\displaystyle\leq 1+n21+1nn2=ϕ(n).\displaystyle\frac{1+\lfloor\frac{n}{2}\rfloor}{1+\frac{1}{n}\lfloor\frac{n}{2}\rfloor}\ =\ \phi(n).

The second inequality follows from the facts that pN(r)1p_{N}(r^{\prime})\leq 1 for any rRr^{\prime}\in R^{\prime} and pM(r)0p_{M}(r)\neq 0 for any rR(X)r\in R\setminus\partial(X). Note that pM(r)0p_{M}(r)\neq 0 implies pM(r)=1(h)1np_{M}(r)=\frac{1}{\ell(h)}\geq\frac{1}{n} where hM(r)h\coloneqq M(r). The third follows from pM(R(X))1p_{M}(R\cap\partial(X))\geq 1. The last one follows from |R(X)|=|R(X)|=n|X|n2|R^{\prime}\setminus\partial(X)|=|R\setminus\partial(X)|=n-|X|\leq\lfloor\frac{n}{2}\rfloor. Thus, we obtain s(N)s(M)ϕ(n)\frac{s(N)}{s(M)}\leq\phi(n) also for this case. ∎

Proof of Lemma 23.

To show the first claim of the lemma, we intend to construct a matching in GG of size at least n2\lceil\frac{n}{2}\rceil. We need some preparation for this construction.

Divide the set RR of residents into three parts:

R+{rR|M(r)rN(r)},\displaystyle R_{+}\coloneqq\set{r\in R}{M(r)\succ_{r}N(r)},
R{rR|N(r)rM(r) or [M(r)=rN(r),pN(r)>pM(r)]}, and\displaystyle R_{-}\coloneqq\set{r\in R}{N(r)\succ_{r}M(r)\text{ or }[M(r)=_{r}N(r),~{}p_{N}(r)>p_{M}(r)]},\mbox{ and}
R0{rR|M(r)=rN(r),pM(r)pN(r)}.\displaystyle R_{0}~{}\!\coloneqq\set{r\in R}{M(r)=_{r}N(r),~{}p_{M}(r)\geq p_{N}(r)}.

Let R+,R,R0R^{\prime}_{+},R^{\prime}_{-},R^{\prime}_{0} be the corresponding subsets of RR^{\prime}.

Claim 24.

There is an injection ξ+:R+R\xi_{+}\colon R_{+}\to R^{\prime} such that pM(r)=pN(ξ+(r))p_{M}(r)=p_{N}(\xi_{+}(r)) for every rR+r\in R_{+}. There is an injection ξ:RR\xi_{-}\colon R^{\prime}_{-}\to R such that pN(r)=pM(ξ(r))p_{N}(r^{\prime})=p_{M}(\xi_{-}(r^{\prime})) for every rRr^{\prime}\in R^{\prime}_{-}.

Proof.

We first construct ξ+:R+R\xi_{+}\colon R_{+}\to R^{\prime}. Set M(R+){M(r)|rR+}M(R_{+})\coloneqq\set{M(r)}{r\in R_{+}}. For each hospital hM(R+)h\in M(R_{+}), any rM(h)R+r\in M(h)\cap R_{+} satisfies h=M(r)rN(r)h=M(r)\succ_{r}N(r). By the stability of NN, then hh is full in NN. Therefore, in N(h)N(h), there are (h)\ell(h) residents with pNp_{N} value 1(h)\frac{1}{\ell(h)} and u(h)(h)u(h)-\ell(h) residents with pNp_{N} value 0. Since |M(h)|u(h)|M(h)|\leq u(h) and pMp_{M} values are 1(h)\frac{1}{\ell(h)} for min{|M(h)|,(h)}\min\{|M(h)|,\ell(h)\} residents, we can define an injection ξ+h:M(h)R+N(h)\xi_{+}^{h}\colon M(h)\cap R_{+}\to N(h) such that pM(r)=pN(ξ+h(r))p_{M}(r)=p_{N}(\xi_{+}^{h}(r)) for every rM(h)R+r\in M(h)\cap R_{+}. By regarding N(h)N(h) as a subset of RR^{\prime} and taking the direct sum of ξ+h\xi_{+}^{h} for all hM(R+)h\in M(R_{+}), we obtain an injection ξ+:R+R\xi_{+}\colon R_{+}\to R^{\prime} such that pM(r)=pN(ξ+(r))p_{M}(r)=p_{N}(\xi_{+}(r)) for every rR+r\in R_{+}.

We next construct ξ:RR\xi_{-}\colon R^{\prime}_{-}\to R. Define N(R){N(r)|rR}N(R^{\prime}_{-})\coloneqq\set{N(r^{\prime})}{r^{\prime}\in R^{\prime}_{-}}. For each hN(R)h^{\prime}\in N(R^{\prime}_{-}), any resident rN(h)Rr\in N(h^{\prime})\cap R^{\prime}_{-} satisfies either h=N(r)rM(r)h^{\prime}=N(r)\succ_{r}M(r) or [h=N(r)=rM(r),pN(r)>pM(r)][h^{\prime}=N(r)=_{r}M(r),~{}p_{N}(r)>p_{M}(r)]. In case some resident rr satisfies h=N(r)rM(r)h^{\prime}=N(r)\succ_{r}M(r), the stability of MM implies that hh^{\prime} is full in MM. Then, we can define an injection ξh:N(h)RM(h)\xi_{-}^{h^{\prime}}\colon N(h^{\prime})\cap R^{\prime}_{-}\to M(h^{\prime}) in the manner we defined ξ+h\xi_{+}^{h} above and pN(r)=pM(ξh(r))p_{N}(r^{\prime})=p_{M}(\xi_{-}^{h^{\prime}}(r^{\prime})) holds for any rN(h)Rr^{\prime}\in N(h^{\prime})\cap R^{\prime}_{-}. We then assume that all residents rN(h)Rr\in N(h^{\prime})\cap R^{\prime}_{-} satisfy [h=N(r)=rM(r),pN(r)>pM(r)][h^{\prime}=N(r)=_{r}M(r),~{}p_{N}(r)>p_{M}(r)]. Then, all those residents satisfy 0pN(r)=1(h)0\neq p_{N}(r)=\frac{1}{\ell(h^{\prime})}, and hence |N(h)R|(h)|N(h^{\prime})\cap R^{\prime}_{-}|\leq\ell(h^{\prime}). Additionally, pN(r)>pN(r)p_{N}(r)>p_{N}(r) implies either pM(r)=0p_{M}(r)=0 or (h)<(h)\ell(h^{\prime})<\ell(h), where hM(r)h\coloneqq M(r). Observe that pM(r)=0p_{M}(r)=0 implies |M(h)|>(h)|M(h)|>\ell(h). As we have h=rhh=_{r}h^{\prime} and M(r)=hM(r)=h, by Lemma 2, each of (h)<(h)\ell(h^{\prime})<\ell(h) and |M(h)|>(h)|M(h)|>\ell(h) implies |M(h)|(h)|M(h^{\prime})|\geq\ell(h^{\prime}), and hence there are (h)\ell(h^{\prime}) residents whose pMp_{M} values are 1(h)\frac{1}{\ell(h^{\prime})}. Since pN(r)=1(h)p_{N}(r)=\frac{1}{\ell(h^{\prime})} for all residents rN(h)Rr\in N(h^{\prime})\cap R^{\prime}_{-}, we can define an injection ξh:N(h)RM(h)\xi_{-}^{h^{\prime}}\colon N(h^{\prime})\cap R^{\prime}_{-}\to M(h^{\prime}) such that pN(r)=pM(ξh(r))=1(h)p_{N}(r^{\prime})=p_{M}(\xi_{-}^{h^{\prime}}(r^{\prime}))=\frac{1}{\ell(h^{\prime})} for every rN(h)Rr^{\prime}\in N(h^{\prime})\cap R^{\prime}_{-}. By taking the direct sum of ξh\xi_{-}^{h^{\prime}} for all hM(R)h^{\prime}\in M(R^{\prime}_{-}), we obtain an injection ξ:RR\xi_{-}\colon R^{\prime}_{-}\to R such that pN(r)=pM(ξ(r))p_{N}(r^{\prime})=p_{M}(\xi_{-}(r^{\prime})) for every rRr^{\prime}\in R^{\prime}_{-}. ∎

We now define a bipartite graph which may have multiple edges. Let G=(R,R;E)G^{*}=(R,R^{\prime};E^{*}), where EE^{*} is the disjoint union of E+E_{+}, EE_{-}, and E0E_{0}, where

E+\displaystyle E_{+} {(r,ξ+(r))|rR+},\displaystyle\coloneqq\set{(r,\xi_{+}(r))}{r\in R_{+}},
E\displaystyle E_{-} {(ξ(r),r)|rR}, and\displaystyle\coloneqq\set{(\xi_{-}(r^{\prime}),r^{\prime})}{r\in R^{\prime}_{-}},\mbox{ and}
E0\displaystyle E_{0}~{}\! {(r,r)|rR0 and r is the copy of r}.\displaystyle\coloneqq\set{(r,r^{\prime})}{r\in R_{0}\text{ and $r^{\prime}$ is the copy of $r$}}.
Refer to caption
Figure 4: A graph G=(R,R;E)G^{*}=(R,R^{\prime};E^{*}). The upper and lower rectangles represent RR and RR^{\prime}, respectively. The edge sets E+E_{+}, EE_{-}, and E0E_{0} are respectively represented by downward directed edges, upward directed edges, and undirected edges. (The same figure as Fig. 1.)

See Fig. 4 for an example. Note that EE^{*} can have multiple edges between rr and rr^{\prime} if (r,r)=(r,ξ+(r))=(ξ(r),r)(r,r^{\prime})=(r,\xi_{+}(r))=(\xi_{-}(r^{\prime}),r^{\prime}). By the definitions of ξ+\xi_{+}, ξ\xi_{-}, and R0R_{0}, any edge (r,r)(r,r^{\prime}) in EE^{*} satisfies pM(r)pN(r)p_{M}(r)\geq p_{N}(r^{\prime}). Since E={(r,r)|pM(r)pN(r)}E=\set{(r,r^{\prime})}{p_{M}(r)\geq p_{N}(r^{\prime})}, any matching in GG^{*} is also a matching in GG.

Then, the following claim completes the first statement of the lemma.

Claim 25.

GG^{*} admits a matching whose size is at least n2\lceil\frac{n}{2}\rceil, and so does GG.

Proof.

Since ξ+:R+R\xi_{+}\colon R_{+}\to R^{\prime} and ξ:RR\xi_{-}\colon R^{\prime}_{-}\to R are injections, every vertex in GG^{*} is incident to at most two edges in EE^{*} as follows: Each vertex in R+R_{+} (resp., RR^{\prime}_{-}) is incident to exactly one edge in E+E_{+} (resp., EE_{-}) and at most one edge in EE_{-} (resp., E+E_{+}). Each vertex in RR_{-} (resp., R+R^{\prime}_{+}) is incident to at most one edge in EE_{-} (resp., E+E_{+}). Each vertex in R0R_{0} (resp., R0R^{\prime}_{0}) is incident to exactly one edge in E=E_{=} and at most one edge in EE_{-} (resp., E+E_{+}).

Since EE^{*} is the disjoint union of E+E_{+}, EE_{-}, and E0E_{0}, we have E=|E+|+|E|+|E=|=|R+|+|R|+|R0|=nE^{*}=|E_{+}|+|E_{-}|+|E_{=}|=|R_{+}|+|R_{-}|+|R_{0}|=n. As every vertex is incident to at most two edges in EE^{*}, each connected component KK of GG^{*} forms a path or a cycle. In KK, we can take a matching that contains at least a half of the edges in KK. (Take edges alternately along a path or a cycle. For a path with odd edges, let the end edges be contained.) The union of such matchings in all components forms a matching in GG^{*} whose size is at least n2\lceil\frac{n}{2}\rceil. ∎

In the rest, we show the second claim of Lemma 23. Suppose that there is a matching YY in GG. Then, there is a maximum matching XX in GG such that (Y)(X)\partial(Y)\subseteq\partial(X). This follows from the behavior of the augmenting path algorithm to compute a maximum matching in a bipartite graph (see e.g., [34]). In this algorithm, a matching, say XX, is repeatedly updated to reach the maximum size. Through the algorithm, (X)\partial(X) is monotone increasing. Therefore, if we initialize XX by YY, it finds a maximum matching with (Y)(X)\partial(Y)\subseteq\partial(X). Additionally, note that (Y)(X)\partial(Y)\subseteq\partial(X) implies pM(R(Y))pM(R(X))p_{M}(R\cap\partial(Y))\leq p_{M}(R\cap\partial(X)) as pMp_{M} is an nonnegative vector. Therefore, the following claim completes the proof of the second claim of the lemma.

Claim 26.

If s(M)<2s(M)<2, then there is a matching YY in GG such that pM(R(Y))1p_{M}(R\cap\partial(Y))\geq 1 holds and any rR(Y)r\in R\setminus\partial(Y) satisfies pM(r)0p_{M}(r)\neq 0.

Proof.

We first consider the case where pM(r)=0p_{M}(r^{*})=0 for some rRr^{*}\in R. For hM(r)h\coloneqq M(r^{*}) we have |M(h)|>(h)|M(h)|>\ell(h), and hence pM(M(h))=1p_{M}(M(h))=1. Since s(M)<2s(M)<2, any hospital other than hh should be deficient. Therefore, the value of pMp_{M} can be 0 only for residents in M(h)M(h).

  • If M(h)R+M(h)\cap R_{+}\neq\emptyset, then as shown in the proof of Claim 24, hh is full in NN. Then, there is an injection ξ:M(h)N(h)\xi:M(h)\to N(h) such that pM(r)=pN(r)p_{M}(r)=p_{N}(r) for any rM(h)r\in M(h). Hence, Y{(r,ξ(r))|rM(h)}EY\coloneqq\set{(r,\xi(r))}{r\in M(h)}\subseteq E is a matching in GG satisfying the required conditions.

  • If M(h)RM(h)\cap R_{-}\neq\emptyset, take any rM(h)Rr\in M(h)\cap R_{-} and set hN(r)h^{\prime}\coloneqq N(r). As shown in the proof of Claim 24, then |M(h)|(h)|M(h^{\prime})|\geq\ell(h^{\prime}), i.e., hh^{\prime} is sufficient. Note that we have pM(r)pN(r)p_{M}(r)\neq p_{N}(r) only if M(r)N(r)M(r)\neq N(r) by the condition (1). Since rRr\in R_{-} implies either hrhh^{\prime}\succ_{r}h or pN(r)>pM(r)p_{N}(r)>p_{M}(r), we have hhh^{\prime}\neq h, which contradicts the fact that hh is the unique sufficient hospital. Thus, M(h)RM(h)\cap R_{-}\neq\emptyset cannot happen.

  • If M(h)R+=M(h)\cap R_{+}=\emptyset and M(h)R=M(h)\cap R_{-}=\emptyset, we have M(h)R0M(h)\subseteq R_{0}. Then, by connecting each resident in M(h)M(h) to its copy in RR^{\prime}, we can obtain a matching satisfying the required conditions.

We next consider the case where pM(r)0p_{M}(r)\neq 0 for any rRr\in R. Then, our task is to find a matching YEY\subseteq E with pM(R(Y))1p_{M}(R\cap\partial(Y))\geq 1. With this assumption, for any resident rRr\in R with h=M(r)h=M(r), we always have pM(r)=1(h)p_{M}(r)=\frac{1}{\ell(h)}.

  • If R+R_{+}\neq\emptyset, then M(R+){M(r)|rR+}M(R_{+})\coloneqq\set{M(r)}{r\in R_{+}}\neq\emptyset. Since hM(R+)|M(h)R+|=|R+|\sum_{h\in M(R_{+})}|M(h)\cap R_{+}|=|R_{+}| and hM(R+)|N(h)R+||R+|\sum_{h\in M(R_{+})}|N(h)\cap R_{+}|\leq|R_{+}|, there is at least one hospital hM(R+)h\in M(R_{+}) such that |M(h)R+||N(h)R+||M(h)\cap R_{+}|\geq|N(h)\cap R_{+}|. Let hh be such a hospital. Since hM(R+)h\in M(R_{+}), as shown in the proof of Claim 24, hh is full in NN and there are (h)\ell(h) residents rr with pN(r)=1(h)p_{N}(r)=\frac{1}{\ell(h)}. We intend to show that there are at least (h)\ell(h) residents rr with pM(r)1(h)p_{M}(r)\geq\frac{1}{\ell(h)}, which implies the existence of a required YY. Regard N(h)N(h) as a subset of RR^{\prime}. If there is some rN(h)Rr^{\prime}\in N(h)\cap R^{\prime}_{-}, as seen in the proof of Claim 24, there are (h)\ell(h) residents rr with pM(r)=1(h)p_{M}(r)=\frac{1}{\ell(h)}, and we are done. So, assume N(h)R=N(h)\cap R^{\prime}_{-}=\emptyset, which implies N(h)R+R0N(h)\subseteq R^{\prime}_{+}\cup R^{\prime}_{0}. Since |M(h)R+||N(h)R+||M(h)\cap R_{+}|\geq|N(h)\cap R_{+}|, at least |N(h)R+||N(h)\cap R_{+}| residents in R+R_{+} belongs to M(h)M(h). As pMp_{M} is positive, then at least |N(h)R+||N(h)\cap R_{+}| residents rM(h)r\in M(h) satisfy pM(r)=1(h)p_{M}(r)=\frac{1}{\ell(h)}. Additionally, by the definition of E0E_{0}, each rN(h)R0r\in N(h)\cap R_{0} satisfies pM(r)pN(r)p_{M}(r)\geq p_{N}(r), where pN(r)=1(h)p_{N}(r)=\frac{1}{\ell(h)} for at least (h)|N(h)R+|\ell(h)-|N(h)\cap R_{+}| residents in N(h)R0N(h)\cap R_{0}. Thus, at least (h)\ell(h) residents rRr\in R satisfy pM(r)1(h)p_{M}(r)\geq\frac{1}{\ell(h)}.

  • If RR_{-}\neq\emptyset, then N(R){N(r)|rR}N(R_{-})\coloneqq\set{N(r)}{r\in R_{-}}\neq\emptyset. Similarly to the argument above, there is at least one hospital hN(R)h\in N(R_{-}) such that |N(h)R||M(h)R||N(h)\cap R_{-}|\geq|M(h)\cap R_{-}|. Let hh be such a hospital. Since hN(R)h\in N(R_{-}), as shown in the proof of Claim 24, hh is sufficient in MM and there are (h)\ell(h) residents rr with pM(r)=1(h)p_{M}(r)=\frac{1}{\ell(h)}. We intend to show that there are at least (h)\ell(h) residents rr with pN(r)1(h)p_{N}(r)\leq\frac{1}{\ell(h)}. If there is some rM(h)R+r\in M(h)\cap R_{+}, we are done as in the previous case. So, assume M(h)R+=M(h)\cap R_{+}=\emptyset, which implies M(h)RR0M(h)\subseteq R_{-}\cup R_{0}. Since |N(h)R||M(h)R||N(h)\cap R_{-}|\geq|M(h)\cap R_{-}|, at least |M(h)R||M(h)\cap R_{-}| residents rr in RR_{-} belongs to N(h)N(h), and satisfies pN(r){1(h),0}p_{N}(r)\in\{\frac{1}{\ell(h)},0\}. Additionally, by the definition of E0E_{0}, each rM(h)R0r\in M(h)\cap R_{0} satisfies pN(r)pM(r)=1(h)p_{N}(r)\leq p_{M}(r)=\frac{1}{\ell(h)}. Thus, at least (h)\ell(h) residents rRr\in R satisfy pN(r)1(h)p_{N}(r)\leq\frac{1}{\ell(h)}.

  • If R+=R_{+}=\emptyset and R=R_{-}=\emptyset, then R0=RR_{0}=R and E0E_{0} forms a matching and we have (E)R=R\partial(E)\cap R=R. Since pM(r)=1(h)1np_{M}(r)=\frac{1}{\ell(h)}\geq\frac{1}{n} for any hHh\in H and rM(h)r\in M(h), we have pM((E)R)1p_{M}(\partial(E)\cap R)\geq 1.

Thus, in any case, we can find a matching with required conditions. ∎

Thus we completed the proof of the second claim of the lemma. ∎

The above analysis for Theorem 7 is tight as shown by the following proposition.

Proposition 27.

For any natural number nn, there is an instance II with nn residents such that OPT(I)ALG(I)=ϕ(n)\frac{{\rm OPT}(I)}{{\rm ALG}(I)}=\phi(n). This holds even if ties appear only in preference lists of hospitals or only in preference lists of residents.

Proof.

As the upper bound is shown in Theorem 7, it suffices to give an instance II with OPT(I)ALG(I)ϕ(n)\frac{{\rm OPT}(I)}{{\rm ALG}(I)}\geq\phi(n). Recall that ϕ(1)=1\phi(1)=1, ϕ(2)=32\phi(2)=\frac{3}{2}, and ϕ(n)=n(1+n2)/(n+n2)\phi(n)=n(1+\lfloor\frac{n}{2}\rfloor)/(n+\lfloor\frac{n}{2}\rfloor) for n3n\geq 3.

Case n=1n=1 is trivial because OPT(I)ALG(I)\frac{{\rm OPT}(I)}{{\rm ALG}(I)} is always at least 1. For n2n\geq 2, we construct instances I1I_{1} and I2I_{2} such that OPT(I1)ALG(I1)ϕ(n)\frac{{\rm OPT}(I_{1})}{{\rm ALG}(I_{1})}\geq\phi(n), OPT(I2)ALG(I2)ϕ(n)\frac{{\rm OPT}(I_{2})}{{\rm ALG}(I_{2})}\geq\phi(n), and in I1I_{1} (resp., in I2I_{2}) ties appear only in preference lists of hospitals (resp., residents).

In case n=2n=2, consider the following instance I1I_{1} with two residents and three hospitals.

r1r_{1}: h1h_{1} h2h_{2} h3h_{3} h1h_{1} [1,1][1,1]: r1r_{1} r2r_{2} )
r2r_{2}: h1h_{1} h3h_{3} h2h_{2} h2h_{2} [1,1][1,1]: r1r_{1} r2r_{2}
h3h_{3} [0,1][0,1]: r1r_{1} r2r_{2}

Recall that we delete arbitrariness in Double Proposal using the priority rules defined by indices. Then, h1h_{1} prefers the second proposal of r1r_{1} to that of r2r_{2}. Therefore, Double Proposal returns {(r1,h1),(r2,h3)}\{(r_{1},h_{1}),(r_{2},h_{3})\}, whose score is 22. Since {(r1,h2),(r2,h1)}\{(r_{1},h_{2}),(r_{2},h_{1})\} is also a stable matching and has the score 33, we obtain OPT(I1)ALG(I1)=32=ϕ(2)\frac{{\rm OPT}(I_{1})}{{\rm ALG}(I_{1})}=\frac{3}{2}=\phi(2).

The instance I2I_{2} for n=2n=2 is given as follows.

r1r_{1}: h1h_{1} h2h_{2} ) h3h_{3} h1h_{1} [0,1][0,1]: r1r_{1} r2r_{2}
r2r_{2}: h2h_{2} h3h_{3} h1h_{1} h2h_{2} [1,1][1,1]: r1r_{1} r2r_{2}
h3h_{3} [1,1][1,1]: r1r_{1} r2r_{2}

The algorithm proceeds as follows. First, r1r_{1} makes the first proposal to h1h_{1} as its lower quota is smaller than that of h2h_{2}. Since (h1)=0\ell(h_{1})=0, this proposal is immediately rejected. Then, r1r_{1} makes the first proposal to h2h_{2}, and it is accepted. Next, r2r_{2} makes the fist proposal to h2h_{2}. Since neither of r1r_{1} and r2r_{2} have been rejected by h2h_{2}, the one with larger index, i.e., r2r_{2} is rejected. Then, r2r_{2} makes the second proposal to h2h_{2}, and then r1r_{1} is rejected by h2h_{2} because r1r_{1} has not been rejected by h2h_{2}. Then, r1r_{1} goes into the second round of the top tie and makes the second proposal to h1h_{1}. As h1h_{1} has upper quota 11 and is currently assigned no resident, this proposal is accepted, and the algorithm terminates with the output {(r1,h1),(r2,h2)}\{(r_{1},h_{1}),(r_{2},h_{2})\}, whose score is 22. On the other hand, a matching {(r1,h2),(r2,h3)}\{(r_{1},h_{2}),(r_{2},h_{3})\} is stable and has a score 33. Thus OPT(I2)ALG(I2)=32=ϕ(2)\frac{{\rm OPT}(I_{2})}{{\rm ALG}(I_{2})}=\frac{3}{2}=\phi(2).

In the rest, we show the claim for n3n\geq 3. In both I1I_{1} and I2I_{2}, the set of residents is given as R=RR′′R=R^{\prime}\cup R^{\prime\prime} where R={r1,r2,,rn2}R^{\prime}=\{r^{\prime}_{1},r^{\prime}_{2},\dots,r^{\prime}_{\lceil\frac{n}{2}\rceil}\} and R′′={r1′′,r2′′,,rn2′′}R^{\prime\prime}=\{r^{\prime\prime}_{1},r^{\prime\prime}_{2},\dots,r^{\prime\prime}_{\lfloor\frac{n}{2}\rfloor}\} and the set of hospital is given as H={h1,h2,hn}{x,y}H=\{h_{1},h_{2}\dots,h_{n}\}\cup\{x,y\}. Then, |R|=n|R|=n and |H|=n+2|H|=n+2.

The preference lists in I1I_{1} are given as follows. Here “(  RR  )” represents the tie consisting of all residents and “[  RR  ]” denotes an arbitrary strict order of all residents. The notation “\cdots” at the tail of lists means an arbitrary strict order of all agents missing in the list.

rir^{\prime}_{i}: xx hih_{i} \cdots xx [n2,n2][\lceil\frac{n}{2}\rceil,\lceil\frac{n}{2}\rceil]: (   RR   )
ri′′r^{\prime\prime}_{i}: xx yy \cdots yy [n,n][n,n]: [   RR   ]
hih_{i} [1,1][1,1]: [   RR   ]

As each resident has a strict preference order, she makes two proposals to the same hospital sequentially. If indices are defined so that residents in RR^{\prime} have smaller indices than those in R′′R^{\prime\prime}, then we can observe that our algorithm Double Proposal returns the matching

M1={(ri,x)|i=1,2,,n2}{(ri′′,y)|i=1,2,,n2}.\textstyle M_{1}=\set{(r^{\prime}_{i},x)}{i=1,2,\dots,\lceil\frac{n}{2}\rceil}\cup\set{(r^{\prime\prime}_{i},y)}{i=1,2,\dots,\lfloor\frac{n}{2}\rfloor}.

Its score is s(M1)=sM1(x)+sM1(y)=1+n2n=1n(n+n2)s(M_{1})=s_{M_{1}}(x)+s_{M_{1}}(y)=1+\frac{\lfloor\frac{n}{2}\rfloor}{n}=\frac{1}{n}(n+\lfloor\frac{n}{2}\rfloor). Next, define N1N_{1} by

N1={(ri,hi)|i=1,2,,n2}{(ri′′,x)|i=1,2,,n2}\textstyle N_{1}=\set{(r^{\prime}_{i},h_{i})}{i=1,2,\dots,\lceil\frac{n}{2}\rceil}\cup\set{(r^{\prime\prime}_{i},x)}{i=1,2,\dots,\lfloor\frac{n}{2}\rfloor}

and let N~1N1\tilde{N}_{1}\coloneqq N_{1} if nn is even and N~1(N1{(r1,h1)}){(r1,x)}\tilde{N}_{1}\coloneqq(N_{1}\setminus\{(r^{\prime}_{1},h_{1})\})\cup\{(r^{\prime}_{1},x)\} if nn is odd. We can check that N~1\tilde{N}_{1} is a stable matching and its score is s(N~1)=1+n2s(\tilde{N}_{1})=1+\lfloor\frac{n}{2}\rfloor. Therefore, OPT(I1)ALG(I1)s(N~1)s(M1)=ϕ(n)\frac{{\rm OPT}(I_{1})}{{\rm ALG}(I_{1})}\geq\frac{s(\tilde{N}_{1})}{s(M_{1})}=\phi(n).

The preference lists in I2I_{2} are given as follows. Similarly to the notation “[  RR  ],” we denote by “[ RR^{\prime} ]” and “[ R′′R^{\prime\prime} ]” arbitrary strict orders of all residents in RR^{\prime} and R′′R^{\prime\prime}, respectively.

rir^{\prime}_{i}: xx yy ) \cdots xx [n2,n2][\lceil\frac{n}{2}\rceil,\lceil\frac{n}{2}\rceil]: RR^{\prime} ] [ R′′R^{\prime\prime} ]
ri′′r^{\prime\prime}_{i}: xx hih_{i} \cdots yy [n,n][n,n]: [   RR   ]
hih_{i} [1,1][1,1]: [   RR   ]

Then, we can observe that Double Proposal returns a matching M~2\tilde{M}_{2} which is defined as follows. First, define M2M_{2} by

M2={(ri,y)|i=1,2,,n2}{(ri′′,x)|i=1,2,,n2}\textstyle M_{2}=\set{(r^{\prime}_{i},y)}{i=1,2,\dots,\lceil\frac{n}{2}\rceil}\cup\set{(r^{\prime\prime}_{i},x)}{i=1,2,\dots,\lfloor\frac{n}{2}\lfloor}

and let M~2M2\tilde{M}_{2}\coloneqq M_{2} if nn is even and M2~(M2{(r1,y)}){(r1,x)}\tilde{M_{2}}\coloneqq(M_{2}\setminus\{(r^{\prime}_{1},y)\})\cup\{(r^{\prime}_{1},x)\} if nn is odd. Its score is s(M~2)=sM~2(x)+sM~2(y)=1+n2n=1n(n+n2)s(\tilde{M}_{2})=s_{\tilde{M}_{2}}(x)+s_{\tilde{M}_{2}}(y)=1+\frac{\lfloor\frac{n}{2}\rfloor}{n}=\frac{1}{n}(n+\lfloor\frac{n}{2}\rfloor). Next, define N2N_{2} by

N2={(ri,x)|i=1,2,,n2}{(ri′′,hi)|i=1,2,,n2}.\textstyle N_{2}=\set{(r^{\prime}_{i},x)}{i=1,2,\dots,\lceil\frac{n}{2}\rceil}\cup\set{(r^{\prime\prime}_{i},h_{i})}{i=1,2,\dots,\lfloor\frac{n}{2}\rfloor}.

We can observe that N2N_{2} is a stable matching and its score is s(N2)=1+n2s(N_{2})=1+\lfloor\frac{n}{2}\rfloor. Thus, OPT(I2)ALG(I2)s(N2)s(M~2)=ϕ(n)\frac{{\rm OPT}(I_{2})}{{\rm ALG}(I_{2})}\geq\frac{s(N_{2})}{s(\tilde{M}_{2})}=\phi(n). ∎

Corollary 28.

Among instances in which ties appear only in preference lists of hospitals, maxIOPT(I)WST(I)=ϕ(n)\max_{I}\frac{{\rm OPT}(I)}{{\rm WST}(I)}=\phi(n).

Proof.

From the proof of Theorem 7, we can observe that the inequality s(N)s(M)ϕ(n)\frac{s(N)}{s(M)}\leq\phi(n) is obtained if both MM and NN are stable and MM satisfies the properties in Lemma 2. Note that the properties in Lemma 2 are satisfied by any stable matching if there is no ties in the preference lists of residents, because h=rhh=_{r}h^{\prime} cannot happen for any rRr\in R and h,hHh,h^{\prime}\in H. Therefore, the maximum value of OPT(I)WST(I)\frac{{\rm OPT}(I)}{{\rm WST}(I)} is at most ϕ(n)\phi(n). Further the instance I1I_{1} in the proof of Proposition 27 shows that the value is at least ϕ(n)\phi(n). ∎

C.2 Uniform Model

In the uniform model, upper quotas and lower quotas are same for all hospitals. Let \ell and uu be the common lower and upper quotas, respectively, and let θu(1)\theta\coloneqq\frac{u}{\ell}~{}(\geq 1). We first provide a worst case analysis of a tie-breaking algorithm.

Proposition 29.

The maximum gap for the uniform model satisfies Λ(Uniform)=θ\Lambda({\cal I}_{\rm Uniform})=\theta. Moreover, this equality holds even if preference lists of hospitals contain no ties.

Proof.

We first show OPT(I)WST(I)θ\frac{{\rm OPT}(I)}{{\rm WST}(I)}\leq\theta for any instance II of the uniform model with θ=u\theta=\frac{u}{\ell}. Let NN and MM be stable matchings with s(N)=OPT(I)s(N)={\rm OPT}(I) and s(M)=WST(I)s(M)={\rm WST}(I). Clearly,

s(N)=hHmin{1,|N(h)|}hH|M(h)|=|R|.\textstyle s(N)=\sum_{h\in H}\min\{1,\frac{|N(h)|}{\ell}\}\leq\sum_{h\in H}\frac{|M(h)|}{\ell}=\frac{|R|}{\ell}.

Note that |M(h)|u|M(h)|\leq u implies min{1,|M(h)|}=|M(h)|min{1|M(h)|,1}|M(h)|u\min\{1,\frac{|M(h)|}{\ell}\}=|M(h)|\cdot\min\{\frac{1}{|M(h)|},\frac{1}{\ell}\}\geq\frac{|M(h)|}{u}. Then

s(M)=hHmin{1,|M(h)|}hH|M(h)|u=|R|u.\textstyle s(M)=\sum_{h\in H}\min\{1,\frac{|M(h)|}{\ell}\}\geq\sum_{h\in H}\frac{|M(h)|}{u}=\frac{|R|}{u}.

Therefore, we have s(N)s(M)u=θ\frac{s(N)}{s(M)}\leq\frac{u}{\ell}=\theta.

Next, we provide an instance II with OPT(I)WST(I)=θ\frac{{\rm OPT}(I)}{{\rm WST}(I)}=\theta in which ties appear only in preference lists of residents. Let II be an instance of the uniform model with quotas [,u][\ell,u] consisting of u\ell\cdot u residents and uu hospitals such that

  • the preference list of every resident consists of a single tie containing all hospitals, and

  • the preference list of every hospital is an arbitrary complete list without ties.

Since any resident is indifferent among all hospitals, a matching is stable whenever all residents are assigned. Let MM be a matching that assigns uu residents to \ell hospitals and no resident to uu-\ell hospitals. Additionally, let NN be a matching that assigns \ell residents to all uu hospitals. Then, s(M)=s(M)=\ell while s(N)=us(N)=u. Thus we obtain OPT(I)WST(I)=u=θ\frac{{\rm OPT}(I)}{{\rm WST}(I)}=\frac{u}{\ell}=\theta. ∎

We show that the approximation factor of our algorithm is θ2+θ12θ1\frac{\theta^{2}+\theta-1}{2\theta-1} for this model. Since θθ2+θ12θ1=θ22θ+12θ1=(θ1)22θ1\theta-\frac{\theta^{2}+\theta-1}{2\theta-1}=\frac{\theta^{2}-2\theta+1}{2\theta-1}=\frac{(\theta-1)^{2}}{2\theta-1}, we see θ2+θ12θ1\frac{\theta^{2}+\theta-1}{2\theta-1} is strictly smaller than θ\theta whenever <u\ell<u.

Theorem 9.

The approximation factor of Double Proposal for the uniform model satisfies APPROX(uniform)=θ2+θ12θ1{\rm APPROX}({\cal I}_{\rm uniform})=\frac{\theta^{2}+\theta-1}{2\theta-1}.

Proof.

Here we only show APPROX(Gen)θ2+θ12θ1{\rm APPROX}({\cal I}_{\rm Gen})\leq\frac{\theta^{2}+\theta-1}{2\theta-1}, since this together with Proposition 29 shown later implies the required equality.

Since any stable matching is optimal when =u\ell=u, we assume in the following that <u\ell<u, which implies that θ>1\theta>1.

Let MM be the output of the algorithm and let NN be an optimal stable matching. Suppose s(N)>s(M)s(N)>s(M) since otherwise the claim is trivial. Consider a bipartite graph (R,H;MN)(R,H;M\cup N), which may have multiple edges. To complete the proof, it is sufficient to show that the approximation factor is attained in each component of the graph. Take any connected component and let RR^{*} and HH^{*} respectively denote the set of residents and hospitals in the component. We define a partition {H0,H1,H2}\{H_{0},H_{1},H_{2}\} of HH^{*} and a partition {R0,R1,R2}\{R_{0},R_{1},R_{2}\} of RR^{*} as follows (See Fig. 5). First, we set

H0{hH|sN(h)>sM(h)} and\displaystyle H_{0}\coloneqq\set{h\in H^{*}}{s_{N}(h)>s_{M}(h)}\mbox{ and}
R0{rR|N(r)H0}.\displaystyle R_{0}\coloneqq\set{r\in R^{*}}{N(r)\in H_{0}}.

That is, H0H_{0} is the set of all hospitals in the component for which the optimal stable matching NN gets scores larger than MM. The set R0R_{0} consists of residents assigned to H0H_{0} in the optimal matching NN. We then define

H1{hHH0|rR0:M(r)=h},\displaystyle H_{1}\coloneqq\set{h\in H^{*}\setminus H_{0}}{\exists r\in R_{0}:M(r)=h},
R1{rR|N(r)H1},\displaystyle R_{1}\coloneqq\set{r\in R^{*}}{N(r)\in H_{1}},
H2H(H0H1), and\displaystyle H_{2}\coloneqq H^{*}\setminus(H_{0}\cup H_{1}),\text{~{}~{}and~{}~{}}
R2R(R0R1).\displaystyle R_{2}\coloneqq R^{*}\setminus(R_{0}\cup R_{1}).
Refer to caption
Figure 5: An example of a connected component in NMN\cup M for the case [,u]=[2,3][\ell,u]=[2,3]. Hospitals and residents are represented by squares and circles, respectively. The matchings NN and MM are represented by solid (black) lines and dashed (red) lines, respectively. (The same figure as Fig. 2.)

For convenience, we use a scaled score function vM(h)sM(h)=min{,|M(h)|}v_{M}(h)\coloneqq\ell\cdot s_{M}(h)=\min\{\ell,|M(h)|\} for each hHh\in H and write vM(H)hHvM(h)v_{M}(H^{\prime})\coloneqq\sum_{h\in H^{\prime}}v_{M}(h) for any HHH^{\prime}\subseteq H. We define vNsN(h)=min{,|N(h)|}v_{N}\coloneqq\ell\cdot s_{N}(h)=\min\{\ell,|N(h)|\} similarly. We now show the following inequality, which completes the proof:

vN(H0H1H2)vM(H0H1H2)θ2+θ12θ1.\frac{v_{N}(H_{0}\cup H_{1}\cup H_{2})}{v_{M}(H_{0}\cup H_{1}\cup H_{2})}\leq\frac{\theta^{2}+\theta-1}{2\theta-1}. (2)

Let αvN(H0)vM(H0)>0\alpha\coloneqq v_{N}(H_{0})-v_{M}(H_{0})>0. Then, αvN(H0)=hH0min{,|N(h)|}hH0|N(h)|=|R0|\alpha\leq v_{N}(H_{0})=\sum_{h\in H_{0}}\min\{\ell,|N(h)|\}\leq\sum_{h\in H_{0}}|N(h)|=|R_{0}|. Note that MM assigns each resident in R0R_{0} to a hospital in H0H_{0} or H1H_{1} by the definition of H1H_{1}. Then, hH0H1|M(h)||R0|α\sum_{h\in H_{0}\cup H_{1}}|M(h)|\geq|R_{0}|\geq\alpha. Since 1θ|M(h)|\ell\geq\frac{1}{\theta}|M(h)| and |M(h)|1θ|M(h)||M(h)|\geq\frac{1}{\theta}|M(h)|, we have that vM(H0H1)=hH0H1min{,|M(h)|}hH0H11θ|M(h)|αθv_{M}(H_{0}\cup H_{1})=\sum_{h\in H_{0}\cup H_{1}}\min\{\ell,|M(h)|\}\geq\sum_{h\in H_{0}\cup H_{1}}\frac{1}{\theta}|M(h)|\geq\frac{\alpha}{\theta}, i.e.,

vM(H0H1)αθ.v_{M}(H_{0}\cup H_{1})\geq\frac{\alpha}{\theta}. (3)

Let βvN(H1H2)vM(H1)\beta\coloneqq v_{N}(H_{1}\cup H_{2})-v_{M}(H_{1}). Then, we have

vN(H0H1H2)=α+β+vM(H0H1).v_{N}(H_{0}\cup H_{1}\cup H_{2})=\alpha+\beta+v_{M}(H_{0}\cup H_{1}). (4)

We separately consider two cases: (i) βαθ1\beta\geq\frac{\alpha}{\theta-1} and (ii) βαθ1\beta\leq\frac{\alpha}{\theta-1}.

First, consider the case (i). Since vN(h)vM(h)v_{N}(h)\leq v_{M}(h) for any hH1H2h\in H_{1}\cup H_{2}, we have vM(H0H1H2)vM(H0)+vN(H1H2)=β+vM(H0H1)v_{M}(H_{0}\cup H_{1}\cup H_{2})\geq v_{M}(H_{0})+v_{N}(H_{1}\cup H_{2})=\beta+v_{M}(H_{0}\cup H_{1}). Combining this with the equation (4), we obtain (2) in this case.

vN(H0H1H2)vM(H0H1H2)\displaystyle\frac{v_{N}(H_{0}\cup H_{1}\cup H_{2})}{v_{M}(H_{0}\cup H_{1}\cup H_{2})} \displaystyle\leq α+β+vM(H0H1)β+vM(H0H1)\displaystyle\frac{\alpha+\beta+v_{M}(H_{0}\cup H_{1})}{\beta+v_{M}(H_{0}\cup H_{1})}
=\displaystyle= 1+αβ+vM(H0H1)\displaystyle 1+\frac{\alpha}{\beta+v_{M}(H_{0}\cup H_{1})}
\displaystyle\leq 1+ααθ1+αθ\displaystyle 1+\frac{\alpha}{\frac{\alpha}{\theta-1}+\frac{\alpha}{\theta}}
=\displaystyle= θ2+θ12θ1.\displaystyle\frac{\theta^{2}+\theta-1}{2\theta-1}.

Here the second inequality follows from the inequality (3) and the condition (i).

We next consider the case (ii) βαθ1\beta\leq\frac{\alpha}{\theta-1}, which is the main part of the proof. Since any hH0h\in H_{0} satisfies vN(h)>vM(h)v_{N}(h)>v_{M}(h), we have vM(h)<uv_{M}(h)<\ell\leq u for any hH0h\in H_{0}, i.e., any hH0h\in H_{0} is undersubscribed in MM. Then, any rR0={rR|N(r)H0}r\in R_{0}=\set{r\in R^{*}}{N(r)\in H_{0}} satisfies M(r)rN(r)M(r)\succeq_{r}N(r) since otherwise (r,N(r))(r,N(r)) blocks MM, which contradicts the stability of MM. Partition H1H_{1} into two sets:

H1{hH1|rR0:M(r)=hrN(r)} and\displaystyle H_{1}^{\succ}\coloneqq\set{h\in H_{1}}{\exists r\in R_{0}:M(r)=h\succ_{r}N(r)}\mbox{ and}
H1=H1H1.\displaystyle H_{1}^{=}\coloneqq H_{1}\setminus H_{1}^{\succ}.

Then, for any hH1=h\in H_{1}^{=}, all residents rR0r\in R_{0} with M(r)=hM(r)=h satisfy M(r)=rN(r)M(r)=_{r}N(r). We claim that

vM(H2)1θ(α+β),v_{M}(H_{2})\geq\frac{1}{\theta}(\alpha+\beta), (5)

which can be proven by estimating |R||R^{*}| in two ways.

For the first estimation, we further partition R1R_{1} into R1{rR1|N(r)H1}R_{1}^{\succ}\coloneqq\set{r\in R_{1}}{N(r)\in H_{1}^{\succ}} and R1={rR1|N(r)H1=}R_{1}^{=}\coloneqq\set{r\in R_{1}}{N(r)\in H_{1}^{=}}. By the stability of NN, each hH1h\in H_{1}^{\succ} is full in NN, since there exists a resident rR0r\in R_{0} with hrN(r)h\succ_{r}N(r), implying that |N(h)|=u|N(h)|=u and vN(h)=v_{N}(h)=\ell. Thus we have |R1|=u|H1|=uvN(H1)=θvN(H1)|R_{1}^{\succ}|=u\cdot|H_{1}^{\succ}|=\frac{u}{\ell}\cdot v_{N}(H_{1}^{\succ})=\theta\cdot v_{N}(H_{1}^{\succ}). Additionally, since each hH1h\in H_{1}^{\succ} satisfies vM(h)vN(h)v_{M}(h)\geq v_{N}(h) by hH0h\not\in H_{0}, we have vM(h)=vN(h)=v_{M}(h)=v_{N}(h)=\ell, which implies vM(H1)=vN(H1)v_{M}(H_{1}^{\succ})=v_{N}(H_{1}^{\succ}). We therefore represent |R1||R_{1}^{\succ}| as |R1|=(θ1)vM(H1)+vN(H1)|R_{1}^{\succ}|=(\theta-1)\cdot v_{M}(H_{1}^{\succ})+v_{N}(H_{1}^{\succ}). Further, by definition, we have |R0|vN(H0)|R_{0}|\geq v_{N}(H_{0}), |R1=|vN(H1=)|R_{1}^{=}|\geq v_{N}(H_{1}^{=}), and |R2|vN(H2)|R_{2}|\geq v_{N}(H_{2}). Combining them together, we obtain

|R|=|R0|+|R1|+|R2|\displaystyle|R^{*}|=|R_{0}|+|R_{1}|+|R_{2}| vN(H0)+(θ1)vM(H1)+vN(H1)+vN(H1=)+vN(H2)\displaystyle\geq v_{N}(H_{0})+(\theta-1)\cdot v_{M}(H_{1}^{\succ})+v_{N}(H_{1}^{\succ})+v_{N}(H_{1}^{=})+v_{N}(H_{2})
=α+β+vM(H0H1)+(θ1)vM(H1).\displaystyle=\alpha+\beta+v_{M}(H_{0}\cup H_{1})+(\theta-1)\cdot v_{M}(H_{1}^{\succ}). (6)

For the second estimation of |R||R^{*}|, we define another partition {S0,S1=,Srest}\{S_{0},S_{1}^{=},S_{\rm rest}\} of RR^{*} depending on the matching MM:

S0{rR|M(r)H0},\displaystyle S_{0}\coloneqq\set{r\in R^{*}}{M(r)\in H_{0}},
S1={rR|M(r)H1=}, and\displaystyle S_{1}^{=}\coloneqq\set{r\in R^{*}}{M(r)\in H_{1}^{=}},\mbox{ and}
SrestR(S0S1=).\displaystyle S_{\rm rest}\coloneqq R^{*}\setminus(S_{0}\cup S_{1}^{=}).

We show that |S0|=vM(H0)|S_{0}|=v_{M}(H_{0}), |S1=|=vM(H1=)|S_{1}^{=}|=v_{M}(H_{1}^{=}), and |Srest|θvM(H1H2)|S_{\rm rest}|\leq\theta\cdot v_{M}(H_{1}^{\succ}\cup H_{2}). Since any hH0h\in H_{0} satisfies vN(h)>vM(h)\ell\geq v_{N}(h)>v_{M}(h), we have vM(h)=|M(h)|v_{M}(h)=|M(h)|, which proves the first equality |S0|=vM(H0)|S_{0}|=v_{M}(H_{0}). For the second equality, recall that, for each hH1=h\in H_{1}^{=}, there exists a resident rR0r\in R_{0} with M(r)=hM(r)=h and M(r)=rN(r)M(r)=_{r}N(r). Since for any rR0r\in R_{0}, the hospital hN(r)h^{\prime}\coloneqq N(r) belongs to H0H_{0}, we have vN(h)>vM(h)=min{,|M(h)|}v_{N}(h^{\prime})>v_{M}(h^{\prime})=\min\{\ell,|M(h^{\prime})|\}, which implies |M(h)|<|M(h^{\prime})|<\ell. From this together with Lemma 2, we have |M(h)||M(h)|\leq\ell, which shows that vM(h)=|M(h)|v_{M}(h)=|M(h)| for each hH1=h\in H_{1}^{=}, i.e., the second equality. The third equality follows from the fact that all residents in SrestS_{\rm rest} are assigned to H1H2H_{1}^{\succ}\cup H_{2}.

By the three equalities above, we have

|R|=|S0|+|S1=|+|Srest|\displaystyle|R^{*}|=|S_{0}|+|S_{1}^{=}|+|S_{\rm rest}| vM(H0)+vM(H1=)+θvM(H1H2)\displaystyle\leq v_{M}(H_{0})+v_{M}(H_{1}^{=})+\theta\cdot v_{M}(H_{1}^{\succ}\cup H_{2})
=vM(H0H1)+(θ1)vM(H1)+θvM(H2),\displaystyle=v_{M}(H_{0}\cup H_{1})+(\theta-1)\cdot v_{M}(H_{1}^{\succ})+\theta\cdot v_{M}(H_{2}), (7)

which together with (6) proves our claim (5).

By using (4) and (5), we obtain the required inequality (2) also for the case (ii):

vN(H0H1H2)vM(H0H1H2)\displaystyle\frac{v_{N}(H_{0}\cup H_{1}\cup H_{2})}{v_{M}(H_{0}\cup H_{1}\cup H_{2})} \displaystyle\leq α+β+vM(H0H1)1θ(α+β)+vM(H0H1)\displaystyle\frac{\alpha+\beta+v_{M}(H_{0}\cup H_{1})}{\frac{1}{\theta}(\alpha+\beta)+v_{M}(H_{0}\cup H_{1})}
=\displaystyle= 1+(θ1)α+(θ1)βα+β+θvM(H0H1)\displaystyle 1+\frac{(\theta-1)\alpha+(\theta-1)\beta}{\alpha+\beta+\theta\cdot v_{M}(H_{0}\cup H_{1})}
\displaystyle\leq 1+(θ1)α+(θ1)β2α+β\displaystyle 1+\frac{(\theta-1)\alpha+(\theta-1)\beta}{2\alpha+\beta}
=\displaystyle= 1+θ12+θ12β2α+β\displaystyle 1+\frac{\theta-1}{2}+\frac{\theta-1}{2}\cdot\frac{\beta}{2\alpha+\beta}
\displaystyle\leq θ+12+θ1212(θ1)+1\displaystyle\frac{\theta+1}{2}+\frac{\theta-1}{2}\cdot\frac{1}{2(\theta-1)+1}
=\displaystyle= θ2+θ12θ1.\displaystyle\frac{\theta^{2}+\theta-1}{2\theta-1}.

Here the second inequality follows from the inequality (3), and the third inequality follows from the condition βαθ1\beta\leq\frac{\alpha}{\theta-1} of this case (ii) and the condition 2α+β>02\alpha+\beta>0, where the latter is obtained from 2α+β>α+β=vN(H0H1H2)vM(H0H1)>02\alpha+\beta>\alpha+\beta=v_{N}(H_{0}\cup H_{1}\cup H_{2})-v_{M}(H_{0}\cup H_{1})>0.

The above analysis for Theorem 9 is tight, as seen from the following proposition.

Proposition 29.

There is an instance II of the uniform model such that OPT(I)ALG(I)=θ2+θ12θ1\frac{{\rm OPT}(I)}{{\rm ALG}(I)}=\frac{\theta^{2}+\theta-1}{2\theta-1}. This holds even if ties appear only in preference lists of hospitals or only in preference lists of residents.

Proof.

As the upper bound is shown in Theorem 9, it suffices to give an instance II with OPT(I)ALG(I)θ2+θ12θ1\frac{{\rm OPT}(I)}{{\rm ALG}(I)}\geq\frac{\theta^{2}+\theta-1}{2\theta-1}. Further, since the case θ=1\theta=1 is trivial, we assume θ>1\theta>1, i.e., <u\ell<u. We construct two instances I1I_{1} and I2I_{2} each of which satisfies this inequality and in I1I_{1} (resp., in I2I_{2}) ties appear only in preference lists of hospitals (resp., residents).

Both I1I_{1} and I2I_{2} consist of 2(u)u+u2(u-\ell)u+\ell u residents and (u)u+(u)+u(u-\ell)u+(u-\ell)+u hospitals. The set of residents is R=ABCR=A\cup B\cup C where A={ai,j|1iu,1ju}A=\set{a_{i,j}}{1\leq i\leq u-\ell,~{}1\leq j\leq u}, B={bi,j|1iu,1ju}B=\set{b_{i,j}}{1\leq i\leq u-\ell,~{}1\leq j\leq u}, and C={ci,j|1iu,1j}C=\set{c_{i,j}}{1\leq i\leq u,~{}1\leq j\leq\ell}. The set of hospitals is H=XYZH=X\cup Y\cup Z where X={xi,j|1iu,1ju}X=\set{x_{i,j}}{1\leq i\leq u-\ell,~{}1\leq j\leq u}, Y={yi|1iu}Y=\set{y_{i}}{1\leq i\leq u-\ell}, and Z={zi|1iu}Z=\set{z_{i}}{1\leq i\leq u}. (Fig. 6 shows a pictorial representation of a small example.)

The preference lists in I1I_{1} are given as follows, where “(R)(~{}~{}R~{}~{})” and “[R][~{}~{}R~{}~{}]” respectively represent a single tie containing all members of RR and an arbitrary strict order on RR.

ai,ja_{i,j}: yiy_{i} xi,jx_{i,j} \cdots xi,jx_{i,j} [,u][\ell,u]: [   RR   ]
bi,jb_{i,j}: yiy_{i} zjz_{j} \cdots yiy_{i} [,u][\ell,u]: (   RR   )
ci,jc_{i,j}: ziz_{i} \cdots ziz_{i} [,u][\ell,u]: [   RR   ]

Note that, for each i=1,2,,ui=1,2,\dots,u-\ell, the hospital yiy_{i} is the first choice of 2u2u residents {ai,j,bi,j|1ju}\set{a_{i,j},b_{i,j}}{1\leq j\leq u}. Recall that we delete arbitrariness in Double Proposal using the priority rules defined by indices. If we set indices on residents so that residents in AA have smaller indices than those in BB, then yiy_{i} prioritizes residents in AA over those in BB. We then observe that the output of the algorithm is

M=\displaystyle M= {(ai,j,yi)|1iu,1ju}{(bi,j,zj)|1iu,1ju}\displaystyle\set{(a_{i,j},y_{i})}{1\leq i\leq u-\ell,~{}1\leq j\leq u}\cup\set{(b_{i,j},z_{j})}{1\leq i\leq u-\ell,~{}1\leq j\leq u}
{(ci,j,zi)|1iu,1j}.\displaystyle\cup\set{(c_{i,j},z_{i})}{1\leq i\leq u,~{}1\leq j\leq\ell}.

In MM, the hospitals xi,jx_{i,j}, yiy_{i}, and ziz_{i} are assigned 0, uu, and uu residents, respectively. Then, their scores in MM are 0, 11, and 11, respectively. Hence, we obtain s(M)=|Y|+|Z|=(u)+us(M)=|Y|+|Z|=(u-\ell)+u.

Refer to caption
Figure 6: An example with [,u]=[3,5][\ell,u]=[3,5]. An optimal matching NN is represented by solid (black) lines while the output MM of the algorithm is represented by dashed (red) lines.

Next, define a matching NN by

N=\displaystyle N= {(ai,j,xi,j)|1iu,1ju}{(bi,j,yi)|1iu,1ju}\displaystyle\set{(a_{i,j},x_{i,j})}{1\leq i\leq u-\ell,~{}1\leq j\leq u}\cup\set{(b_{i,j},y_{i})}{1\leq i\leq u-\ell,~{}1\leq j\leq u}
{(ci,j,zi)|1iu,1j}.\displaystyle\cup\set{(c_{i,j},z_{i})}{1\leq i\leq u,~{}1\leq j\leq\ell}.

It is straightforward to see that this is a stable matching. In NN, the hospitals xi,jx_{i,j}, yiy_{i}, and ziz_{i} are assigned 11, uu, and \ell residents, respectively. Then, their scores in NN are 1\frac{1}{\ell}, 11, and 11, respectively. Hence, s(N)=1|X|+|Y|+|Z|=(u)u+(u)+us(N)=\frac{1}{\ell}|X|+|Y|+|Z|=\frac{(u-\ell)u}{\ell}+(u-\ell)+u. From these, we obtain

OPT(I1)ALG(I1)s(N1)s(M1)=(u)u+(u)+u(u)+u=(u)u+(u)+u(u)+u=u2+u22u2=θ2+θ12θ1.\textstyle\frac{{\rm OPT}(I_{1})}{{\rm ALG}(I_{1})}\geq\frac{s(N_{1})}{s(M_{1})}=\frac{\frac{(u-\ell)u}{\ell}+(u-\ell)+u}{(u-\ell)+u}=\frac{(u-\ell)u+\ell(u-\ell)+\ell u}{\ell(u-\ell)+\ell u}=\frac{u^{2}+\ell u-\ell^{2}}{2\ell u-\ell^{2}}=\frac{\theta^{2}+\theta-1}{2\theta-1}.

Next, we define I2I_{2}. The preference lists in I2I_{2} are given as follows. Similarly to the notation “[  RR  ],” we denote by “[ BB ]” and “[ ACA\cup C ]” arbitrary strict orders of all residents in BB and ACA\cup C, respectively.

ai,ja_{i,j}: yiy_{i} xi,jx_{i,j} \cdots xi,jx_{i,j} [,u][\ell,u]: [   RR   ]
bi,jb_{i,j}: yiy_{i} zjz_{j} ) \cdots yiy_{i} [,u][\ell,u]: BB ][ ACA\cup C ]
ci,jc_{i,j}: ziz_{i} \cdots ziz_{i} [,u][\ell,u]: [   RR   ]

If we set indices on hospitals so that those in ZZ have smaller indices than those in YY, in Double Proposal, each bi,jb_{i,j} makes (the second) proposal to zjz_{j} before to yiy_{i}. Then, we can observe the output of the algorithm coincides with the matching MM defined above. Additionally, we see that the matching NN defined above is a stable matching of I2I_{2}. Therefore, we can obtain OPT(I2)ALG(I2)s(N)s(M)=θ2+θ12θ1\frac{{\rm OPT}(I_{2})}{{\rm ALG}(I_{2})}\geq\frac{s(N)}{s(M)}=\frac{\theta^{2}+\theta-1}{2\theta-1}. ∎

Corollary 30.

Among instances of the uniform model in which ties appear only in preference lists of hospitals, maxIOPT(I)WST(I)=θ2+θ12θ1\max_{I}\frac{{\rm OPT}(I)}{{\rm WST}(I)}=\frac{\theta^{2}+\theta-1}{2\theta-1}.

Proof.

From the proof of Theorem 9, we can observe that the inequality s(N)s(M)θ2+θ12θ1\frac{s(N)}{s(M)}\leq\frac{\theta^{2}+\theta-1}{2\theta-1} is obtained if both MM and NN are stable and MM satisfies the property given as Lemma 2(ii). Note that this property is satisfied by any stable matching if there is no ties in the preference lists of residents, because h=rhh=_{r}h^{\prime} cannot happen for any rRr\in R and h,hHh,h^{\prime}\in H. Therefore, the maximum value of OPT(I)WST(I)\frac{{\rm OPT}(I)}{{\rm WST}(I)} is at most θ2+θ12θ1\frac{\theta^{2}+\theta-1}{2\theta-1}. Further the instance I1I_{1} in the proof of Proposition 29 shows that this bound is tight. ∎

C.3 Marriage Model

In the marriage model, the upper quota of each hospital is 11. Therefore, [(h),u(h)][\ell(h),u(h)] is either [0,1][0,1] or [1,1][1,1] for each hHh\in H. We first provide a worst case analysis of a tie-breaking algorithm.

Proposition 31.

The maximum gap for the marriage model satisfies Λ(Marriage)=2\Lambda({\cal I}_{\rm Marriage})=2. Moreover, this equality holds even if ties appear only in preference lists of residents.

Proof.

We first show OPT(I)WST(I)2\frac{{\rm OPT}(I)}{{\rm WST}(I)}\leq 2 for any instance II of the marriage model. Let NN and MM be stable matchings with s(N)=OPT(I)s(N)={\rm OPT}(I) and s(M)=WST(I)s(M)={\rm WST}(I). Consider a bipartite graph G=(R,H:NM)G=(R,H:N\cup M), where we consider an edge used in both NN and MM as a length-two cycle in GG. Since NN and MM are one-to-one matchings in which all residents are assigned, each component is an alternating cycle or an alternating path whose two end vertices are both in HH.

Take any connected component. It suffices to show that the sum of the scores of the hospitals in this component in NN is at most twice of that in MM. The case of a cycle is trivial since every hospital in it has the score of 1. Therefore, consider a path. Then, one of two terminal hospitals, say h1h_{1}, is incident only to NN and the other, say h2h_{2}, is only to MM. We then have sN(h1)=1s_{N}(h_{1})=1 and sM(h2)=1s_{M}(h_{2})=1. The value sM(h1)s_{M}(h_{1}) is 11 if (h1)=0\ell(h_{1})=0 and 0 otherwise. Similarly, sN(h2)s_{N}(h_{2}) is 11 if (h2)=0\ell(h_{2})=0 and 0 otherwise. For any non-terminal hospital hh, we have sN(h)=sM(h)=1s_{N}(h)=s_{M}(h)=1. If there are kk non-terminal hospitals, then the sum of scores in this component in NN is 1+sN(h2)+k1+s_{N}(h_{2})+k while that in MM is 1+sM(h1)+k1+s_{M}(h_{1})+k. Since k0k\geq 0, sN(h2)1s_{N}(h_{2})\leq 1, and sM(h1)0s_{M}(h_{1})\geq 0, we have 1+sN(h2)+k1+sM(h1)+k2\frac{1+s_{N}(h_{2})+k}{1+s_{M}(h_{1})+k}\leq 2.

Next, we provide an instance II with OPT(I)WST(I)=2\frac{{\rm OPT}(I)}{{\rm WST}(I)}=2. Let II be an instance containing one resident rr and two hospitals h1h_{1} and h2h_{2} such that rr is indifferent between h1h_{1} and h2h_{2} and quotas are defined as [(h1),u(h1)]=[1,1][\ell(h_{1}),u(h_{1})]=[1,1] and [(h2),u(h2)]=[0,1][\ell(h_{2}),u(h_{2})]=[0,1]. Then, N={(r,h1)}N=\{(r,h_{1})\} and M={(r,h2)}M=\{(r,h_{2})\} are both stable matchings and we have s(N)=2s(N)=2 while s(M)=1s(M)=1. ∎

Theorem 32.

The approximation factor of Double Proposal for the marriage model satisfies APPROX(Marriage)=1.5{\rm APPROX}({\cal I}_{\rm Marriage})=1.5. Moreover, this is best possible for the marriage model, if strategy-proofness is required.

Proof.

We first show that OPT(I)ALG(I)1.5\frac{{\rm OPT}(I)}{{\rm ALG}(I)}\leq 1.5 holds for any instance II of the marriage model. Let MM be the output of Double Proposal and let NN be an optimal stable matching. By the arguments in the proof of Proposition 31, it suffices to show that there is no component of G=(R,H;NM)G=(R,H;N\cup M) that forms a path with two edges (r,h1)N(r,h_{1})\in N, (r,h2)M(r,h_{2})\in M with (h1)=1\ell(h_{1})=1 and (h2)=0\ell(h_{2})=0. Suppose conversely that there is such a path. As h1h_{1} is assigned no resident in MM, we have h2=M(r)rh1h_{2}=M(r)\succeq_{r}h_{1} by the stability of MM. Similarly, the stability of NN implies h1=N(r)rh2h_{1}=N(r)\succeq_{r}h_{2}, and hence h1=rh2h_{1}=_{r}h_{2}. Since |M(h2)|=1>(h2)|M(h_{2})|=1>\ell(h_{2}), Lemma 2(ii) implies |M(h1)|(h1)=1|M(h_{1})|\geq\ell(h_{1})=1, which contradicts |M(h1)|=0|M(h_{1})|=0.

To see that maxIOPT(I)ALG(I)1.5\max_{I}\frac{{\rm OPT}(I)}{{\rm ALG}(I)}\geq 1.5 even if ties appear only in preference lists of hospitals or only in that of residents, see the instances I1I_{1} and I2I_{2} defined for n=2n=2 in Proposition 27. These two are instances of the marriage model and satisfy OPT(I1)ALG(I1)=OPT(I2)ALG(I2)=ϕ(2)=1.5\frac{{\rm OPT}(I_{1})}{{\rm ALG}(I_{1})}=\frac{{\rm OPT}(I_{2})}{{\rm ALG}(I_{2})}=\phi(2)=1.5. ∎

It is worth mentioning that, for the marriage model, our algorithm attains the best approximation factor in the domain of strategy-proof algorithms. As shown in Example 15, there is no strategy-proof algorithm that achieves an approximation factor better than 1.51.5 even in the marriage model. Therefore, we cannot improve this ratio without harming strategy-proofness for residents.

Corollary 33.

Among instances of the marriage model in which ties appear only in preference lists of hospitals, maxIOPT(I)WST(I)=1.5\max_{I}\frac{{\rm OPT}(I)}{{\rm WST}(I)}=1.5.

Proof.

If the preference lists of the residents have no ties, the proof of Theorem 32 works for any pair of stable matchings, since it cannot be h1=rh2h_{1}=_{r}h_{2}. Hence, the upper bound follows. The lower bound follows from the instance I1I_{1} mentioned there. ∎

C.4 Resident-Side Master List Model

In the resident-side master list case, the preference lists of all residents are the same. Even with this restriction, the maximum value of OPT(I)WST(I)\frac{{\rm OPT}(I)}{{\rm WST}(I)} can be n+1n+1 as shown in Proposition 6. Our algorithm, however, solves this special case exactly.

Theorem 34.

The approximation factor of Double Proposal for the R-side master list model satisfies APPROX(R-ML)=1{\rm APPROX}({\cal I}_{\scriptsize\mbox{R-ML}})=1, i.e., Double Proposal can solve the R-side master list model exactly.

Proof.

Let II be an instance and PP be the master preference list of residents over hospitals. For convenience, we suppose that PP is a strictly ordered list of ties T1,T2,,TkT_{1},T_{2},\dots,T_{k}, by regarding a hospital that does not belong to any tie as a tie of length one. For each ii, let u(Ti)=hTiu(h)u(T_{i})=\sum_{h\in T_{i}}u(h). Let zz be the index (if any) such that i=1z1u(Ti)<|R|\sum_{i=1}^{z-1}u(T_{i})<|R| and i=1zu(Ti)>|R|\sum_{i=1}^{z}u(T_{i})>|R|. If there is an integer pp such that i=1pu(Ti)=|R|\sum_{i=1}^{p}u(T_{i})=|R|, we define z=p+0.5z=p+0.5.

A tie TiT_{i} is called full if 1i<z1\leq i<z and empty if z<ikz<i\leq k. In case zz is an integer, the tie TzT_{z} is called intermediate. The following lemma gives a necessary condition for a matching to be stable in II.

Lemma 35.

Any stable matching of II assigns u(h)u(h) residents to each hospital hh in a full tie and no resident to each hospital in an empty tie.

Proof.

Let MM be a stable matching. Suppose that |M(h)|<u(h)|M(h)|<u(h) holds for a hospital hh in a full tie. Then, there must be a resident rr such that M(r)=hM(r)=h^{\prime} and hrhh\succ_{r}h^{\prime}. Thus (r,h)(r,h) blocks MM, a contradiction. Suppose that |M(h)|>0|M(h)|>0 holds for a hospital hh in an empty tie. Let rr be a resident in M(h)M(h). Then, there must be an undersubscribed hospital hh^{\prime} such that hrhh^{\prime}\succ_{r}h. Thus (r,h)(r,h^{\prime}) blocks MM, a contradiction. ∎

Let MM be the output of Double Proposal and NN be an optimal solution. For contradiction, suppose that s(M)<s(N)s(M)<s(N). By Lemma 35, sM(h)=sN(h)s_{M}(h)=s_{N}(h) for any hospital hh in a full tie or in an empty tie. Hence, the difference of the scores of MM and NN is caused by hospitals in the intermediate tie. In the following, we concentrate on the intermediate tie, and if we refer to a hospital, it always means a hospital in the intermediate tie.

Suppose that there is a hospital hh such that |M(h)|>(h)|M(h)|>\ell(h). Then, by Lemma 2(ii), |M(h)|>(h)|M(h^{\prime})|>\ell(h^{\prime}) holds for any hospital hh^{\prime} in this tie. Therefore, the score of each hospital is 1 in MM and it is impossible that s(M)<s(N)s(M)<s(N). Hence, in the following, we assume that |M(h)|(h)|M(h)|\leq\ell(h) for each hospital hh.

Suppose that there are qq different lower quotas for hospitals in the intermediate tie, and let them be 1,2,q\ell_{1},\ell_{2}\ldots,\ell_{q} such that 1<2<<q\ell_{1}<\ell_{2}<\cdots<\ell_{q}. For 1iq1\leq i\leq q, let LiL_{i} be the set of hospitals whose lower quota is i\ell_{i}. For 1iq1\leq i\leq q, let SM(i)=hL1L2LisM(h)S_{M}(i)=\sum_{h\in L_{1}\cup L_{2}\cup\cdots\cup L_{i}}s_{M}(h) and SN(i)=hL1L2LisN(h)S_{N}(i)=\sum_{h\in L_{1}\cup L_{2}\cup\cdots\cup L_{i}}s_{N}(h). By the assumption s(M)<s(N)s(M)<s(N), there exists an index ii such that SM(i)<SN(i)S_{M}(i)<S_{N}(i) and let ii^{*} be the minimum one. Then, there is a hospital hLih^{\prime}\in L_{i^{*}} such that |M(h)|<(h)|M(h^{\prime})|<\ell(h^{\prime}), as otherwise all hospitals in LiL_{i^{*}} have the score 1 in MM and this contradicts the choice of ii^{*}. Since |M(h)|(h)|M(h)|\leq\ell(h) for each hospital hh, NN assigns strictly more residents to hospitals in L1L2LiL_{1}\cup L_{2}\cup\cdots\cup L_{i^{*}} than MM, as otherwise SM(i)<SN(i)S_{M}(i^{*})<S_{N}(i^{*}) would not hold. Then, there is a resident rr and a hospital h~Li\tilde{h}\in L_{i} (i>ii>i^{*}) such that M(r)=h~M(r)=\tilde{h}. Since h~=rh\tilde{h}=_{r}h^{\prime} and (h~)>(h)\ell(\tilde{h})>\ell(h^{\prime}), Lemma 2(i) implies that |M(h)|(h)|M(h^{\prime})|\geq\ell(h^{\prime}), but this contradicts the fact we have derived above. ∎

Corollary 36.

If there is a master preference list of residents that contains no ties, then any stable matching is optimal.

Proof.

This corollary is easily derived from the proof of Theorem 34. Since there are no ties in the master preference list, the intermediate tie (if any) consists of a single hospital. Hence, Lemma 35 implies that the number of residents assigned to each hospital does not depend on the choice of a stable matching. This completes the proof. ∎

Appendix D Proofs of Hardness Results

In this section, we give omitted proofs of hardness results given in Section 6. For readability, we give proofs of Theorems 12 and 11 in this order.

D.1 Proof of Theorem 12

We show the theorem by a reduction from the minimum maximal matching problem (MMM for short). In this problem, we are given an undirected graph GG and are asked to find a maximal matching of minimum size, denoted by OPT(G){\rm OPT}(G). It is known that under UGC, there is no polynomial-time algorithm to distinguish between the following two cases: (i) OPT(G)(12+δ)n{\rm OPT}(G)\leq(\frac{1}{2}+\delta)n and (ii) OPT(G)(23δ)n{\rm OPT}(G)\geq(\frac{2}{3}-\delta)n for any positive constant δ\delta, even for bipartite graphs with nn vertices in each part [10]:

Let G=(U,V;E)G=(U,V;E) (|U|=|V|=n|U|=|V|=n) be an instance of MMM, where U={u1,u2,,un}U=\{u_{1},u_{2},\ldots,u_{n}\} and V={v1,v2,,vn}V=\{v_{1},v_{2},\ldots,v_{n}\}. We will construct an instance II of HRT-MSLQ in the marriage model. II consists of nn residents U={u1,u2,,un}U=\{u_{1},u_{2},\ldots,u_{n}\} and 2n2n hospitals V={v1,v2,,vn}V=\{v_{1},v_{2},\ldots,v_{n}\} and Y={y1,y2,,yn}Y=\{y_{1},y_{2},\ldots,y_{n}\}. For convenience, we use uiu_{i} and viv_{i} as the names of vertices in GG and agents in II interchangeably.

Preference lists and quotas of hospitals are defined in Fig. 7. Here, N(ui)N(u_{i}) is the set of neighbors of uiu_{i} in GG, namely, N(ui)={vj|(ui,vj)E}N(u_{i})=\set{v_{j}}{(u_{i},v_{j})\in E} and “(  N(ui)N(u_{i})  )” is the tie consisting of all hospitals in N(ui)N(u_{i}). The notation “(  N(vi)N(v_{i})  )” in viv_{i}’s list is defined similarly. The notation “\cdots” at the tail of lists means an arbitrary strict order of all agents missing in the list.

uiu_{i}: (  N(ui)N(u_{i})  ) yiy_{i} \cdots viv_{i} [0,1][0,1]: (  N(vi)N(v_{i})  ) \cdots
yiy_{i} [1,1][1,1]: uiu_{i}  \cdots
Figure 7: Preference lists of residents and hospitals.

In the following, we show that OPT(I)=2nOPT(G){\rm OPT}(I)=2n-{\rm OPT}(G). If so, the above mentioned hardness implies that it is UG-hard to distinguish between the cases (i) OPT(I)(32δ)n{\rm OPT}(I)\geq(\frac{3}{2}-\delta)n and (ii) OPT(I)(43+δ)n{\rm OPT}(I)\leq(\frac{4}{3}+\delta)n. This in turn implies that an approximation algorithm with an approximation factor smaller than 96δ8+6δ\frac{9-6\delta}{8+6\delta} would refute UGC. Since δ\delta can be taken arbitrarily small, if we set δ<12ϵ179ϵ\delta<\frac{12\epsilon}{17-9\epsilon}, the theorem is proved.

We first show that OPT(I)2nOPT(G){\rm OPT}(I)\geq 2n-{\rm OPT}(G). Let LL be an optimal solution of GG, i.e., a maximal matching of GG of size OPT(G){\rm OPT}(G). Then, we construct a matching MM of II as M=M1M2M=M_{1}\cup M_{2} where M1={(ui,vj)|(ui,vj)L}M_{1}=\set{(u_{i},v_{j})}{(u_{i},v_{j})\in L} and M2={(ui,yi)|ui is unmatched in L}M_{2}=\set{(u_{i},y_{i})}{u_{i}\mbox{ is unmatched in }L}.

We show that MM is stable. If a resident uiu_{i} is matched in M1M_{1}, she is matched with a top choice hospital so she cannot be a part of a blocking pair. Suppose that a resident uiu_{i} who is matched in M2M_{2} (with yiy_{i}) forms a blocking pair. Then, uiu_{i} is unmatched in LL and the counterpart of the blocking pair must be some vjN(ui)v_{j}\in N(u_{i}). Note that vjv_{j} is unmatched in MM since, by construction of MM, if a hospital vjv_{j} is matched, then it is assigned a top choice resident and hence cannot form a blocking pair. From the above arguments, we have that (ui,vj)E(u_{i},v_{j})\in E but both uiu_{i} and vjv_{j} are unmatched in LL, which contradicts maximality of LL.

The score of each viv_{i} is 1 because its lower quota is 0. Since all residents are matched in MM, |M1|+|M2|=n|M_{1}|+|M_{2}|=n. By construction |M1|=|L||M_{1}|=|L| holds, so the total score of hospitals in YY is |M2|=n|M1|=n|L|=nOPT(G)|M_{2}|=n-|M_{1}|=n-|L|=n-{\rm OPT}(G). Hence, we have that OPT(I)s(M)=n+|M2|=2nOPT(G){\rm OPT}(I)\geq s(M)=n+|M_{2}|=2n-{\rm OPT}(G).

Next, we show that OPT(I)2nOPT(G){\rm OPT}(I)\leq 2n-{\rm OPT}(G). Let MM be an optimal solution for II, a stable matching of II whose score is OPT(I){\rm OPT}(I). Since each viv_{i}’s score is 1 without depending on the matching, OPT(I)n{\rm OPT}(I)\geq n and we can write OPT(I)=n+k{\rm OPT}(I)=n+k for a nonnegative integer kk. Here, kk coincides the number of hospitals of YY matched in MM.

As mentioned in Sec. 2, every resident is matched in MM. Note that, for any ii, resident uiu_{i} is not matched with any hospital in “\cdots” part, as otherwise, (ui,yi)(u_{i},y_{i}) blocks MM, a contradiction (note that uiu_{i} is the unique first choice of yiy_{i}). Among nn residents, kk ones are matched with hospitals in YY, so the remaining nkn-k ones are matched with hospitals in VV.

Let us define a matching LL of GG as L={(ui,vj)|(ui,vj)M}L=\set{(u_{i},v_{j})}{(u_{i},v_{j})\in M}. Then, from the above observations, |L|=nk=2nOPT(I)|L|=n-k=2n-{\rm OPT}(I). We show that LL is maximal in GG. Suppose not and that (ui,vj)E(u_{i},v_{j})\in E but both uiu_{i} and vjv_{j} are unmatched in LL. Then, uiu_{i} is matched with yiy_{i} and vjv_{j} is unmatched in MM, which implies that (ui,vj)(u_{i},v_{j}) blocks MM, contradicting the stability of MM. Therefore, OPT(G)|L|=2nOPT(I){\rm OPT}(G)\leq|L|=2n-{\rm OPT}(I), which completes the proof.

D.2 Proof of Theorem 11

The proof is a nontrivial extension of that of Theorem 12. As a reduction source, we use MMM for bipartite graphs (see the proof of Theorem 12 for definition). Let G=(U,V,E)G=(U,V,E) be an input bipartite graph for MMM where |U|=|V|=n|U|=|V|=n. We will construct an instance II for HRT-MSLQ in the uniform model. The set of residents is XRX\cup R where X={xi,j1in,1j}X=\{x_{i,j}\mid 1\leq i\leq n,1\leq j\leq\ell\} and R={ri,j1in,1ju}R=\{r_{i,j}\mid 1\leq i\leq n,1\leq j\leq u-\ell\}. The set of hospitals is HYH\cup Y where H={hi1in}H=\{h_{i}\mid 1\leq i\leq n\} and Y={yi,j1in,1ju}Y=\{y_{i,j}\mid 1\leq i\leq n,1\leq j\leq u-\ell\}.

Preference lists of agents are given in Fig. 8. Here, N(ui)N(u_{i}) is defined as N(ui)={hj(ui,vj)E}N(u_{i})=\{h_{j}\mid(u_{i},v_{j})\in E\} and “(  N(ui)N(u_{i})  )” denotes the tie consisting of all hospitals in N(ui)N(u_{i}). N(vi)N(v_{i}) is defined as N(vi)={rj,k(uj,vi)E,1ku}N(v_{i})=\{r_{j,k}\mid(u_{j},v_{i})\in E,1\leq k\leq u-\ell\} and “(  N(vi)N(v_{i})  )” is the tie consisting of all residents in N(vi)N(v_{i}). As before, the notation “\cdots” at the tail of the list means an arbitrary strict order of all agents missing in the list.

xi,jx_{i,j}: hih_{i}  \cdots hih_{i} [,u][\ell,u]: xi,1x_{i,1} \cdots xi,lx_{i,l} (  N(vi)N(v_{i})  ) \cdots
ri,jr_{i,j}: (  N(ui)N(u_{i})  ) yi,jy_{i,j} \cdots yi,jy_{i,j} [,u][\ell,u]: ri,jr_{i,j} \cdots
Figure 8: Preference lists of residents and hospitals.

It would be helpful to informally explain here an idea behind the reduction. The uu-\ell residents ri,jr_{i,j} (1ju1\leq j\leq u-\ell) correspond to the vertex uiUu_{i}\in U of GG, and a hospital hih_{i} corresponds to the vertex viVv_{i}\in V of GG. The first choice of the \ell residents xi,jx_{i,j} (1j1\leq j\leq\ell) is hih_{i} and hih_{i}’s first \ell choices are xi,jx_{i,j} (1j1\leq j\leq\ell), so all xi,jx_{i,j}s are assigned to hih_{i} in any stable matching. These xi,jx_{i,j}s fill the \ell positions of hih_{i}, so hih_{i}’s score is 1 and there remains uu-\ell positions. Then, the residents in RR (uu-\ell copies of vertices of UU) and the hospitals in HH (corresponding to vertices of VV) form a matching that simulates a maximal matching of GG. A resident ri,jr_{i,j} unmatched in this maximal matching will be assigned to yi,jy_{i,j}, by which yi,jy_{i,j} obtains a score of 1\frac{1}{\ell}. Thus a smaller maximal matching of GG can produce a stable matching of II of larger score.

Formally, we will prove that the equation OPT(I)=n+u(nOPT(G))(=n+(θ1)(nOPT(G))){\rm OPT}(I)=n+\frac{u-\ell}{\ell}(n-{\rm OPT}(G))(=n+(\theta-1)(n-{\rm OPT}(G))) holds. Then, by (i) and (ii) in the proof of Theorem 12, it is UG-hard to distinguish between the cases (i′′) OPT(I)(1+(θ1)(12δ))n{\rm OPT}(I)\geq(1+(\theta-1)(\frac{1}{2}-\delta))n and (ii′′) OPT(I)(1+(θ1)(13+δ))n{\rm OPT}(I)\leq(1+(\theta-1)(\frac{1}{3}+\delta))n. This implies that existence of a polynomial-time approximation algorithm with an approximation factor smaller than 1+(θ1)(12δ)1+(θ1)(13+δ)=3θ+36(θ1)δ2θ+4+6(θ1)δ\frac{1+(\theta-1)(\frac{1}{2}-\delta)}{1+(\theta-1)(\frac{1}{3}+\delta)}=\frac{3\theta+3-6(\theta-1)\delta}{2\theta+4+6(\theta-1)\delta} would refute UGC. Then, the theorem holds by setting δ<215ϵ<(2θ+4)2ϵ6(θ1)((5θ+4)(2θ+4)ϵ)\delta<\frac{2}{15}\epsilon<\frac{(2\theta+4)^{2}\epsilon}{6(\theta-1)((5\theta+4)-(2\theta+4)\epsilon)}.

First, we show that OPT(I)n+u(nOPT(G)){\rm OPT}(I)\geq n+\frac{u-\ell}{\ell}(n-{\rm OPT}(G)). Let LL be a minimum maximal matching of GG. Let us define a matching MM of II as M=M1M2M3M=M_{1}\cup M_{2}\cup M_{3}, where M1={(xi,j,hi)1in,1j}M_{1}=\{(x_{i,j},h_{i})\mid 1\leq i\leq n,1\leq j\leq\ell\}, M2={(ri,j,hk)(ui,vk)L,1ju}M_{2}=\{(r_{i,j},h_{k})\mid(u_{i},v_{k})\in L,1\leq j\leq u-\ell\}, and M3={(ri,j,yi,j)ui is unmatched in L,1ju}M_{3}=\{(r_{i,j},y_{i,j})\mid u_{i}\mbox{ is unmatched in }L,1\leq j\leq u-\ell\}.

We show that MM is stable. Since all residents in XX are assigned to a top choice hospital, none of them can be a part of a blocking pair. This also holds for a resident ri,jr_{i,j} if she is assigned to a hospital in N(ui)N(u_{i}). Hence, only a resident ri,jr_{i,j} who is assigned to yi,jy_{i,j} can form a blocking pair with some hkN(ui)h_{k}\in N(u_{i}). This means that (ui,vk)E(u_{i},v_{k})\in E but uiu_{i} is unmatched in LL. Observe that, by construction of MM, hkh_{k} is assigned \ell residents xk,1,,xk,x_{k,1},\ldots,x_{k,\ell}, and is either assigned uu-\ell more residents in N(vk)N(v_{k}), in which case hkh_{k} is full, or assigned no more residents, in which case hkh_{k} is undersubscribed. In the former case, hkh_{k} cannot prefer ri,jr_{i,j} to any of residents in M(hk)M(h_{k}), so the latter case must hold. This implies that vkv_{k} is unmatched in LL. Thus L{(ui,vk)}L\cup\{(u_{i},v_{k})\} is a matching of GG, contradicting the maximality of LL.

Since each hih_{i} is assigned \ell residents in M1M_{1}, hih_{i}’s score is 1. Hence, the total score of hospitals in HH is nn. Among (u)n(u-\ell)n residents in RR, (u)|L|(u-\ell)|L| ones are assigned to hospitals in HH, so the remaining (u)(n|L|)(u-\ell)(n-|L|) ones are assigned to hospitals in YY. Such residents are assigned to different hospitals, so the total score of hospitals in YY is 1(u)(n|L|)\frac{1}{\ell}(u-\ell)(n-|L|). Thus the score of MM is n+1(u)(n|L|)=n+u(nOPT(G))n+\frac{1}{\ell}(u-\ell)(n-|L|)=n+\frac{u-\ell}{\ell}(n-{\rm OPT}(G)), so it results that OPT(I)n+u(nOPT(G)){\rm OPT}(I)\geq n+\frac{u-\ell}{\ell}(n-{\rm OPT}(G)).

Next, we show that OPT(I)n+u(nOPT(G)){\rm OPT}(I)\leq n+\frac{u-\ell}{\ell}(n-{\rm OPT}(G)). Let MM be a stable matching of maximum score, that is, s(M)=OPT(I)s(M)={\rm OPT}(I). As observed above, each hih_{i} is assigned \ell residents xi,1,xi,2,,xi,x_{i,1},x_{i,2},\ldots,x_{i,\ell} of XX and hence the score of each hospital hih_{i} is 1. Additionally, we can see that ri,jr_{i,j} is not assigned to any hospital in “\cdots” part, as otherwise, (ri,j,yi,j)(r_{i,j},y_{i,j}) blocks MM. We construct a bipartite (multi-)graph GM=(U,V;F)G_{M}=(U,V;F) where U={u1,u2,,un}U=\{u_{1},u_{2},\ldots,u_{n}\} and V={v1,v2,,vn}V=\{v_{1},v_{2},\ldots,v_{n}\} are identified with vertices of GG, and we have an edge (ui,vk)jF(u_{i},v_{k})_{j}\in F if and only if (ri,j,hk)M(r_{i,j},h_{k})\in M. Here, a subscript jj of edge (ui,vk)j(u_{i},v_{k})_{j} is introduced to distinguish multiplicity of edge (ui,vk)(u_{i},v_{k}). The degree of each vertex of GMG_{M} is at most uu-\ell, so by Kőnig’s edge coloring theorem [25], GMG_{M} is (u)(u-\ell)-edge colorable and each color class cc induces a matching McM_{c} (1cu1\leq c\leq u-\ell) of GMG_{M}. Note that each McM_{c} is a matching of GG because (ui,vk)Mc(u_{i},v_{k})\in M_{c} means (ui,vk)jF(u_{i},v_{k})_{j}\in F for some jj, which implies (ri,j,hk)M(r_{i,j},h_{k})\in M, which in turn implies hkN(ui)h_{k}\in N(u_{i}) and hence (ui,vk)E(u_{i},v_{k})\in E. We then show that McM_{c} is a maximal matching of GG. Suppose not and that Mc{(ua,vb)}M_{c}\cup\{(u_{a},v_{b})\} is a matching of GG. Then, uau_{a} in GMG_{M} is not incident to an edge of color cc, but since there are uu-\ell colors in total, uau_{a}’s degree in GMG_{M} is less than uu-\ell. This implies that ra,pr_{a,p} for some pp is not assigned to any hospital in N(ua)N(u_{a}) (and hence unmatched or assigned to ya,py_{a,p}) in MM. A similar argument shows that vbv_{b}’s degree in GMG_{M} is less than uu-\ell and hence hbh_{b} is undersubscribed in MM. These imply that (ra,p,hb)(r_{a,p},h_{b}) blocks MM, a contradiction.

Hence, we have shown that each McM_{c} is a maximal matching of GG, so its size is at least OPT(G){\rm OPT}(G). Since {Mc1cu}\{M_{c}\mid 1\leq c\leq u-\ell\} is a partition of FF, we have that |F|(u)OPT(G)|F|\geq(u-\ell){\rm OPT}(G). There are (u)n(u-\ell)n residents in RR and at least (u)OPT(G)(u-\ell){\rm OPT}(G) of them are assigned to a hospital in HH, so at most (u)(nOPT(G))(u-\ell)(n-{\rm OPT}(G)) residents are assigned to hospitals in YY. Each such resident contributes 1\frac{1}{\ell} for a hospital’s score and hence the total score of hospitals in YY is at most u(nOPT(G))\frac{u-\ell}{\ell}(n-{\rm OPT}(G)). Since, as observed above, the total score of hospitals in HH is nn, we have that OPT(I)n+u(nOPT(G)){\rm OPT}(I)\leq n+\frac{u-\ell}{\ell}(n-{\rm OPT}(G)).

D.3 Proof of Theorem 14

We give a reduction from the minimum Pareto optimal matching problem (MIN-POM) [3] defined as follows. An instance of MIN-POM consists of a set S={s1,s2,,sn}S=\{s_{1},s_{2},\ldots,s_{n}\} of agents and a set T={t1,t2,,tm}T=\{t_{1},t_{2},\ldots,t_{m}\} of houses. Each agent has a strict and possibly incomplete preference list over houses, but houses have no preference. A matching MM is an assignment of houses to agents such that each agent is assigned at most one house and each house is assigned to at most one agent. We write M(s)M(s) the house assigned to ss by MM if any. An agent ss strictly prefers a matching MM to a matching MM^{\prime} if ss prefers M(s)M(s) to M(s)M^{\prime}(s) or ss is assigned a house in MM but not in MM^{\prime}. An agent ss is indifferent between MM and MM^{\prime} if M(s)=M(s)M(s)=M^{\prime}(s) or ss is unassigned in both MM and MM^{\prime}. An agent ss weakly prefers MM to MM^{\prime} if ss strictly prefers MM to MM^{\prime} or is indifferent between them. A matching MM Pareto dominates a matching MM^{\prime} if every agent weakly prefers and at least one agent strictly prefers MM to MM^{\prime}. A matching MM is Pareto optimal if there is no matching that dominates MM. MIN-POM asks to find a Pareto optimal matching of minimum size. It is known that MIN-POM is NP-hard [3].

We now show the reduction. Let II be an instance of MIN-POM as above. We can assume without loss of generality that the number of agents and the number of houses are the same, as otherwise, we may add either dummy agents with empty preference list or dummy houses which no agent includes in a preference list, without changing the set of Pareto optimal matchings. We let |S|=|T|=n|S|=|T|=n. We will construct an instance II^{\prime} of HRT-MSLQ from II. The set of residents of II^{\prime} is R=ABCR=A\cup B\cup C, where A={a1,a2,,an}A=\{a_{1},a_{2},\ldots,a_{n}\}, B={b1,b2,,bn}B=\{b_{1},b_{2},\ldots,b_{n}\}, and C={c1,c2,,cn}C=\{c_{1},c_{2},\ldots,c_{n}\}, and the set of hospitals is H=XYZH=X\cup Y\cup Z, where X={x1,x2,,xn}X=\{x_{1},x_{2},\ldots,x_{n}\}, Y={y1,y2,,yn}Y=\{y_{1},y_{2},\ldots,y_{n}\}, and Z={z1,z2,,zn}Z=\{z_{1},z_{2},\ldots,z_{n}\}. The set AA of residents and the set XX of hospitals correspond, respectively, to the set SS of agents and the set TT of houses of II. The upper and the lower quotas of each hospital is [1,2][1,2] and each hospital’s preference list is a single tie including all residents, as stated in the theorem. We then show preference lists of residents. For each agent siSs_{i}\in S of II, let P(si)P(s_{i}) be her preference list and define P(ai)P^{\prime}(a_{i}) as the list obtained from P(si)P(s_{i}) by replacing the occurrence of each tjt_{j} by xjx_{j}. The preference lists of residents are shown in Fig. 9, where for a set DD, the notation [D][D] denotes an arbitrary strict order of the hospitals in DD. For later use, some symbols are written in boldface.

aia_{i}: 𝑷(𝒂𝒊)P^{\prime}(a_{i})    𝒚𝒊y_{i}   [(XP(ai))(Y{yi})][(X\setminus P^{\prime}(a_{i}))\cup(Y\setminus\{y_{i}\})]   [Z][Z]
bib_{i}: 𝒙𝒊x_{i}   [𝒁][Z]   [(XY){xi}][(X\cup Y)\setminus\{x_{i}\}]
cic_{i}: 𝒛𝒊z_{i}   [𝒁{𝒛𝒊}][Z\setminus\{z_{i}\}]   [XY][X\cup Y]
Figure 9: Preference lists of residents.

Here we briefly explain an idea behind the reduction. For the correctness, we give a relationship between optimal solutions of II and II^{\prime}. To this aim, we show that there is an optimal solution of II^{\prime} such that a resident bib_{i} is assigned to a hospital xix_{i} for each ii and a resident cic_{i} is assigned to a hospital ziz_{i} for each ii. Then, each hospital in XZX\cup Z obtains the score of 1. Note that each hospital in XX has the remaining capacity of 1. Residents in AA and hospitals in XX simulates a matching between SS and TT of II, i.e., a matching between AA and XX is stable if and only if a matching between SS and TT is Pareto optimal. If this matching is small, then unmatched aia_{i} can go to the hospital yiy_{i}, by which yiy_{i} obtains the score of 1. Hence, a Pareto optimal matching of II of smaller size gives us a stable matching of II^{\prime} of higher score.

Now we proceed to a formal proof. Let OPT(I){\rm OPT}(I) and OPT(I){\rm OPT}(I^{\prime}) be the values of optimal solutions for II and II^{\prime}, respectively. Our goal is to show OPT(I)=3nOPT(I){\rm OPT}(I)=3n-{\rm OPT}(I^{\prime}).

Let MM be an optimal solution for II, i.e., a Pareto optimal matching of size OPT(I){\rm OPT}(I). We define a matching MM^{\prime} of II^{\prime} as M={(ai,xj)(si,tj)M}{(ai,yi)si is unmatched in M}{(bi,xi)1in}{(ci,zi)1in}M^{\prime}=\{(a_{i},x_{j})\mid(s_{i},t_{j})\in M\}\cup\{(a_{i},y_{i})\mid s_{i}\mbox{ is unmatched in }M\}\cup\{(b_{i},x_{i})\mid 1\leq i\leq n\}\cup\{(c_{i},z_{i})\mid 1\leq i\leq n\}. We show that MM^{\prime} is stable. Since residents in BCB\cup C are assigned to the top choice hospital and each resident aiAa_{i}\in A is assigned to a hospital in P(ai){yi}P^{\prime}(a_{i})\cup\{y_{i}\}, if there were a blocking pair for MM^{\prime}, it is of the form (ai,xj)(a_{i},x_{j}) for some xjP(ai)x_{j}\in P^{\prime}(a_{i}). Hence, sis_{i}’s preference list includes tjt_{j}. As xjx_{j}’s preference list is a single tie of all residents, xjx_{j} must be unmatched in MM^{\prime}, which implies that tjt_{j} is unmatched in MM. If (ai,yi)M(a_{i},y_{i})\in M^{\prime} then sis_{i} is unmatched in MM, so M{(si,tj)}M\cup\{(s_{i},t_{j})\} Pareto dominates MM, a contradiction. If (ai,xk)M(a_{i},x_{k})\in M^{\prime} for some kk, then xjaixkx_{j}\succ_{a_{i}}x_{k}, so (si,tk)M(s_{i},t_{k})\in M and tjsitkt_{j}\succ_{s_{i}}t_{k}. Thus (M{(si,tk)}){(si,tj)}(M\setminus\{(s_{i},t_{k})\})\cup\{(s_{i},t_{j})\} Pareto dominates MM, a contradiction. The numbers of hospitals in XX, YY, and ZZ that are assigned at least one resident are nn, nOPT(I)n-{\rm OPT}(I), and nn, respectively, so s(M)=3nOPT(I)s(M^{\prime})=3n-{\rm OPT}(I). Hence, we have that OPT(I)s(M)=3nOPT(I){\rm OPT}(I^{\prime})\geq s(M^{\prime})=3n-{\rm OPT}(I).

For the other direction, we first show that there exists an optimal solution for II^{\prime}, i.e., a stable matching of score OPT(I){\rm OPT}(I^{\prime}), that contains (bi,xi)(b_{i},x_{i}) and (ci,zi)(c_{i},z_{i}) for all ii (1in1\leq i\leq n). Let us call this property Π\Pi.

We begin with observing that in any stable matching, any resident is assigned to a hospital written in boldface in Fig. 9. Let MM^{\prime} be any stable matching for II^{\prime}. First, observe that (i) (ai,zj)M(a_{i},z_{j})\not\in M^{\prime} for any ii and jj. This is because there are at least 2n2n hospitals in XYX\cup Y preferred to zjz_{j} by aia_{i}, and their 4n4n positions cannot be fully occupied by only 3n3n residents. Next, we show that (ii) (bi,h)M(b_{i},h)\not\in M^{\prime} for any ii and h(XY){xi}h\in(X\cup Y)\setminus\{x_{i}\}. The reason is similar to that for (i): If (bi,h)M(b_{i},h)\in M^{\prime}, then all hospitals in ZZ must be full. But as shown above, no resident in AA is assigned a hospital in ZZ, so these 2n2n positions must be occupied by residents in BCB\cup C. However, this is impossible by 2n12n-1 residents (note that bib_{i} is assumed to be assigned to a hospital not in ZZ). The same argument as (ii) shows that (iii) (ci,h)M(c_{i},h)\not\in M^{\prime} for any ii and hXYh\in X\cup Y. We then show that (iv) (ai,h)M(a_{i},h)\not\in M^{\prime} for any ii and h(XP(ai))(Y{yi})h\in(X\setminus P^{\prime}(a_{i}))\cup(Y\setminus\{y_{i}\}). For contradiction, suppose that there are kk such residents ai1,,aika_{i_{1}},\ldots,a_{i_{k}}. Then, to avoid blocking pairs, all hospitals yi1,,yiky_{i_{1}},\ldots,y_{i_{k}} must be full, but by (ii) and (iii) above, no resident in BCB\cup C can be assigned to a hospital in YY. It then results that these 2k2k positions are occupied by kk agents ai1,,aika_{i_{1}},\ldots,a_{i_{k}}, a contradiction.

Now let MM^{\prime} be an optimal solution for II^{\prime}. Of course MM^{\prime} satisfies (i)–(iv) above. We show that MM^{\prime} can be modified to satisfy the property Π\Pi without breaking the stability and decreasing the score.

  1. (1)

    (Re)assign every bib_{i} to xix_{i} and every cic_{i} to ziz_{i} and let M1M^{\prime}_{1} be the resulting assignment. The followings are properties of M1M^{\prime}_{1}. Every resident is assigned to one hospital. By property (i), each hospital ziz_{i} is assigned one resident cic_{i}. By properties (i) and (iv), each hospital yiy_{i} is assigned at most one resident aia_{i}. A hospital xix_{i} is assigned a resident bib_{i} and at most two other residents of AA who are assigned to it by MM^{\prime}. Hence, xix_{i} is assigned at most three residents.

  2. (2)

    Let xix_{i} be a hospital that is assigned three residents in M1M^{\prime}_{1}. Then, M1(xi)={aj,ak,bi}M^{\prime}_{1}(x_{i})=\{a_{j},a_{k},b_{i}\} for some jj and kk. We choose either one agent of AA from M1(xi)M^{\prime}_{1}(x_{i}), say aja_{j}, and delete (aj,xi)(a_{j},x_{i}) from M1M^{\prime}_{1}. We do this for all such hospitals xix_{i} and let M2M^{\prime}_{2} be the resulting assignment. Note that M2M^{\prime}_{2} satisfies all upper quotas and hence is a matching. Note also that any hospital in XX that is full in MM^{\prime} is also full in M2M^{\prime}_{2}.

  3. (3)

    Let A2AA_{2}\subseteq A be the set of residents who is unmatched in M2M^{\prime}_{2}. Order residents in A2A_{2} arbitrarily, and apply the serial dictatorship algorithm in this order, i.e., in a resident aia_{i}’s turn, assign aia_{i} to the most preferred hospital that is undersubscribed. Note that in this process aia_{i} is assigned to a hospital in P(ai){yi}P^{\prime}(a_{i})\cup\{y_{i}\} because yiy_{i} is unassigned in M2M^{\prime}_{2}. Let M3M^{\prime}_{3} be the resulting matching.

It is not hard to see that M3M^{\prime}_{3} satisfies the property Π\Pi. Note that any hospital in XZX\cup Z is assigned at least one resident in M3M^{\prime}_{3}. Additionally, by the properties (i)–(iv), if a hospital yiy_{i} is assigned a resident in MM^{\prime} then she is aia_{i}, and by the modifications (1)–(3), the pair (ai,yi)(a_{i},y_{i}) is still in M3M^{\prime}_{3}. Hence, we have that c(M3)c(M)c(M^{\prime}_{3})\geq c(M^{\prime}). It remains to show the stability of M3M^{\prime}_{3}. Residents in BCB\cup C are assigned to the first choice hospital, so they do not participate in a blocking pair. Consider a resident aa in AA. If M3(a)=M(a)M^{\prime}_{3}(a)=M^{\prime}(a) then aa cannot form a blocking pair because MM^{\prime} is stable and any hospital hXh\in X that is full in MM^{\prime} is also full in M3M^{\prime}_{3}. If M3(a)M(a)M^{\prime}_{3}(a)\neq M^{\prime}(a), then aa is reassigned at the modification (3). By the assignment rule of (3), any hospital preferred to M3(a)M^{\prime}_{3}(a) by aa is full. Therefore aa cannot form a blocking pair. Thus we have shown that M3M^{\prime}_{3} is an optimal solution that satisfies property Π\Pi.

We construct a matching MM of II from M3M^{\prime}_{3} as M{(si,tj)(ai,xj)M3}M\coloneqq\{(s_{i},t_{j})\mid(a_{i},x_{j})\in M^{\prime}_{3}\}. MM is actually a matching because in (bj,xj)M3(b_{j},x_{j})\in M^{\prime}_{3} for each jj so xjx_{j} can be assigned at most one resident from AA. As noted above, all hospitals in XZX\cup Z are assigned in M3M^{\prime}_{3}, yielding a score of 2n2n. Hence, we can write OPT(I)=2n+α{\rm OPT}(I^{\prime})=2n+\alpha for some nonnegative integer α\alpha. Note that α\alpha is the number of hospitals in YY that are assigned at least one resident in M3M^{\prime}_{3}, and equivalently, the number of residents in AA assigned to a hospital in YY. Since each resident in AA is assigned to a hospital xjXx_{j}\in X for some jj or a hospital yiYy_{i}\in Y, we have that |M|=nα=3nOPT(I)|M|=n-\alpha=3n-{\rm OPT}(I^{\prime}).

We can see that MM is maximal since if M{(sp,tq)}M\cup\{(s_{p},t_{q})\} is a matching of II, then M3(ap)=ypM^{\prime}_{3}(a_{p})=y_{p} and xqx_{q} is undersubscribed in M3M^{\prime}_{3}, so (ap,xq)(a_{p},x_{q}) blocks M3M^{\prime}_{3}, contradicting the stability of M3M^{\prime}_{3}. We can see that MM is trade-in-free, i.e., there is no pair of an agent sps_{p} and a house tqt_{q} such that tqt_{q} is unmatched in MM, sps_{p} is matched in MM, and sps_{p} prefers tqt_{q} to M(sp)M(s_{p}). This is because if such a pair exists, then xqx_{q} is undersubscribed in M3M^{\prime}_{3} and apa_{p} prefers xqx_{q} to M3(ap)M^{\prime}_{3}(a_{p}), contradicting the stability of M3M^{\prime}_{3}. A coalition of MM is defined as a set of agents C={a0,a1,,ak1}C=\{a_{0},a_{1},\ldots,a_{k-1}\} for some k2k\geq 2 such that each aia_{i} prefers M(ai+1)M(a_{i+1}) to M(ai)M(a_{i}), where i+1i+1 is taken modulo kk. Satisfying a coalition CC means updating MM as M(M{(ai,M(ai))0ik1}){(ai,M(ai+1))0ik1}M\coloneqq(M\setminus\{(a_{i},M(a_{i}))\mid 0\leq i\leq k-1\})\cup\{(a_{i},M(a_{i+1}))\mid 0\leq i\leq k-1\}. Note that satisfying a coalition maintains the matching size, maximality, and trade-in-freeness. As long as there is a coalition, we satisfy it. This sequence of satisfying operations eventually terminates because at each operation at least two residents will be strictly improved (and none will be worse off). Then, the resulting matching MM is maximal, trade-in-free, and coalition-free. It is known (Proposition 2 of [3]) that a matching is Pareto optimal if and only if it is maximal, trade-in-free, and coalition-free. Hence, MM is Pareto optimal and |M|=3nOPT(I)|M|=3n-{\rm OPT}(I^{\prime}). Thus OPT(I)|M|=3nOPT(I){\rm OPT}(I)\leq|M|=3n-{\rm OPT}(I^{\prime}).

We have shown that OPT(I)=3nOPT(I){\rm OPT}(I)=3n-{\rm OPT}(I^{\prime}), which means that computing OPT(I){\rm OPT}(I^{\prime}) in polynomial time implies computing OPT(I){\rm OPT}(I) in polynomial time.

D.4 Proof of Theorem 13

We give a reduction from a decision problem COM-SMTI-2ML. An instance of this problem consists of the same number nn of men and women. There are two master lists, both of which may contain ties; one is a master list of men that includes all women, and the other is a master list of women that includes all men. Each man’s preference list is derived from the master list of men by deleting some women (and keeping the relative order of the remaining women), and each woman’s preference list is derived similarly from the master list of women. (However, in the following argument, we do not use the fact that there is a master list of men.) If ww is included in mm’s preference list, we say that ww is acceptable to mm. Without loss of generality, we assume that acceptability is mutual, i.e., mm is acceptable to ww if and only if ww is acceptable to mm. The problem COM-SMTI-2ML asks if there exists a weakly stable matching of size nn. It is known that COM-SMTI-2ML is NP-complete even if ties appear in preference lists of one side only (Theorem 3.1 of [22]).

Let II be an instance of COM-SMTI-2ML consisting of nn men m1,,mnm_{1},\ldots,m_{n} and nn women w1,,wnw_{1},\ldots,w_{n}. We will construct an instance II^{\prime} of HRT-MSLQ as follows. The set of residents is R={ri1in}R=\{r_{i}\mid 1\leq i\leq n\} and the set of hospitals is H={hi1in+1}H=\{h_{i}\mid 1\leq i\leq n+1\}. Hospitals hih_{i} (1in1\leq i\leq n) has quotas [1,1][1,1] and the hospital hn+1h_{n+1} has quotas [0,1][0,1]. A preference list of each hospital is derived from the master list of women in II by replacing a man mim_{i} with a resident rir_{i} for each ii. Let P(mi)P(m_{i}) be a (possibly incomplete) preference list of mim_{i} in II. We construct P(ri)P^{\prime}(r_{i}) from P(mi)P(m_{i}) by replacing each woman wjw_{j} with a hospital hjh_{j}. Then, the preference list of rir_{i} is defined as

rir_{i}: P(ri)P^{\prime}(r_{i}) hn+1h_{n+1} \cdots

where “\cdots” means an arbitrary strict order of all hospitals missing in the list. Now the reduction is completed. Note that if preference lists of men (resp. women) of II do not contain ties, the preference lists of residents (resp. hospitals) of II^{\prime} do not contain ties.

For the correctness, we show that II is an yes-instance if and only if II^{\prime} admits a stable matching of score n+1n+1. If II is an yes-instance, there exists a perfect stable matching MM of II. We construct a matching MM^{\prime} of II^{\prime} in such a way that (ri,hj)M(r_{i},h_{j})\in M^{\prime} if and only if (mi,wj)M(m_{i},w_{j})\in M. It is easy to see that the score of MM^{\prime} is n+1n+1. We show that MM^{\prime} is stable. Suppose not and let (ri,hj)(r_{i},h_{j}) be a blocking pair for MM^{\prime}. Note that jn+1j\neq n+1 because each resident is assigned to a better hospital than hn+1h_{n+1}. Then, we have that hjriM(ri)h_{j}\succ_{r_{i}}M^{\prime}(r_{i}) and rihjM(hj)r_{i}\succ_{h_{j}}M^{\prime}(h_{j}). By construction of MM^{\prime}, M(ri)M^{\prime}(r_{i}) is in P(ri)P^{\prime}(r_{i}) and hence hjh_{j} is also in P(ri)P^{\prime}(r_{i}), meaning that wjw_{j} is acceptable to mim_{i}. The fact hjriM(ri)h_{j}\succ_{r_{i}}M^{\prime}(r_{i}) in II^{\prime} implies wjmiM(mi)w_{j}\succ_{m_{i}}M(m_{i}) in II. Since mim_{i} is acceptable to wjw_{j}, the fact rihjM(hj)r_{i}\succ_{h_{j}}M^{\prime}(h_{j}) in II^{\prime} implies miwjM(wj)m_{i}\succ_{w_{j}}M(w_{j}) in II. Thus (mi,wj)(m_{i},w_{j}) blocks MM in II, a contradiction.

Conversely, suppose that there is a stable matching MM^{\prime} of II^{\prime} such that s(M)=n+1s(M^{\prime})=n+1. Since s(M)=n+1s(M^{\prime})=n+1, MM^{\prime} forms a perfect matching between RR and H{hn+1}H\setminus\{h_{n+1}\}. If rir_{i} is assigned to a hospital in the “\cdots” part, then rir_{i} and hn+1h_{n+1} form a blocking pair for MM^{\prime}, a contradiction. Hence, each rir_{i} is assigned to a hospital in P(ri)P^{\prime}(r_{i}). Then, M={(mi,wj)(ri,hj)M}M=\{(m_{i},w_{j})\mid(r_{i},h_{j})\in M^{\prime}\} is a perfect matching of II. It is not hard to see that MM is stable in II because if (mi,wj)(m_{i},w_{j}) blocks MM then (ri,hj)(r_{i},h_{j}) blocks MM^{\prime}.

Appendix E Other Objective Functions

This paper investigates approximability and inapproximability of the problem of maximizing the total satisfaction ratio over the family \mathcal{M} of stable matchings. This is formulated as

maxMhHsM(h), where sM(h)=min{1,|M(h)|(h)}.\max_{M\in\mathcal{M}}\sum_{h\in H}s_{M}(h),\mbox{ ~{}~{}where ~{}~{}}\textstyle{s_{M}(h)=\min\left\{1,\frac{|M(h)|}{\ell(h)}\right\}}.

To formulate the objective of “filling lower quotas of hospitals as much as possible,” other objective functions can be considered. Here we briefly discuss on three alternative objective functions below.

(a) Maximizing the minimum satisfaction ratio:
maxMminhHsM(h).\max_{M\in\mathcal{M}}\min_{h\in H}s_{M}(h).
(b) Maximizing the number of satisfied hospitals:
maxM|{hH|sM(h)=1}|.\max_{M\in\mathcal{M}}|\set{h\in H}{s_{M}(h)=1}|.
(c) Maximizing the number of residents filling lower quotas:
maxMhHvM(h), where vM(h)=min{(h),|M(h)|}.\max_{M\in\mathcal{M}}\sum_{h\in H}v_{M}(h),\mbox{ ~{}~{}where ~{}~{}}v_{M}(h)=\min\{\ell(h),|M(h)|\}.

We first provide a hardness result that is used to show the difficulty of approximation of those alternatives. Let us define a decision problem HRT-D as follows. An input of HRT-D is a pair (I,h)(I,h^{*}) consisting of an HRT instance II and a specified hospital hh^{*} in II, we are asked whether II admits a stable matching in which hh^{*} is assigned at least one resident.

Theorem 37.

The problem HRT-D is NP-complete.

Proof.

Membership in NP is obvious. We give a reduction from an NP-complete problem COM-SMTI [22]. An input of this problem is a stable marriage instance consisting of nn men and nn women, each having an incomplete preference list with ties. The problem asks if there exists a weakly stable matching of size nn.

Let II be an instance of COM-SMTI consisting of nn men m1,,mnm_{1},\ldots,m_{n} and nn women w1,,wnw_{1},\ldots,w_{n}. We will construct an instance II^{\prime} of HRT-D as follows. The set of residents is R={ri1in}{r}R=\{r_{i}\mid 1\leq i\leq n\}\cup\{r\} and the set of hospitals is H={hi1in}{h,h}H=\{h_{i}\mid 1\leq i\leq n\}\cup\{h,h^{*}\}, where hh^{*} is the specified hospital. An upper quota of each hospital is 1.

Let P(mi)P(m_{i}) and P(wi)P(w_{i}) be the preferences lists of mim_{i} and wiw_{i} in II, respectively. Then, we define P(ri)P^{\prime}(r_{i}) as the list obtained from P(mi)P(m_{i}) by replacing each wjw_{j} by hjh_{j}. Similarly, let P(hi)P^{\prime}(h_{i}) be the list obtained from P(wi)P(w_{i}) by replacing each mjm_{j} by rjr_{j}. The preference lists of agents in II^{\prime} are as follows, where “\cdots” denotes an arbitrary strict order of all agents missing in the list.

rir_{i}: P(ri)P^{\prime}(r_{i}) \cdots hh^{*} hih_{i} [1][1]: P(hi)P^{\prime}(h_{i}) rr \cdots
rr: \cdots hh^{*} hh hh [1][1]: \cdots
hh^{*} [1][1]: rr \cdots

Suppose that II admits a weakly stable matching MM of size nn. Then, M={(ri,hj)(mi,wj)M}{(r,h)}M^{\prime}=\{(r_{i},h_{j})\mid(m_{i},w_{j})\in M\}\cup\{(r,h^{*})\} is a stable matching of II^{\prime} in which hh^{*} is assigned.

Conversely, suppose that II^{\prime} admits a stable matching MM^{\prime} in which hh^{*} is assigned. Since the preference lists are all complete and the number of hospitals exceeds that of residents by one, all agents but one hospital are assigned in MM^{\prime}. If M(ri)=hM^{\prime}(r_{i})=h^{*} for some ii, then rir_{i} forms a blocking pair with the unassigned hospital; hence we have that M(r)=hM^{\prime}(r)=h^{*}. Then, each hih_{i} must be assigned a resident in P(hi)P^{\prime}(h_{i}), as otherwise (r,hi)(r,h_{i}) blocks MM^{\prime}. Thus MM^{\prime} defines a perfect matching between {ri1in}\{r_{i}\mid 1\leq i\leq n\} and {hi1in}\{h_{i}\mid 1\leq i\leq n\}. It is not hard to see that M={(mi,wj)(ri,hj)M}M=\{(m_{i},w_{j})\mid(r_{i},h_{j})\in M^{\prime}\} is a weakly stable matching of II of size nn. ∎

Proposition 38.

For the objective function (a), there is no polynomial-time algorithm whose approximation factor is bounded unless P=NP.

Proof.

We show the claim by a reduction from HRT-D. Given an instance (I,h)(I,h^{*}) of HRT-D, let II^{\prime} be an instance of HRT-MSLQ obtained from II by setting lower quotas as (h)=1\ell(h^{*})=1 and (h)=0\ell(h)=0 for any hH{h}h\in H\setminus\{h^{*}\}. Note that the sets of stable matchings in II and II^{\prime} are the same. Then, the optimal value of II^{\prime} is 11 if (I,h)(I,h^{*}) is a yes instance and 0 otherwise. Hence, any algorithm with a bounded approximation factor can distinguish these two cases. ∎

The proof of Proposition 38 utilizes the fact that assigning a resident to a hospital with lower quota of 0 does not contribute to the objective function at all. However, even without such hospitals, approximation of this objective function is impossible.

Proposition 39.

Proposition 38 holds even if lower quotas of all hospitals are positive.

Proof.

We modify the proof of Proposition 38 as follows. In the construction of II^{\prime}, for each hH{h}h\in H\setminus\{h^{*}\}, we set (h)=1\ell(h)=1 and increase the upper quota of hh by 11 from that in II. Hence, all lower quotas are 1 in II^{\prime}. We also add |H||H| dummy residents {dhhH{h}}\set{d_{h}\mid h\in H\setminus\{h^{*}\}}. Each hH{h}h\in H\setminus\{h^{*}\} adds dhd_{h} at the top of its preference list and adds other |H|1|H|-1 dummy residents at the bottom of its list in an arbitrary order. Each dummy resident dhd_{h} puts hh at the top of her list, followed by the other hospitals in any order. We can observe that, for any hH{h}h\in H\setminus\{h^{*}\}, (dh,h)(d_{h},h) is a pair in any stable matching. Thus, the problem reduces to the one in Proposition 38 and our claim is proved. ∎

We then turn to the objective function (b).

Proposition 40.

Solving the problem with objective function (b) exactly is NP-hard. There exists an algorithm whose approximation factor is at most nn. (Recall that nn is the number |R||R| of residents.)

Proof.

The first claim easily follows from a reduction from HRT-D. Given an instance (I,h)(I,h^{*}) of HRT-D, let II^{\prime} be an HRT-MSLQ instance obtained from II by setting lower quotas as (h)=1\ell(h^{*})=1 and (h)=0\ell(h)=0 for any hH{h}h\in H\setminus\{h^{*}\}. Then, the optimal value of II^{\prime} is |H||H| if (I,h)(I,h^{*}) is a yes instance and |H|1|H|-1 otherwise.

For the second claim, we show that the following naive algorithm attains an approximation factor of nn. Given an HRT-MSLQ instance II consisting of RR and HH, the algorithm first constructs a bipartite graph (R,H;E)(R,H;E) with E={(r,h)R×H|h is included in the top tie of r}E=\set{(r,h)\in R\times H}{\text{$h$ is included in the top tie of $r$}}. Let d(h)d(h) be the degree of each hospital hHh\in H in this graph. If d(h)<(h)d(h)<\ell(h) for every hHh\in H, then the algorithm returns an arbitrary stable matching; otherwise, the algorithm takes any hh^{*} with d(h)(h)d(h^{*})\geq\ell(h^{*}), breaks the ties of II so that hh^{*} has the highest rank in any tie including hh^{*}, and returns any stable matching of the resultant instance.

In the former case, we can easily see that any stable matching is a subset of EE; hence the optimal value is 0. Hence, any stable matching is optimal. In the latter case, the hospital hh^{*} is assigned at least (h)\ell(h^{*}) residents, and hence the objective value of the output matching is at least max{1,|{hH|(h)=0}|}\max\{1,|\set{h\in H}{\ell(h)=0}|\}. As the optimal value is at most n+|{hH|(h)=0}|n+|\set{h\in H}{\ell(h)=0}|, the approximation factor of this algorithm is at most nn. ∎

Note that the approximation factor mentioned in Proposition 40 cannot be attained by our algorithm Double Proposal: it may return a stable matching of value 0 even when there exists a stable matching of positive objective value. As the algorithm in the above proof is just a simple greedy algorithm and there is no inapproximability result for this problem, its approximability may be worth investigating further.

Finally, we consider the objective function (c).

Proposition 41.

For the objective function (c), there is no polynomial-time algorithm whose approximation factor is bounded unless P=NP.

Proof.

By the reduction used to show Proposition 38, we see that it is NP-hard to distinguish the two cases where the optimal objective value is 0 and 1. ∎

As shown above, the problem with the objective function (c) is inapproximable. Fortunately, however, it is approximable if all hospitals have positive lower quotas, in contrast to the objective function (a). We show that our algorithm Double Proposal presented in Section 5 attains an approximation factor better than the arbitrary tie-breaking GS algorithm.

Proposition 42.

For the objective function (c), Double Proposal attains the approximation factor shown in the second row of Table E if all hospitals have positive lower quotas.

Proof.

We show the approximation factors: For the R-side ML model, the output of Double Proposal is an optimal solution by Lemma 2(ii) and Lemma 35. For the marriage model, the assumption that all lower quotas are positive implies that any hospitals has quotas [1,1][1,1]. Then, every stable matching MM satisfies hHvM(h)=n\sum_{h\in H}v_{M}(h)=n, and hence the approximation factor of Double Proposal is clearly 11. For the uniform model, the approximation factor of Double Proposal for the objective function hHvM(h)\sum_{h\in H}v_{M}(h) is equivalent to that for hHsM(h)\sum_{h\in H}s_{M}(h), which is θ2+θ12θ1\frac{\theta^{2}+\theta-1}{2\theta-1} by Theorem 9. For the general model, the approximation factor is obtained by combining Claims 43 and 44 below. ∎

In Table E below, we also present the maximum gap for the objective function (c) when all lower quotas are positive. The values for the marriage and uniform models follow from the above arguments. The maximum gap of nn for the general and R-side ML models can be obtained by modifying the proof of Proposition 6 (note that H0:={hH|(h)=0}=H_{0}:=\set{h\in H}{\ell(h)=0}=\emptyset under the assumption of positive lower quotas).

[htbp] General  Uniform  Marriage  RR-side ML Maximum gap Λ()\Lambda({\cal I}) (i.e., Approx. factor of arbitrary tie-breaking GS) nn θ\theta 11 nn Approx. factor of Double Proposal  n2+12\frac{\lceil\frac{n}{2}\rceil+1}{2} θ2+θ12θ1\frac{\theta^{2}+\theta-1}{2\theta-1} 11 11 Maximum gap Λ()\Lambda({\cal I}) and approximation factor of Double Proposal for the objective function (c) when lower quotas of all hospitals are positive.

Claim 43.

For the objective function (c), if all hospitals have positive lower quotas, the approximation factor of Double Proposal is at least n2+12\frac{\lceil\frac{n}{2}\rceil+1}{2} for the general model.

Proof.

We provide a family of instances each of which admits a stable matching with objective value n2+12\frac{\lceil\frac{n}{2}\rceil+1}{2} times as large as that of the output of Double Proposal. Let R=RR′′R=R^{\prime}\cup R^{\prime\prime} where R={r1,r2,,rn2}R^{\prime}=\{r^{\prime}_{1},r^{\prime}_{2},\dots,r^{\prime}_{\lfloor\frac{n}{2}\rfloor}\} and R′′={r1′′,r2′′,,rn2′′}R^{\prime\prime}=\{r^{\prime\prime}_{1},r^{\prime\prime}_{2},\dots,r^{\prime\prime}_{\lceil\frac{n}{2}\rceil}\} and the set of hospitals is given as H={h1,h2,hn}{x,y}H=\{h_{1},h_{2}\dots,h_{n}\}\cup\{x,y\}. Then, |R|=n|R|=n and |H|=n+2|H|=n+2. The preference lists are given as follows, where “(  RR  )” represents the tie consisting of all residents.

rir^{\prime}_{i}: xx hih_{i} \cdots xx [1,n2][1,\lfloor\frac{n}{2}\rfloor]: (  RR  )
ri′′r^{\prime\prime}_{i}: xx yy \cdots yy [1,n][1,n]: (  RR  )
hih_{i} [1,1][1,1]: (  RR  )

If indices are defined so that residents in RR^{\prime} have smaller indices compared with those in R′′R^{\prime\prime}, then Double Proposal returns M={(ri,x)|i=1,2,,n2}{(ri′′,y)|i=1,2,,n2}M=\set{(r^{\prime}_{i},x)}{i=1,2,\dots,\lfloor\frac{n}{2}\rfloor}\cup\set{(r^{\prime\prime}_{i},y)}{i=1,2,\dots,\lceil\frac{n}{2}\rceil}, whose objective value is vM(x)+vM(y)=2v_{M}(x)+v_{M}(y)=2. Define NN^{\prime} by N={(ri,hi)|i=1,2,,n2}{(ri′′,x)|i=1,2,,n2}N^{\prime}=\set{(r^{\prime}_{i},h_{i})}{i=1,2,\dots,\lfloor\frac{n}{2}\rfloor}\cup\set{(r^{\prime\prime}_{i},x)}{i=1,2,\dots,\lfloor\frac{n}{2}\rfloor} and let N=NN=N^{\prime} if nn is even and N=N{(rn2′′,y)}N=N^{\prime}\cup\{(r^{\prime\prime}_{\lceil\frac{n}{2}\rceil},y)\} if nn is odd. Then, NN is a stable matching whose objective value is n2+1\lceil\frac{n}{2}\rceil+1. ∎

Claim 44.

For the objective function (c), if all hospitals have positive lower quotas, the approximation factor of Double Proposal is at most n2+12\frac{\lceil\frac{n}{2}\rceil+1}{2} for the general model.

Proof.

Take any instance and let NN be an optimal solution and MM be the output of Double Proposal. We use the notation vM(H)=hHvM(h)v_{M}(H^{\prime})=\sum_{h\in H^{\prime}}v_{M}(h) for any HHH^{\prime}\subseteq H and define vN(H)v_{N}(H^{\prime}) similarly. Consider a bipartite graph (R,H;MN)(R,H;M\cup N), which may have multiple edges. Take an arbitrary connected component and let RR^{*} and HH^{*} be the sets of residents and hospitals, respectively, contained in it. It is sufficient to bound vN(H)vM(H)\frac{v_{N}(H^{*})}{v_{M}(H^{*})}.

For any hHh\in H, let rM(h)=|M(h)|vM(h)r_{M}(h)=|M(h)|-v_{M}(h), which is the number of residents assigned to hh redundantly in MM. We write rM(H)=hHrM(h)r_{M}(H^{\prime})=\sum_{h\in H^{\prime}}r_{M}(h) for any HHH^{\prime}\subseteq H. We define rN(h)r_{N}(h) and rN(H)r_{N}(H^{\prime}) similarly for NN. Define the sets H0,R0,H1,R1,H2,H_{0},R_{0},H_{1},R_{1},H_{2}, and R2R_{2} as in the proof of Theorem 9. By the arguments there, using Lemma 2(ii), we can see that each hH1h\in H_{1} satisfies either |N(h)|=u(h)|N(h)|=u(h) or |M(h)|(h)|M(h)|\leq\ell(h), each of which implies rM(h)rN(h)r_{M}(h)\leq r_{N}(h). Then, we have rM(H1)rN(H1)r_{M}(H_{1})\leq r_{N}(H_{1}). Moreover, the definition of H0H_{0} implies rM(h)=0r_{M}(h)=0 for each hH0h\in H_{0}, and hence rM(H0)rN(H0)r_{M}(H_{0})\leq r_{N}(H_{0}). Thus, we have rM(H0H1)rN(H0H1)0r_{M}(H_{0}\cup H_{1})-r_{N}(H_{0}\cup H_{1})\leq 0.

Note that vN(H)+rN(H)=|R|=vM(H)+rM(H)v_{N}(H^{*})+r_{N}(H^{*})=|R^{*}|=v_{M}(H^{*})+r_{M}(H^{*}), and hence vN(H)=vM(H)+rM(H)rN(H)v_{N}(H^{*})=v_{M}(H^{*})+r_{M}(H^{*})-r_{N}(H^{*}). If H2=H_{2}=\emptyset, then rM(H)rN(H)=rM(H0H1)rN(H0H1)0r_{M}(H^{*})-r_{N}(H^{*})=r_{M}(H_{0}\cup H_{1})-r_{N}(H_{0}\cup H_{1})\leq 0, and hence clearly vN(H)vM(H)1\frac{v_{N}(H^{*})}{v_{M}(H^{*})}\leq 1. We then assume H2H_{2}\neq\emptyset in the rest of the proof.

By the definitions of H1H_{1} and H2H_{2}, at least one resident is assigned to each of them in MM. By the assumption that all hospitals have positive lower quotas, we obtain vM(H1)1v_{M}(H_{1})\geq 1 and vM(H2)1v_{M}(H_{2})\geq 1, and hence vM(H)2v_{M}(H^{*})\geq 2. Thus,

vN(H)vM(H)=vM(H)+rM(H)rN(H)vM(H)2+rM(H)rN(H)2.\frac{v_{N}(H^{*})}{v_{M}(H^{*})}=\frac{v_{M}(H^{*})+r_{M}(H^{*})-r_{N}(H^{*})}{v_{M}(H^{*})}\leq\frac{2+r_{M}(H^{*})-r_{N}(H^{*})}{2}.

We now bound the value of rM(H)rN(H)r_{M}(H^{*})-r_{N}(H^{*}). By the definitions of rMr_{M} and rNr_{N}, we have

rM(H2)rN(H2)=hH2|M(h)|vM(H2){hH2|N(h)|vN(H2)}.\textstyle r_{M}(H_{2})-r_{N}(H_{2})=\sum_{h\in H_{2}}|M(h)|-v_{M}(H_{2})-\left\{\sum_{h\in H_{2}}|N(h)|-v_{N}(H_{2})\right\}.

Additionally, vM(H2)+vN(H2)0-v_{M}(H_{2})+v_{N}(H_{2})\leq 0, hH2|N(h)|=|R2|\sum_{h\in H_{2}}|N(h)|=|R_{2}|, and hH2|M(h)||R2|+|R1|\sum_{h\in H_{2}}|M(h)|\leq|R_{2}|+|R_{1}| by the definitions of H0H_{0}, H1H_{1}, and H2H_{2}. Substituting them, we obtain rM(H2)rN(H2)|R1|r_{M}(H_{2})-r_{N}(H_{2})\leq|R_{1}|. We also have |R1|=vN(H1)+rN(H1)vM(H1)+rN(H1)=hH1|M(h)|rM(H1)+rN(H1)|R_{1}|=v_{N}(H_{1})+r_{N}(H_{1})\leq v_{M}(H_{1})+r_{N}(H_{1})=\sum_{h\in H_{1}}|M(h)|-r_{M}(H_{1})+r_{N}(H_{1}) and hH1|M(h)|nhH2|M(h)|=nvM(H2)rM(H2)n1rM(H2)\sum_{h\in H_{1}}|M(h)|\leq n-\sum_{h\in H_{2}}|M(h)|=n-v_{M}(H_{2})-r_{M}(H_{2})\leq n-1-r_{M}(H_{2}). Combining them, we obtain

rM(H2)rN(H2)n1rM(H2)rM(H1)+rN(H1),r_{M}(H_{2})-r_{N}(H_{2})\leq n-1-r_{M}(H_{2})-r_{M}(H_{1})+r_{N}(H_{1}),

which implies rM(H2)12(n1rM(H1)+rN(H2)+rN(H1))r_{M}(H_{2})\leq\frac{1}{2}(n-1-r_{M}(H_{1})+r_{N}(H_{2})+r_{N}(H_{1})). Then,

rM(H)rN(H)\displaystyle r_{M}(H^{*})-r_{N}(H^{*}) \displaystyle\leq rM(H2)+rM(H1)rN(H2)rN(H1)\displaystyle r_{M}(H_{2})+r_{M}(H_{1})-r_{N}(H_{2})-r_{N}(H_{1})
\displaystyle\leq 12(n1+rM(H1)rN(H1)rN(H2)).\displaystyle\frac{1}{2}(n-1+r_{M}(H_{1})-r_{N}(H_{1})-r_{N}(H_{2})).

Since rM(H1)rN(H1)0r_{M}(H_{1})-r_{N}(H_{1})\leq 0 and rN(H2)0-r_{N}(H_{2})\leq 0, we obtain rM(H)rN(H)n12r_{M}(H^{*})-r_{N}(H^{*})\leq\frac{n-1}{2}, which implies rM(H)rN(H)n21r_{M}(H^{*})-r_{N}(H^{*})\leq\lceil\frac{n}{2}\rceil-1 by the integrality of rM(H)rN(H)r_{M}(H^{*})-r_{N}(H^{*}). Thus, we obtain vN(H)vM(H)2+rM(H)rN(H)2n2+12\frac{v_{N}(H^{*})}{v_{M}(H^{*})}\leq\frac{2+r_{M}(H^{*})-r_{N}(H^{*})}{2}\leq\frac{\lceil\frac{n}{2}\rceil+1}{2}. ∎