This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Combating Collusion Rings is Hard but Possible

Niclas Boehmer Technische Universität Berlin, Faculty IV, Algorithmics and Computational Complexity, Berlin, Germany
{niclas.boehmer,andre.nichterlein}@tu-berlin.de
Robert Bredereck Humboldt-Universität zu Berlin, Institut für Informatik, Algorithm Engineering, Berlin, Germany
robert.bredereck@hu-berlin.de
André Nichterlein Technische Universität Berlin, Faculty IV, Algorithmics and Computational Complexity, Berlin, Germany
{niclas.boehmer,andre.nichterlein}@tu-berlin.de
Abstract

A recent report of Littmann [Commun. ACM ’21] outlines the existence and the fatal impact of collusion rings in academic peer reviewing. We introduce and analyze the problem Cycle-Free Reviewing that aims at finding a review assignment without the following kind of collusion ring: A sequence of reviewers each reviewing a paper authored by the next reviewer in the sequence (with the last reviewer reviewing a paper of the first), thus creating a review cycle where each reviewer gives favorable reviews. As a result, all papers in that cycle have a high chance of acceptance independent of their respective scientific merit.

We observe that review assignments computed using a standard Linear Programming approach typically admit many short review cycles. On the negative side, we show that Cycle-Free Reviewing is NP-hard in various restricted cases (i.e., when every author is qualified to review all papers and one wants to prevent that authors review each other’s or their own papers or when every author has only one paper and is only qualified to review few papers). On the positive side, among others, we show that, in some realistic settings, an assignment without any review cycles of small length always exists. This result also gives rise to an efficient heuristic for computing (weighted) cycle-free review assignments, which we show to be of excellent quality in practice.

1 Introduction

As recently pointed out by Littman (2021), the integrity and legitimacy of scientific conference publications (particularly important in the context of computer science) is threatened by so-called “collusion rings”, which are sets of authors that unethically review and support each other while breaking anonymity and hiding conflicts of interest. Despite the fact that details are usually not disclosed for various reasons, it is inevitable that the process of assigning papers to reviewers is the key point to engineer technical barriers against such incidents. Whereas assignments at very small venues could be performed manually, support by (semi-)automatic systems becomes necessary already for medium-size conferences. Today computational support for finding review assignments is well-established and has improved the quality of the reviewing and paper assignment process in many ways (see the surveys of Shah (2021) and Price and Flach (2017) for details). Still there is huge potential for improving processes and further computational support is urgently requested (Price and Flach, 2017; Shah, 2021).

When aiming to prevent collusion rings, one of the most basic properties one can request from a review assignment is that the assignment does not contain any review cycle of length zz, that is, a sequence of zz agents each reviewing a paper authored by the next agent in the sequence (with the last agent reviewing a paper authored by the first). This property is of high practical relevance: For example, in the AAAI’21 review assignment the non-existence of review cycles of length at most z=2z=2 was a soft constraint (Leyton-Brown and Mausam, 2021). Yet, there is a lack of systematic studies concerning the computation of such assignments. Motivated by this, we propose and analyze Cycle-Free Reviewing, the problem of computing an assignment of papers to agents that is free of review cycles of length at most zz, both from a theoretical and practical perspective.

1.1 Related Work

The literature is rich in the general context of peer reviewing (see, e. g., the works of Goldsmith and Sloan (2007); Taylor (2008); Garg et al. (2010); Long et al. (2013); Lian et al. (2018); Kobren et al. (2019); Stelmakh et al. (2021) on computational aspects of finding a “good” review assignment, and the survey of Shah (2021)). Closest to our work are Barrot et al. (2020) and Guo et al. (2018). In the context of product reviewing, among others, Barrot et al. (2020) propose and analyze a restricted case which translates to our setting as follows: Given a set of single-author papers and a set of agents each writing a single paper and each having some conflicts of interest over papers, find a review assignment of papers to agents, where each agent serves as a reviewer providing one review and each paper must receive one review. They show that in this setting finding an assignment without review cycles of length at most zz corresponds to finding a 22-factor without cycles of length at most zz, which is known to be NP-hard for z5z\geq 5 but polynomial-time solvable for z3z\leq 3 (Hell et al., 1988). Closer to our setting is that of Guo et al. (2018), who also consider the computation of cycle-free review assignments. They propose two simple heuristics and conduct experiments measuring the quality of their heuristics and the number of review cycles in a weight-maximizing solution on two instances, mostly focusing on the influence of the number of reviews per paper and per reviewer.

1.2 Outline and Contributions

Our contribution is threefold. First, in Section 3, we show the intractability of Cycle-Free Reviewing in various restricted settings: We show NP-hardness even when just forbidding review cycles of length at most two in “sparse” and “dense” settings (e.g., if each reviewer can review only “few” or can review “almost all” papers, see Theorems 1, 2 and 3). Furthermore, solving a question left open by Barrot et al. (2020), we show NP-hardness if each agent writes just one single-author paper and can review only few papers (Theorem 4).

Second, in Section 4, we develop greedy heuristics. In contrast to Guo et al. (2018) we provide a theoretical analysis for the heuristics. In particular, we prove that, if the considered instance satisfies certain near-realistic conditions (such as that each paper has few authors and that for each paper there are many possible reviewers), then these heuristics are guaranteed to output a zz-cycle-free review assignments in polynomial time.

Third, in Section 5, we present and discuss the results of our experiments. Our core results are:

  1. 1.

    Existing linear-programming-based methods for computing maximum-weight review assignments (as often used in practice) produce assignments where a high fraction (20% or more) of agents and papers belong to some review cycles of length two.

  2. 2.

    For z{2,3,4}z\in\{2,3,4\} maximum-weight zz-cycle-free assignments computed by one of our heuristics (see Section 4) or computed via Integer Linear Programming are almost as good as the maximum-weight review assignments with cycles (solution quality loss less than 4% resp. 1%).

  3. 3.

    Somewhat surprisingly, we show that adding additional reviewers that are authors of some papers to the reviewer pool increases the number of papers that belong to review cycles in maximum-weight (non cycle-free) assignments.

2 Preliminaries

For nn\in\mathds{N}, we set [n]:={1,,n}[n]:=\{1,\ldots,n\}. In an instance of Cycle-Free Reviewing, we are given a set PP of papers and set AA of agents, where each paper pPp\in P is authored by a subset aut(p)A\operatorname{\text{aut}}(p)\subseteq A of agents. Moreover, we are given for each agent aAa\in A a subset rev(a)P\operatorname{\text{rev}}(a)\subseteq P of papers the agent is qualified to review111Being “qualified to review” can encode that the agent is capable of reviewing the paper or that the agent does not have a conflict of interest with one of the co-authors or both.. We capture this information in a bipartite graph (AP,EAEP)(A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P,E_{A}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}) with EA={(a,p)aA,prev(a)}E_{A}=\{(a,p)\mid a\in A,p\in\operatorname{\text{rev}}(a)\} and EP={(p,a)pP,aaut(p)}E_{P}=\{(p,a)\mid p\in P,a\in\operatorname{\text{aut}}(p)\} (see also Table 1 for an overview). A (peer) review assignment EEAE^{\prime}\subseteq E_{A} is a subset of edges from agents to papers, where we say that aa reviews pp in EE^{\prime} if (a,p)E(a,p)\in E^{\prime}. Given a review assignment EEAE^{\prime}\subseteq E_{A}, for an agent aAa\in A, let N+(a,E)={pP(a,p)E}N^{+}(a,E^{\prime})=\{p\in P\mid(a,p)\in E^{\prime}\} be the subset of papers agent aa reviews in EE^{\prime} and, for a paper pPp\in P, let N(p,E)={aA(a,p)E}N^{-}(p,E^{\prime})=\{a\in A\mid(a,p)\in E^{\prime}\} be the subset of agents that review pp in EE^{\prime}. For c,dc,d\in\mathds{N} a review assignment EEAE^{\prime}\subseteq E_{A} is called cc-dd-valid if each agent reviews at most cc papers and each paper is reviewed by dd agents, that is, |N+(a,E)|c|N^{+}(a,E^{\prime})|\leq c for all aAa\in A and |N(p,E)|=d|N^{-}(p,E^{\prime})|=d for all pPp\in P. In a review assignment EEAE^{\prime}\subseteq E_{A}, we say that papers p1,,pzp_{1},\dots,p_{z} and agents a1,,aza_{1},\dots,a_{z} form a review cycle (of length zz) if aia_{i} is an author of pip_{i} ((pi,ai)EP(p_{i},a_{i})\in E_{P}) for all i[z]i\in[z], aia_{i} reviews pi+1p_{i+1} in EE^{\prime} ((ai,pi+1)E(a_{i},p_{i+1})\in E^{\prime}) for i[z1]i\in[z-1] and aza_{z} reviews p1p_{1} in EE^{\prime} ((az,p1)E(a_{z},p_{1})\in E^{\prime}). Notably, a review cycle of length zz in EE^{\prime} corresponds to a directed cycle of length 2z2z in (AP,EEP)(A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}) and a review cycle of length one corresponds to an author reviewing one of its own papers. We say that a review assignment EE^{\prime} is zz-cycle free if there is no review cycle of length i[z]i\in[z] in EE^{\prime}.

Table 1: Notation overview
Variable Explanation
V=APV=A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P vertex set consisting of agents AA and papers PP with nA=|A|n_{A}=|A| and nP=|P|n_{P}=|P|
EAE_{A} (a,p)EAA×P(a,p)\in E_{A}\subseteq A\times P shows aa can review pp
EPE_{P} (p,a)EPP×A(p,a)\in E_{P}\subseteq P\times A shows aa authors pp
N(v,E)N^{-}(v,E) in-neighbors of vVv\in V wrt. E(V2)E\subseteq\binom{V}{2}, i. e., N(v,E):={uV(u,v)E}N^{-}(v,E):=\{u\in V\mid(u,v)\in E\}
N+(v,E)N^{+}(v,E) out-neighbors of vVv\in V wrt. E(V2)E\subseteq\binom{V}{2}, i. e., N+(v,E):={uV(v,u)E}N^{+}(v,E):=\{u\in V\mid(v,u)\in E\}
ΔU\Delta^{-}_{U}, ΔU+\Delta^{+}_{U} maximum in- and out-degree in UU resp., e. g., ΔU:=maxuU|N(u,EAEP)|\Delta^{-}_{U}:=\max_{u\in U}|N^{-}(u,E_{A}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P})|
δU\delta^{-}_{U}, δU+\delta^{+}_{U} minimum in- and out-degree in UU resp., e. g., δU+:=minuU|N+(u,EAEP)|\delta^{+}_{U}:=\min_{u\in U}|N^{+}(u,E_{A}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P})|
ΔA\Delta_{A}^{-}, δA\delta_{A}^{-} maximum resp. minimum number of papers per author
ΔP+\Delta_{P}^{+}, δP+\delta_{P}^{+} maximum resp. minimum number of authors per paper
ΔA+\Delta_{A}^{+}, δA+\delta_{A}^{+} maximum resp. minimum number of papers any author is qualified to review
ΔP\Delta_{P}^{-}, δP\delta_{P}^{-} maximum resp. minimum number of potential reviewers for any paper

Using this notation, we define our central problem and refer to Table 1 for further necessary variable definitions:

[Weighted] Cycle-Free Reviewing
Input: A directed bipartite graph (AP,EAEP)(A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P,E_{A}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}) and non-negative integers creviewerc_{\text{reviewer}}, dpaperd_{\text{paper}}, and zz [and a weight function w:EAw:E_{A}\mapsto\mathbb{Z} and an integer WW].

Question: Is there a creviewerc_{\text{reviewer}}-dpaperd_{\text{paper}}-valid and zz-cycle-free review assignment EEAE^{\prime}\subseteq E_{A} [of weight at least WW, i.e., eEw(e)W\sum_{e\in E^{\prime}}w(e)\geq W]?

3 NP-Hardness in Various Restricted Cases

From the work of Barrot et al. (2020, Theorem 4.12) it follows that Cycle-Free Reviewing is NP-hard in the single-author-single-paper setting (ΔA=ΔP+=1\Delta_{A}^{-}=\Delta_{P}^{+}=1) even if creviewer=dpaper=1c_{\text{reviewer}}=d_{\text{paper}}=1 and z=2z=2. However, as in reality instances of Cycle-Free Reviewing are hardly arbitrary but have a quite strong structure, in this section we prove that the NP-hardness of Cycle-Free Reviewing upholds even if the given instance fulfills further quite restrictive conditions, e.g., each agent is qualified to review all papers or our problem specific parameters (ΔA,ΔP+,ΔA+,ΔP,creviewer,dpaper,z\Delta_{A}^{-},\Delta_{P}^{+},\Delta_{A}^{+},\Delta_{P}^{-},c_{\text{reviewer}},d_{\text{paper}},z) are small constants.

3.1 Sparse Review Graph and Small Weights

We start by considering the case where all our parameters are small. Specifically, we show the NP-hardness of Cycle-Free Reviewing for arbitrarily z2z\geq 2 even if each paper is only authored by at most two agents, each agent authors at most two papers, each agent is only qualified to review at most three papers, and for each paper only at most three agents are qualified to review it (see Table 1 for definitions).

Theorem 1.

For any z2z\geq 2, Cycle-Free Reviewing is NP-hard, even if ΔA+=ΔP=3\Delta_{A}^{+}=\Delta_{P}^{-}=3, ΔA=ΔP+=2\Delta_{A}^{-}=\Delta_{P}^{+}=2, nA=nPn_{A}=n_{P}, and creviewer=dpaper=1c_{\text{reviewer}}=d_{\text{paper}}=1. The hardness results still hold if agents are not allowed to review papers of co-authors.

Proof.

We reduce from an NP-hard variant of Satisfiability where each clause consists of exactly three literals and each variable occurs positive in at most two clauses and negative in at most two clauses (Berman et al., 2003).

Construction.

Given an instance of Satisfiability consisting of a set XX of variables and a set CC of clauses, we set dpaper=creviewer=1d_{\text{paper}}=c_{\text{reviewer}}=1 and zz to some integer greater than one. We construct the set AA of agents and the set PP of papers as follows. For each variable xXx\in X, we introduce three agents axa_{x}, ax¯a_{\bar{x}}, and bxb_{x} and three papers pxp_{x}, px¯p_{\bar{x}}, and qxq_{x} (qxq_{x} has no author and can be considered as a dummy paper). Agents axa_{x} and bxb_{x} are qualified to review pxp_{x}, agents ax¯a_{\bar{x}} and bxb_{x} are qualified to review px¯p_{\bar{x}} and agents axa_{x} and ax¯a_{\bar{x}} are qualified to review qxq_{x}. Intuitively, either does axa_{x} review pxp_{x} (which corresponds to setting xx to false) or ax¯a_{\bar{x}} review px¯p_{\bar{x}} (which corresponds to setting xx to true).

For each clause c=123c=\ell_{1}\vee\ell_{2}\vee\ell_{3}, we introduce three agents ac1a_{c}^{1}, ac2a_{c}^{2}, and ac3a_{c}^{3} and three papers pc1p_{c}^{1}, pc2p_{c}^{2}, and pc3p_{c}^{3} where acia_{c}^{i} is qualified to review pcip_{c}^{i} for i[3]i\in[3]. Moreover, we introduce two dummy agents that are both qualified to review pc1p_{c}^{1}, pc2p_{c}^{2}, and pc3p_{c}^{3} and two dummy papers who ac1a_{c}^{1}, ac2a_{c}^{2}, and ac3a_{c}^{3} are all qualified to review. Notably, for one i[3]i\in[3], acia_{c}^{i} needs to review pcip_{c}^{i} (which corresponds to cc being fulfilled because of i\ell_{i}).

Concerning the authors of each paper, for each clause c=123c=\ell_{1}\vee\ell_{2}\vee\ell_{3} and i[3]i\in[3], acia_{c}^{i} is an author of pip_{\ell_{i}} and aia_{\ell_{i}} is an author of pcip^{i}_{c}.

It is easy to see that each agent is only qualified to review at most three papers and that for each paper only at most three agents are qualified to review it. Moreover, as each literal only appears in at most two clauses, every paper has at most two authors and each agent authors at most two papers. Moreover, note that |A|=|P||A|=|P|, implying that each agent has to review exactly one paper.

(\Rightarrow) Let ZZ be the set of variables that are set to true in a satisfying assignment of the given Satisfiability instance. Then, for xZx\in Z, we assign bxb_{x} to pxp_{x}, axa_{x} to qxq_{x}, and ax¯a_{\bar{x}} to px¯p_{\bar{x}}, while for xZx\notin Z, we assign bxb_{x} to px¯p_{\bar{x}}, ax¯a_{\bar{x}} to qxq_{x}, and axa_{x} to pxp_{x}. For a clause c=123c=\ell_{1}\vee\ell_{2}\vee\ell_{3}, let i\ell_{i^{*}} with i[3]i^{*}\in[3] be a literal from cc that is set to true by the given assignment (such a literal exists because the given assignment is satisfying). Then, we set acia_{c}^{i^{*}} to review pcip_{c}^{i^{*}}. The two dummy agents from this clause are assigned arbitrarily to pcip_{c}^{i} for i[3]{i}i\in[3]\setminus\{i^{*}\} and the agents acia_{c}^{i} for i[3]{i}i\in[3]\setminus\{i^{*}\} are assigned arbitrarily to the two dummy papers. To show that the constructed assignment does not contain a review cycle (of arbitrary length) note that only papers that have an author are papers pp_{\ell} for some literal \ell (which are authored by acia_{c}^{i} for some cCc\in C and i[3]i\in[3] where \ell appears in cc as the iith literal) and papers pcip^{i}_{c} for some c=123Cc=\ell_{1}\vee\ell_{2}\vee\ell_{3}\in C and i[3]i\in[3] (which are authored by aia_{\ell_{i}}). Thus, every review cycle of length at least two needs to contain an agent acia_{c}^{i} for some cCc\in C and i[3]i\in[3] and aa_{\ell}, where \ell appears in cc as the iith literal, and acia_{c}^{i} reviews pcip_{c}^{i} and aa_{\ell} reviews pp_{\ell}. For acia_{c}^{i} to review pcip_{c}^{i} it needs to hold that the given assignment satisfies \ell. However, by our construction of the review assignment, aa_{\ell} reviewing pp_{\ell} implies that ¯\bar{\ell} is satisfied. Thus, no review cycle exists.

(\Leftarrow) Assume we are given a 11-11-valid zz-cycle-free review assignment. Let Y:={xXax¯ reviews px¯}Y:=\{x\in X\mid a_{\bar{x}}\text{ reviews }p_{\bar{x}}\}. We claim that the assignment α\alpha which sets all variables in YY to true and all variables in XYX\setminus Y to false satisfies the given formula. Assume for the sake of contradiction that there exists a clause c=123Cc=\ell_{1}\vee\ell_{2}\vee\ell_{3}\in C which is not satisfied by α\alpha. As the given assignment is 11-11-valid and we have the same number of agents and papers in the constructed instance, there is a i[3]i^{*}\in[3] such that acia_{c}^{i^{*}} reviews pcip_{c}^{i^{*}}. Note that by the same reasoning, for each xXx\in X, either does axa_{x} review pxp_{x} or ax¯a_{\bar{x}} review px¯p_{\bar{x}}. Thus, if a literal \ell is not satisfied by α\alpha, then aa_{\ell} reviews pp_{\ell}. As i\ell_{i^{*}} is not satisfied by α\alpha , aia_{\ell_{i^{*}}} reviews pip_{\ell_{i^{*}}}. Thus, acia_{c}^{i} and aia_{\ell_{i^{*}}} form a review cycle of length two, as acia_{c}^{i} reviews pcip_{c}^{i}, which is authored by aia_{\ell^{*}_{i}}, and aia_{\ell^{*}_{i}} reviews pip_{\ell^{*}_{i}}, which is authored by acia_{c}^{i}, a contradiction. ∎

The above reduction crucially relies on the “sparsity” of the qualifications, i.e., that each agent is qualified to review between two and three papers and that for each paper only two or three agents are qualified to review it. Motivated by the observation that, in practice, reviewers are typically qualified to review more than just two or three papers and that for each paper there typically exists more than just two or three qualified reviewers, it is a natural question whether our above hardness result still extends to this case. We answer this question affirmative by proving hardness for arbitrary δA+\delta_{A}^{+} and δP\delta_{P}^{-}, i.e., for the case where each agent is qualified to review at least δA+\delta_{A}^{+} papers and for each paper there exist at least δP\delta_{P}^{-} agents that are qualified to review to:

Proposition 1.

For any z2z\geq 2, δP2δA+\delta_{P}^{-}\geq 2\leq\delta_{A}^{+}, Cycle-Free Reviewing is NP-hard, even if ΔA=ΔP+=2\Delta_{A}^{-}=\Delta_{P}^{+}=2, nA=nPn_{A}=n_{P}, and creviewer=dpaper=1c_{\text{reviewer}}=d_{\text{paper}}=1.

Proof.

Let δ:=max(δA+,δP)\delta:=\max(\delta_{A}^{+},\delta_{P}^{-}). We reduce from the restricted NP-hard variant of Cycle-Free Reviewing considered in Theorem 1. Given an instance =((AP,EAEP),creviewer=1,dpaper=1,z=2)\mathcal{I}=((A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P,E_{A}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}),c_{\text{reviewer}}=1,d_{\text{paper}}=1,z=2) of Cycle-Free Reviewing with ΔA=ΔP+=2\Delta_{A}^{-}=\Delta_{P}^{+}=2, we modify the instance \mathcal{I} by introducing two sets AA^{\prime} and A′′A^{\prime\prime} of δ\delta agents each and two sets PP^{\prime} and P′′P^{\prime\prime} of δ\delta papers each. All agents from AA^{\prime} are qualified to review all papers from PP^{\prime} and from PP. In addition to being qualified to review some papers from PP (as captured in EAE_{A}), all agents from AA are qualified to review all papers from P′′P^{\prime\prime}. Moreover, all agents from A′′A^{\prime\prime} are qualified to review all papers from P′′P^{\prime\prime}. Thereby, all agents are qualified to review at least δ\delta papers and for each paper at least δ\delta agents are qualified to review it. Notably, we still have |A|=|P||A|=|P|. Thus, as agents from A′′A^{\prime\prime} are only qualified to review papers from P′′P^{\prime\prime} and |A′′|=|P′′||A^{\prime\prime}|=|P^{\prime\prime}|, all papers from P′′P^{\prime\prime} need to be reviewed by agents from A′′A^{\prime\prime} (which is always possible to do in without creating a review cycle as no paper from A′′A^{\prime\prime} has an author). Similarly, as papers from PP^{\prime} can only be reviewed by agents from AA^{\prime} and |A|=|P||A^{\prime}|=|P^{\prime}|, all agents from AA^{\prime} need to review papers from PP^{\prime} (which is always possible to do in without creating a review cycle as no paper from AA^{\prime} has an author). Thus, all agents from AA need to review papers from PP from which the correctness of the reduction directly follows.

Lastly, note that we did not modify the set of authors for any paper from PP and did not add papers with an author. Thus, it still holds in the modified instance that each agent authors at most two papers and each paper has at most two authors (ΔA=ΔP+=2\Delta_{A}^{-}=\Delta_{P}^{+}=2). ∎

While we prove hardness for arbitrary δA+\delta_{A}^{+} and δP\delta_{P}^{-}, in our construction from Proposition 1, there are always agents that are not qualified to review “many” papers (around 23\frac{2}{3}) and always papers that cannot be reviewed by “many” agents (around 23\frac{2}{3}). Thus, interpreting a qualification as the absence of a conflict of interest, for our NP-hardness agents need to have many conflicts. In Section 4, we prove that this does not happen by accident, as if the number of conflicts per agent/paper (and ΔA\Delta_{A}^{-}, ΔP+\Delta_{P}^{+}, creviewerc_{\text{reviewer}}, and dpaperd_{\text{paper}}) are “small”, then Cycle-Free Reviewing always admits a solution.

In Weighted Cycle-Free Reviewing it is possible to encode the “qualifications” of agents into weights: If we modify the reduction from above and give an agent-reviewer pair weight one if the agent is qualified to review the paper and weight zero otherwise, we get that Weighted Cycle-Free Reviewing is NP-hard even if each agent is qualified to review all papers and we have few non-zero weights.

Corollary 1.

For any z2z\geq 2, Weighted Cycle-Free Peer Reviewing is NP-hard, even if each agent is qualified to review all papers, each agent gives only at most three papers a non-zero weight, for each paper at most three agents give it a non-zero weight, ΔP+2ΔA\Delta_{P}^{+}\leq 2\geq\Delta_{A}^{-}, nA=nPn_{A}=n_{P}, and creviewer=dpaper=1c_{\text{reviewer}}=d_{\text{paper}}=1.

3.2 No Conflicts of Interest

We now extend the hardness from Corollary 1 for the case where each agent is qualified to review all papers (no conflicts) to the unweighted case. However, our new reduction relies on the existence of papers with many authors and agents authoring many papers.

To show that Cycle-Free Reviewing is NP-hard even if each agent is qualified to review all papers, nA=nPn_{A}=n_{P}, creviewer=dpaper=1c_{\text{reviewer}}=d_{\text{paper}}=1, and z=2z=2 (Theorem 2), we reduce from Multicolored Independent Set where we are given a graph GG with vertices partitioned into kk sets V1,,VkV^{1},\dots,V^{k} (to which we refer as color classes) and the question is whether there exists a subset of kk vertices, containing one vertex from each class, that are pairwise non-adjacent. We denote as n:=|V1|n:=|V^{1}| the number of vertices in the first color class and assume without loss of generality that n>kn>k and that |Vc|:=n+c1|V^{c}|:=n+c-1 for c[k]c\in[k] (note that we can do so because we can always add vertices that are connected to all other vertices and put them into one of the color classes).

Construction.

Given an instance \mathcal{I} of Multicolored Independent Set G=(V=(V1,,Vk),E)G=(V=(V^{1},\dots,V^{k}),E), we construct an instance \mathcal{I}^{\prime} of Cycle-Free Reviewing as follows. For each color c[k]c\in[k], we add a special agent aca^{c}_{*} and a special paper pcp^{c}_{*}. Moreover, for each vertex vVcv\in V^{c}, we add a vertex agent avca^{c}_{v} and a vertex paper pvcp^{c}_{v}. Further, we add n+c2n+c-2 dummy agents a~1c,,a~n+c2c\tilde{a}^{c}_{1},\dots,\tilde{a}^{c}_{n+c-2} and n+c2n+c-2 dummy papers p~1c,,p~n+c2c\tilde{p}^{c}_{1},\dots,\tilde{p}^{c}_{n+c-2}. Lastly, we insert an agent aa^{*} and a paper pp^{*}.

The paper pp^{*} is authored by all vertex agents and dummy agents. For color c[k]c\in[k], pcp^{c}_{*} is authored by all vertex und dummy agents from colors cc[k]c^{\prime}\neq c\in[k] and agent aa^{*}. Further, all dummy papers p~ic\tilde{p}^{c}_{i} for i[n+c2]i\in[n+c-2] are authored by the special agent aca^{c}_{*}. For a vertex vVcv\in V^{c}, paper pvcp^{c}_{v} is authored by the special agent aca^{c}_{*}, all agents corresponding to vertices from Vc{v}V^{c}\setminus\{v\} or to vertices adjacent to vv in GG, i.e., pvcp^{c}_{v} is authored by agents {ac}{avcvvVc}{avcc[k],vVc,{v,v}E}\{a^{c}_{*}\}\cup\{a^{c}_{v^{\prime}}\mid v\neq v^{\prime}\in V^{c}\}\cup\{a^{c^{\prime}}_{v^{\prime}}\mid c^{\prime}\in[k],v^{\prime}\in V^{c^{\prime}},\{v,v^{\prime}\}\in E\}. Each agent is qualified to review all papers and we set creviewer=dpaper=1c_{\text{reviewer}}=d_{\text{paper}}=1 and z=2z=2.

Lemma 1.

If the given instance \mathcal{I} of Multicolored Independent Set is a YES-instance, then the constructed instance \mathcal{I}^{\prime} of Cycle-Free Reviewing is a YES-instance.

Proof.

Let V={w1,wk}VV^{\prime}=\{w^{1},\dots w^{k}\}\subseteq V be a independent set of size kk in the given Multicolored Independent Set instance \mathcal{I} with wcVcw^{c}\in V^{c} for c[k]c\in[k]. From this we construct a solution for the constructed Cycle-Free Reviewing instance \mathcal{I}^{\prime} as follows. Agent aa^{*} reviews paper pp^{*}. For c[k]c\in[k], special agent aca^{c}_{*} reviews special paper pcp^{c}_{*}. Vertex agents {avcvVc{wc}}\{a^{c}_{v^{\prime}}\mid v^{\prime}\in V^{c}\setminus\{w^{c}\}\} are assigned arbitrarily to dummy papers p~1c,,p~n+c2c\tilde{p}^{c}_{1},\dots,\tilde{p}^{c}_{n+c-2}. Lastly, vertex agent awcca^{c}_{w^{c}} reviews paper pwccp^{c}_{w^{c}} and the dummy agents from class cc are assigned arbitrarily to the remaining vertex papers from this class. Note that by construction, the described assignment is 11-11 valid. Moreover, it is easy to verify that no agent reviews a paper authored by it so it remains to check for reviewing cycles of length two. All special agents are only authors of papers from their color class but review papers authored solely by agents outside their color class. Thus there exist no review cycles involving special agents. All papers aa^{*} wrote are reviewed by special agents so aa^{*} cannot be part of a review cycle. Dummy agents only write papers that are reviewed by special agents and aa^{*} so no dummy agent can be part of a review cycle. Thus, every possible review cycle of length two needs to involve two vertex agents. As no dummy paper is written by a vertex agent, the only vertex agents that review papers authored by other vertex agents are those assigned to vertex papers, i.e., agents {aw11,awkk}\{a^{1}_{w^{1}},\dots a^{k}_{w^{k}}\}. Assume for the sake of contradiction that awiia^{i}_{w_{i}} (which reviews paper pwiip^{i}_{w_{i}}) forms a cycle with reviewer awiia^{i^{\prime}}_{w^{i^{\prime}}} with ii[k]i\neq i^{\prime}\in[k]. However, from this it follows by the definition of a review cycle that awiia^{i^{\prime}}_{w^{i^{\prime}}} is an author of paper pwiip^{i}_{w_{i}}, which implies that {wi,wi}E\{w^{i},w^{i^{\prime}}\}\in E contradicting that VV^{\prime} is an independent set. ∎

We now turn to proving the backwards direction of the reduction. To do this, we first identify several assignments that need to be made in all solutions to the constructed Cycle-Free Reviewing instance. We start by proving that aa^{*} needs to review pp^{*}.

Lemma 2.

In every 11-11 valid 22-cycle-free assignment in the constructed instance \mathcal{I}^{\prime}, aa^{*} reviews pp^{*}.

Proof.

Recall that all agents except all special agents and agent aa^{*} are authors of pp^{*}. So for the sake of contradiction let us assume that special agent aca_{*}^{c} for some c[k]c\in[k] reviews pp^{*}. However, to prevent a reviewing cycle, this implies that only the remaining k1k-1 special agents and aa^{*} can review papers written by aca_{*}^{c}. However, as aca_{*}^{c} is an author of all vertex papers corresponding to vertices from VcV^{c} and we have assumed that each set VcV^{c} consists of more than kk vertices, these kk agents are not enough to review all papers written by aca_{*}^{c}, a contradiction. ∎

We next prove that aca^{c}_{*} reviews pcp^{c}_{*} for all c[k]c\in[k]. For this, we need the following lemma:

Lemma 3.

In every 11-11 valid 22-cycle-free assignment in the constructed instance \mathcal{I}^{\prime}, if aca^{c}_{*} reviews paper pcp^{c^{\prime}}_{*} for c,c[k]c,c^{\prime}\in[k], then only vertex and dummy agents from class cc^{\prime} and special agents can review dummy and vertex papers from class cc.

Proof.

Note that the special agent aca^{c}_{*} is an author of all dummy and vertex papers from color class cc. Moreover, paper pcp^{c^{\prime}}_{*} is authored by all dummy and vertex agents from color classes different from cc^{\prime}. Thus, if aca^{c}_{*} reviews pcp^{c^{\prime}}_{*}, then no vertex or dummy agent from a class different from cc^{\prime} can review papers written by aca^{c}_{*}. As aca^{c}_{*} authors all dummy and vertex papers from class cc, the lemma follows. ∎

Using this, we are able to prove that each special agent reviews the corresponding special paper.

Lemma 4.

In every 11-11 valid 22-cycle-free assignment in the constructed instance \mathcal{I}^{\prime}, for c[k]c\in[k], aca^{c}_{*} reviews pcp^{c}_{*}.

Proof.

By Lemma 2, aa^{*} is assigned to pp^{*}, which is authored by all dummy agents and vertex agents. Thus, to prevent the existence of reviewing cycles, only special agents can review papers written by aa^{*}. As for each c[k]c\in[k], pcp^{c}_{*} is written by aa^{*}, it follows that the set of kk agents {acc[k]}\{a_{*}^{c}\mid c\in[k]\} needs to review the set of kk papers {pcc[k]}\{p_{*}^{c}\mid c\in[k]\}. For the sake of contradiction, let us assume that special agent aca^{c}_{*} reviews paper pcp^{c^{\prime}}_{*} for cc[k]c\neq c^{\prime}\in[k]. We assume without loss of generality that c<cc^{\prime}<c (if there exists a pair where ac~a^{\tilde{c}}_{*} reviews paper pc~p^{\tilde{c}^{\prime}}_{*} with c~<c~\tilde{c}<\tilde{c}^{\prime} there also has to exist one with c<cc^{\prime}<c). By Lemma 3 and as special agents need to review special papers, from this it follows that only dummy and vertex agents from color cc^{\prime} can review the vertex and dummy agents from class cc (which are all written by aca^{c}_{*}). As we have assumed that c<cc^{\prime}<c, the number of these agents (2n+2c32n+2c^{\prime}-3) does not suffices to review all of these papers (2n+2c32n+2c-3), a contradiction. ∎

We are now ready to prove the correctness of the backwards direction of the reduction:

Lemma 5.

If the constructed instance \mathcal{I}^{\prime} of Cycle-Free Reviewing is a YES-instance, then the given instance \mathcal{I} of Multicolored Independent Set is a YES-instance.

Proof.

From Lemma 2, Lemma 3, and Lemma 4 it follows that for each color c[k]c\in[k] every vertex and dummy agent from this color class needs to review a vertex or dummy paper from this color class and that each vertex or dummy paper from this color class needs to be reviewed by a vertex or dummy agent from this color class. As there exist n+c2n+c-2 dummy agents from color class cc but n+c1n+c-1 vertex papers at least one vertex paper from color class cc needs to be reviewed by a vertex agent from color class cc. Note that for each vVcv\in V^{c}, agent avca_{v}^{c} is an author of all vertex papers except pvcp_{v}^{c}. Thus, for each color c[k]c\in[k] there needs to exist (at least) one agent awcca_{w_{c}}^{c} for some wcVcw_{c}\in V^{c} that reviews pwccp_{w_{c}}^{c}. So let aw11,,awkka_{w_{1}}^{1},\cdots,a_{w_{k}}^{k} be a list of those agents (containing one vertex agent from each color class). We claim that {w1,,wk}\{w_{1},\dots,w_{k}\} forms an independent set in GG. For the sake of contradiction assume that {wc,wc}E\{w_{c},w_{c^{\prime}}\}\in E for cc[k]c\neq c^{\prime}\in[k], then by construction it follows that awcca_{w_{c}}^{c} who reviews paper pwccp_{w_{c}}^{c} is an author of paper pwccp_{w_{c^{\prime}}}^{c^{\prime}} and similarly awcca_{w_{c^{\prime}}}^{c^{\prime}} who reviews pwccp_{w_{c^{\prime}}}^{c^{\prime}} is an author of pwccp_{w_{c}}^{c}. Thus, awcca_{w_{c}}^{c} and awcca_{w_{c^{\prime}}}^{c^{\prime}} form a reviewing cycle, a contradiction. ∎

From Lemma 1 and Lemma 5, Theorem 2 directly follows:

Theorem 2.

Cycle-Free Reviewing is NP-hard even if each agent is qualified to review all papers, nA=nPn_{A}=n_{P}, creviewer=dpaper=1c_{\text{reviewer}}=d_{\text{paper}}=1, and z=2z=2.

The reduction from Theorem 2 heavily relies on the possibility that an agent reviews a paper written by an agent with whom she has a joint paper. As some conferences might declare an automatic conflict of interest for co-authors, we now consider the case where an agent is qualified to review all papers that are not authored by one of her co-authors:

Theorem 3.

Cycle-Free Reviewing is NP-hard even if each agent is qualified to review all papers that are not written by one of her co-authors, creviewer=dpaper=1c_{\text{reviewer}}=d_{\text{paper}}=1, and z=2z=2.

Proof.

We reduce from Cycle-Free Reviewing with creviewer=dpaper=1c_{\text{reviewer}}=d_{\text{paper}}=1, and z=2z=2 where agents are not qualified to review papers of co-authors, which is NP-hard as proven in Theorem 1. We assume without loss of generality that for each paper there is one agent who is not qualified to review it.

Construction.

Given an instance =((AP,EAEP),creviewer=1,dpaper=1,z=2)\mathcal{I}=((A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P,E_{A}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}),c_{\text{reviewer}}=1,d_{\text{paper}}=1,z=2) of Cycle-Free Reviewing, we construct a new instance \mathcal{I}^{\prime} with agents AA^{\prime} and papers PP^{\prime} and creviewer=dpaper=1c_{\text{reviewer}}^{\prime}=d_{\text{paper}}^{\prime}=1 and z=2z^{\prime}=2. We start by setting A=AA^{\prime}=A. Next, we add agents ww, xx, yy, and zz to AA^{\prime}. For each agent aAa\in A and each paper pPp\in P, we insert an agent apa_{p} to AA^{\prime} and add a so called agent paper which is authored by aa and apa_{p} to PP^{\prime}. For each paper pPp\in P, we introduce an agent bpb_{p} to AA^{\prime}. Moreover, we introduce nAnPn_{A}\cdot n_{P} dummy agents d1,,dnAnPd_{1},\dots,d_{n_{A}\cdot n_{P}} to AA^{\prime}. We introduce five different (types of) papers in PP^{\prime}:

  • For each paper pPp\in P, we introduce a paper pp to PP^{\prime} that is written by all authors of pp, agent bpb_{p} and by agents apa_{p} for all agents aAa\in A that are not qualified to review pp in \mathcal{I}.

  • We introduce a paper qq authored by ww and apa_{p} for all aAa\in A and pPp\in P.

  • We introduce a paper qq^{\prime} authored by xx and apa_{p} for all aAa\in A and pPp\in P to PP^{\prime}.

  • We introduce a paper rr authored by yy and all dummy agents d1,,dnAnPd_{1},\cdots,d_{n_{A}\cdot n_{P}} and bpb_{p} for each pPp\in P.

  • We introduce a paper rr^{\prime} authored by zz and all dummy agents d1,,dnAnPd_{1},\cdots,d_{n_{A}\cdot n_{P}} and bpb_{p} for each pPp\in P.

Each agent is qualified to review all papers that are not written by one of her co-authors.

(\Rightarrow) Given a 11-11-valid 22-cycle-free review assignment for \mathcal{I}, we construct a 11-11-valid 22-cycle-free review assignment for \mathcal{I}^{\prime} as follows. All agents from AA still review the same papers as in the given assignment (which are all still qualified to do so because we have not added or removed any papers with two authors from AA apart from copies of papers from PP). Agent ww reviews rr, agent xx reviews rr^{\prime}, yy reviews qq^{\prime}, and zz reviews qq (which are all are qualified to do so). Moreover, the dummy agents are assigned arbitrarily to the agent papers, which they are qualified to review because dummy agents only author papers together with agents {bppP}\{b_{p}\mid p\in P\} and yy and zz.

Concerning review cycles, note that agents {ap,bpaA,\{a_{p},b_{p}\mid a\in A, pP}p\in P\} do not review any paper. Moreover dummy agents only review papers written by agents {ap,aaA,\{a_{p},a\mid a\in A, pP}p\in P\} but are only reviewed by xx and ww and thus cannot be part of a review cycle. Moreover, also no agent from AA can be part of a review cycle because there was no such cycle in the given review assignment and all agents that author a paper reviewed by an agent from AA are not part of a review cycle. Thus, any review cycle needs to consists of ww, xx, yy, and zz. Note that ww reviews a paper of yy, yy reviews a paper of xx, xx reviews a paper of zz, and zz reviews a paper of ww. Thus, these agents form a 44-cycle but no 22-cycle.

(\Leftarrow) Given a 11-11-valid 22-cycle-free review assignment for \mathcal{I^{\prime}}, we claim that this assignment restricted to agents from AA and papers from PP is a solution to the given instance \mathcal{I}. To prove this, we will argue for all agents from AAA^{\prime}\setminus A that they cannot review a paper from PP from which the correctness directly follows, as we have not added any authors from AA to papers from PP. Fix some paper pPp^{\prime}\in P. We now iterate over all agents AAA^{\prime}\setminus A and argue why they cannot review pp^{\prime}. As we have assumed in \mathcal{I} that for all papers there is an agent not qualified to review it, it follows that pp^{\prime} has an author ap{a^{*}}_{p^{\prime}} for some aAa^{*}\in A and author bpb_{p^{\prime}} in \mathcal{I^{\prime}}. For agent ww and xx it holds that both have a joint paper with ap{a^{*}}_{p^{\prime}} and thus cannot review pp^{\prime}. Next, note that as, for each aAa\in A and pPp\in P, apa_{p} is either identical to ap{a^{*}}_{p^{\prime}} or has a joint paper with ap{a^{*}}_{p^{\prime}} none of these agents can review pp^{\prime}. Lastly, did_{i} for some i[nAnP]i\in[n_{A}\cdot n_{P}], bpb_{p} for pP{p}p\in P\setminus\{p^{\prime}\}, and yy and zz have a joint paper with bpb_{p^{\prime}}, which is an author of pp^{\prime}. Thus, all these agents (and thereby no agent from AAA^{\prime}\setminus A) can review pp^{\prime}.

3.3 Single-Author-Single-Paper Setting

In their theoretical analysis, Barrot et al. (2020) focus on Cycle-Free Reviewing where each agent writes a single-author paper (we speak of an agent and its paper interchangeably) and qualifications are symmetric, i.e., if an agent aa is qualified to review agent bb, then bb is qualified to review aa. They prove that this problem is NP-hard for creviewer=dpaper=1c_{\text{reviewer}}=d_{\text{paper}}=1 and z=5z=5 (without bounds on ΔA+\Delta_{A}^{+} or ΔP\Delta_{P}^{-}) but polynomial-time solvable for arbitrary creviewer=dpaperc_{\text{reviewer}}=d_{\text{paper}} for z=2z=2. We close the gap between these two results and extend their general picture by proving that for creviewer=dpaper=2c_{\text{reviewer}}=d_{\text{paper}}=2, Cycle-Free Reviewing is NP-hard for z=3z=3 even if qualifications are symmetric and each agent is only qualified to review four agents, i.e., we need to decide for each agent aa which two of these four agents review aa and which two of these agents will get a review from aa.

Theorem 4.

Cycle-Free Reviewing is NP-hard, even if z=3z=3, creviewer=dpaper=2c_{\text{reviewer}}=d_{\text{paper}}=2, ΔA=ΔP+=1\Delta_{A}^{-}=\Delta_{P}^{+}=1, nA=nPn_{A}=n_{P}, each agent is qualified to review exactly four papers and if an agent aa can review the paper written by agent bb, then bb can review the paper of aa.

To prove Theorem 4, we reduce from Two-in-Four-Satisfiability, a variant of Satisfiability, where given a propositional formula φ\varphi over variables XX where each clause contains four different literals, the question is whether there exists an assignment α\alpha of variables XX such that in each clause exactly two out of four literals are satisfied. As to the best of our knowledge, this variant of Satisfiability has not been considered before, we start by proving that it is NP-hard even if each literal appears exactly twice positive and twice negative:

Proposition 2.

Two-in-Four-Satisfiability is NP-hard, even if each variable appears exactly twice positive and twice negative.

Proof.

In Monotone Not-All-Equal 3-Sat, we are given a propositional formula where each clause contains three different positive literals and the question is whether there is a variable assignment such that in each clause at least one literal is set to true and at least one is set to false. Reducing Monotone Not-All-Equal 3-Sat to Two-in-Four-Satisfiability (without any additional restrictions) is straightforward: Given an instance of Monotone Not-All-Equal 3-Sat, for each clause, we introduce a new variable which we add to the clause. Thereby, we can extend a valid assignment α\alpha for the Monotone Not-All-Equal 3-Sat instance by setting for a clause the newly introduced variable to true if α\alpha originally sets only one literal from this clause to true and to false if α\alpha originally sets only one literal from this clause to false. The reverse direction is immediate. However, to achieve that each variable appears twice positive and twice negative, a slightly more involved approach is needed.

In fact, for simplicity, we reduce from the NP-hard variant of Monotone Not-All-Equal 3-Sat where each variable appears in exactly four clauses (Darmann and Döcker, 2020). Given an instance φ=C1Cm\varphi=C_{1}\wedge\dots\wedge C_{m} over variables XX of Monotone Not-All-Equal 3-Sat, note that mm needs to be even, as there are m=4|X|3m=\frac{4\cdot|X|}{3} and mm needs to be an integer. We now construct a new propositional formula ϕ\phi over variable set XX^{\prime} as follows. For each clause Ci=wxyC_{i}=w\vee x\vee y for i[m]i\in[m], we add variables wiw_{i}, xix_{i}, yiy_{i}, and ziz_{i} to XX^{\prime} and clauses wixiyiziw_{i}\vee x_{i}\vee y_{i}\vee z_{i} and wi¯xi¯yi¯zi¯\overline{w_{i}}\vee\overline{x_{i}}\vee\overline{y_{i}}\vee\overline{z_{i}} to ϕ\phi. Now, every variable appears once negative and once positive. It remains to link the copies of each variable.

We do this for each variable separately. Let xXx\in X be some original variable and let j1,,j4j_{1},\dots,j_{4} denote the list of all clauses where xx appears in φ\varphi. We introduce dummy variables ax1a^{1}_{x} and ax2a^{2}_{x} to XX^{\prime} and add clauses xj1¯xj2ax1ax1¯\overline{x_{j_{1}}}\vee x_{j_{2}}\vee a^{1}_{x}\vee\overline{a^{1}_{x}}, xj2¯xj3ax1ax1¯\overline{x_{j_{2}}}\vee x_{j_{3}}\vee a^{1}_{x}\vee\overline{a^{1}_{x}}, xj3¯xj4ax2ax2¯\overline{x_{j_{3}}}\vee x_{j_{4}}\vee a^{2}_{x}\vee\overline{a^{2}_{x}}, and xj4¯xj1ax2ax2¯\overline{x_{j_{4}}}\vee x_{j_{1}}\vee a^{2}_{x}\vee\overline{a^{2}_{x}} to ϕ\phi. As for each j[3]j\in[3], exactly one of xji¯\overline{x_{j_{i}}} and xji+1x_{j_{i+1}} need to be set to true, these clauses enforce that xj1x_{j_{1}}, xj2x_{j_{2}}, xj3x_{j_{3}}, and xj4x_{j_{4}}, all have the same truth value. Lastly, for i[m2]i\in[\frac{m}{2}], we add twice the clause z2i1z2i1¯z2iz2i¯z_{2i-1}\vee\overline{z_{2i-1}}\vee z_{2i}\vee\overline{z_{2i}} which are always trivially satisfied.

The correctness of the reduction is immediate and all variables appear twice positive and twice negative in ϕ\phi. ∎

Using this, we are now ready to prove Theorem 4:

Proof of Theorem 4.

We reduce from Two-in-Four-Satisfiability where each variable appears exactly twice positive and twice negative.

Construction.

Given an instance of Two-in-Four-Satisfiability consisting of a propositional formula φ=C1Cm\varphi=C_{1}\wedge\dots\wedge C_{m} over variables X={x1,,xn}X=\{x_{1},\dots,x_{n}\}, for i[n]i\in[n], we denote as ti,1post^{\text{pos}}_{i,1} and ti,2post^{\text{pos}}_{i,2} the indices of the two clauses in which variable xix_{i} appears positive and as ti,1negt^{\text{neg}}_{i,1} and ti,2negt^{\text{neg}}_{i,2} the indices of the two clauses in which variable xix_{i} appears negative. From this, we construct an instance of Peer Cycle-Free Reviewing as follows. For i[n]i\in[n], we introduce four agents aiposa^{\text{pos}}_{i}, ainega^{\text{neg}}_{i}, ai1a^{1}_{i}, and ai2a^{2}_{i} (constituting a gadget modeling this variable). Moreover, for j[m]j\in[m], we introduce one agent bjb_{j}. Qualification are symmetric, i.e., if agent aa is qualified to review bb, then bb is qualified to review aa. For i[n]i\in[n], aiposa^{\text{pos}}_{i} is qualified to review ai1a^{1}_{i}, ai2a^{2}_{i}, bti,1posb_{t^{\text{pos}}_{i,1}}, and bti,2posb_{t^{\text{pos}}_{i,2}} (and the other way round). Moreover, ainega^{\text{neg}}_{i} is qualified to review ai1a^{1}_{i}, ai2a^{2}_{i}, bti,1negb_{t^{\text{neg}}_{i,1}}, and bti,2negb_{t^{\text{neg}}_{i,2}} (and the other way round). Lastly, ai1a^{1}_{i} and ai2a^{2}_{i} are qualified to review each other and ai2a^{2}_{i} and ai+11a^{1}_{i+1} are qualified to review each other (where ii is taken modulo nn). We set creviewer=dpaper=2c_{\text{reviewer}}=d_{\text{paper}}=2 and z=3z=3.

(\Rightarrow) Let α\alpha be an assignment of variables in XX that is a solution to the given Two-in-Four-Satisfiability instance. For i[n1]i\in[n-1], we let ai1a^{1}_{i} review ai2a^{2}_{i} and ai2a^{2}_{i} review ai+11a^{1}_{i+1}. Moreover, we let an1a^{1}_{n} review an2a^{2}_{n} and an2a^{2}_{n} review a11a^{1}_{1}.

For i[n]i\in[n] where xix_{i} is set to true by α\alpha, we let ai1a^{1}_{i} and ai2a^{2}_{i} review aiposa^{\text{pos}}_{i} and ainega^{\text{neg}}_{i} review ai1a^{1}_{i} and ai2a^{2}_{i}. Moreover we let aiposa^{\text{pos}}_{i} review bti,1posb_{t^{\text{pos}}_{i,1}} and bti,2posb_{t^{\text{pos}}_{i,2}} and we let bti,1negb_{t^{\text{neg}}_{i,1}} and bti,2posb_{t^{\text{pos}}_{i,2}} review ainega^{\text{neg}}_{i}. Conversely, for i[n]i\in[n] where xix_{i} is set to false by α\alpha, we let ai1a^{1}_{i} and ai2a^{2}_{i} review ainega^{\text{neg}}_{i}, we let aiposa^{\text{pos}}_{i} review ai1a^{1}_{i} and ai2a^{2}_{i}. Moreover, we let ainega^{\text{neg}}_{i} review bti,1negb_{t^{\text{neg}}_{i,1}} and bti,2negb_{t^{\text{neg}}_{i,2}} and let bti,1posb_{t^{\text{pos}}_{i,1}} and bti,2posb_{t^{\text{pos}}_{i,2}} review aiposa^{\text{pos}}_{i}.

As α\alpha sets exactly two literals in each clause to true and two to false, for each j[m]j\in[m], bjb_{j} is reviewed by two agents and reviews two agents. The same also holds for all other agents, implying that the constructed review assignment is 22-22 valid. It is easy to see that there are no 22-cycles. Moreover, as no two agents bib_{i} and bjb_{j} for iji\neq j are qualified to review each other and, for no j[m]j\in[m], are there two agents that are both qualified to review bjb_{j} and that are qualified to review each other, each 33-cycle needs to solely consist of agents from a gadget corresponding to a single variable. So let us fix some i[n]i\in[n]. The only possible 33-cycles consist of aiposa^{\text{pos}}_{i}, ai1a^{1}_{i} and ai2a^{2}_{i} or ainega^{\text{neg}}_{i}, ai1a^{1}_{i} and ai2a^{2}_{i}. However, there is no such 33-cycle, as either aiposa^{\text{pos}}_{i} reviews both ai1a^{1}_{i} and ai2a^{2}_{i} and both ai1a^{1}_{i} and ai2a^{2}_{i} review ainega^{\text{neg}}_{i}, or ainega^{\text{neg}}_{i} reviews both ai1a^{1}_{i} and ai2a^{2}_{i} and both ai1a^{1}_{i} and ai2a^{2}_{i} review aiposa^{\text{pos}}_{i}. Thus, the constructed assignment is 33-cycle-free.

(\Leftarrow) Assume we are given a 22-22-valid 33-cycle-free review assignment in the constructed Cycle-Free Reviewing instance. Assume that an2a^{2}_{n} reviews a11a^{1}_{1} in the given assignment (if a11a^{1}_{1} reviews an2a^{2}_{n} an analgous argument works). We now argue that a11a^{1}_{1} needs to review a12a^{2}_{1}. Assume for the sake of contradiction that this is not the case, then as a11a^{1}_{1} is reviewed by an2a^{2}_{n} and a12a^{2}_{1}, she needs to review a1posa^{\text{pos}}_{1} and a1nega^{\text{neg}}_{1}. However, to prevent a 33-cycle, a12a^{2}_{1} then needs to review a1posa^{\text{pos}}_{1} and a1nega^{\text{neg}}_{1}, a contradiction (as a12a^{2}_{1} gives three reviews). Next, we want to argue that a12a^{2}_{1} reviews a21a^{1}_{2}. For the sake of contradiction, assume that a21a^{1}_{2} reviews a12a^{2}_{1}. Then, a12a^{2}_{1} already gets two reviews and thus needs to review a1nega^{\text{neg}}_{1} and a1posa^{\text{pos}}_{1}. However, as a11a_{1}^{1} already reviews a12a_{1}^{2} either a1nega^{\text{neg}}_{1} or anposa^{\text{pos}}_{n} review a11a_{1}^{1} which leads to a 33-cycle together with a11a_{1}^{1} and a12a_{1}^{2}. Applying the same arguments inductively, it follows that for i[n1]i\in[n-1], ai1a^{1}_{i} review ai2a^{2}_{i} and ai2a^{2}_{i} reviews ai+11a^{1}_{i+1} and that an1a^{1}_{n} reviews an2a^{2}_{n}.

Further, observe that for each i[n]i\in[n] agents ai1a_{i}^{1} and ai2a_{i}^{2} either both review aiposa_{i}^{\text{pos}} or both get reviews from aiposa_{i}^{\text{pos}}. For the sake of contradiction, assume that this is not the case. If ai2a^{2}_{i} reviews aiposa_{i}^{\text{pos}} and aiposa_{i}^{\text{pos}} reviews ai1a^{1}_{i}, then we have a 33-cycle consisting of these three agents. Otherwise, ai1a^{1}_{i} reviews aiposa_{i}^{\text{pos}} and aiposa_{i}^{\text{pos}} reviews ai2a^{2}_{i}. However, as the given assignment is 22-22-valid, from this it follows that ai2a^{2}_{i} reviews ainega^{\text{neg}}_{i} and ainega^{\text{neg}}_{i} reviews ai1a^{1}_{i}, which leads to a 33-cycle consisting of ai1a^{1}_{i}, ai2a^{2}_{i}, and ainega^{\text{neg}}_{i}. Thus, we have reached a contradiction proving our initial claim. Moreover, as the given assignment is 22-22 valid, in case that ai1a_{i}^{1} and ai2a_{i}^{2} both review aiposa_{i}^{\text{pos}}, then ainega_{i}^{\text{neg}} reviews ai1a_{i}^{1} and ai2a_{i}^{2}, and in case that aiposa_{i}^{\text{pos}} reviews both ai1a_{i}^{1} and ai2a_{i}^{2}, then ai1a_{i}^{1} and ai2a_{i}^{2} both review ainega_{i}^{\text{neg}}. We now construct an assignment α\alpha by, for i[n]i\in[n], setting variable xix_{i} to true if ai1a_{i}^{1} and ai2a_{i}^{2} review aiposa_{i}^{\text{pos}} and xix_{i} to false if ai1a_{i}^{1} and ai2a_{i}^{2} review ainega_{i}^{\text{neg}}. Using our argument from above, it follows that α\alpha is well-defined. Moreover, the given assignment is 22-22-valid, if α\alpha sets a literal to true, then the agents corresponding to this literal review the agents corresponding to the two clauses in which the literal appears. Similarly, if α\alpha sets a literal to false, then the agents corresponding to this literal get a review from the two agents corresponding to the two clauses in which the literal appears. Thus, as each agent corresponding to a clause gets and issues two reviews (as the given assignment is 22-22-valid), it follows that α\alpha sets for each clause exactly two literals to true and thus that α\alpha is a solution to the given instance of Two-in-Four-Satisfiability. ∎

4 Polynomial-Time Solvable Special Cases

In this section, we identify conditions under which a short-cycle-free review assignment provably exists and can be computed in polynomial time. As we will see in our experiments, the subsequently presented algorithms provide short-cycle-free review assignments even beyond the theoretical limitations we discuss below. As we are interested in computing zz-cycle-free review assignments for z1z\geq 1, no author is allowed to review one of its own papers. That is why throughout this section we assume that we do not have (a,p)EA(a,p)\in E_{A} and (p,a)EP(p,a)\in E_{P} at the same time.

Our algorithms in this section are based on the following simple observation: Given a partial zz-cycle-free review assignment EE^{\prime} and a paper pPp\in P that requires more assigned reviewers, the number of potential reviewers that would create a zz-cycle–if assigned to review pp–is bounded by a function in zz, the maximum number ΔP+\Delta_{P}^{+} of authors per paper, and the maximum number creviewerc_{\text{reviewer}} of reviews per agent; the precise function is given in the subsequent proofs. Thus, assuming that the minimum number δP\delta_{P}^{-} of potential reviewers for each paper is large compared to zz, ΔP+\Delta_{P}^{+}, dpaperd_{\text{paper}}, and creviewerc_{\text{reviewer}}, for each paper pp there are always reviewers that can be assigned to review pp without creating a zz-cycle. Note that in practice we can expect that zz, dpaperd_{\text{paper}}, and creviewerc_{\text{reviewer}} are quite small. Moreover, while the minimum number of fitting reviewers might be not very large, it is not uncommon to assign papers to reviewers that are not “perfect”. Thus, interpreting δP\delta_{P}^{-} as the number of community members that do not have a conflict of interest actually yields relative large values for δP\delta_{P}^{-} in practice.

We start with a very restrictive setting and then, step by step, generalize the approach and the results. First, each paper is written by exactly one author, each agent has at most one paper and we want a completely cycle-free review assignment (i. e., zz-cycle-free for every zz\in\mathds{N}). This of course implies that some agents cannot be authors of papers and so the number nPn_{P} of papers is smaller than the number nAn_{A} of agents. However, it allows Algorithm 1 to work (implicitly) with the topological ordering of the (acyclic) review assignment while constructing it.

1 E;S0E^{\prime}\leftarrow\emptyset;S_{0}\leftarrowagents without papers
/* ϕi(a)\phi_{i}(a) is the free reviewing capacity of aa before iteration ii of the for-loop from Line 3; each agent reviews at most dpaperd_{\text{paper}} papers */
2 foreach aAa\in A do ϕ0(a)dpaper\phi_{0}(a)\leftarrow d_{\text{paper}}
3
/* Assign reviewers to one paper per iteration: */
4 for i0i\leftarrow 0 to nP1n_{P}-1 do
5    foreach aAa\in A do ϕi+1(a)ϕi(a)\phi_{i+1}(a)\leftarrow\phi_{i}(a)
6    select some (p,a)EP(p,a)\in E_{P} where pp has no reviews yet
    /* collect qualified reviewers and assign dpaperd_{\text{paper}} of them to pp */
7    R{bSi(b,p)EA}R\leftarrow\{b\in S_{i}\mid(b,p)\in E_{A}\}
8    for j1j\leftarrow 1 to dpaperd_{\text{paper}} do
9        arbitrary bRb\in R reviews p:EE{(b,p)}p\colon E^{\prime}\leftarrow E^{\prime}\cup\{(b,p)\}
10        ϕi+1(b)ϕi(b)1\phi_{i+1}(b)\leftarrow\phi_{i}(b)-1; RR{b}R\leftarrow R\setminus\{b\}
   /* collect possible reviewers for next paper */
11    Si+1:={a}{bSiϕi+1(b)>0}S_{i+1}:=\{a\}\cup\{b\in S_{i}\mid\phi_{i+1}(b)>0\}
return EE^{\prime}
Algorithm 1 A greedy algorithm computing a dpaperd_{\text{paper}}-dpaperd_{\text{paper}}-valid completely cycle-free assignment EE^{\prime}.
Proposition 3.

If ΔA1=δP+=ΔP+\Delta_{A}^{-}\leq 1=\delta_{P}^{+}=\Delta_{P}^{+}, dpapercreviewerd_{\text{paper}}\leq c_{\text{reviewer}}, and δPnP+dpaper\delta_{P}^{-}\geq n_{P}+d_{\text{paper}}, then Algorithm 1 computes a dpaperd_{\text{paper}}-dpaperd_{\text{paper}}-valid and completely cycle-free review assignment in linear time.

Proof.

We first show the correctness of Algorithm 1. Clearly, if in each iteration of the loop in Algorithm 1 the set of eligible reviewers RR (see Algorithm 1) is of size at least dpaperd_{\text{paper}}, then a completely cycle-free review assignment is created as each agent only reviews papers from agents “occurring” later during the algorithm. Observe that if |Si|nAδP+dpaper|S_{i}|\geq n_{A}-\delta_{P}^{-}+d_{\text{paper}} for i{0,,nP1}i\in\{0,\ldots,n_{P}-1\}, then in iteration ii we have |R|dpaper|R|\geq d_{\text{paper}}: There are at most nAδPn_{A}-\delta_{P}^{-} agents in SiS_{i} that cannot review pp (the corresponding edge is not in EAE_{A}) and, thus, at least dpaperd_{\text{paper}} agents in SiS_{i} are eligible to review pp. It remains to show that |Si|nAδP+dpaper|S_{i}|\geq n_{A}-\delta_{P}^{-}+d_{\text{paper}} for all i{0,,nP1}i\in\{0,\ldots,n_{P}-1\} follows from our assumptions. By assumption of the lemma we have nPδPdpapern_{P}\leq\delta_{P}^{-}-d_{\text{paper}}. Hence, |S0|=nAnPnAδP+dpaper|S_{0}|=n_{A}-n_{P}\geq n_{A}-\delta_{P}^{-}+d_{\text{paper}}. We next show that |Si||S0||S_{i}|\geq|S_{0}| for all i[nP1]i\in[n_{P}-1]. Observe that at the start we have aS0ϕ0(a)=|S0|dpaper\sum_{a\in S_{0}}\phi_{0}(a)=|S_{0}|\cdot d_{\text{paper}}. Moreover, after the iith iteration of the loop in Algorithm 1 we have aSi+1ϕi+1(a)=aSiϕi(a)\sum_{a\in S_{i+1}}\phi_{i+1}(a)=\sum_{a\in S_{i}}\phi_{i}(a) as each paper gets dpaperd_{\text{paper}} reviews and the reviewer aa in Si+1SiS_{i+1}\setminus S_{i} starts with ϕi+1(a)=dpaper\phi_{i+1}(a)=d_{\text{paper}}. Observe that ϕi(a)dpaper\phi_{i}(a)\leq d_{\text{paper}} for all aAa\in A and i{0,,nP1}i\in\{0,\ldots,n_{P}-1\}. Thus, we have |Si|dpaperaSiϕi(a)=aS0ϕ0(a)=|S0|dpaper|S_{i}|d_{\text{paper}}\geq\sum_{a\in S_{i}}\phi_{i}(a)=\sum_{a\in S_{0}}\phi_{0}(a)=|S_{0}|d_{\text{paper}} and, hence, |Si||S0||S_{i}|\geq|S_{0}|. This completes the correctness proof.

As to the running time, everything outside the loop starting in Algorithm 1 clearly runs in linear time. As to the part inside the loop, note that by keeping just one array of length nAn_{A} we can store the values of ϕ\phi in linear time. Moreover, the reviewers for pp are selected arbitrarily from RR, which is doable in |N(p,EA)||N^{-}(p,E_{A})| time. Hence, the loop in Algorithm 1 can be processed in O(nP+|EA|)O(n_{P}+|E_{A}|) time. Thus, the overall algorithm runs in O(nA+nP+|EA|)O(n_{A}+n_{P}+|E_{A}|), that is, linear time. ∎

For our next result we replace the completely cycle-free property of the resulting review assignment with zz-cycle freeness. This implies that the idea of constructing the review assignment along its topological ordering (as done by Algorithm 1) cannot be employed. Instead, Algorithm 2 constructs greedily a maximal zz-cycle-free assignment and then extends the assignment by replacing one review assignment by two other assignments. The argument behind the replacement strategy is an extension of the argument in Algorithm 1 that there are always enough reviewers to assign in Algorithms 1 to 1.

To keep our arguments simple we first consider the case that each agent reviews at most one paper and each paper requires one review. Moreover, as before, we are in the setting that each paper has one author and each agent authors at most one paper. Formally, we have the following.

Proposition 4.

If ΔA1=δP+=ΔP+=creviewer=dpaper\Delta_{A}^{-}\leq 1=\delta_{P}^{+}=\Delta_{P}^{+}=c_{\text{reviewer}}=d_{\text{paper}}, nAnPn_{A}\geq n_{P}, δA+>z\delta_{A}^{+}>z, δP>z\delta_{P}^{-}>z, and nPδA++δP2zn_{P}\leq\delta_{A}^{+}+\delta_{P}^{-}-2z, then Algorithm 2 computes a creviewerc_{\text{reviewer}}-dpaperd_{\text{paper}}-valid zz-cycle-free review assignment in polynomial time.

1 EE^{\prime}\leftarrow\emptyset
2 while pP:|N(p,E)|<dpaper\exists p\in P\colon|N^{-}(p,E^{\prime})|<d_{\text{paper}} do
3    if (a,p)EAE:E{(a,p)}\exists(a,p)\in E_{A}\setminus E^{\prime}\colon E^{\prime}\cup\{(a,p)\} is zz-cycle free and |N+(a,E)|<creviewer|N^{+}(a,E^{\prime})|<c_{\text{reviewer}} then
        /* Case 1: greedy assignment of reviews as long as no zz-cycles are created: */
4        EE{(a,p)}E^{\prime}\leftarrow E^{\prime}\cup\{(a,p)\}
5       
6   else
        /* Case 2: replace one review assignment by two: */
7        pick (a,p)E(a^{\prime},p^{\prime})\in E^{\prime} and aAa\in A so that |N+(a,E)|<creviewer|N^{+}(a,E^{\prime})|<c_{\text{reviewer}}, (a,p),(a,p)EA(a^{\prime},p),(a,p^{\prime})\in E_{A}, and (E{(a,p)}){(a,p),(a,p)}(E^{\prime}\setminus\{(a^{\prime},p^{\prime})\})\cup\{(a^{\prime},p),(a,p^{\prime})\} is zz-cycle free
8        E(E{(a,p)}){(a,p),(a,p)}E^{\prime}\leftarrow(E^{\prime}\setminus\{(a^{\prime},p^{\prime})\})\cup\{(a^{\prime},p),(a,p^{\prime})\}
9       
10   
return EE^{\prime}
Algorithm 2 Greedy algorithm to compute a creviewerc_{\text{reviewer}}-dpaperd_{\text{paper}}-valid zz-cycle-free review assignment EE^{\prime}.
Proof.

Obviously, Algorithm 2 terminates after at most nPn_{P} iterations of the while loop as in each iteration the number of assigned reviews increases. Moreover, a creviewerc_{\text{reviewer}}-dpaperd_{\text{paper}}-valid zz-cycle-free review assignment is returned if a,a,pa,a^{\prime},p^{\prime} as described in case 2 (Algorithm 2) always exist. To prove their existence, we introduce some notation. For some vAPv\in A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P let Nz+(v,EEP)N_{z}^{+}(v,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}) be the zz-out-neighborhood of vv, that is, the set of vertices that can be reached from vv in the review graph (AP,EEP)(A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}) via a path of length at least one and at most 2z2z. Similarly, let Nz(v,EEP)N_{z}^{-}(v,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}) be the 2z2z-in-neighborhood of vv, that is, the set of vertices that can reach vv in the review graph (AP,EEP)(A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}) via a path of length at least one and at most 2z2z. Note that if vNz(v,EEP)v\in N_{z}^{-}(v,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}), then also vNz+(v,EEP)v\in N_{z}^{+}(v,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}) and vv is contained in a review cycle of length zz (that is a directed cycle of length 2z2z in (AP,EEP)(A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P})). Subsequently, we present upper bounds on the size of Nz(v,EEP)N_{z}^{-}(v,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}) and Nz+(v,EEP)N_{z}^{+}(v,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P}) for vAPv\in A\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}P thereby proving the existence of a,a,pa,a^{\prime},p^{\prime}.

Let pPp\in P be the paper without reviewer selected in Algorithm 2 when the algorithm enters case 2. Let ApAA_{p}\subseteq A be the set of agents that could review pp without creating a zz-cycle, that is, Ap:={aA(a,p)EAE{(a,p)}A_{p}:=\{a\in A\mid(a,p)\in E_{A}\land E^{\prime}\cup\{(a,p)\} is zz-cycle free}\}. Since dpaper=creviewer=ΔP+=1d_{\text{paper}}=c_{\text{reviewer}}=\Delta_{P}^{+}=1, there are at most zz agents whose assignment to review pp would create a review cycle, that is, |Nz+(p,EEP)A|z|N_{z}^{+}(p,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P})\cap A|\leq z, and thus |Ap|δPz|A_{p}|\geq\delta_{P}^{-}-z. Since we are in case 2, no more review assignments could be added without creating a zz-cycle. Hence, the algorithm assigned the at least δPz\delta_{P}^{-}-z potential reviewers in ApA_{p} to different papers. Let PpP_{p} be the set of these papers. Since dpaper=creviewer=1d_{\text{paper}}=c_{\text{reviewer}}=1 we have |Pp|=|Ap|δPz|P_{p}|=|A_{p}|\geq\delta_{P}^{-}-z.

Let aAa\in A be an arbitrary agent without assigned review, that is, p′′:(a,p′′)E\nexists p^{\prime\prime}\colon(a,p^{\prime\prime})\in E^{\prime}. Since dpaper=creviewer=ΔA=1d_{\text{paper}}=c_{\text{reviewer}}=\Delta_{A}^{-}=1, we have |Nz(a,EEP)P|z|N_{z}^{-}(a,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P})\cap P|\leq z. Thus, there are δA+z\delta_{A}^{+}-z papers that aa could review without creating a zz-cycle; let PaP_{a} denote the set of these papers. Since we assume that nPδA++δP2zn_{P}\leq\delta_{A}^{+}+\delta_{P}^{-}-2z, it follows that there is a pPaPpp^{\prime}\in P_{a}\cap P_{p}. By definition of PpP_{p} there is an agent aa^{\prime} with (a,p)E(a^{\prime},p^{\prime})\in E^{\prime} and aApa^{\prime}\in A_{p}. Thus, a,a,pa,a^{\prime},p^{\prime} exist and EE^{\prime} can be updated to (E{(a,p)}){(a,p),(a,p)}(E^{\prime}\setminus\{(a^{\prime},p^{\prime})\})\cup\{(a^{\prime},p),(a,p^{\prime})\} in Algorithm 2. ∎

We now turn our attention to our general case where agents can author and review many papers and papers can have multiple authors and can require several reviews. While the conditions that guarantee the existence of a zz-cycle-free review assignment need adjustments, we can still use Algorithm 2 together with a correctness proof that follows a similar pattern as the proof of Proposition 4.

Theorem 5.

If, nAcreviewernPdpapern_{A}\cdot c_{\text{reviewer}}\geq n_{P}\cdot d_{\text{paper}}, δA+>2(ΔAdpaper)z+creviewer\delta_{A}^{+}>2(\Delta_{A}^{-}\cdot d_{\text{paper}})^{z}+c_{\text{reviewer}}, δP>2(ΔP+creviewer)z+dpaper\delta_{P}^{-}>2(\Delta_{P}^{+}\cdot c_{\text{reviewer}})^{z}+d_{\text{paper}}, and nPδA+2(ΔAdpaper)zcreviewer+(creviewer/dpaper)(δP2(ΔP+creviewer)zdpaper)n_{P}\leq\delta_{A}^{+}-2(\Delta_{A}^{-}\cdot d_{\text{paper}})^{z}-c_{\text{reviewer}}+(c_{\text{reviewer}}/d_{\text{paper}})(\delta_{P}^{-}-2(\Delta_{P}^{+}\cdot c_{\text{reviewer}})^{z}-d_{\text{paper}}), then Algorithm 2 computes a creviewerc_{\text{reviewer}}-dpaperd_{\text{paper}}-valid zz-cycle-free review assignment in polynomial time.

Proof.

We use the same notation as in the proof of Proposition 4 and similarly to this proof we need to show that a,a,pa,a^{\prime},p^{\prime} as described in Algorithm 2 actually always exist.

Let pPp\in P be the paper with a missing review selected in Algorithm 2 and the algorithm entered case 2. Let ApPA_{p}\subseteq P be the set of agents that could review pp without creating a zz-cycle, that is, Ap:={aA(a,p)EAEE{(a,p)}A_{p}:=\{a\in A\mid(a,p)\in E_{A}\setminus E^{\prime}\land E^{\prime}\cup\{(a,p)\} is zz-cycle free}\}. As every paper has at most ΔP+\Delta_{P}^{+} authors and every author has at most creviewerc_{\text{reviewer}} assigned papers to review, it follows that

|Nz+(p,EEP)A|ΔP+i=0z1(ΔP+creviewer)i\displaystyle|N_{z}^{+}(p,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P})\cap A|\leq\Delta_{P}^{+}\cdot\sum_{i=0}^{z-1}(\Delta_{P}^{+}\cdot c_{\text{reviewer}})^{i}
=\displaystyle={} ΔP+((ΔP+creviewer)z1)/((ΔP+creviewer)1)\displaystyle\Delta_{P}^{+}\cdot((\Delta_{P}^{+}\cdot c_{\text{reviewer}})^{z}-1)/((\Delta_{P}^{+}\cdot c_{\text{reviewer}})-1)
<\displaystyle<{} 2(ΔP+creviewer)z.\displaystyle 2(\Delta_{P}^{+}\cdot c_{\text{reviewer}})^{z}.

Thus, |Ap|>δP2(ΔP+creviewer)zdpaper|A_{p}|>\delta_{P}^{-}-2(\Delta_{P}^{+}\cdot c_{\text{reviewer}})^{z}-d_{\text{paper}}, as at most dpaper1d_{\text{paper}}-1 agents are already assigned to pp and at most |Nz+(p,EEP)A|<2(ΔP+creviewer)z|N_{z}^{+}(p,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P})\cap A|<2(\Delta_{P}^{+}\cdot c_{\text{reviewer}})^{z} agents cannot review pp because this would cause a review cycle of length at most zz. When case 2 was entered, no more review assignment could be added without creating a zz-cycle. Hence, the algorithm assigned the potential reviewers in ApA_{p} already to different papers. Let PpP_{p} be the set of these papers. Note that |Pp|creviewer|Ap|/dpaper|P_{p}|\geq c_{\text{reviewer}}|A_{p}|/d_{\text{paper}}.

Let aAa\in A be an arbitrary agent that can do one more review, that is, |N+(a,E)|<creviewer|N^{+}(a,E^{\prime})|<c_{\text{reviewer}}. Using a similar argument as above, we can show |Nz(a,EEP)P|<2(ΔAdpaper)z|N_{z}^{-}(a,E^{\prime}\mathbin{\mathchoice{\ooalign{$\displaystyle\cup$\cr$\displaystyle\cdot$}}{\ooalign{$\textstyle\cup$\cr$\textstyle\cdot$}}{\ooalign{$\scriptstyle\cup$\cr$\scriptstyle\cdot$}}{\ooalign{$\scriptscriptstyle\cup$\cr$\scriptscriptstyle\cdot$}}}E_{P})\cap P|<2(\Delta_{A}^{-}\cdot d_{\text{paper}})^{z}. Thus, there are more than δA+2(ΔAdpaper)zcreviewer\delta_{A}^{+}-2(\Delta_{A}^{-}\cdot d_{\text{paper}})^{z}-c_{\text{reviewer}} papers that aa could review additionally without creating a zz-cycle; let PaP_{a} denote the set of these papers. Note that by our assumptions that δA+>2(ΔAdpaper)z+creviewer\delta_{A}^{+}>2(\Delta_{A}^{-}\cdot d_{\text{paper}})^{z}+c_{\text{reviewer}} and δP>2(ΔP+creviewer)z+dpaper\delta_{P}^{-}>2(\Delta_{P}^{+}\cdot c_{\text{reviewer}})^{z}+d_{\text{paper}}, PaP_{a} and PpP_{p} are both non-empty. Since nPδA+2(ΔAdpaper)zcreviewer+(creviewer/dpaper)(δP2(ΔP+creviewer)zdpaper)<|Pa|+|Pp|n_{P}\leq\delta_{A}^{+}-2(\Delta_{A}^{-}\cdot d_{\text{paper}})^{z}-c_{\text{reviewer}}+(c_{\text{reviewer}}/d_{\text{paper}})(\delta_{P}^{-}-2(\Delta_{P}^{+}\cdot c_{\text{reviewer}})^{z}-d_{\text{paper}})<|P_{a}|+|P_{p}|, it follows that there is a pPaPpp^{\prime}\in P_{a}\cap P_{p}. By definition of PpP_{p} there is an agent aa^{\prime} with (a,p)E(a^{\prime},p^{\prime})\in E^{\prime} and aApa^{\prime}\in A_{p}. Thus, a,a,pa,a^{\prime},p^{\prime} exist and EE^{\prime} can be updated to (E{(a,p)}){(a,p),(a,p)}(E^{\prime}\setminus\{(a^{\prime},p^{\prime})\})\cup\{(a^{\prime},p),(a,p^{\prime})\} in Algorithm 2. This finishes the correctness proof. ∎

To simplify the statement of Theorem 5 consider a “symmetric” case where nAnPn_{A}\geq n_{P}, δP=δA+\delta_{P}^{-}=\delta_{A}^{+}, and ΔP+=ΔA\Delta_{P}^{+}=\Delta_{A}^{-}. For brevity, set n:=nPn:=n_{P}, δ:=δP\delta:=\delta_{P}^{-}, and Δ:=ΔP+\Delta:=\Delta_{P}^{+}. Let coi\operatorname{coi} be the maximum number of papers any agent is not qualified to review/has a conflict of interest with, that is, coi=nδ\operatorname{coi}=n-\delta. Setting creviewer=6c_{\text{reviewer}}=6 and dpaper=3d_{\text{paper}}=3 as in our experiments we get:

Corollary 2.

If n61.5coi+Δz(6z2+3z)n-6\geq 1.5\cdot\operatorname{coi}+\Delta^{z}(6^{z}\cdot 2+3^{z}), then there always exists a 66-33-valid zz-cycle-free review assignment that can be found in polynomial time.

Considering that AAAI’22 had 9,251 submissions and that there was a submission limit of 1010 papers per author and assuming that each paper has at most ten authors (implying that Δ=10\Delta=10) and that each author has at most 700 conflict of interests, it follows that there is a 66-33-valid 22-cycle-free review assignment computable with Algorithm 2.

As we see in the experiments in the next section, our algorithm returns 2/3/42/3/4-cycle-free review assignments even well beyond the theoretical guarantees given above. We also remark that Algorithm 2 allows for an easy extension to the weighted case which we use in our experiments in the next section. To this end, in the first case (Algorithm 2) we do not pick an arbitrary edge (a,p)(a,p) but a eligible edge of maximum weight to be added to the assignment EE^{\prime}.

5 Experiments

In this section, we compare the weight of review assignments computed using different methods and analyze the occurrences of review cycles.222The code for our experiments is available at github.com/n-boehmer/Combating-Collusion-Rings-is-Hard-but-Possible. For this, we use a dataset from the 2018 International Conference on Learning Representations (ICLR ’18) prepared by Xu et al. (2019). Xu et al. (2019) collected all 911911 papers submitted to ICLR ’18 and the identity of all 24282428 authors. As reviewers identities are unknown, they considered all authors to be reviewers and computed for each author-paper pair a similarity score.333To the best of our knowledge, in all other publicly available datasets, there are similarity scores for reviewer-paper pairs but the link between the identities of authors and reviewers is missing (as this is considered sensitive information).

From the dataset of Xu et al. (2019), we created multiple instances of Weighted Cycle-Free Reviewing as follows. Given a number nPn_{P} of papers and a ratio rAPr_{AP} of the numbers of agents and papers, we sample a subset of nPn_{P} of the 911911 ICLR ’18 papers and set this as our set of papers. Subsequently, we compute the set of all authors of one of these papers and sample a subset of rAPnPr_{AP}\cdot n_{P} authors and set this as our set of agents. Notably, the created instances can be seen as particularly challenging when it comes to avoiding review cycles, as in reality also “uncritical” reviewers, i.e., reviewers that do not author any paper, exist.

As done in other papers using the same dataset, we focus on the case with dpaper=3d_{\text{paper}}=3 and creviewer=6c_{\text{reviewer}}=6, i.e., every paper needs exactly three reviews and each agent can review at most six papers (Xu et al., 2019; Jecmen et al., 2020). We consider three different types of review assignments: As “optimal” we denote a maximum-weight creviewerc_{\text{reviewer}}-dpaperd_{\text{paper}}-valid review assignment. Such an assignment can be computed using a simple Linear Program (LP) as, for instance, described by Taylor (2008). As “optimal zz-cycle free” we denote a maximum-weight creviewerc_{\text{reviewer}}-dpaperd_{\text{paper}}-valid zz-cycle-free review assignment. This solution can be computed by treating the LP of Taylor (2008) as an Integer Linear Program (ILP) and adding for each possible ii-cycle for i[z]i\in[z] a separate constraint which imposes that at least one of the agent-paper pairs from the cycle is not assigned. We solved all (I)LPs using Gurobi Optimization, LLC (2021). Lastly, as “heuristic zz-cycle free”, we denote a creviewerc_{\text{reviewer}}-dpaperd_{\text{paper}}-valid zz-cycle-free review assignment computed by the weighted variant of Algorithm 2 as described at the end of Section 4.444We could not use the heuristics of Guo et al. (2018) as these are not available and their algorithm details are ambiguous. In all experiments conducted in this section, the heuristic always returned a solution despite the fact that most of the time we are beyond the setting in which Theorem 5 guarantees this behavior of the heuristic. In experiment I presented in the following subsection, for z=2/3/4z=2/3/4, an unoptimized implementation of our heuristic was always able to find a zz-cycle-free review assignment in less than 3030 seconds, being on average around 22 times faster than the “optimal” LP, on average around 3.73.7 times faster than the “optimal 22-cycle free” ILP, and on average more than 100100 times faster than the “optimal 33-cycle free” ILP.

5.1 Experiment I

In this experiment, we focus on the case where the total number of needed reviews is the same as the total number of reviews that can be written, which is in some sense the most “challenging” but probably also one of the more realistic scenarios. Specifically, for nP{150,175,,900}n_{P}\in\{150,175,\dots,900\}, we prepared 100100 instances with rAP=0.5r_{AP}=0.5 as described above and computed for each of these instances the optimal, heuristic 22/33/44-cycle-free, and optimal 22-cycle-free review assignment. Moreover, for all instances with nP225n_{P}\leq 225, we also computed the optimal 33-cycle-free review assignment (for larger instances the ILP solver run out of memory.)

To measure the “price of zz-cycle freeness”, in Figure 3, we display the weights of different cycle-free review assignments divided by the weight of an optimal review assignment. What stands out here is that forbidding the existence of 22-cycles only comes at the cost of decreasing the assignment’s weight by on average at most 0.8%0.8\% (if the optimal 22-cycle-free assignment is used). Turning to the results produced by our heuristic, the quality decrease for 22/33/44-cycle-free assignments lies, on average, around 3.1%3.1\%, 3.2%3.2\%, and 3.3%3.3\%. The weight of assignments computed using our heuristic is thus clearly worse than the weight of the optimal cycle-free assignment, yet still not far away from the the weight of an optimal assignment. What is particularly surprising here is that for both our heuristic and the optimal cycle-free assignment, whether 22, 33 or 44 cycles are forbidden seems to be rather irrelevant for the quality decrease. All in all, it is encouraging that 22/33/44-cycle freeness can be realized at a low cost independent of whether our heuristic or an ILP is used.

2002004004006006008008000.960.9650.970.9750.980.9850.990.995number of papersweight normalized by weight of optimal solution2-cycle freeheuristic cycle free3-cycle freeoptimal cycle free4-cycle free
Figure 1: For different values of zz, weight of an optimal/heuristic zz-cycle-free assignment divided by the weight of an optimal assignment.
2002004004006006008008000.20.30.40.50.60.70.8number of papersfraction of agents in a review cycle2-cyclesoptimal2/3-cyclesheuristic 2-cycle free2/3/4-cyclesheuristic 3-cyclce free
Figure 2: Fraction of agents that are part of a review cycle of at most some length for different types of assignment.
2002004004006006008008000.10.150.20.250.3number of papersfraction of papers in a review cycle2-cyclesoptimal2/3-cyclesheuristic 2-cycle free2/3/4-cyclesheuristic 3-cyclce free
Figure 3: Fraction of papers that are part of a review cycle of at most some length for different types of assignment.

The necessity of dealing with review cycles is underlined by the data displayed in Figure 3. Here, we show the fraction of agents that are contained in at least one review cycle of some length in an optimal assignment and in a heuristic 2/32/3-cycle-free assignment. Overall, as the number of papers increases the fraction of agents contained in review cycles constantly decreases, yet for all considered values of nPn_{P} the results are worrisome. In the optimal assignment for 150150 papers, the fraction of agents contained in a review cycle of length at most 2/3/42/3/4 is, on average, 40%/58%/76%40\%/58\%/76\% , while even for 900900 papers, still 32%/41%/55%32\%/41\%/55\% of agents are contained in a review cycle. Considering heuristic zz-cycle-free assignments, the fraction of agents contained in a cycle of length z+1z+1 is considerably lower than for the optimal solution but still non-negligible (the results for optimal 2/32/3-cycle-free assignments are similar to the displayed results for our heuristic).

We also computed the fraction of papers that are contained in at least one review cycle (see Figure 3). The results are as in Figure 3 with all values roughly halved, e.g, even in the optimal assignment for 900900 papers, 15%/20%/27%15\%/20\%/27\% of papers are contained in a review cycle of length at most 2/3/42/3/4. An intuitive explanation for this difference between agents and papers is that the number of papers is twice the number of agents and that there exist some papers without reviewing authors. Overall, it is striking that even for a high number of papers, in an optimal assignment around 15%15\% of papers could have a considerably higher chance of getting accepted if two agents coordinate to give each others paper better reviews and 32%32\% of reviewers would have an opportunity to participate in such a collusion.

0.60.81.01.21.41.61.82.00.960.9650.970.9750.980.9850.990.995number of agents/number of papers\nicefrac{{\text{number of agents}}}{{\text{number of papers}}}weight normalized by weight of optimal solution2-cycle freeheuristic cycle free3-cycle freeoptimal cycle free4-cycle free
Figure 4: For different values of zz, weight of a optimal/heuristic zz-cycle-free assignment divided by the weight of an optimal assignment.
0.60.81.01.21.41.61.82.00.20.30.40.50.60.7number of agents/number of papers\nicefrac{{\text{number of agents}}}{{\text{number of papers}}}fraction contained in a review cylce2-cyclesfraction agents2/3-cyclesfraction papers2/3/4-cycles
Figure 5: Fraction of agents/papers that are part of a review cycle of at most some length in an optimal assignment for 200200 papers and between 100100 and 400400 agents.

5.2 Experiment II

In this experiment, we analyze how the results from experiment I depend on the assumption that the supply and demand of reviews exactly matches. In particular, as describe before, for rAP{0.5,0.6,,1.9,2}r_{AP}\in\{0.5,0.6,\dots,1.9,2\} we prepared 100100 instances with 200200 papers and rAP200r_{AP}\cdot 200 agents (we also repeated this experiment for 400400 and 600600 papers producing similar results) and computed the different types of review assignments. Considering the assignment weights (see Figure 5), increasing rAPr_{AP} from 0.50.5 to 22, the normalized weight of an optimal 2/32/3-cycle-free assignment decreases by 0.0050.005 to 0.987/0.9850.987/0.985, while the normalized weight of a heuristic 2/3/42/3/4-cycle-free assignment increases by 0.010.01 to 0.982/0.979/0.9760.982/0.979/0.976: our heuristic performs particularly well if there are (considerably) more reviews available then needed; this supports our theoretical statements for our heuristic in Section 4.

Turning to the possible impact of review cycles, we visualize the fraction of agents/papers contained in a review cycle in an optimal assignment in Figure 5.555For readability, we do not display the values for the optimal/heuristic cycle-free assignment, as their relationship to the optimal assignment is again similar as in Figure 3. While the fraction of agents contained in a review cycle constantly and significantly decreases if more and more agents are added, the fraction of papers contained in a cycle constantly increases. The former observation is quite intuitive, as when more and more agents are added, the average review load decreases and even if the number of review cycles remains the same, it is likely that the fraction of agents contained in one gets smaller. The latter observation is less intuitive but probably a consequence of the fact that, starting with rAP=0.5r_{AP}=0.5, for some papers none of the authors is part of the agent set, implying that these papers cannot be part of a review cycle; however, if we start to add more and more agents, more and more papers can potentially be part of a review cycle. Overall, it might be quite counter intuitive that adding more and more reviewers (that are also authors) to the reviewer pool does not decrease the number of papers contained in a review cycle but increases them.

6 Conclusion

Our work provides a first systematic analysis of Cycle-Free Reviewing. On the theoretical side, we show that Cycle-Free Reviewing is a computationally hard problem even in very restricted settings, yet practically relevant polynomial-time solvable special cases exist. In our practical analysis, we could show that in assignments that do not care for review cycles a high fraction of authors and papers will likely be part of a short review cycle. While collusion rings can certainly also emerge without the existence of review cycles, for example, when authors coordinate over multiple conferences (Littman, 2021; Shah, 2021), allowing so many easy opportunities means to leave a huge door unlocked without good reason: Our heuristic significantly improves the situation, since it seems to always find cycle-free review assignment at a very low quality loss.

For future work, it would be valuable to further investigate the limits of our heuristic. While our current bounds are certainly not tight, there are also clear limitations for possible extensions imposed by our NP-hardness results in quite restrictive settings from Section 3. However, a concrete and practically very relevant open question is whether the minimum degree in our analysis can be replaced by the average degree; this would make the results much more robust against outliers. Finally, due to the lack of data, we tested our model on just one dataset. Obtaining more data to test our and other models on would be extremely valuable.

Acknowledgments

NB was supported by the DFG project MaMu (NI 369/19) and by the DFG project ComSoc-MPMS (NI 369/22). RB was partially supported by the DFG project AFFA (BR 5207/1 and NI 369/15). This work was started at the research retreat of the TU Berlin Algorithms and Computational Complexity group held in September 2020.

References

  • Barrot et al. [2020] Nathanaël Barrot, Sylvaine Lemeilleur, Nicolas Paget, and Abdallah Saffidine. Peer reviewing in participatory guarantee systems: Modelisation and algorithmic aspects. In Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems (AAMAS ’20), pages 114–122. IFAAMAS, 2020.
  • Berman et al. [2003] Piotr Berman, Marek Karpinski, and Alex D. Scott. Approximation hardness of short symmetric instances of MAX-3SAT. ECCC, (49), 2003.
  • Darmann and Döcker [2020] Andreas Darmann and Janosch Döcker. On a simple hard variant of not-all-equal 3-sat. Theor. Comput. Sci., 815:147–152, 2020.
  • Garg et al. [2010] Naveen Garg, Telikepalli Kavitha, Amit Kumar, Kurt Mehlhorn, and Julián Mestre. Assigning papers to referees. Algorithmica, 58(1):119–136, 2010.
  • Goldsmith and Sloan [2007] Judy Goldsmith and Robert H. Sloan. The AI conference paper assignment problem. In Proceedings of the 22nd AAAI Conference Workshop on Preference Handling for Artificial Intelligence (MPREF ’07), pages 53–57. AAAI Press, 2007.
  • Guo et al. [2018] Longhua Guo, Jie Wu, Wei Chang, Jun Wu, and Jianhua Li. K-loop free assignment in conference review systems. In Proceedings of the 2018 International Conference on Computing, Networking and Communications (ICNC ’18), pages 542–547. IEEE Computer Society, 2018.
  • Gurobi Optimization, LLC [2021] Gurobi Optimization, LLC. Gurobi Optimizer Reference Manual, 2021. URL https://www.gurobi.com.
  • Hell et al. [1988] Pavol Hell, David G. Kirkpatrick, Jan Kratochvíl, and Igor Kríz. On restricted two-factors. SIAM J. Discret. Math., 1(4):472–484, 1988.
  • Jecmen et al. [2020] Steven Jecmen, Hanrui Zhang, Ryan Liu, Nihar B. Shah, Vincent Conitzer, and Fei Fang. Mitigating manipulation in peer review via randomized reviewer assignments. In Advances in Neural Information Processing Systems 33 (NeurIPS ’20), 2020.
  • Kobren et al. [2019] Ari Kobren, Barna Saha, and Andrew McCallum. Paper matching with local fairness constraints. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD ’19), pages 1247–1257. ACM, 2019.
  • Leyton-Brown and Mausam [2021] K. Leyton-Brown and Mausam. Aaai 2021 - introduction. https://slideslive.com/38952457/aaai-2021-introduction?ref=account-folder-79533-folders, 2021. minute 8 onwards in the video.
  • Lian et al. [2018] Jing Wu Lian, Nicholas Mattei, Renee Noble, and Toby Walsh. The conference paper assignment problem: Using order weighted averages to assign indivisible goods. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI ’18), pages 1138–1145. AAAI Press, 2018.
  • Littman [2021] Michael L. Littman. Collusion rings threaten the integrity of computer science research. Commun. ACM, 64(6):43–44, 2021.
  • Long et al. [2013] Cheng Long, Raymond Chi-Wing Wong, Yu Peng, and Liangliang Ye. On good and fair paper-reviewer assignment. In Proceedings of the 2013 IEEE 13th International Conference on Data Mining (ICDM ’13), pages 1145–1150. IEEE Computer Society, 2013.
  • Price and Flach [2017] Simon Price and Peter A. Flach. Computational support for academic peer review: a perspective from artificial intelligence. Commun. ACM, 60(3):70–79, 2017.
  • Shah [2021] Nihar B. Shah. Systemic challenges and solutions on bias and unfairness in peer review. http://www.cs.cmu.edu/˜nihars/preprints/SurveyPeerReview.pdf, 2021.
  • Stelmakh et al. [2021] Ivan Stelmakh, Nihar Shah, and Aarti Singh. Peerreview4all: Fair and accurate reviewer assignment in peer review. J. Mach. Learn. Res., 22(163):1–66, 2021.
  • Taylor [2008] Camillo J Taylor. On the optimal assignment of conference papers to reviewers. https://repository.upenn.edu/cis˙reports/889, 2008.
  • Xu et al. [2019] Yichong Xu, Han Zhao, Xiaofei Shi, and Nihar B. Shah. On strategyproof conference peer review. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI ’19), pages 616–622. ijcai.org, 2019.