This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Inconsistency in the ordinal pairwise comparisons method
with and without ties

Konrad Kułakowski konrad.kulakowski@agh.edu.pl AGH University of Science and Technology, Krakᅵw, Poland
Abstract

Comparing alternatives in pairs is a well-known method of ranking creation. Experts are asked to perform a series of binary comparisons and then, using mathematical methods, the final ranking is prepared. As experts conduct the individual assessments, they may not always be consistent. The level of inconsistency among individual assessments is widely accepted as a measure of the ranking quality. The higher the ranking quality, the greater its credibility.

One way to determine the level of inconsistency among the paired comparisons is to calculate the value of the inconsistency index. One of the earliest and most widespread inconsistency indices is the consistency coefficient defined by Kendall and Babington Smith. In their work, the authors consider binary pairwise comparisons, i.e., those where the result of an individual comparison can only be: better or worse. The presented work extends the Kendall and Babington Smith index to sets of paired comparisons with ties. Hence, this extension allows the decision makers to determine the inconsistency for sets of paired comparisons, where the result may also be "equal." The article contains a definition and analysis of the most inconsistent set of pairwise comparisons with and without ties. It is also shown that the most inconsistent set of pairwise comparisons with ties represents a special case of the more general set cover problem.

keywords:
pairwise comparisons, consistency coefficient , inconsistency , AHP

1 Introduction

The use of pairwise comparisons (PC) to form judgments has a long history. Probably the first who formally defined and used pairwise comparisons for decision making was Ramon Llull (the XIII century) [6]. He proposed a voting system based on binary comparisons. The subject of comparisons (alternatives) were people - candidates for office. Voters evaluated the candidates in pairs, deciding which one was better. In the XVIII century, Llull’s method was rediscovered by Condorcet [7], then once again reinvented in the middle of the XX century by Copeland [6, 8]. At the beginning of the XX century, Thurstone used the pairwise comparisons method (PC method) quantitatively [39]. In this approach, the result returned does not only contain information about who or what is better, but also indicates how strong the preferences are. Later, both approaches, ordinal (qualitative), as proposed by Llull, and cardinal (quantitative), as used by Thurstone, were developed in parallel. Comparing alternatives in pairs plays an important role in research into decision making systems [14, 17, 29], ranking theory [34, 21], social choice theory [38], voting systems [40, 12, 41] and others.

In general, the PC method is a ranking technique that allows the assessment of the importance (relevance, usefulness, competence level etc.) of a number of alternatives. As it is much easier for people to assess two alternatives at a time than handling all of them at once, the PC method assumes that, first, all the alternatives are compared in pairs, then, by using an appropriate algorithm, the overall ranking is synthesized. The choice of the algorithm is not easy and is still the subject of research and vigorous debate [35, 42, 28]. Of course, it also depends on the nature of the comparisons. The cardinal methods use different algorithms [19, 13] than the ordinal ones [21, 6, 20, 40]. Despite the many differences between ordinal and cardinal pairwise comparisons, both approaches have much in common. For example, both approaches use the idea of inconsistency among individual comparisons. The notion of inconsistency introduced by the pairwise comparisons method is based on the natural expectation that every two comparisons of any three different alternatives should determine the third possible comparison among those alternatives.

To better understand the phenomenon of inconsistency, let us assume that we have to compare three alternatives c1,c2c_{1},c_{2} and c3c_{3} with respect to the same criterion. If after the comparison of c1c_{1} and c2c_{2} it is clear to us that c2c_{2} is more important than c2c_{2}, and similarly, after comparing c2c_{2} and c3c_{3} it is evident that c3c_{3} is more important than c1c_{1} then we may expect that c3c_{3} will also turn out to be more important than c1.c_{1}. The situation in which c1c_{1} is better than c3c_{3} would raise our surprise and concern. That is because it seems natural to assume that the preferential relationship should be transitive. If it is not, we have to deal with inconsistency. As pairwise comparisons are performed by experts, who, like all human beings, sometimes make mistakes, the phenomenon of inconsistency is something natural. The ranking synthesis algorithm must take it into account. On the other hand, if a large number of such “mistakes” can be found in the set of paired comparisons, one can have reasonable doubts as to the credibility of the ranking obtained from such lower quality data.

Both ordinal and cardinal PC methods developed their own solutions for determining the degree of inconsistency. Research into the cardinal PC method resulted in a number of works on inconsistency indices. Probably the most popular inconsistency index was defined by Saaty in his seminal work on the Analytic Hierarchy Process (AHP) [34]. His work prompted others to continue the research [27, 32, 1, 37, 3, 5]. The ordinal PC methods also have their own ways of assessing the level of inconsistency. In their seminal work [26] Kendall and Babington Smith introduced the inconsistency index (called by the authors the consistency coefficient). Their index allows the inconsistency degree of a set composed of binary pairwise comparisons to be determined. The results obtained by the authors were the inspiration for many other researchers in different fields of science [23, 30, 31, 2, 4, 36].

Although the ordinal pairwise comparisons method is a really powerful and handy tool facilitating the right decision, in practice we very often face the problem that the two options seem to be equally important. In such a situation, we can try to get around the problem by a brute force method of breaking ties. For example, we can do this by “instructing the judge to toss a mental coin when he cannot otherwise reach a decision; or, allowing him the comfort of reserving judgment, we can let a physical coin decide for him” [9, p. 94 - 95]. It is clear, however, that instead of relying on more or less arbitrary methods of breaking ties, it is better to accept their existence and incorporate them into the model. Indeed, ties have been inextricably linked with the ranking theory for a long time [6, 25, 9]. The ordinal pairwise comparisons method with ties has its own techniques of synthesizing ranking [15, 10, 40]. In this perspective, research into the inconsistency of ordinal pairwise comparisons with ties is quite poor. In particular, the consistency coefficient as defined by [26] is not suitable for determining the inconsistency of PC with ties. The problem was recognized by Jensen and Hicks [22], and later by Iida [18]. These authors also made attempts to patch up this gap in the ranking theory, however, the fundamental question as to what extent the set of PC with ties can be inconsistent still remains unanswered.

The purpose of the present article is to answer this question, and thus to define the inconsistency index for the ordinal PC with ties in the same manner as Kendall and Babington Smith did [26] for binary PC. The definition of the inconsistency index is accompanied by a thorough study of the most inconsistent sets of pairwise comparisons with and without ties.

The article is composed of eight sections including the introduction and four appendices. The PC with ties is formally introduced in the next section (Sec. 2). For the purpose of modeling PC with ties, a generalized tournament graph has also been defined there. The most inconsistent set of binary PC is studied in (Sec. 3). It is also proven that the number of inconsistent triads in such a graph is determined by Kendall Babington Smith’s consistency coefficient. The next section (Sec. 4) describes how the most inconsistent set of PC with ties may look. Thus, it contains several theorems describing the quantitative relationship between the elements of the generalized tournament graph. Finally, in (Sec. 5) the most inconsistent set of PC with ties is proposed. The generalized inconsistency index for ordinal PC is also defined (Sec. 6). The penultimate section (Sec. 7) contains a discussion of the subject. In particular, the relationship between the maximally inconsistent set of PC and the NP-complete set cover problem [24] is shown. A brief summary is provided in (Sec. 8).

2 Model of inconsistency

Let us suppose we have a number of possible choices (alternatives, concepts) c1,,cnc_{1},\ldots,c_{n} where we are able to decide only whether one is better (more preferred) than the other or whether both alternatives are equally preferred. In the first case, we will write that cicjc_{i}\prec c_{j} to denote that cjc_{j} is more preferred than cic_{i}, whilst in the second case, to express that two alternatives cic_{i} and cjc_{j} are equally preferred we write cicjc_{i}\sim c_{j}. The preference relationship is total. Hence, for every two cic_{i} and cjc_{j} it holds that either cicjc_{i}\prec c_{j}, cjcic_{j}\prec c_{i} or cicjc_{i}\sim c_{j}. The relationship is reflexive and asymmetric. In particular, we will assume that if cicjc_{i}\prec c_{j} then not cjcic_{j}\prec c_{i}, and cicic_{i}\sim c_{i} for every i,j=1,,ni,j=1,\ldots,n. It is convenient to represent the relationship of preferences in the form of an n×nn\times n matrix.

Definition 1

The n×nn\times n matrix M=[mij]M=[m_{ij}] where mij{1,0,1}m_{ij}\in\{-1,0,1\} is said to be the ordinal PC matrix for nn alternatives c1,,cnc_{1},\ldots,c_{n} if a single comparison mijm_{ij} takes the value 11 when cic_{i} wins with cjc_{j} (i.e. cicj)c_{i}\succ c_{j}), 1-1 if, reversely, cjc_{j} is better than cic_{i} (i.e. cjci)c_{j}\succ c_{i}) and 0 in the case of a tie between cic_{i} and cjc_{j} (cicjc_{i}\sim c_{j}). The the diagonal values are 0.

The PC matrix is skew-symmetric except the diagonal, so that for every i,j=1,,ni,j=1,\ldots,n it holds that mij+mji=0m_{ij}+m_{ji}=0. An example of the ordinal PC matrix for five alternatives is given below (1).

M=(0101010111010111110101110)M=\left(\begin{array}[]{ccccc}0&1&0&1&0\\ -1&0&1&1&1\\ 0&-1&0&1&-1\\ -1&-1&-1&0&1\\ 0&-1&1&-1&0\end{array}\right) (1)

The PC matrix can be easily represented in the form of a graph.

Definition 2

A tournament graph (t-graph) with nn vertices is a pair T=(V,Ed)T=(V,E_{d}) where V=c1,,cnV=c_{1},\ldots,c_{n} is a set of vertices and EdV2E_{d}\subset V^{2} is a set of ordered pairs called directed edges, so that for every two distinct vertices cic_{i} and cjc_{j} either (ci,cj)Ed(c_{i},c_{j})\in E_{d} or (cj,ci)Ed(c_{j},c_{i})\in E_{d} .

Let us expand the definition of a tournament graph so that it can also model the collection of pairwise comparisons with ties.

Definition 3

The generalized tournament graph (gt-graph) with nn vertices is a triple G=(V,Eu,Ed)G=(V,E_{u},E_{d}) where V=c1,,cnV=c_{1},\ldots,c_{n} is a set of vertices, Eu2VE_{u}\subset 2^{V} is a set of unordered pairs called undirected edges, and EdV2E_{d}\subset V^{2} is a set of ordered pairs called directed edges, so that for every two distinct vertices cic_{i} and cjc_{j} either (ci,cj)Ed(c_{i},c_{j})\in E_{d} or (cj,ci)Ed(c_{j},c_{i})\in E_{d} or {ci,cj}Eu\{c_{i},c_{j}\}\in E_{u} .

Wherever it increases the readability of the text the directed and undirected edges (ci,cj)(c_{i},c_{j}), (cj,ci)(c_{j},c_{i}), {ci,cj}\{c_{i},c_{j}\} between ci,cjVc_{i},c_{j}\in V are denoted as cicj,cjcic_{i}\rightarrow c_{j},c_{j}\rightarrow c_{i} and cicjc_{i}-c_{j} correspondingly.

It is easy to see that every tournament graph can easily be extended to a generalized tournament graph where Eu=E_{u}=\emptyset. Therefore, it will be assumed that every t-graph is also a gt-graph, but not reversely.

Definition 4

A family of t-graphs with nn vertices will be denoted as 𝒯nt\mathscr{T}_{n}^{t}, where 𝒯nt={(V,Ed) is a t-graph, where|V|=n}\mathscr{T}_{n}^{t}=\{(V,E_{d})\,\textit{ is a t-graph, where}\,\left|V\right|=n\}, and similarly, a family of gt-graphs with nn vertices will be denoted as 𝒯ng\mathscr{T}_{n}^{g}, where 𝒯ng={(V,Eu,Ed) is a gt-graph, where|V|=n}\mathscr{T}_{n}^{g}=\{(V,E_{u},E_{d})\,\textit{ is a gt-graph, where}\,\left|V\right|=n\}

It is obvious that for every n>0n>0 it holds that 𝒯nt𝒯ng\mathscr{T}_{n}^{t}\subsetneq\mathscr{T}_{n}^{g}.

Definition 5

A family of gt-graphs with nn vertices and mm directed edges will be denoted as 𝒯n,mg={(V,Eu,Ed) is a gt-graph, where|V|=nand|Ed|=m}\mathscr{T}_{n,m}^{g}=\{(V,E_{u},E_{d})\,\textit{ is a gt-graph, where}\,\left|V\right|=n\,\textit{and}\,\left|E_{d}\right|=m\}

Definition 6

A gt-graph GM𝒯ngG_{M}\in\mathscr{T}_{n}^{g} is said to correspond to the n×nn\times n ordinal PC matrix M=[mij]M=[m_{ij}] if every directed edge (ci,cj)Ed(c_{i},c_{j})\in E_{d} implies mji=1m_{ji}=1 and mij=1m_{ij}=-1, and every undirected edge {ci,cj}Eu\{c_{i},c_{j}\}\in E_{u} implies mij=0m_{ij}=0.

Definition 7

All three mutually distinct vertices t={ci,ck,cj}Vt=\{c_{i},c_{k},c_{j}\}\subseteq V are said to be a triad. The vertex cc is said to be contained by a triad t={ci,ck,cj}t=\{c_{i},c_{k},c_{j}\} if ctc\in t. A triad t={ci,ck,cj}t=\{c_{i},c_{k},c_{j}\} is said to be covered by the edge (p,q)Ed(p,q)\in E_{d} if p,qtp,q\in t.

Sometimes it will be more convenient to write a triad t={ci,ck,cj}t=\{c_{i},c_{k},c_{j}\} as the set of edges, e.g. {cick,ckcj,cicj}\{c_{i}\rightarrow c_{k},c_{k}\,\text{---}\,c_{j},c_{i}\,\text{---}\,c_{j}\}. However, both notations are equivalent, the latter one allows the reader to immediately identify the type of triad.

Definition 8
Refer to caption
Figure 1: The gt-graph corresponding to the matrix MM, see (1).

In their work, Kendall and Babington Smith dealt with the ordinal pairwise comparisons without ties [26]. Hence, in fact, they do not consider the situation in which cicjc_{i}\sim c_{j}. For the same reason, their ordinal PC matrices had no zeros anywhere outside the diagonal111In fact, those matrices had no zeros as the authors inserted dashes on the diagonal [26].. For the purpose of defining the notion of inconsistency in preferences, they adopt the transitivity of the preference relationship. According to this assumption, every triad ci,ck,cjc_{i},c_{k},c_{j} of three different alternatives can be classified as consistent or inconsistent (contradictory). Providing that there are no ties between alternatives, there are two different kinds of triads (it is easy to verify that any other triad can be simply boiled down to one of these two by simple index changing). The first one cick,ckcjc_{i}\rightarrow c_{k},c_{k}\rightarrow c_{j} and cicjc_{i}\rightarrow c_{j} hereinafter referred to as the consistent triad222Index 33 means that this kind of triad is formed by three directed edges. CT3\textit{CT}_{3}, and cick,ckcjc_{i}\rightarrow c_{k},c_{k}\rightarrow c_{j} and cjcic_{j}\rightarrow c_{i} termed hereinafter as the inconsistent triad IT3\textit{IT}_{3} (Fig. 2).

Refer to caption
(a) CT3\textit{CT}_{3} - a consistent triad covered by three directed edges
Refer to caption
(b) IT3\textit{IT}_{3} - an inconsistent (circular) triad covered by three directed edges
Figure 2: Triads for paired comparisons without ties

Of course, the more inconsistent the triads in the ordinal PC matrix, the more inconsistent the set of preferences, hence the less reliable the conclusions drawn from the set of paired comparisons. To determine how inconsistent the given set of paired comparisons is, Kendall and Babington Smith [26] provide the maximal number of inconsistent triads in the n×nn\times n PC matrix without ties. Denoting the actual number of inconsistent triads in TMT_{M} by |TM|i\left|T_{M}\right|_{i}, and the maximal possible number of inconsistent triads in n×nn\times n PC matrix MM as (n)\mathcal{I}(n), we have 333As every n×nn\times n ordinal PC matrix MM corresponds to some tournament graph TnT_{n}^{\ast} we also use the notation |Tn|i\left|T_{n}^{\ast}\right|_{i} to express the number of inconsistent triads in it.:

(n)={n3n24whenn is oddn34n24whenn is even\mathcal{I}(n)=\left\{\begin{array}[]{ccc}\frac{n^{3}-n}{24}&\text{when}&\text{n is odd}\\ \frac{n^{3}-4n}{24}&\text{when}&\text{n is even}\end{array}\right. (2)

Therefore, the inconsistency index for MM defined in [26] is:

ζ(M)=1|TM|i(n)\zeta(M)=1-\frac{\left|T_{M}\right|_{i}}{\mathcal{I}(n)} (3)

Unfortunately, including ties into consideration significantly complicates the scene. Besides the two types of triads CT3\textit{CT}_{3} and IT3\textit{IT}_{3} we need to take into consideration an additional five:

  • CT0\textit{CT}_{0}

    - consistent triad of three equally preferred alternatives ci,ckc_{i},c_{k} and cjc_{j} such that cick,ckcjc_{i}\sim c_{k},c_{k}\sim c_{j} and cicjc_{i}\sim c_{j}.

  • IT1\textit{IT}_{1}

    - inconsistent triad composed of three alternatives ci,ckc_{i},c_{k} and cjc_{j} such that cick,ckcjc_{i}\sim c_{k},c_{k}\sim c_{j} and cicjc_{i}\prec c_{j}.

  • IT2\textit{IT}_{2}

    - inconsistent triad composed of three alternatives ci,ckc_{i},c_{k} and cjc_{j} such that cick,ckcjc_{i}\sim c_{k},c_{k}\prec c_{j} and cjcic_{j}\prec c_{i}.

  • CT2a\textit{CT}_{2a}

    - consistent triad composed of three alternatives ci,ckc_{i},c_{k} and cjc_{j} such that cick,ckcjc_{i}\sim c_{k},c_{k}\prec c_{j} and cicjc_{i}\prec c_{j}.

  • CT2b\textit{CT}_{2b}

    - consistent triad composed of three alternatives ci,ckc_{i},c_{k} and cjc_{j} such that cick,cjckc_{i}\sim c_{k},c_{j}\prec c_{k} and cjcic_{j}\prec c_{i}.

The above triads can be easily represented as tournament graphs with ties (Fig. 4). With the increased number of different types of triads in a graph, the maximum number of inconsistent triads also increases. For example, according to (2) the maximum number of inconsistent triads in (4)\mathcal{I}(4) without ties is 22. When ties are allowed, the maximal number of inconsistent triads increases to 44, which is the total number of triads in every simple graph (i.e. with only one edge between one pair of vertices) with four vertices.

Refer to caption
Figure 3: (4)\mathcal{I}(4) with four IT1\textit{IT}_{1} triads

Let us analyze the graph in (Fig 3). It is easy to notice that it contains four IT1\textit{IT}_{1} triads which are: {c1c2,c2c3,c3c1c_{1}\rightarrow c_{2},\,c_{2}\,\text{---}\,c_{3},\,c_{3}\,\text{---}\,c_{1}}, {c1c2,c2c4,c4c1c_{1}\rightarrow c_{2},\,c_{2}\,\text{---}\,c_{4},\,c_{4}\,\text{---}\,c_{1}}, {c1c3,c3c4,c4c1c_{1}\text{---}c_{3},\,c_{3}\,\rightarrow\,c_{4},\,c_{4}\,\text{---}\,c_{1}}, and {c2c3,c3c4,c4c1c_{2}\,\text{---}\,c_{3},\,c_{3}\,\text{$\rightarrow$}\,c_{4},\,c_{4}\,\text{---}\,c_{1}}. Thus, it is clear that the formulae (2) and (3) cannot be used to estimate inconsistency in preferences when ties are allowed. The desire to extend those concepts to paired comparisons with ties was the main motivation for writing the work.

Refer to caption
(a) CT0\textit{CT}_{0} - a consistent triad not covered by any directed edge
Refer to caption
(b) IT1\textit{IT}_{1} - an inconsistent triad covered by one directed edge
Refer to caption
(c) IT2\textit{IT}_{2} - an inconsistent triad covered by two directed edges
Refer to caption
(d) CT2a\textit{CT}_{\textit{2a}} - a consistent triad covered by two directed edges. One alternative is more preferred than two others.
Refer to caption
(e) CT2b\textit{CT}_{\textit{2b}} - a consistent triad covered by two directed edges. One alternative is less preferred than two others.
Figure 4: Triads specific for the pairwise comparisons with ties

3 The most inconsistent set of preferences without ties

To construct the most inconsistent set of pairwise preferences without ties, let us introduce a few definitions relating to the degree of vertices. Since every t-graph is also a gt-graph the definitions are formulated for the gt-graph.

Definition 9

Let G=(V,Eu,Ed)G=(V,E_{u},E_{d}) be a gt-graph and c,dVc,d\in V. Then input degree, output degree, undirected degree and degree of a vertex cc are defined as follows: degin(c)=df|{dV:dcEd}|\text{deg}_{\textit{in}}(c)\overset{\textit{df}}{=}\left|\{d\in V\,:\,d\rightarrow c\in E_{d}\}\right|, degout(c)=df|{dV:cdEd}|\text{deg}_{out}(c)\overset{\textit{df}}{=}\left|\{d\in V\,:\,c\rightarrow d\in E_{d}\}\right|, degun(c)=df|{dV:cdEu}|\text{deg}_{un}(c)\overset{\textit{df}}{=}\left|\{d\in V\,:\,c\,\text{---}\,d\in E_{u}\}\right| and deg(c)=dfdegin(c)+degout(c)+degun(c)\text{deg}(c)\overset{\textit{df}}{=}\text{deg}_{\textit{in}}(c)+\text{deg}_{out}(c)+\text{deg}_{un}(c).

Theorem 1

Let G=(V,Eu,Ed)G=(V,E_{u},E_{d}) from 𝒯ng\mathscr{T}_{n}^{g}. Then every vertex cVc\in V, for which degin(c)=k\text{deg}_{\textit{in}}(c)=k is contained by at least (k2)\binom{k}{2} consistent triads of the type CT2a\textit{CT}_{2a} or CT3\textit{CT}_{3}. Those triads are said to be introduced by cc.

Proof 1

Let c1,,ckVc_{1},\ldots,c_{k}\in V be the vertices such that the edges cicc_{i}\rightarrow c are in EdE_{d}. Since TT is a gt-graph with nn vertices, then for every ci,cjc_{i},c_{j} where i,j=1,,ki,j=1,\ldots,k there must exist an edge cicjc_{i}\rightarrow c_{j}, cjcic_{j}\rightarrow c_{i} in EdE_{d} or cicjc_{i}\,\text{---}\,c_{j} in EuE_{u}. In the first two cases, the vertices ci,c,cjc_{i},c,c_{j} make a consistent triad type CT2a\textit{CT}_{2a}, whilst in the latter case the vertices ci,c,cjc_{i},c,c_{j} form a consistent triad type CT3\textit{CT}_{3}. Since there are kk vertices adjacent via the incoming edge to cc there are at least as many different consistent triads containing cc as two-element combinations of c1,,ckc_{1},\ldots,c_{k} i.e. (k2)\binom{k}{2}. See (Fig. 5).

Refer to caption
Figure 5: Consistent triads introduced by the vertex cVc\in V with degin(c)=k\text{deg}_{\textit{in}}(c)=k

In general, the given vertex cc can form more consistent triads than those indicated in the above theorem. This is due to the fact that there may be two or more edges in the form cck+1,,cck+rc\rightarrow c_{k+1},\dots,c\rightarrow c_{k+r}. Thus, in TT there may also be some number of consistent triads CT2b\textit{CT}_{2b} containing cc.

The Theorem 1 is also true for the ordinary tournament graph (without ties). However, since the only consistent triads in such a graph are type CT3\textit{CT}_{3} (i.e. there are no triads of the type CT2a\textit{CT}_{2a} or CT2b\textit{CT}_{2b} containing cc), the only consistent triads containing cc are those introduced by cc. This leads to the following observation:

Corollary 1

Let T=(V,Ed)T=(V,E_{d}) from 𝒯nt\mathscr{T}_{n}^{t}. Then every vertex cVc\in V, for which degin(c)=k\text{deg}_{in}(c)=k is contained by exactly (k2)\binom{k}{2} consistent triads of the type CT3\textit{CT}_{3}.

Thus, if we would like to construct a tournament graph without ties which has the maximal number of inconsistent triads, we have to minimize the number of consistent triads introduced by the vertices, i.e.

|T|c=dfcV(degin(c)2)\left|T\right|_{c}\overset{\textit{df}}{=}\sum_{c\in V}\binom{\text{deg}_{\textit{in}}(c)}{2} (4)

Since there are no other consistent triads in the tournament graph than those introduced by the vertices, the expression (5) denotes, in fact, the number of inconsistent triads in some T𝒯ntT\in\mathscr{T}_{n}^{t}. Thus,

|T|i=(n3)cV(degin(c)2)\left|T\right|_{i}=\binom{n}{3}-\sum_{c\in V}\binom{\text{deg}_{\textit{in}}(c)}{2} (5)

It is commonly known that the sum of degrees in any undirected graph G=(V,E)G=(V,E) equals 2|E|2|E| [11, p. 5]. For the same reason in T𝒯ntT\in\mathscr{T}_{n}^{t} the sum of incoming edges into vertices is444Every directed edge corresponds to one victory. |E|=(n2)|E|=\binom{n}{2}, i.e.:

cVdegin(c)=(n2)\sum_{c\in V}\text{deg}_{\textit{in}}(c)=\binom{n}{2} (6)

Hence, we would like to minimize (5) providing that the expression (6) holds. Intuitively |T|i\left|T\right|_{i} is the largest (5) i.e. |T|c\left|T\right|_{c} is the smallest (4) when the input degrees of vertices in a graph are the most evenly distributed555As it will be explained latter the input degrees are the most evenly distributed if for two different vertices c,dc,d holds that |degin(c)degin(d)|1\left|\deg_{\textit{in}}(c)-\deg_{\textit{in}}(d)\right|\leq 1. .

Definition 10

A gt-graph with nn vertices is said to be maximal with respect to the number of inconsistent triads, or briefly maximal if it has the highest possible number of inconsistent triads among the gt-graphs with the size nn. The fact that the gt-graph is maximal will be denoted G𝒯ng¯G\in\overline{\mathscr{T}_{n}^{g}} or T𝒯nt¯T\in\overline{\mathscr{T}_{n}^{t}}, depending on whether ties are or are not allowed. 𝒯nt¯\overline{\mathscr{T}_{n}^{t}} and 𝒯ng¯\overline{\mathscr{T}_{n}^{g}} denote families of gt-graphs with the highest possible number of inconsistent triads, i.e.

𝒯nt¯={T𝒯ntsuch that|T|i=maxTr𝒯nt|Tr|i}\overline{\mathscr{T}_{n}^{t}}=\{T\in\mathscr{T}_{n}^{t}\,\textit{such that}\,\left|T\right|_{i}=\underset{T_{r}\in\mathscr{T}_{n}^{t}}{\text{max}}\left|T_{r}\right|_{i}\} (7)
𝒯ng¯={G𝒯ngsuch that|G|i=maxGr𝒯ng|Gr|i}\overline{\mathscr{T}_{n}^{g}}=\{G\in\mathscr{T}_{n}^{g}\,\textit{such that}\,\left|G\right|_{i}=\underset{G_{r}\in\mathscr{T}_{n}^{g}}{\text{max}}\left|G_{r}\right|_{i}\} (8)

Before we prove the Theorem (2) about the maximal t-graph let us notice that for r+r\in\mathbb{N}_{+} it holds that:

(2r+12)=r(2r+1)\binom{2r+1}{2}=r\cdot\left(2r+1\right) (9)

and

(2r2)=rr+r(r1)\binom{2r}{2}=r\cdot r+r(r-1) (10)

The above expression (9) means that by adopting n=2r+1n=2r+1 as the number of vertices in a graph, we may assign exactly rr incoming edges to every vertex cc in VV when nn is odd. Similarly (10), providing that n=2rn=2r is even, we can assign rr incoming edges to rr vertices and r1r-1 incoming edges to the next rr vertices.

Theorem 2

The number of inconsistent triads in the t-graph T=(V,Ed)T=(V,E_{d}) is maximal i.e. T𝒯nt¯T\in\overline{\mathscr{T}_{n}^{t}} if and only if

  1. 1.

    for every cc in VV degin(c)=r\text{deg}_{\textit{in}}(c)=r when n=2r+1n=2r+1

  2. 2.

    there are rr vertices c1,,crc_{1},\ldots,c_{r} in VV such that degin(ci)=r\text{deg}_{\textit{in}}(c_{i})=r, and rr vertices cr+1,,cnc_{r+1},\ldots,c_{n} such that degin(cj)=r1\text{deg}_{\textit{in}}(c_{j})=r-1, where n=2rn=2r and 1ir<jn1\leq i\leq r<j\leq n.

Proof 2

To prove the theorem, it is enough to show that (4) is minimized by the distributions of the vertex degrees mentioned in the thesis of the theorem. Let us suppose that n=2r+1n=2r+1 and (4) is minimal but not all the vertices have input degrees equal rr. Thus, there must be at least one ciVc_{i}\in V such that degin(ci)r\text{deg}_{\textit{in}}(c_{i})\neq r. Let us suppose that degin(ci)=p>r\text{deg}_{\textit{in}}(c_{i})=p>r (the second case is symmetric). Formulae (6) and (9) imply that there must also be at least one cjVc_{j}\in V such that degin(cj)=q<r\text{deg}_{\textit{in}}(c_{j})=q<r . Therefore we can decrease pp and increase qq by one without changing the sum (6) just by replacing cjcic_{j}\rightarrow c_{i} to cicjc_{i}\rightarrow c_{j}. Since p+q=zp+q=z and zz is constant, the sum of consistent triads introduced by cic_{i} and cjc_{j} (Theorem 1) is given as:

(p2)+(q2)=(p2)+(zp2)=p(pz)+z(z1)2\binom{p}{2}+\binom{q}{2}=\binom{p}{2}+\binom{z-p}{2}=p(p-z)+\frac{z(z-1)}{2} (11)

Since z(z1)/2z(z-1)/2 is constant let

f(p)=dfp(pz)+z(z1)2f(p)\overset{\textit{df}}{=}p(p-z)+\frac{z(z-1)}{2} (12)

The value f(p)f(p) decreases alongside a decreasing pp if

f(p)f(p1)>0f(p)-f(p-1)>0 (13)

which is true if and only if

2p>(z1)2p>(z-1) (14)

Since p>qp>q and p+q=zp+q=z the last statement is true, which implies that, by decreasing degin(ci)\text{deg}_{\textit{in}}(c_{i}) and increasing degin(cj)\text{deg}_{\textit{in}}(c_{j}) by one, we can decrease the expression (4). This fact is contrary to the assumption that (4) is minimal, but not all the vertices have input degrees equal rr.

The proof for n=2rn=2r is analogous to the case when n=2r+1n=2r+1 except the fact that as cic_{i} we should adopt such a vertex for which degin(ci)r\text{deg}_{\textit{in}}(c_{i})\neq r and degin(ci)r1\text{deg}_{\textit{in}}(c_{i})\neq r-1. Note that there must be one if we reject the second statement of the thesis and, at the same time, we claim that (4) is minimal. \qed

The proof of (Theorem 2) also suggests an algorithm that converts any tournament graph into a graph with the maximal number of inconsistent triads. In every step of such an algorithm, it is enough to find a vertex cic_{i} whose input degree differs from rr (when nn is odd) or differs from rr and r1r-1 (when nn is even) and decreases (or increases) its input degree in parallel with increases (or decreases) in the input degree of cjc_{j}. If it is impossible to find such a pair (ci,cjc_{i},c_{j}) this means that the graph is maximal. The algorithm satisfies the stop condition as with every iteration the number of inconsistent triads in a graph gets higher whilst the total number of triads in a graph is bounded by (n3).\binom{n}{3}.

Kendall and Babington Smith [26] suggest a way of constructing the most inconsistent graph that brings to mind circulant graphs [33]. Namely, first add to a graph the cycle c1c2c3cnc1c_{1}\rightarrow c_{2}\rightarrow c_{3}\rightarrow\ldots\rightarrow c_{n}\rightarrow c_{1} then the cycle c1c3c5cnc2c_{1}\rightarrow c_{3}\rightarrow c_{5}\rightarrow\ldots\rightarrow c_{n}\rightarrow c_{2}\rightarrow\ldots if nn is even or two cycles c1c3cn1c1c_{1}\rightarrow c_{3}\rightarrow\ldots\rightarrow c_{n-1}\rightarrow c_{1} and c2c4cnc2c_{2}\rightarrow c_{4}\rightarrow\ldots\rightarrow c_{n}\rightarrow c_{2} if nn is odd, and so on. Adding cycles with more and more skips needs to be continued until the insertion of all (n2)\binom{n}{2} edges. An example of the maximally inconsistent graphs TX𝒯6tT_{X}\in\mathscr{T}_{6}^{t} and TY𝒯7tT_{Y}\in\mathscr{T}_{7}^{t} can be found in (Fig. 6). Those graphs correspond to the matrices XX and YY (15).

X=(011111101111110111111011111101111110)Y=(0111111101111111011111110111111101111111011111110)X=\left(\begin{array}[]{cccccc}0&1&1&1&-1&-1\\ -1&0&1&1&1&-1\\ -1&-1&0&1&1&1\\ -1&-1&-1&0&1&1\\ 1&-1&-1&-1&0&1\\ 1&1&-1&-1&-1&0\end{array}\right)\,\,\,\,\,Y=\left(\begin{array}[]{ccccccc}0&1&1&1&-1&-1&-1\\ -1&0&1&1&1&-1&-1\\ -1&-1&0&1&1&1&-1\\ -1&-1&-1&0&1&1&1\\ 1&-1&-1&-1&0&1&1\\ 1&1&-1&-1&-1&0&1\\ 1&1&1&-1&-1&-1&0\end{array}\right) (15)

The Theorem 2 clearly indicates the form of the most inconsistent tournament graph, but it does not specify the number of inconsistent triads in such a graph. This number, however, can be easily computed using the formula (2). To see that the results obtained so far are consistent with (2) as defined in [26] let us prove the following theorem.

Theorem 3

For every t-graph T=(V,Ed)T=(V,E_{d}) where T𝒯nt¯T\in\overline{\mathscr{T}_{n}^{t}}, n3n\geq 3 which has the form defined by the Theorem 2 it holds that

|T|i=(n)\left|T\right|_{i}=\mathcal{I}(n) (16)
Proof 3

According to (5)

|T|i=(2r+13)cV(degin(c)2)\left|T\right|_{i}=\binom{2r+1}{3}-\sum_{c\in V}\binom{\text{deg}_{\textit{in}}(c)}{2} (17)

Let n=2r+1n=2r+1 and r+r\in\mathbb{N}_{+}. Then due to (Theorem 2)

|T|i=(2r+13)((r2)++(r2))2r+1\left|T\right|_{i}=\binom{2r+1}{3}-\underset{2r+1}{\left(\underbrace{\binom{r}{2}+\ldots+\binom{r}{2}}\right)} (18)
|T|i=r(2r1)(2r+1)3(r1)r(2r+1)2\left|T\right|_{i}=\frac{r(2r-1)(2r+1)}{3}-\frac{(r-1)r(2r+1)}{2} (19)
|T|i=r(2r2+3r+1)6=(2r+1)3(2r+1)24\left|T\right|_{i}=\frac{r\left(2r^{2}+3r+1\right)}{6}=\frac{(2r+1)^{3}-(2r+1)}{24} (20)
|T|i=(2r+1)3(2r+1)24=n3n24=(n)\left|T\right|_{i}=\frac{(2r+1)^{3}-(2r+1)}{24}=\frac{n^{3}-n}{24}=\mathcal{I}(n) (21)

Similarly, when n=2rn=2r and r+r\in\mathbb{N}_{+}. Then due to (Th. 2)

|T|i=(2r3)((r2)++(r2))𝑟((r12)++(r12))𝑟\left|T\right|_{i}=\binom{2r}{3}-\underset{r}{\left(\underbrace{\binom{r}{2}+\ldots+\binom{r}{2}}\right)}-\underset{r}{\left(\underbrace{\binom{r-1}{2}+\ldots+\binom{r-1}{2}}\right)} (22)
|T|i=r(2r2)(2r1)3(r1)r22(r2)(r1)r2\left|T\right|_{i}=\frac{r(2r-2)(2r-1)}{3}-\frac{(r-1)r^{2}}{2}-\frac{(r-2)(r-1)r}{2} (23)
|T|i=r(r21)3=(2r)34(2r)24=n34n24=(n)\left|T\right|_{i}=\frac{r\left(r^{2}-1\right)}{3}=\frac{(2r)^{3}-4(2r)}{24}=\frac{n^{3}-4n}{24}=\mathcal{I}(n) (24)

which completes the proof of the theorem. \qed

Refer to caption
(a) TX𝒯6t¯T_{X}\in\overline{\mathscr{T}_{6}^{t}}
Refer to caption
(b) TY𝒯7t¯T_{Y}\in\overline{\mathscr{T}_{7}^{t}}
Figure 6: An example of the most inconsistent tournament graphs with six and seven vertices

The above theorem shows that the number of inconsistent triads in the tournament graph in which input degrees of their vertices are most evenly distributed is expressed by the formula provided by Kendall and Babington Smith [26]. This result, of course, is the natural consequence of the fact that such a graph is maximal as regards the number of inconsistent triads, as proven in (Theorem 2).

4 Properties of the most inconsistent set of preferences with ties

The graph representation of the set of paired comparisons with ties is the gt-graph. As it may contain two different types of edges, and hence, essentially more different kinds of triads (Fig. 4), the problem of finding the maximum number of inconsistent triads in such a graph is appropriately more difficult. The reasoning presented in this section is composed of three parts. In the first part, the properties of the gt-graph are discussed. Next, the maximally inconsistent gt-graph is proposed, and then, we prove that the proposed graph is indeed maximal with respect to the number of inconsistent triads.

The most straightforward example of the fully consistent gt-graph is a complete undirected graph of nn vertices (undirected nn-clique). It contains only undirected edges, thus all the triads contained in it are type CT0\textit{CT}_{0}. At first glance it seems that by successive replacing of undirected edges into directed ones we can make the graph more and more inconsistent. At the beginning, we will try to choose isolated edges i.e. those which are not adjacent to any directed edge. It is easy to observe that such edges alone cover n2n-2 different triads. Hence, by replacing isolated undirected edges into directed ones we increase the number of inconsistent triads by n2n-2. Unfortunately, we can insert at most n2\left\lfloor\frac{n}{2}\right\rfloor isolated directed edges (every isolated edge needs two vertices out of nn only for itself). Then we have to replace not isolated undirected edges into directed ones, and finally, we decide to make such replacements, which results in increasing the number of inconsistent triads in a graph, but also increases input degrees for some vertices. After several experiments carried out according to the above scheme, one may observe that it is not easy to choose the edge to replace. However, studying the above greedy algorithm is not useless. The first thing to notice is the fact that every gt-graph containing more than a certain number of edges should always have some number of consistent triads. Another finding is the observation that when constructing a maximal gt-graph one should strive to put at least one directed edge in each triad. Otherwise, the triad remains consistent, increasing the chance that the resulting gt-graph is not maximal. Both intuitive observations lead to the conclusion that the construction of the maximal gt-graph is a matter of finding a balance between too many directed edges resulting in the appearance of consistent triads of the type CT2a\textit{CT}_{2a} and CT2b\textit{CT}_{2b} and too few directed edges resulting in the existence of consistent triads of the type CT0\textit{CT}_{0}. Let us try to formulate this conclusion in a more formal way.

Theorem 4

Each gt-graph G𝒯n,mgG\in\mathscr{T}_{n,m}^{g} contains at least 𝒞(n,m)\mathcal{C}(n,m) consistent triads of the type CT2a\textit{CT}_{2a} or CT3\textit{CT}_{3} where

𝒞(n,m)=12mn(2mnmnn)\mathcal{C}(n,m)=\frac{1}{2}\left\lfloor\frac{m}{n}\right\rfloor\left(2m-n\left\lfloor\frac{m}{n}\right\rfloor-n\right) (25)
Proof 4

The theorem is a straightforward consequence of (Theorem 1 and 2). The first of them estimates the number of triads CT2a\textit{CT}_{2a} or CT3\textit{CT}_{3} for a given vertex, whilst the second one shows that the sum of triads CT2a\textit{CT}_{2a} or CT3\textit{CT}_{3} introduced by the vertices is minimal when the input degrees are evenly distributed. As we would like to determine the lower bound for the number of consistent triads in GG, we therefore have to assume that the input degrees are evenly distributed. Since there are mm directed edges in GG (it occurs that mm times one alternative is better than the other), then the sum of input degrees of vertices is mm. Therefore, adopting an even distribution postulate, every vertex has at least mn\left\lfloor\frac{m}{n}\right\rfloor victories assigned (their input degree is at least mn\left\lfloor\frac{m}{n}\right\rfloor). Of course, the input degree of some of them may be larger by one. In other words, in the considered gt-graph there are pp vertices whose input degree is mn\left\lfloor\frac{m}{n}\right\rfloor and npn-p vertices whose input degree might be mn+1\left\lfloor\frac{m}{n}\right\rfloor+1. According to (Theorem 1) such a graph has at least 𝒞(n,m)\mathcal{C}(n,m) consistent triads, where

𝒞(n,m)=p(mn2)+(np)(mn+12)\mathcal{C}(n,m)=p\binom{\left\lfloor\frac{m}{n}\right\rfloor}{2}+(n-p)\binom{\left\lfloor\frac{m}{n}\right\rfloor+1}{2} (26)

We know that the sum of input degrees of vertices is mm, so

pmn+(np)(mn+1)=mp\left\lfloor\frac{m}{n}\right\rfloor+(n-p)\left(\left\lfloor\frac{m}{n}\right\rfloor+1\right)=m (27)

Hence,

p=n(mn+1)mp=n\left(\left\lfloor\frac{m}{n}\right\rfloor+1\right)-m (28)

Therefore (26) can be written as

𝒞(n,m)=(n(mn+1)m)(mn2)+(mnmn)(mn+12)\mathcal{C}(n,m)=\left(n\cdot\left(\left\lfloor\frac{m}{n}\right\rfloor+1\right)-m\right)\cdot\binom{\left\lfloor\frac{m}{n}\right\rfloor}{2}+\left(m-n\cdot\left\lfloor\frac{m}{n}\right\rfloor\right)\cdot\binom{\left\lfloor\frac{m}{n}\right\rfloor+1}{2} (29)

which, after appropriate transformations leads to (25). \qed

The immediate consequence of (Lemma 4) is the following corollary:

Corollary 2

Each gt-graph G𝒯n,mgG\in\mathscr{T}_{n,m}^{g} contains at most

(n3)𝒞(n,m)\binom{n}{3}-\mathcal{C}(n,m) (30)

inconsistent triads.

For the purpose of further consideration, let us denote by 𝒯\mathcal{T} a set of all the triads in the gt-graph and by 𝒯i\mathcal{T}_{i} - a set of triads covered by i=0,,3i=0,\ldots,3 directed edges. For brevity, we denote the sum 𝒯i𝒯j\mathcal{T}_{i}\cup\mathcal{T}_{j} as 𝒯i,j\mathcal{T}_{i,j}. In particular, it holds that 𝒯=𝒯0𝒯1𝒯2,3\mathcal{T}=\mathcal{T}_{0}\cup\mathcal{T}_{1}\cup\mathcal{T}_{2,3}. This allows the formulation of a quite straightforward but useful observation.

Corollary 3

As every two sets out of 𝒯0,,𝒯3\mathcal{T}_{0},\ldots,\mathcal{T}_{3} are mutually disjoint, then for every gt-graph G𝒯ngG\in\mathscr{T}_{n}^{g} it is true that

(n3)=|𝒯0|+|𝒯1|+|𝒯2,3|\binom{n}{3}=\left|\mathcal{T}_{0}\right|+\left|\mathcal{T}_{1}\right|+\left|\mathcal{T}_{2,3}\right| (31)

Another important piece of information about the gt-graph follows from the number of undirected edges adjacent to particular vertices. Such edges may form the triads CT0\textit{CT}_{0} but may also form the triads IT1\textit{IT}_{1} (Fig. 7). This observation allows the number of both triad types to be estimated.

Refer to caption
Figure 7: Vertex cc where degun(c)=4\text{deg}_{\textit{un}}(c)=4 is contained by 66 different triads. Three of them are CT0\textit{CT}_{0} (dashed edges), the other three are IT1\textit{IT}_{1} (dotted edges).
Lemma 1

For every gt-graph G𝒯ngG\in\mathscr{T}_{n}^{g} where G=(V,Eu,Ed)G=(V,E_{u},E_{d}) it holds that

cV(degun(c)2)=3|𝒯0|+|𝒯1|\sum_{c\in V}\binom{\text{deg}_{\textit{un}}(c)}{2}=3\left|\mathcal{T}_{0}\right|+\left|\mathcal{T}_{1}\right| (32)
Proof 5

Let c1c,,ckcc_{1}\,\text{---}\,c,\dots,c_{k}\,\text{---}\,c be the undirected edges in EuE_{u} adjacent to some cVc\in V. There are (k2)\binom{k}{2} triads that contain cc. The type of triad depends on the edge (ci,cj)(c_{i},c_{j}). If (ci,cj)Eu(c_{i},c_{j})\in E_{u} then the triad belongs to 𝒯0\mathcal{T}_{0} whilst if (ci,cj)Ed(c_{i},c_{j})\in E_{d} then the triad is in 𝒯1\mathcal{T}_{1}. While calculating the sum cV(degun(c)2)\sum_{c\in V}\binom{\text{deg}_{\textit{un}}(c)}{2} every uncovered triad is counted three times as there are three vertices adjacent to two undirected edges forming the triad. For the same reason, the triads covered by one directed edge are taken into account only once. \qed

Similarly as before, we try to generalize the result (32) to all the graphs that have mm directed edges.

Lemma 2

For each gt-graph G𝒯n,mgG\in\mathscr{T}_{n,m}^{g} where G=(V,Eu,Ed)G=(V,E_{u},E_{d}) it holds that

𝒟(n,m)3|𝒯0|+|𝒯1|\mathcal{D}(n,m)\leq 3\left|\mathcal{T}_{0}\right|+\left|\mathcal{T}_{1}\right| (33)

where

𝒟(n,m)=12(n2mn2)(n2+n(2mn1)4m)\mathcal{D}(n,m)=\frac{1}{2}\left(n-\left\lfloor\frac{2m}{n}\right\rfloor-2\right)\left(n^{2}+n\left(\left\lfloor\frac{2m}{n}\right\rfloor-1\right)-4m\right) (34)
Proof 6

Similarly as in (Lemma 4) the left side of (32) is minimal if undirected degrees are evenly distributed among the vertices. As for every cVc\in V it holds that degun(c)=deg(c)degin(c)degout(c)\text{deg}_{\textit{un}}(c)=\text{deg}(c)-\text{deg}_{in}(c)-\text{deg}_{out}(c) then degun(c)=n1(degin(c)+degout(c))\text{deg}_{\textit{un}}(c)=n-1-\left(\text{deg}_{in}(c)+\text{deg}_{out}(c)\right). Thus, undirected degrees of vertices are evenly distributed if and only if the number of directed edges adjacent to the vertices are evenly distributed.

It is easy to see that in a gt-graph having mm directed edges the sum of input and output degrees is 2m2m. Thus, for every graph that minimizes the left side of (32) it holds that:

p2mn+(np)(2mn+1)=2mp\left\lfloor\frac{2m}{n}\right\rfloor+(n-p)\left(\left\lfloor\frac{2m}{n}\right\rfloor+1\right)=2m (35)

The above equality means in particular that in such a graph there are pnp\leq n vertices c1,,cpc_{1},\ldots,c_{p} for which degin(ci)+degout(ci)=2mn\text{deg}_{\textit{in}}(c_{i})+\text{deg}_{out}(c_{i})=\left\lfloor\frac{2m}{n}\right\rfloor and 1ip1\leq i\leq p, and npn-p vertices cp+1,,cnc_{p+1},\ldots,c_{n} for which degin(cj)+degout(cj)=2mn+1\text{deg}_{\textit{in}}(c_{j})+\text{deg}_{out}(c_{j})=\left\lfloor\frac{2m}{n}\right\rfloor+1 and p+1jnp+1\leq j\leq n. This statement also implies that in every graph that minimizes the left side of (32) there are pp vertices c1,,cpc_{1},\ldots,c_{p} for which degun(ci)=n12mn\text{deg}_{un}(c_{i})=n-1-\left\lfloor\frac{2m}{n}\right\rfloor and 1ip1\leq i\leq p, and also npn-p vertices cp+1,,cnc_{p+1},\ldots,c_{n} for which degun(cj)=n22mn\text{deg}_{un}(c_{j})=n-2-\left\lfloor\frac{2m}{n}\right\rfloor and p+1jnp+1\leq j\leq n.

Thus, for every G𝒯n,mgG\in\mathscr{T}_{n,m}^{g} the lower bound of 3|𝒯0|+|𝒯1|3\left|\mathcal{T}_{0}\right|+\left|\mathcal{T}_{1}\right| is:

𝒟(n,m)=p(n12mn2)+(np)(n22mn2)\mathcal{D}(n,m)=p\binom{n-1-\left\lfloor\frac{2m}{n}\right\rfloor}{2}+(n-p)\binom{n-2-\left\lfloor\frac{2m}{n}\right\rfloor}{2} (36)

Since from (35) pp equals

p=n(2mn+1)2mp=n\left(\left\lfloor\frac{2m}{n}\right\rfloor+1\right)-2m (37)

Thus,

𝒟(n,m)=\displaystyle\mathcal{D}(n,m)= (n(2mn+1)2m)(n12mn2)\displaystyle\left(n\left(\left\lfloor\frac{2m}{n}\right\rfloor+1\right)-2m\right)\binom{n-1-\left\lfloor\frac{2m}{n}\right\rfloor}{2}
+(2mn2mn)(n22mn2)\displaystyle+\left(2m-n\left\lfloor\frac{2m}{n}\right\rfloor\right)\binom{n-2-\left\lfloor\frac{2m}{n}\right\rfloor}{2} (38)

The above expression simplifies to

𝒟(n,m)=12(2mn+n2)(n2mn4m+(n1)n)\mathcal{D}(n,m)=\frac{1}{2}\left(-\left\lfloor\frac{2m}{n}\right\rfloor+n-2\right)\left(n\left\lfloor\frac{2m}{n}\right\rfloor-4m+(n-1)n\right) (39)

which completes the proof of the theorem. \qed

Through the analysis of the degree of vertices we can also estimate the value |𝒯2,3|\left|\mathcal{T}_{2,3}\right|.

Lemma 3

For every gt-graph G𝒯ngG\in\mathscr{T}_{n}^{g} where G=(V,Eu,Ed)G=(V,E_{u},E_{d}) it holds that

13cV(degin(c)+degout(c)2)|𝒯2,3|\frac{1}{3}\sum_{c\in V}\binom{\text{deg}_{\textit{in}}(c)+\text{deg}_{out}(c)}{2}\leq\left|\mathcal{T}_{2,3}\right| (40)
Proof 7

Let c1c,cc2,,ckcc_{1}\rightarrow c,c\rightarrow c_{2},\dots,c_{k}\rightarrow c be the directed edges in EdE_{d} adjacent to some cVc\in V. There are (k2)\binom{k}{2} triads that contain cc where k=degin(c)+degout(c)k=\text{deg}_{\textit{in}}(c)+\text{deg}_{\textit{out}}(c), which are covered by two or three directed edges. While calculating the sum cV(degin(c)+degout(c)2)\sum_{c\in V}\binom{\text{deg}_{\textit{in}}(c)+\text{deg}_{\textit{out}}(c)}{2} triads covered by two directed edges are counted once, whilst all the triads covered by three directed edges are counted three times. In the worst case scenario, all the considered triads are covered by three directed edges. Thus, 13cV(degin(c)+degout(c)2)\frac{1}{3}\sum_{c\in V}\binom{\text{deg}_{\textit{in}}(c)+\text{deg}_{\textit{out}}(c)}{2} is the lower bound for |𝒯2,3|\left|\mathcal{T}_{2,3}\right|. This observation completes the proof. \qed

Similarly as before, let us extend the above Lemma to all gt-graphs that have nn vertices and mm directed edges.

Lemma 4

For each gt-graph G𝒯n,mgG\in\mathscr{T}_{n,m}^{g} where G=(V,Eu,Ed)G=(V,E_{u},E_{d}) it holds that

(n,m)|𝒯2,3|\mathcal{E}(n,m)\leq\left|\mathcal{T}_{2,3}\right| (41)

where

(n,m)=162mn(4mn(2mn+1))\mathcal{E}(n,m)=\frac{1}{6}\left\lfloor\frac{2m}{n}\right\rfloor\left(4m-n\left(\left\lfloor\frac{2m}{n}\right\rfloor+1\right)\right) (42)
Proof 8

Similarly as in (Lemma 2) the left side of (40) is minimal if the sum of input and output degrees of the vertices are evenly distributed. It is easy to see that in a gt-graph that has mm directed edges the sum of input and output degrees is 2m2m. Thus, for every graph that minimizes the left side of (40) it holds that (35). This implies that in the gt-graph which minimizes the left side of (40) there should be pp vertices adjacent to 2mn\left\lfloor\frac{2m}{n}\right\rfloor directed edges and npn-p vertices adjacent to 2mn+1\left\lfloor\frac{2m}{n}\right\rfloor+1 directed edges. Based on (40) we conclude that

(n,m)=13(p(2mn2)+(np)(2mn+12))\mathcal{E}(n,m)=\frac{1}{3}\left(p\binom{\left\lfloor\frac{2m}{n}\right\rfloor}{2}+\left(n-p\right)\binom{\left\lfloor\frac{2m}{n}\right\rfloor+1}{2}\right) (43)

Applying (37) we obtain

(n,m)=\displaystyle\mathcal{E}(n,m)= 13{[n(2mn+1)2m](2mn2)\displaystyle\frac{1}{3}\left\{\left[n\left(\left\lfloor\frac{2m}{n}\right\rfloor+1\right)-2m\right]\binom{\left\lfloor\frac{2m}{n}\right\rfloor}{2}\right.
+[n(n(2mn+1)2m)](2mn+12)}\displaystyle+\left.\left[n-\left(n\left(\left\lfloor\frac{2m}{n}\right\rfloor+1\right)-2m\right)\right]\binom{\left\lfloor\frac{2m}{n}\right\rfloor+1}{2}\right\} (44)

Hence,

(n,m)=13{(n2mn+n2m)(2mn2)+(2mn2mn)(2mn+12)}\mathcal{E}(n,m)=\frac{1}{3}\left\{\left(n\left\lfloor\frac{2m}{n}\right\rfloor+n-2m\right)\binom{\left\lfloor\frac{2m}{n}\right\rfloor}{2}\right.\left.+\left(2m-n\left\lfloor\frac{2m}{n}\right\rfloor\right)\binom{\left\lfloor\frac{2m}{n}\right\rfloor+1}{2}\right\} (45)

The above equation simplifies to

(n,m)=162mn(4mn2mnn)\mathcal{E}(n,m)=\frac{1}{6}\left\lfloor\frac{2m}{n}\right\rfloor\left(4m-n\left\lfloor\frac{2m}{n}\right\rfloor-n\right) (46)

Which completes the proof of the Lemma. \qed

The Corollary (3) and Lemmas (1 - 4) allow us to estimate the minimal number of consistent triads which are not covered by any directed edge.

Theorem 5

For each gt-graph G𝒯n,mgG\in\mathscr{T}_{n,m}^{g} where G=(V,Eu,Ed)G=(V,E_{u},E_{d}) holds that

(n,m)|𝒯0|\mathcal{F}(n,m)\leq\left|\mathcal{T}_{0}\right| (47)

where

(n,m)=12(𝒟(n,m)+(n,m)(n3))\mathcal{F}(n,m)=\frac{1}{2}\left(\mathcal{D}(n,m)+\mathcal{E}(n,m)-\binom{n}{3}\right) (48)

which is equivalent to

(n,m)=16(2n2mn2+(8m2n)2mn+(n2)((n1)n6m))\mathcal{F}(n,m)=\frac{1}{6}\left(-2n\left\lfloor\frac{2m}{n}\right\rfloor^{2}+(8m-2n)\left\lfloor\frac{2m}{n}\right\rfloor+(n-2)((n-1)n-6m)\right) (49)
Proof 9

According to (Corollary 3)

(n3)=|𝒯0|+|𝒯1|+|𝒯2,3|\binom{n}{3}=\left|\mathcal{T}_{0}\right|+\left|\mathcal{T}_{1}\right|+\left|\mathcal{T}_{2,3}\right| (50)

Due to (Lemma 2) it holds that

𝒟(n,m)3|𝒯0||𝒯1|\mathcal{D}(n,m)-3\left|\mathcal{T}_{0}\right|\leq\left|\mathcal{T}_{1}\right| (51)

Therefore it is true that

(n3)|𝒯0|+(𝒟(n,m)3|𝒯0|)+|𝒯2,3|=𝒟(n,m)+|𝒯2,3|2|𝒯0|\binom{n}{3}\geq\left|\mathcal{T}_{0}\right|+\left(\mathcal{D}(n,m)-3\left|\mathcal{T}_{0}\right|\right)+\left|\mathcal{T}_{2,3}\right|=\mathcal{D}(n,m)+\left|\mathcal{T}_{2,3}\right|-2\left|\mathcal{T}_{0}\right| (52)

As we know (Lemma 4) that (n,m)|𝒯2,3|\mathcal{E}(n,m)\leq\left|\mathcal{T}_{2,3}\right| it is true that

(n3)𝒟(n,m)+(n,m)2|𝒯0|\binom{n}{3}\geq\mathcal{D}(n,m)+\mathcal{E}(n,m)-2\left|\mathcal{T}_{0}\right| (53)

Hence,

|𝒯0|12(𝒟(n,m)+(n,m)(n3))\left|\mathcal{T}_{0}\right|\geq\frac{1}{2}\left(\mathcal{D}(n,m)+\mathcal{E}(n,m)-\binom{n}{3}\right) (54)

which, after simplifying, leads to

|𝒯0|16((8m2n)2mn2n2mn2+(n2)((n1)n6m))\left|\mathcal{T}_{0}\right|\geq\frac{1}{6}\left((8m-2n)\left\lfloor\frac{2m}{n}\right\rfloor-2n\left\lfloor\frac{2m}{n}\right\rfloor^{2}+(n-2)((n-1)n-6m)\right) (55)

Which completes the proof of the theorem. \qed

One can easily check that for fixed nn the values of (n,m)\mathcal{F}(n,m) decrease to 0 then become negative, whilst |𝒯0|\left|\mathcal{T}_{0}\right| is always a positive integer. Hence, the inequality (47) can also be written as:

max{0,(n,m)}|𝒯0|\max\{0,\left\lceil\mathcal{F}(n,m)\right\rceil\}\leq\left|\mathcal{T}_{0}\right| (56)

Both theorems 4 and 5 provide estimations for the minimal number of consistent triads in a gt-graph. Theorem 4 provides the lower bound 𝒞(n,m)\mathcal{C}(n,m) for the number of triads CT2a\textit{CT}_{2a} and CT3\textit{CT}_{3}, whilst Theorem 5 provides the lower bound for the number of consistent triads CT0\textit{CT}_{0}. Hence, the number of consistent triads in the gt-graph T𝒯n,mgT\in\mathscr{T}_{n,m}^{g} cannot be lower than 𝒢(n,m)\mathcal{G}(n,m) where

𝒢(n,m)=df𝒞(n,m)+max{0,(n,m)}\mathcal{G}(n,m)\stackrel{{\scriptstyle\textit{df}}}{{=}}\mathcal{C}(n,m)+\max\{0,\left\lceil\mathcal{F}(n,m)\right\rceil\} (57)

Of course, its number could be even higher as we do not care about triads CT2b\textit{CT}_{2b}. The immediate consequence of the above expression is the observation that the number of inconsistent triads in the gt-graph cannot be higher than (n,m)\mathcal{H}(n,m) where:

(n,m)=df(n3)𝒢(n,m)\mathcal{H}(n,m)\stackrel{{\scriptstyle\textit{df}}}{{=}}\binom{n}{3}-\mathcal{G}(n,m) (58)

In particular, the most inconsistent gt-graph G𝒯ng¯G\in\overline{\mathscr{T}_{n}^{g}} with some fixed n3n\geq 3 can have as many inconsistent triads as the maximal value of the upper bounding function (n,m)\mathcal{H}(n,m), i.e.

|G|imax0m(n2)(n,m)\left|G\right|_{i}\leq\max_{0\leq m\leq\binom{n}{2}}\mathcal{H}(n,m) (59)

Reversely, a gt-graph G𝒯ngG\in\mathscr{T}_{n}^{g}, which fits that maximum must be maximal i.e. wherever |G|i=max0m(n2)(n,m)\left|G\right|_{i}=\max_{0\leq m\leq\binom{n}{2}}\mathcal{H}(n,m) then G𝒯ng¯G\in\overline{\mathscr{T}_{n}^{g}}. Through the experimental analysis of the upper bounding function (n,m)\mathcal{H}(n,m) we can see that for every fixed nn it has one distinct maximum (Fig. 8).

Refer to caption
(a) (n,m)\mathcal{H}(n,m) for n=10n=10 and m=0,,(102)m=0,\ldots,\binom{10}{2}
Refer to caption
(b) (n,m)\mathcal{H}(n,m) for n=3,,20n=3,\ldots,20 and m=0,,(n2)m=0,\ldots,\binom{n}{2}
Figure 8: The upper bounding function (n,m)\mathcal{H}(n,m)

In the next section we propose the graph which fits the maximum of (n,m)\mathcal{H}(n,m) and formally prove indispensable theorems.

5 The most inconsistent set of preferences with ties

In order to find the maximal gt-graph, let us try to look at the function (n,m)\mathcal{H}(n,m) and the two functions 𝒞(n,m)\mathcal{C}(n,m) and (n,m)\mathcal{F}(n,m) of which it is composed (Fig. 9). 𝒞(n,m)\mathcal{C}(n,m) determines the minimal number of consistent triads covered by more than one directed edge. The more directed edges, the greater the number of consistent triads in a graph. Hence, for some small number of directed edges 𝒞\mathcal{C} equals 0, then slowly begins to grow. The function (n,m)\mathcal{F}(n,m) indicates the minimal number of triads not covered by any directed edge. Those triads are also consistent. With the increase in the number of directed edges, their quantity decreases and eventually reaches 0. Since for the positive ordinates \mathcal{F} decreases faster than 𝒞\mathcal{C} grows, then the function \mathcal{H} reaches the maximum when \mathcal{F} becomes 0. This indicates that in the optimal gt-graph all the triads should be covered by at least one directed edge. This requires the introduction of so many directed edges that the number of triads will become consistent thereby. However, the slope of both functions \mathcal{F} and 𝒞\mathcal{C} indicates that it is more important to cover each triad CT0\textit{CT}_{0} than not to create too many consistent triads CT2a,CT2b\textit{CT}_{2a},\textit{CT}_{2b} or CT3\textit{CT}_{3}.

Refer to caption
Figure 9: Bounding of consistent and inconsistent triads for gt-graph with n=20n=20 vertices.

The considerations in the previous section also indicate that directed edges should be evenly distributed. Otherwise, the gt-graph may not be maximal. The above somewhat intuitive considerations, based on the viewing functions in the figure, lead to the definition of the most inconsistent gt-graph.

Definition 11

A double tournament graph (hereinafter referred to as dt-graph), is a gt-graph G=(V1V2,Ed1Ed2,Eu)G=(V_{1}\cup V_{2},E_{d_{1}}\cup E_{d_{2}},E_{u}) such that (V1,Ed1)(V_{1},E_{d_{1}}) and (V2,Ed2)(V_{2},E_{d_{2}}) are t-graphs, where V1V2=V_{1}\cap V_{2}=\emptyset and Eu={{c,d}:cV1dV2}E_{u}=\{\{c,d\}\,\,:\,\,c\in V_{1}\,\wedge\,d\in V_{2}\}.

It is easy to observe that in every dt-graph all triads are covered by directed edges (Lemma 6). Thus, for every dt-graph it holds that max{0,(n,m)}=0\max\{0,\left\lceil\mathcal{F}(n,m)\right\rceil\}=0. This does not guarantee, however, the minimality of 𝒞(n,m)\mathcal{C}(n,m). Let us propose an improved version of the dt-graph, which, as will be shown later, indeed contains the maximal number of inconsistent triads.

Proposition 1

The dt-graph T=(V1V2,Ed1Ed2,Eu)T=(V_{1}\cup V_{2},E_{d_{1}}\cup E_{d_{2}},E_{u}) is the maximal dt-graph if (V1,Ed1)(V_{1},E_{d_{1}}) and (V2,Ed2)(V_{2},E_{d_{2}}) are maximal t-graphs where |V1|=n2\left|V_{1}\right|=\left\lfloor\frac{n}{2}\right\rfloor and |V2|=n2\left|V_{2}\right|=\left\lceil\frac{n}{2}\right\rceil.

Refer to caption
(a) GX𝒥6g¯G_{X^{*}}\in\overline{\mathcal{\mathcal{J}}_{6}^{g}}
Refer to caption
(b) GY𝒥7g¯G_{Y^{*}}\in\overline{\mathcal{\mathcal{J}}_{7}^{g}}
Figure 10: Two examples of the maximal dt-graphs (undirected edges were dotted).

In other words, we suppose that the dt-graph with nn vertices composed of two maximal t-graphs whose numbers of vertices are identical (when nn is even) or differ by one (when nn is odd) is maximal. Examples of such maximal dt-graph candidates can be found at (Fig. 10). The matrices that correspond to the graphs GXG_{X^{*}} and GYG_{Y^{*}} are given as (60).

X=(011000101000110000000011000101000110)Y=(0111000101100011010001110000000001100001010000110)X^{*}=\left(\begin{array}[]{cccccc}0&-1&1&0&0&0\\ 1&0&-1&0&0&0\\ -1&1&0&0&0&0\\ 0&0&0&0&-1&1\\ 0&0&0&1&0&-1\\ 0&0&0&-1&1&0\end{array}\right)\,\,\,\,Y^{*}=\left(\begin{array}[]{ccccccc}0&-1&1&1&0&0&0\\ 1&0&-1&-1&0&0&0\\ -1&1&0&-1&0&0&0\\ -1&1&1&0&0&0&0\\ 0&0&0&0&0&1&-1\\ 0&0&0&0&-1&0&1\\ 0&0&0&0&1&-1&0\end{array}\right) (60)

Let us denote the number of directed edges in a maximal dt-graph candidate by 𝒳(n)\mathcal{X}(n). It is easy to see that:

𝒳(n)=(n22)+(n22)\mathcal{X}(n)=\binom{\left\lfloor\frac{n}{2}\right\rfloor}{2}+\binom{\left\lceil\frac{n}{2}\right\rceil}{2} (61)
Corollary 4

It can be easily calculated that when nn is even i.e. n=2qn=2q and q+q\in\mathbb{N}_{+} it holds that

𝒳(2q)=q(q1)\mathcal{X}(2q)=q(q-1) (62)

whilst when nn is odd i.e. n=2q+1n=2q+1 and q+q\in\mathbb{N}_{+} it holds that

𝒳(2q+1)=q2\mathcal{X}(2q+1)=q^{2} (63)

To determine the number of consistent/inconsistent triads in this “maximal gt-graph candidate” let us observe that all the consistent triads are in the two maximal tournament subgraphs. This observation can be written in the form of a short Lemma.

Lemma 5

For every dt-graph G=(V1V2,Ed1Ed2,Eu)G=(V_{1}\cup V_{2},E_{d_{1}}\cup E_{d_{2}},E_{u}) and a triad t={vi,vk,vj}t=\{v_{i},v_{k},v_{j}\} if tV1t\cap V_{1}\neq\emptyset and tV2t\cap V_{2}\neq\emptyset then tt is inconsistent.

Proof 10

Since tV1t\cap V_{1}\neq\emptyset and tV2t\cap V_{2}\neq\emptyset, there are two vertices from tt in one of the two sets V1V_{1} and V2V_{2} and one vertex from tt in the other set. Let us suppose that vi,vkV1v_{i},v_{k}\in V_{1} and vjV2v_{j}\in V_{2}. Since (V1,Ed1)(V_{1},E_{d_{1}}) is a t-graph then the edge between viv_{i} and vkv_{k} is directed. Due to the definition of dt-graph both edges (vi,vj)(v_{i},v_{j}) and (vk,vj)(v_{k},v_{j}) are undirected, hence tt is IT1IT_{1}. \qed

The immediate conclusion can be written as the Lemma

Lemma 6

The dt-graph does not contain uncovered triads

Proof 11

Let us consider the dt-graph G=(V1V2,Ed1Ed2,Eu)G=(V_{1}\cup V_{2},E_{d_{1}}\cup E_{d_{2}},E_{u}) and a triad t={vi,vk,vj}t=\{v_{i},v_{k},v_{j}\}. If vi,vkV1v_{i},v_{k}\in V_{1} and vjV2v_{j}\in V_{2} then tt is inconsistent (Lemma 5), hence it cannot be uncovered. If all vi,vk,vjV1v_{i},v_{k},v_{j}\in V_{1} then all three edges are spanned between vi,vkv_{i},v_{k} and vjv_{j}. Hence, tt is covered. The proof is completed as all the other cases are similar. \qed

It is also easy to determine the number of inconsistent triads in the candidate graph. Due to (Theorem 3) the number of consistent triads in the maximal tournament sub-graphs are (n23)(n2)\binom{\left\lfloor\frac{n}{2}\right\rfloor}{3}-\mathcal{I}\left(\left\lfloor\frac{n}{2}\right\rfloor\right) and (n23)(n2)\binom{\left\lceil\frac{n}{2}\right\rceil}{3}-\mathcal{I}\left(\left\lceil\frac{n}{2}\right\rceil\right) correspondingly. Since there are no consistent triads in double tournament graphs, except those that are fully enclosed in the maximal tournament sub-graphs (Lemma 5), the number of inconsistent triads in the maximal gt-graph candidate is given as:

𝒴(n)=(n3)((n23)(n2))((n23)(n2))\mathcal{Y}(n)=\binom{n}{3}-\left(\binom{\left\lfloor\frac{n}{2}\right\rfloor}{3}-\mathcal{I}\left(\left\lfloor\frac{n}{2}\right\rfloor\right)\right)-\left(\binom{\left\lceil\frac{n}{2}\right\rceil}{3}-\mathcal{I}\left(\left\lceil\frac{n}{2}\right\rceil\right)\right) (64)

To confirm that a dt-graph (Proposition 1) is indeed maximal we need to prove that

  • 1.

    the function (n,m)\mathcal{H}(n,m) reaches the maximum when the number of directed edges in a graph equals m=𝒳(n)m=\mathcal{X}(n), and

  • 2.

    the maximum of (n,m)\mathcal{H}(n,m) equals 𝒴(n)\mathcal{Y}(n)

Therefore to make the Proposition 1 a fully fledged claim we prove (Theorem 6). However, before we start (Theorem 6) let us prove a couple of Lemmas which formally confirm what we have seen at (Fig. 9). The aim of the first Lemma (7) is a formal confirmation of the shape of the function \mathcal{F}. In particular, it confirms that \mathcal{F} crosses the x-axis at the same point where \mathcal{H} reaches the maximum i.e. for every fixed n3n\geq 3, \mathcal{F} is positive when 0m<𝒳(n)0\leq m<\mathcal{X}(n), equals 0 when m=𝒳(n)m=\mathcal{X}(n) and it is non-positive for 𝒳(n)m(n2)\mathcal{X}(n)\leq m\leq\binom{n}{2}.

Lemma 7

For every n+,n3n\in\mathbb{N}_{+},n\geq 3 and k+k\in\mathbb{N}_{+} it holds that:

(n,𝒳(n))=0\mathcal{F}(n,\mathcal{X}(n))=0 (65)
(n,𝒳(n)k)1,where   0<k<𝒳(n)\mathcal{F}(n,\mathcal{X}(n)-k)\geq 1,\,\,\,\text{where}\,\,\,0<k<\mathcal{X}(n) (66)
(n,𝒳(n)+k)0,where   0<k(n2)𝒳(n)\mathcal{F}(n,\mathcal{X}(n)+k)\leq 0,\,\,\,\text{where}\,\,\,0<k\leq\binom{n}{2}-\mathcal{X}(n) (67)
Proof 12

Proof of the Lemma, consisting of elementary but time consuming operations, can be found in (A).

The aim of the next Lemma is to show that 𝒞\mathcal{C} is strictly increasing for every mm not smaller than nn and obviously not greater than the maximal number of edges in a gt-graph i.e. (n2)\binom{n}{2} (Fig. 9). Thus, by adding more directed edges than nn we may only increase the minimal number of consistent triads of the types CT2a\textit{CT}_{2a} or CT3\textit{CT}_{3}.

Lemma 8

For every n+,n3n\in\mathbb{N}_{+},n\geq 3 the function 𝒞\mathcal{C}

  1. 1.

    is constant and equals 𝒞(n,m)=0\mathcal{C}(n,m)=0 for every mm such that 0m<n0\leq m<n

  2. 2.

    is strictly increasing for every m+m\in\mathbb{N}_{+} such that nm(n2)n\leq m\leq\binom{n}{2}, i.e.

    𝒞(n,m+1)𝒞(n,m)>0\mathcal{C}(n,m+1)-\mathcal{C}(n,m)>0 (68)
Proof 13

Proof of the Lemma, consisting of elementary but time consuming operations, can be found in (B).

In every gt-graph with nn vertices and mm directed edges there are at least 𝒞(n,m)\mathcal{C}(n,m) consistent triads CT2a\textit{CT}_{2a} or CT3\textit{CT}_{3}. This means that in this graph there are at most (n3)𝒞(n,m)\binom{n}{3}-\mathcal{C}(n,m) inconsistent triads. In particular the Lemma 9 shows that there is no gt-graph with nn vertices and 𝒳(n)\mathcal{X}(n) directed edges which has more inconsistent triads than the maximal gt-graph defined in (Proposition 1).

Lemma 9

For every n+,n3n\in\mathbb{N}_{+},n\geq 3 it holds that

𝒴(n)=(n3)𝒞(n,𝒳(n))\mathcal{Y}(n)=\binom{n}{3}-\mathcal{C}(n,\mathcal{X}(n)) (69)
Proof 14

Proof of the Lemma, composed of elementary but time consuming operations, can be found in (C).

The next Lemma shows that the minimal number of consistent triads in a gt-graph decreases along with adding the next directed edges. Such a decrease continues as long as the number of directed edges does not reach the value 𝒳(n)\mathcal{X}(n). In other words, following the increasing number of directed edges (until there are less than 𝒳(n)\mathcal{X}(n) ) the number of inconsistent triads also increases.

Lemma 10

For every n+,n3n\in\mathbb{N}_{+},n\geq 3 the function 𝒢\mathcal{G} is strictly decreasing for every m+m\in\mathbb{N}_{+} such that 1m𝒳(n)1\leq m\leq\mathcal{X}(n), i.e.

𝒢(n,m)𝒢(n,m+1)>0where   1m<𝒳(n)\mathcal{G}(n,m)-\mathcal{G}(n,m+1)>0\,\,\,\textit{where}\,\,\,1\leq m<\mathcal{X}(n) (70)
Proof 15

Proof of the Lemma, composed of elementary but time consuming operations, can be found in (D).

For every fixed n3n\geq 3 the function \mathcal{H} determines the maximal possible number of inconsistent triads in every gt-graph.

The aim of the theorem below is to confirm that, indeed, the proposed dt-graph (Proposition 1) is a maximal gt-graph.

Theorem 6

For every dt-graph G=(V1V2,Ed1Ed2,Eu)G=(V_{1}\cup V_{2},E_{d_{1}}\cup E_{d_{2}},E_{u}) with nn vertices where (V1,Ed1)(V_{1},E_{d_{1}}) and (V2,Ed2)(V_{2},E_{d_{2}}) are maximal t-graphs and |V1|=n2\left|V_{1}\right|=\left\lfloor\frac{n}{2}\right\rfloor and |V2|=n2\left|V_{2}\right|=\left\lceil\frac{n}{2}\right\rceil and n>3n>3 it holds that:

  1. 1.

    𝒳(n)=m\mathcal{X}(n)=m maximizes (n,m)\mathcal{H}(n,m), i.e.

    (n,𝒳(n))=max0m(n2)(n,m)\mathcal{H}(n,\mathcal{X}(n))=\max_{0\leq m\leq\binom{n}{2}}\mathcal{H}(n,m) (71)
  2. 2.

    𝒴(n)\mathcal{Y}(n) is a maximum of (n,m)\mathcal{H}(n,m)

    (n,𝒳(n))=𝒴(n)\mathcal{H}(n,\mathcal{X}(n))=\mathcal{Y}(n) (72)
Proof 16

As (58) then the first claim of the theorem is equivalent to

𝒢(n,𝒳(n))=min0m(n2)𝒢(n,m)\mathcal{G}(n,\mathcal{X}(n))=\min_{0\leq m\leq\binom{n}{2}}\mathcal{G}(n,m) (73)

As (57) then the function 𝒢\mathcal{G} is the sum of 𝒞(n,m)\mathcal{C}(n,m) and max{0,(n,m)}\max\{0,\left\lceil\mathcal{F}(n,m)\right\rceil\}. From (Lemma 8) we know that 𝒞\mathcal{C} does not decrease with respect to mm. On the other hand, due to the (Lemma 7) (n,𝒳(n)+k)0\mathcal{F}(n,\mathcal{X}(n)+k)\leq 0 for every 0<k(n2)𝒳(n)0<k\leq\binom{n}{2}-\mathcal{X}(n), which translates to the observation that for every m𝒳(n)m\geq\mathcal{X}(n) it holds that max{0,(n,m)}=0\max\{0,\left\lceil\mathcal{F}(n,m)\right\rceil\}=0. Hence, for every m𝒳(n)m\geq\mathcal{X}(n) the function 𝒢\mathcal{G} does not decrease and boils down to 𝒢(n,m)=𝒞(n,m)\mathcal{G}(n,m)=\mathcal{C}(n,m). In other words

𝒢(n,𝒳(n))𝒢(n,𝒳(n)+1)𝒢(n,(n2))\mathcal{G}(n,\mathcal{X}(n))\leq\mathcal{G}(n,\mathcal{X}(n)+1)\leq\ldots\leq\mathcal{G}(n,\binom{n}{2}) (74)

This fact, coupled with (Lemma 10) i.e.

𝒢(n,0)>𝒢(n,1)>>𝒢(n,𝒳(n))\mathcal{G}(n,0)>\mathcal{G}(n,1)>\ldots>\mathcal{G}(n,\mathcal{X}(n)) (75)

implies that indeed

𝒢(n,𝒳(n))=min0m(n2)𝒢(n,m)\mathcal{G}(n,\mathcal{X}(n))=\min_{0\leq m\leq\binom{n}{2}}\mathcal{G}(n,m) (76)

which completes the proof of the first claim (71) of the Theorem 6. To prove the second claim it is enough to recall that for every m𝒳(n)m\geq\mathcal{X}(n) it holds that 𝒢(n,m)=𝒞(n,m)\mathcal{G}(n,m)=\mathcal{C}(n,m). Thus, in particular

(n,𝒳(n))=(n3)𝒞(n,𝒳(n))\mathcal{H}(n,\mathcal{X}(n))=\binom{n}{3}-\mathcal{C}(n,\mathcal{X}(n)) (77)

which satisfies the second claim (72) of the Theorem 6, and which thereby confirms the Proposition 1. \qed

6 Inconsistency indices in paired comparisons with ties

As shown in (Section 2) the inconsistency index (called there “coefficient of consistence”) defined by Kendall and Babington Smith [26, p. 330] cannot be used in the context of ordinal pairwise comparisons with ties. Thus, in (3) (n)\mathcal{I}(n) needs to be replaced by 𝒴(n)\mathcal{Y}(n) - the maximal number of triads in the case when ties are allowed. The generalized inconsistency index that covers pairwise comparisons with ties finally takes the form

ζg(M)=1|GM|i𝒴(n)\zeta_{g}(M)=1-\frac{\left|G_{M}\right|_{i}}{\mathcal{Y}(n)} (78)

where MM is an ordinal PC matrix with ties of the size n×nn\times n (Def. 1) and G is a gt-graph corresponding to MM. The formula (78), although concise, may not be handy in practice. This is due to the use in (64) of the floor x\left\lfloor x\right\rfloor and ceiling x\left\lceil x\right\rceil operations as well as binomial symbol (xy)\binom{x}{y}. For this reason, let us simplify (64) depending on whether nn and n/2\nicefrac{{n}}{{2}} are odd or even. There are four cases that need to be considered:

𝒴(n)={13n324n216n96whenn=4qforq=1,2,3,13n324n219n+3096whenn=4q+1forq=1,2,3,13n324n24n96whenn=4q+2forq=1,2,3,13n324n219n+1896whenn=4q+3forq=0,1,2,\mathcal{Y}(n)=\begin{cases}\frac{13n^{3}-24n^{2}-16n}{96}&\text{when}\,\,\,n=4q\,\,\,\text{for}\,\,\,q=1,2,3,\ldots\\ \frac{13n^{3}-24n^{2}-19n+30}{96}&\text{when}\,\,\,n=4q+1\,\,\,\text{for}\,\,\,q=1,2,3,\ldots\\ \frac{13n^{3}-24n^{2}-4n}{96}&\text{when}\,\,\,n=4q+2\,\,\,\text{for}\,\,\,q=1,2,3,\ldots\\ \frac{13n^{3}-24n^{2}-19n+18}{96}&\text{when}\,\,\,n=4q+3\,\,\,\text{for}\,\,\,q=0,1,2,\ldots\end{cases} (79)

For example, to compute the inconsistency index for the ordinal PC matrix MM (1) (see Fig. 1) first it is necessary to compute the number of inconsistent triads in MM. Since (1) has five inconsistent triads: (A1,A2,A3)(A_{1},A_{2},A_{3}), (A1,A2,A5)(A_{1},A_{2},A_{5}), (A1,A3,A5)(A_{1},A_{3},A_{5}), (A1,A4,A5)(A_{1},A_{4},A_{5}) and (A3,A4,A5)(A_{3},A_{4},A_{5}) then |TM|=5\left|T_{M}\right|=5. On the other hand, 5=41+15=4\cdot 1+1 hence, the value 𝒴(5)\mathcal{Y}(5) is obtained by replacing nn with 55 in the expression 1/96(13n324n219n+30)\nicefrac{{1}}{{96}}\cdot\left(13n^{3}-24n^{2}-19n+30\right), i.e. 𝒴(5)=10\mathcal{Y}(5)=10. In other words, in the considered gt-graph (Fig. 1) five triads out of ten possible ones are inconsistent. The generalized consistency index for MM takes the form:

ζg(M)=1510=12\zeta_{g}(M)=1-\frac{5}{10}=\frac{1}{2} (80)

Hence the inconsistency level for MM (1) is 50%50\%.

As every t-graph is also a gt-graph but not reversely (see Def. 2 and 3) then the generalized inconsistency index ζg\zeta_{g} can also be used to estimate the inconsistency level of paired comparisons without ties. Conversely it is not possible.

Both inconsistency indices ζ\zeta and ζg\zeta_{g} compare the number of inconsistent triads in MM with the maximal number of such triads in a matrix of the same size as MM. Hence, for the maximally inconsistent matrix the index functions will return 11, whilst the inconsistency index for a fully consistent matrix is 0. The maximal value of the inconsistency index, of course, does not automatically imply that all the triads in the given matrix are inconsistent. To capture this phenomenon, let us define the absolute inconsistency index η\eta as a ratio of the number of inconsistent triads to the number of all possible triads in the n×nn\times n matrix MM.

η(M)=df|GM|i(n3)\eta(M)\stackrel{{\scriptstyle\textit{df}}}{{=}}\frac{\left|G_{M}\right|_{i}}{\binom{n}{3}} (81)

Of course, 0η(M)10\leq\eta(M)\leq 1. If, for example, η(M)=0.4\eta(M)=0.4 then it would mean that MM contains 60%60\% consistent triads and 40%40\% inconsistent triads. The maximal value that η(M)\eta(M) may take is limited by (n)/(n3)\mathcal{I}(n)/\binom{n}{3} and 𝒴(n)/(n3)\mathcal{Y}(n)/\binom{n}{3} for t-graphs and gt-graphs correspondingly. Thus, for the larger matrices η(M)\eta(M) may never reach 1. Let us consider the first few values of (n)/(n3)\mathcal{I}(n)/\binom{n}{3} and 𝒴(n)/(n3)\mathcal{Y}(n)/\binom{n}{3} (Fig. 11).

Refer to caption
Figure 11: The maximal values of η(M)\eta(M) for t-graph and gt-graph

We can see that for small graphs the percentage of inconsistent triads is higher than for the larger graphs. In particular, for n=3,,6n=3,\ldots,6 there are such gt-graphs that have all triads inconsistent. However, there is only one t-graph which has all triads inconsistent. It is just a single triad. Although the percentage of inconsistent triads for both t-graph and gt-graph decrease, they seem to never drop below certain values. It is easy to compute that666Expression limn(n)/(n3)=0.25\lim_{n\rightarrow\infty}\mathcal{I}(n)/\binom{n}{3}=0.25 means that both limn(n3n24)/(n3)=limn(n34n24)/(n3)=0.25\lim_{n\rightarrow\infty}\left(\frac{n^{3}-n}{24}\right)/\binom{n}{3}=\lim_{n\rightarrow\infty}\left(\frac{n^{3}-4n}{24}\right)/\binom{n}{3}=0.25. Similarly limn𝒴(n)(n3)=0.8125\lim_{n\rightarrow\infty}\frac{\mathcal{Y}(n)}{\binom{n}{3}}=0.8125 means that all four limits (see 79) equal 0.81250.8125. :

limn(n)(n3)=0.25andlimn𝒴(n)(n3)=0.8125\lim_{n\rightarrow\infty}\frac{\mathcal{I}(n)}{\binom{n}{3}}=0.25\,\,\,\,\,\text{and}\,\,\,\,\,\lim_{n\rightarrow\infty}\frac{\mathcal{Y}(n)}{\binom{n}{3}}=0.8125 (82)

In other words, although in the larger t-graphs (n>3)(n>3) and gt-graphs (n>6)(n>6), there must always be consistent triads. Hence, it is impossible to create a completely inconsistent set of paired comparisons when the alternatives are more than 33 (without ties) and 66 (when ties are allowed). As we can see very often, consistent triads must exist. However, it should be remembered that the “guaranteed” number of consistent triads is limited. The expression (82) implies that at most 75%75\% of triads are “guaranteed” to be consistent without ties, and at most 18.75%18.75\% of triads are “guaranteed” to be consistent when ties are allowed.

Figuratively speaking, the possibility of a tie allows us to be much more inconsistent. However, we rarely have a chance to be completely inconsistent - only when there are “sufficiently few” alternatives. Fortunately, there is no limit to the number of consistent triads in a gt-graph. Hence, we can be as consistent (and as frequently) in our views as we want.

7 Discussion and remarks

To calculate the inconsistency index ζ\zeta or the generalized inconsistency index ζg\zeta_{g} for some ordinal PC MM n×nn\times n matrix we need to determine the number of inconsistent triads in MM. The most straightforward method is to consider every single triad and decide whether it is consistent or not. Since in every complete set of paired comparisons for nn alternatives there are (n3)=n(n1)(n2)3\binom{n}{3}=\frac{n(n-1)(n-2)}{3} different triads, then the running time of such a procedure is O(n3)O(n^{3}). For t-graphs, however, there is a faster way to determine the number of inconsistent triads in a graph. As mentioned earlier, (5) denotes the number of inconsistent triads |T|i\left|T\right|_{i} in some t-graph T=(V,Ed)T=(V,E_{d}). To compute (5) |T|i\left|T\right|_{i} we need to visit every vertex cVc\in V and determine its input degree. Computing degin(c)\text{deg}_{in}(c) for every cVc\in V requires visiting every edge (ci,cj)Ed(c_{i},c_{j})\in E_{d} twice. The first time when calculating degin(ci)\text{deg}_{in}(c_{i}), the second time when degin(cj)\text{deg}_{in}(c_{j}) is calculated. Thus, determining degin(c1),,degin(cn)\text{deg}_{in}(c_{1}),\ldots,\text{deg}_{in}(c_{n}) requires 2|Ed|2\left|E_{d}\right| operations. As |Ed|=n(n1)2\left|E_{d}\right|=\frac{n(n-1)}{2} then the actual running time of computation for (5) is O(n(n1))=O(n2)O(n(n-1))=O(n^{2}). For this reason the inconsistency index ζ\zeta can be determined faster than ζg\zeta_{g}.

Looking at the different types of triads occurring in a gt-graph (Fig. 4), one may notice that a triad not covered by any directed edge is consistent, whilst a triad covered by one directed edge is always inconsistent (see Def. 7). Therefore the question arises as to whether it is possible to cover all triads by one directed edge. If not, what is the minimal number of directed edges covering all triads? Let us try to formally address this question. Denote the set of directed edges of some gt-graph by Ed={(c1,c2),(c1,c3),,(cn1,cn)}E_{d}=\{(c_{1},c_{2}),(c_{1},c_{3}),\ldots,(c_{n-1},c_{n})\} and the set of triads by 𝒯={{c1,c2,c3},{c1,c2,c4},,{cn2,cn1,cn}}\mathcal{T}=\{\{c_{1},c_{2},c_{3}\},\{c_{1},c_{2},c_{4}\},\ldots,\{c_{n-2},c_{n-1},c_{n}\}\}. Of course, |Ed|=(n2)\left|E_{d}\right|=\binom{n}{2} and |𝒯|=(n3)\left|\mathcal{T}\right|=\binom{n}{3}. Then, let B=(V,E)B=(V,E) be a bipartite graph such that V=Ed𝒯V=E_{d}\cup\mathcal{T} and E={(e,t)|(e,t)Ed×𝒯andecoverst}E=\{(e,t)\,\,|\,\,(e,t)\in E_{d}\times\mathcal{T}\,\,\text{and}\,\,e\,\,\text{covers}\,\,t\}. Hence, we would like to select the minimal subset of edges from EdE_{d} whose elements cover (i.e. are connected to) every triad in 𝒯\mathcal{T}.

Let us consider the problem for n=5n=5 (Fig. 12a).

Refer to caption
(a) Bipartite graph corresponding to triads cover problem where n=5n=5
Refer to caption
(b) Maximal dt-graph with 55 vertices (undirected edges are dotted)
Figure 12: Triads cover problem

In such a case Ed={(1,2)E_{d}=\{(1,2), (1,3)(1,3), (1,4)(1,4), (1,5)(1,5), (2,3)(2,3), (2,4)(2,4), (2,5)(2,5), (3,4)(3,4), (3,5)(3,5), (4,5)}(4,5)\} and 𝒯={{1,2,3}\mathcal{T}=\{\{1,2,3\}, {1,2,4}\{1,2,4\}, {1,2,5}\{1,2,5\}, {1,3,4}\{1,3,4\}, {1,3,5}\{1,3,5\}, {2,3,4}\{2,3,4\}, {2,3,5}\{2,3,5\}, {1,4,5}\{1,4,5\}, {2,4,5}\{2,4,5\}, {3,4,5}}\{3,4,5\}\}. As every edge covers three different triads we may form the set S={{ti,tj,tk}|S=\{\{t_{i},t_{j},t_{k}\}\,| t,tj,tk𝒯,\,t,t_{j},t_{k}\in\mathcal{T}, eEdthat coversti,tj,tk}\exists e\in E_{d}\,\text{that covers}\,\,t_{i},t_{j},t_{k}\}. For example, a tripleton {{1,2,3},{1,2,4},{1,2,5}}\{\{1,2,3\},\{1,2,4\},\{1,2,5\}\} is an element of SS as all its elements are covered by edges (1,2)(1,2) etc. Thus, the question about the minimal subset of |Ed|\left|E_{d}\right| whose elements cover all the elements in |𝒯|\left|\mathcal{T}\right|, can be reformulated as follows: what is the minimal subset of SS such that the union of its elements equals 𝒯\mathcal{T}?

In general, we can not provide a satisfactory answer to such a question. The problem we formulate is called a set cover problem777Wikipedia may serve as a quick reference: https://en.wikipedia.org/wiki/Set_cover_problem and is one of Karp’s 21 NP-complete problems formulated in 1972 [24]. Fortunately, we are not dealing with a set cover problem as such, but with its special instance that can be called a “triads cover problem”. In the latter case, a maximal dt-graph comes to the rescue (1). The number of directed edges in the maximal dt-graph is 𝒳(n)\mathcal{X}(n). Due to (Lemma 7) we know that every gt-graph that has less than 𝒳(n)\mathcal{X}(n) directed edges must contain at least one triad of the type CT0\textit{CT}_{0}. On the other hand, any maximal dt-graph does not contain uncovered triads (Lemma 6). This means that a maximal dt-graph is a minimal graph covering all triads by directed edges.

Let us consider the maximal dt-graph for n=5n=5. According to (Proposition 1), such a graph should be composed of two maximal subgraphs having 52=3\left\lfloor\frac{5}{2}\right\rfloor=3 and 52=2\left\lceil\frac{5}{2}\right\rceil=2 vertices. An instance of the first subgraph can be a triad (c1,c2),(c2,c3)(c_{1},c_{2}),\,(c_{2},c_{3}) and (c3,c1)(c_{3},c_{1}) whilst the second subgraph is just a single edge (c4,c5)(c_{4},c_{5}). As the maximal dt-graph with 55 vertices provides a minimal edge covering of triads in 55-clique then the minimal subset of SS that covers the entire 𝒯\mathcal{T} is, for example, {{1,2,3}\{\{1,2,3\}, {1,2,4}\{1,2,4\}, {1,2,5}}\{1,2,5\}\}, {{1,2,3}\{\{1,2,3\}, {1,3,4}\{1,3,4\}, {1,3,5}}\{1,3,5\}\} {{1,2,3}\{\{1,2,3\}, {2,3,4}\{2,3,4\}, {2,3,5}}\{2,3,5\}\} and {{1,4,4}\{\{1,4,4\}, {2,4,5}\{2,4,5\}, {3,4,5}}\{3,4,5\}\} (Fig. 12).

8 Summary

In the presented article, the inconsistency index proposed by Kendall and Babington Smith [26] has been extended to cover pairwise comparisons with ties. For this purpose, the most inconsistent sets of pairwise comparisons with and without ties have been analyzed. To model pairwise comparisons with ties a generalized tournament graph has been defined. An additional absolute consistency index η\eta for pairwise comparisons with and without ties has also been proposed. The relationship between the maximally inconsistent set of pairwise comparisons with ties and the set cover problem has also been shown.

Acknowledgements

I would like to thank Prof. Andrzej Bielecki and Dr. Hab. Adam Sędziwy for their insightful comments, corrections and reading of the first version of this work. Special thanks are due to Ian Corkill for his editorial help. The research is supported by AGH University of Science and Technology, contract no.: 11.11.120.859.

Literature

References

  • [1] J. Aguarón and J. M. Moreno-Jiménez. The geometric consistency index: Approximated thresholds. European Journal of Operational Research, 147(1):137 – 145, 2003.
  • [2] S. Bozóki, L. Dezső, A. Poesz, and J. Temesi. Analysis of pairwise comparison matrices: an empirical research. Annals of Operations Research, 211(1):511–528, February 2013.
  • [3] S. Bozóki, J. Fülöp, and W. W. Koczkodaj. An lp-based inconsistency monitoring of pairwise comparison matrices. Mathematical and Computer Modelling, 54(1-2):789–793, 2011.
  • [4] M. Brunelli. On the conjoint estimation of inconsistency and intransitivity of pairwise comparisons. Operations Research Letters, 44(5):672–675, September 2016.
  • [5] M. Brunelli, L. Canal, and M. Fedrizzi. Inconsistency indices for pairwise comparison matrices: a numerical study. Annals of Operations Research, 211:493–509, February 2013.
  • [6] J. M. Colomer. Ramon Llull: from ‘Ars electionis’ to social choice theory. Social Choice and Welfare, 40(2):317–328, October 2011.
  • [7] M. Condorcet. Essay on the Application of Analysis to the Probability of Majority Decisions. Paris: Imprimerie Royale, 1785.
  • [8] A. H. Copeland. A “reasonable” social welfare function. Seminar on applications of mathematics to social sciences, 1951.
  • [9] H. A. David. The method of paired comparisons. A Charles Griffin Book, 1969.
  • [10] R. R. Davidson. On extending the Bradley-Terry model to accommodate ties in paired comparison experiments. Journal of the American Statistical Association, 65(329):317, 1970.
  • [11] Reinhard Diestel. Graph theory. Springer Verlag, 2005.
  • [12] P. Faliszewski, E. Hemaspaandra, L. A. Hemaspaandra, and J. Rothe. Llull and Copeland Voting Computationally Resist Bribery and Constructive Control. J. Artif. Intell. Res. (JAIR), 35:275–341, 2009.
  • [13] M Fedrizzi and M Brunelli. On the priority vector associated with a reciprocal relation and a pairwise comparison matrix. Journal of Soft Computing, 14(6):639–645, 2010.
  • [14] J. Figueira, M. Ehrgott, and S. Greco, editors. Multiple Criteria Decision Analysis: State of the Art Surveys. Springer, 2005.
  • [15] W A Glenn and H A David. Ties in paired-comparison experiments using a modified Thurstone-Mosteller model. Biometrics, 16(1):86, 1960.
  • [16] R. L. Graham, D. E Knuth, and O. Patashnik. Concrete Mathematics. Addison & Wesley, 1994.
  • [17] S. Greco, B. Matarazzo, and R. Słowiński. Dominance-based rough set approach to preference learning from pairwise comparisons in case of decision under uncertainty. In Eyke Hüllermeier, Rudolf Kruse, and Frank Hoffmann, editors, Computational Intelligence for Knowledge-Based Systems Design, volume 6178 of Lecture Notes in Computer Science, pages 584–594. Springer Berlin Heidelberg, 2010.
  • [18] Y. Iida. Ordinality consistency test about items and notation of a pairwise comparison matrix in AHP. In Proceedings of the international symposium on the …, 2009.
  • [19] A. Ishizaka and M. Lusti. How to derive priorities in AHP: a comparative study. Central European Journal of Operations Research, 14(4):387–400, December 2006.
  • [20] R. Janicki and W. W. Koczkodaj. A weak order approach to group ranking. Comput. Math. Appl., 32(2):51–59, July 1996.
  • [21] R. Janicki and Y. Zhai. On a pairwise comparison-based consistent non-numerical ranking. Logic Journal of the IGPL, 20(4):667–676, 2012.
  • [22] R. E. Jensen and T. E. Hicks. Ordinal data AHP analysis: A proposed coefficient of consistency and a nonparametric test. Math. Comput. Model., 17(4-5):135–150, February 1993.
  • [23] J. B. Kadane. Some Equivalence Classes in Paired Comparisons. The Annals of Mathematical Statistics, 37(2):488–494, April 1966.
  • [24] R. M. Karp. Reducibility among Combinatorial Problems. In Complexity of Computer Computations, pages 85–103. Springer US, Boston, MA, 1972.
  • [25] M. G. Kendall. The treatment of ties in ranking problems. Biometrika, 33:239–251, November 1945.
  • [26] M.G. Kendall and B. Smith. On the method of paired comparisons. Biometrika, 31(3/4):324–345, 1940.
  • [27] W. W. Koczkodaj. A new definition of consistency of pairwise comparisons. Math. Comput. Model., 18(7):79–84, October 1993.
  • [28] K. Kułakowski. On the properties of the priority deriving procedure in the pairwise comparisons method. Fundamenta Informaticae, 139(4):403 – 419, July 2015.
  • [29] K. Kułakowski, K. Grobler-Dębska, and J. Wąs. Heuristic rating estimation: geometric approach. Journal of Global Optimization, 62(3):529–543, 2015.
  • [30] A. Maas, T. Bezembinder, and P. Wakker. On solving intransitivities in repeated pairwise choices. Mathematical Social Sciences, 29(2):83–101, April 1995.
  • [31] E. Parizet. Paired comparison listening tests and circular error rates. Acta acustica united with Acustica, 2002.
  • [32] J.I. Peláez and M.T. Lamata. A new measure of consistency for positive reciprocal matrices. Computers & Mathematics with Applications, 46(12):1839 – 1845, 2003.
  • [33] S. Pemmaraju and S. Skiena. Computational Discrete Mathematics - Combinatorics and Graph Theory with Mathematica. Cambridge University Press, January 2003.
  • [34] T. L. Saaty. A scaling method for priorities in hierarchical structures. Journal of Mathematical Psychology, 15(3):234 – 281, 1977.
  • [35] T. L. Saaty and G. Hu. Ranking by eigenvector versus other methods in the analytic hierarchy process. Applied Mathematics Letters, 11(4):121–125, 1998.
  • [36] S. Siraj, L. Mikhailov, and J. A. Keane. Contribution of individual judgments toward inconsistency in pairwise comparisons. European Journal of Operational Research, 242(2):557–567, April 2015.
  • [37] W. E. Stein and P. J. Mizzi. The harmonic consistency index for the Analytic Hierarchy Process. European Journal of Operational Research, 177(1):488–497, February 2007.
  • [38] K. Suzumura, K. J. Arrow, and A. K. Sen. Handbook of Social Choice & Welfare. Elsevier Science Inc., 2010.
  • [39] L. L. Thurstone. The Method of Paired Comparisons for Social Values. Journal of Abnormal and Social Psychology, pages 384–400, 1927.
  • [40] T. N. Tideman. Independence of clones as a criterion for voting rules. Social Choice and Welfare, 4:185–206, 1987.
  • [41] L. G. Vargas. Voting with Intensity of preferences. In 12th International Symposium on the Analytic Hierarchy Process, Kuala Lumpur, Malaysia, 2013. Creative Decision Foundation.
  • [42] Ying-Ming Wang, C. Parkan, and Y. Luo. Priority estimation in the AHP through maximization of correlation coefficient. Applied Mathematical Modelling, 31(12):2711–2718, December 2007.

Appendix A Proof of Lemma 7

Thesis.

For every n+,n3n\in\mathbb{N}_{+},n\geq 3 and k+k\in\mathbb{N}_{+} it holds that:

(n,𝒳(n))=0\mathcal{F}(n,\mathcal{X}(n))=0
(n,𝒳(n)k)1,where   0<k𝒳(n)\mathcal{F}(n,\mathcal{X}(n)-k)\geq 1,\,\,\,\text{where}\,\,\,0<k\leq\mathcal{X}(n)
(n,𝒳(n)+k)0,where   0<k(n2)𝒳(n)\mathcal{F}(n,\mathcal{X}(n)+k)\leq 0,\,\,\,\textit{where}\,\,\,0<k\leq\binom{n}{2}-\mathcal{X}(n)

Proof. Equation (65), part 1.

Let nn be even i.e. n=2qn=2q where q+q\in\mathbb{N}_{+}. Thus, let us insert to (49) as nn the value 2q2q and as mm the value 𝒳(2q)\mathcal{X}(2q). After a series of elementary transformations applied to (48) we obtain:

(2q,𝒳(2q))\displaystyle\mathcal{F}(2q,\mathcal{X}(2q)) =13(2)q(q2+(12q)q+(q1)q)\displaystyle=\frac{1}{3}(-2)q\left(\lfloor q\rfloor^{2}+(1-2q)\lfloor q\rfloor+(q-1)q\right) (82)

Since q+q\in\mathbb{N}_{+} then

q=q\lfloor q\rfloor=q (83)

Thus,

(2q,𝒳(2q))\displaystyle\mathcal{F}(2q,\mathcal{X}(2q)) =13(2)q(q2+(q1)q+(12q)q)\displaystyle=\frac{1}{3}(-2)q\left(q^{2}+(q-1)q+(1-2q)q\right) (84)

Which after reduction leads to

(2q,𝒳(2q))=0\mathcal{F}(2q,\mathcal{X}(2q))=0 (85)

Proof. Equation (65), part 2.

Let nn be odd i.e. n=2q+1n=2q+1 where q+q\in\mathbb{N}_{+}. Similarly, let us replace n in (49) by 2q+12q+1 and mm by 𝒳(2q+1)\mathcal{X}(2q+1). After elementary transformations we obtain:

(2q+1,𝒳(2q+1))=\displaystyle\mathcal{F}(2q+1,\mathcal{X}(2q+1))= 13(2q+1)2q22q+12\displaystyle-\frac{1}{3}(2q+1)\left\lfloor\frac{2q^{2}}{2q+1}\right\rfloor^{2}
+13(4q22q1)2q22q+1\displaystyle+\frac{1}{3}\left(4q^{2}-2q-1\right)\left\lfloor\frac{2q^{2}}{2q+1}\right\rfloor
+13(2q2+3q1)q\displaystyle+\frac{1}{3}\left(-2q^{2}+3q-1\right)q (86)

Since q+q\in\mathbb{N}_{+}, we can bound 2q2/(2q+1)2q^{2}/\left(2q+1\right) from above

2q22q+1<2q22q=q\frac{2q^{2}}{2q+1}<\frac{2q^{2}}{2q}=q (87)

and below

q1=2(q1)22(q1)<2(q1)22q+1=2q22q+22q+12q22q+1q-1=\frac{2\left(q-1\right)^{2}}{2\left(q-1\right)}<\frac{2\left(q-1\right)^{2}}{2q+1}=\frac{2q^{2}-2q+2}{2q+1}\leq\frac{2q^{2}}{2q+1} (88)

Therefore, when qq is a positive integer it is true that

2q22q+1=(q1)\left\lfloor\frac{2q^{2}}{2q+1}\right\rfloor=(q-1) (89)

By applying (89) to (86) we obtain

(2q+1,𝒳(2q+1))\displaystyle\mathcal{F}(2q+1,\mathcal{X}(2q+1)) =13(4q22q1)(q1)\displaystyle=\frac{1}{3}\left(4q^{2}-2q-1\right)(q-1)
+13q(2q2+3q1)\displaystyle+\frac{1}{3}q\left(-2q^{2}+3q-1\right)
13(2q+1)(q1)2\displaystyle-\frac{1}{3}(2q+1)\left(q-1\right)^{2} (90)

Then, after making further transformations it is easy to verify that:

(2q+1,𝒳(2q+1))=0\mathcal{F}(2q+1,\mathcal{X}(2q+1))=0 (91)

which completes the proof of (65).

Proof. Equation (66), part 1.

Let nn be even i.e. n=2qn=2q where q+q\in\mathbb{N}_{+}. Thus, to prove that (n,𝒳(n)k)\mathcal{F}(n,\mathcal{X}(n)-k) is greater than 0 it is enough to show that for every q2q\geq 2 and 1k<q(q1)1\leq k<q(q-1) it holds that (n,𝒳(n)k)>1\mathcal{F}(n,\mathcal{X}(n)-k)>1. Thus, let us insert to (49) as nn the value 2q2q. After a series of elementary transformations applied to (48) we obtain:

(2q,𝒳(2q)k)=23(qkq2+(2k+q)kq+k(q1))\mathcal{F}(2q,\mathcal{X}(2q)-k)=\frac{2}{3}\left(-q\left\lceil\frac{k}{q}\right\rceil^{2}+(2k+q)\left\lceil\frac{k}{q}\right\rceil+k(q-1)\right) (92)

Let us observe that for the positive integer p=1,2,p=1,2,\ldots if pqk<(p+1)q1p\cdot q\leq k<(p+1)q-1 then kq=p\left\lceil\frac{k}{q}\right\rceil=p. In order to analyze \mathcal{F} let us replace kq\left\lceil\frac{k}{q}\right\rceil by pp and define hh such that

h(q,k)=23(p(2k+q)+k(q1)qp2)h(q,k)=\frac{2}{3}\left(p(2k+q)+k(q-1)-qp^{2}\right) (93)

where pqk<(p+1)q1p\cdot q\leq k<(p+1)q-1 for every p=1,2,,q2p=1,2,\ldots,q-2. Of course, when pqk<(p+1)q1p\cdot q\leq k<(p+1)q-1 it holds that

(2q,𝒳(2q)k)=h(q,k)\mathcal{F}(2q,\mathcal{X}(2q)-k)=h(q,k) (94)

As hh is linear with respect to kk then in order to check whether h(k)>0h(k)>0 it is enough to check whether hh is greater than 0 at both ends of the considered interval. So,

h(q,pq)=23pq(p+q)h(q,p\cdot q)=\frac{2}{3}pq(p+q) (95)

and

h(q,(p+1)q1)=13(2p2q+2pq2+4pq4p+2q24q+2)h(q,(p+1)q-1)=\frac{1}{3}\left(2p^{2}q+2pq^{2}+4pq-4p+2q^{2}-4q+2\right) (96)

Since for p,q=1,2,p,q=1,2,\ldots it holds that 4pq4p4pq\geq 4p and 2p2q+2pq24q2p^{2}q+2pq^{2}\geq 4q then

h(q,(p+1)q1)13(2q2+2)13(2+2)>1h(q,(p+1)q-1)\geq\frac{1}{3}\left(2q^{2}+2\right)\geq\frac{1}{3}\left(2+2\right)>1 (97)

Thus, for every pqk<(p+1)q1p\cdot q\leq k<(p+1)q-1 where p=1,2,,q2p=1,2,\ldots,q-2, h(k)>0h(k)>0. We just need to check hh for k=q(q1)k=q(q-1). In such a case kq=q1\left\lceil\frac{k}{q}\right\rceil=q-1. Thus h(q(q1))h(q(q-1)) takes the form:

h(q,q(q1))=23q(2q23q+1)h(q,q(q-1))=\frac{2}{3}q\left(2q^{2}-3q+1\right) (98)

As q2q\geq 2 then it is easy to verify that h(q,q(q1))>0h(q,q(q-1))>0.

Since h(q,k)>0h(q,k)>0 for every p=1,2,,q2p=1,2,\ldots,q-2, where pqk<(p+1)q1p\cdot q\leq k<(p+1)q-1 and for k=q(q1)k=q(q-1) then also (2q,𝒳(2q)k)>0\mathcal{F}(2q,\mathcal{X}(2q)-k)>0 for n=2qn=2q and 1k<q(q1)1\leq k<q(q-1), which completes the first part of the proof.

Proof. Equation (66), part 2.

Let nn be even i.e. n=2q+1n=2q+1 where q+q\in\mathbb{N}_{+}. Thus, let us insert to (49) as nn the value 2q+12q+1 and 𝒳(2q+1)k\mathcal{X}(2q+1)-k, where this time 1kq21\leq k\leq q^{2} (see 63). After a series of elementary transformations applied to (48) we obtain:

(n,𝒳(n)k)=\displaystyle\mathcal{F}(n,\mathcal{X}(n)-k)= 13((4k+2q+1)2(kq2)2q+1+4q22(q2k)2q+1\displaystyle\frac{1}{3}\left((4k+2q+1)\left\lceil\frac{2\left(k-q^{2}\right)}{2q+1}\right\rceil+4q^{2}\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor-\right.
(2q+1)2(q2k)2q+12+(2q1)(3kq2+q))\displaystyle\left.(2q+1)\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor^{2}+(2q-1)\left(3k-q^{2}+q\right)\right) (99)

Since for every xx\in\mathbb{R} it holds 888A quick reference is https://en.wikipedia.org/wiki/Floor_and_ceiling_functions [16] that x=x-\left\lceil x\right\rceil=\left\lfloor-x\right\rfloor, and 𝒳(n)=𝒳(2q+1)=q2\mathcal{X}(n)=\mathcal{X}(2q+1)=q^{2} then

(2q+1,q2k)=\displaystyle\mathcal{F}(2q+1,q^{2}-k)= 13((4k+2q+1)2(q2k)2q+1+4q22(q2k)2q+1\displaystyle\frac{1}{3}\left(-(4k+2q+1)\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor+4q^{2}\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor-\right.
(2q+1)2(q2k)2q+12+(2q1)(3kq2+q))\displaystyle\left.(2q+1)\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor^{2}+(2q-1)\left(3k-q^{2}+q\right)\right) (100)

It is easy to observe the relationship between 2(q2k)2q+1\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor and kk is:

2(q2k)2q+1=0\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor=0 if and only if 02(q2k)<2q+10\leq 2\left(q^{2}-k\right)<2q+1, in other words, we require that q2q12k<q2q^{2}-q-\frac{1}{2}\leq k<q^{2}

2(q2k)2q+1=1\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor=1 if and only if 2q+12(q2k)<2(2q+1)2q+1\leq 2\left(q^{2}-k\right)<2(2q+1) which translates to the interval: 12(2q22(2q+1))k<12(2q21(2q+1))\frac{1}{2}\left(2q^{2}-2\left(2q+1\right)\right)\leq k<\frac{1}{2}\left(2q^{2}-1\left(2q+1\right)\right)

2(q2k)2q+1=2\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor=2 if and only if 2(2q+1)2(q2k)<3(2q+1)2(2q+1)\leq 2\left(q^{2}-k\right)<3(2q+1), hence 12(2q23(2q+1))k<12(2q22(2q+1))\frac{1}{2}\left(2q^{2}-3\left(2q+1\right)\right)\leq k<\frac{1}{2}\left(2q^{2}-2\left(2q+1\right)\right)

and in general, r=df2(q2k)2q+1r\overset{\textit{df}}{=}\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor if and only if (r1)(2q+1)2(q2k)<r(2q+1)(r-1)(2q+1)\leq 2\left(q^{2}-k\right)<r(2q+1), which translates to the interval for kk: 12(2q2r(2q+1))k<12(2q2(r1)(2q+1))\frac{1}{2}\left(2q^{2}-r\left(2q+1\right)\right)\leq k<\frac{1}{2}\left(2q^{2}-(r-1)\left(2q+1\right)\right).

Thus, instead of analyzing \mathcal{F} with respect to kk over the whole domain i.e. 1kq21\leq k\leq q^{2} and q2q\geq 2 we can analyze it in the subsequent intervals, in which the value 2(q2k)2q+1\left\lfloor\frac{2\left(q^{2}-k\right)}{2q+1}\right\rfloor is known and fixed.

Let us introduce the auxiliary function h:h:

h(q,k,r)=df(2q+1,q2k)h(q,k,r)\overset{\textit{df}}{=}\mathcal{F}(2q+1,q^{2}-k) (101)

defined for kk such that 12(2q2r(2q+1))k<12(2q2(r1)(2q+1))\frac{1}{2}\left(2q^{2}-r\left(2q+1\right)\right)\leq k<\frac{1}{2}\left(2q^{2}-(r-1)\left(2q+1\right)\right). Hence,

h(q,k,r)=13((4k+2q+1)r+4q2r(2q+1)r2+(2q1)(3kq2+q))h(q,k,r)=\frac{1}{3}\left(-(4k+2q+1)r+4q^{2}r-(2q+1)r^{2}+(2q-1)\left(3k-q^{2}+q\right)\right) (102)

Moreover, rr is the highest when kk is 11. Thus, due to (89) it holds that 2(q21)2q+12q22q+1=q1\left\lfloor\frac{2\left(q^{2}-1\right)}{2q+1}\right\rfloor\leq\left\lfloor\frac{2q^{2}}{2q+1}\right\rfloor=q-1. Therefore, we know that rq1r\leq q-1. Hence, instead of showing that (2q+1,q2k)>1\mathcal{F}(2q+1,q^{2}-k)>1 for every 0kq20\leq k\leq q^{2}, we prove that h(q,k,r)>1h(q,k,r)>1 when 12(2q2r(2q+1))k<12(2q2(r1)(2q+1))\frac{1}{2}\left(2q^{2}-r\left(2q+1\right)\right)\leq k<\frac{1}{2}\left(2q^{2}-(r-1)\left(2q+1\right)\right) for every 0rq10\leq r\leq q-1.

Let us observe that h(q,k,r)h(q,k,r) is a decreasing function with respect to kk. That is because

h(q,k,r)h(q,k1,r)=2q4r3+1h(q,k,r)-h(q,k-1,r)=2q-\frac{4r}{3}+1 (103)

where rq1r\leq q-1. In particular, it is easy to verify that always 2q+1>4r32q+1>\frac{4r}{3} for rq1r\leq q-1.

The above equalities justify the following estimation:

h(q,k,r)>h(q,k1,r)>>h(q,12(2q2r(2q+1)),r)h(q,k,r)>h(q,k-1,r)>\ldots>h(q,\frac{1}{2}\left(2q^{2}-r\left(2q+1\right)\right),r) (104)

Thus, to prove that h(q,k,r)>0h(q,k,r)>0 for all admissible values of q,k,rq,k,r we need to check whether h(q,12(2q2r(2q+1)),r)>0h(q,\frac{1}{2}\left(2q^{2}-r\left(2q+1\right)\right),r)>0 for 0rq10\leq r\leq q-1.

So, applying the lower bound for kk, i.e. k=12(2q2r(2q+1))k=\frac{1}{2}\left(2q^{2}-r\left(2q+1\right)\right) to (102) we obtain

h(q,12(2q2r(2q+1)),r)=16(2q+1)(4q26qr2q+2r2+r)h(q,\frac{1}{2}\left(2q^{2}-r\left(2q+1\right)\right),r)=\frac{1}{6}(2q+1)\left(4q^{2}-6qr-2q+2r^{2}+r\right) (105)

Let us denote h2(q,r)=dfh(q,12(2q2r(2q+1)),r)h_{2}(q,r)\overset{\textit{df}}{=}h(q,\frac{1}{2}\left(2q^{2}-r\left(2q+1\right)\right),r). It is easy to observe that h2h_{2} is a parabola with respect to rr. Since 2h2r2=23(2q+1)\frac{\partial^{2}h_{2}}{\partial r^{2}}=\frac{2}{3}(2q+1) is greater than 0 for q2q\geq 2, thus h2(q,r)h_{2}(q,r) has the minimum with respect to rr when h2r=0\frac{\partial h_{2}}{\partial r}=0. I.e.

h2r=16(2q+1)(6q4r1)=0\frac{\partial h_{2}}{\partial r}=-\frac{1}{6}(2q+1)(6q-4r-1)=0 (106)

i.e., when

r=14(6q1)r=\frac{1}{4}(6q-1) (107)

In other words, h2h_{2} decreases for r=1,2,r=1,2,\ldots, then reaches the minimum999In fact, due to the diophantic nature of h2h_{2}, its minimum is either at 14(6q1)\left\lfloor\frac{1}{4}(6q-1)\right\rfloor or 14(6q1)\left\lceil\frac{1}{4}(6q-1)\right\rceil. at r=14(6q1)r=\frac{1}{4}(6q-1), next starts to increase for r14(6q1)r\geq\left\lceil\frac{1}{4}(6q-1)\right\rceil. However, h,h2h,h_{2} are defined for rq1r\leq q-1. Thus, it is clear that within the interval 0rq10\leq r\leq q-1 the function h2h_{2} is strictly decreasing with respect to rr. Moreover, it is easy to verify that q1<14(6q1)q-1<\left\lfloor\frac{1}{4}(6q-1)\right\rfloor. Thus, to determine the minimal value of h2h_{2} it is enough to check their value for r=q1r=q-1.

Thus h2h_{2}:

h2(q,q1)=16(2q2+3q+1)h_{2}(q,q-1)=\frac{1}{6}\left(2q^{2}+3q+1\right) (108)

Since, q2q\geq 2 then it is easy to verify that h2(q,q1)>0h_{2}(q,q-1)>0. This implies that h(q,k,r)>0h(q,k,r)>0 for every 0rq10\leq r\leq q-1 and kk such that 12(2q2r(2q+1))k<12(2q2(r1)(2q+1))\frac{1}{2}\left(2q^{2}-r\left(2q+1\right)\right)\leq k<\frac{1}{2}\left(2q^{2}-(r-1)\left(2q+1\right)\right). Hence, also (n,𝒳(n)k)>0\mathcal{F}(n,\mathcal{X}(n)-k)>0 for n=2q+1n=2q+1 where 1kq21\leq k\leq q^{2}, which completes the proof of (66).

Proof. Equation (67), part 1.

Let nn be even i.e. n=2qn=2q where q+q\in\mathbb{N}_{+}. Since (4) to prove that (n,𝒳(n)+k)\mathcal{F}(n,\mathcal{X}(n)+k) is smaller than 0 it is enough to show that for every integer q,kq,k such that q2q\geq 2 and 1k(n2)𝒳(n)1\leq k\leq\binom{n}{2}-\mathcal{X}(n) where (n2)𝒳(n)=(2q2)q(q1)=q2\binom{n}{2}-\mathcal{X}(n)=\binom{2q}{2}-q(q-1)=q^{2} it holds that (2q,q(q1)+k)0\mathcal{F}(2q,q(q-1)+k)\leq 0. After a series of elementary transformations applied to (48) we obtain that:

(2q,q(q1)+k)=23(qkq2+(q2k)kq+k(q1))\mathcal{F}(2q,q(q-1)+k)=-\frac{2}{3}\left(q\left\lfloor\frac{k}{q}\right\rfloor^{2}+(q-2k)\left\lfloor\frac{k}{q}\right\rfloor+k(q-1)\right) (109)

Let us consider the relationship between kk and kq\left\lfloor\frac{k}{q}\right\rfloor. When 1k<q1\leq k<q it holds that kq=0\left\lfloor\frac{k}{q}\right\rfloor=0, when qk<2qq\leq k<2q it holds that kq=1\left\lfloor\frac{k}{q}\right\rfloor=1 and similarly, 2qk<3q2q\leq k<3q then it holds that kq=2\left\lfloor\frac{k}{q}\right\rfloor=2. In general, when rqk<(r+1)qrq\leq k<(r+1)q then kq=r\left\lfloor\frac{k}{q}\right\rfloor=r. Of course, since kq2k\leq q^{2} then rqr\leq q. Hence, instead of considering the function \mathcal{F} at once, we may analyze it in the intervals in which kq\left\lfloor\frac{k}{q}\right\rfloor is known and constant. Let us define:

f(q,k,r)=dfqr2+(q2k)r+k(q1)f(q,k,r)\overset{\textit{df}}{=}qr^{2}+(q-2k)r+k(q-1) (110)

It is easy to see that f(q,k,r)=32(2q,q(q1)+k)f(q,k,r)=-\frac{3}{2}\cdot\mathcal{F}(2q,q(q-1)+k) if rqk<(r+1)qrq\leq k<(r+1)q for r=0,,q1r=0,\ldots,q-1. Hence, instead of analyzing \mathcal{F} we will focus on the auxiliary function f.f.

The first observation is that ff is linear with respect to kk providing that qq and rr are known and fixed. Thus, the minimal value of ff with respect to kk within the interval rqk<(r+1)qrq\leq k<(r+1)q is min{f(q,rq,r),f(q,(r+1)q,r)}\min\{f(q,rq,r),f(q,(r+1)q,r)\}. In other words, it is enough to check that ff is greater than 0 at both edges of the interval for kk. Let us consider ff at the lower bound, i.e. for k=rqk=rq.

f(q,rq,r)=qr(qr)f(q,rq,r)=qr(q-r) (111)

It is easy to verify that for every 0<r<q0<r<q and q2q\geq 2 the value f(q,rq,r)>0f(q,rq,r)>0. The function f(q,rq,r)f(q,rq,r) reaches 0 when r=0r=0. Thus, f(q,rq,r)0f(q,rq,r)\geq 0 for every rr such that 0rq0\leq r\leq q.

Let us consider ff at the other end of interval, i.e. for k=(r+1)q1k=(r+1)q-1.

f(q,(r+1)q1,r)=q2(r+1)q(r2+2r+2)+2r+1f(q,(r+1)q-1,r)=q^{2}(r+1)-q\left(r^{2}+2r+2\right)+2r+1 (112)

Similarly as above, we would like to show that for every admissible rr the function f(q,(r+1)q1,r)0f(q,(r+1)q-1,r)\geq 0. Hence, let us rewrite ff with respect to rr.

f(q,(r+1)q1,r)=qr2+r(q22q+2)+(q22q+1)f(q,(r+1)q-1,r)=-qr^{2}+r\left(q^{2}-2q+2\right)+\left(q^{2}-2q+1\right) (113)

When considering ff as a polynomial with respect to rr one may notice that the coefficient at r2r^{2} is negative (q<0-q<0) which means that ff is concave.

Let us denote f2(q,r)=dff(q,(r+1)q1,r)f_{2}(q,r)\stackrel{{\scriptstyle\textit{df}}}{{=}}f(q,(r+1)q-1,r). It is easy to compute that f2r=0\frac{\partial f_{2}}{\partial r}=0 when r=q22q+22qr=\frac{q^{2}-2q+2}{2q}. Since 2f2r2=2q>0\frac{\partial^{2}f_{2}}{\partial r^{2}}=-2q>0, thus f2f_{2} reaches the maximum101010In fact, due to the diophantine nature of ff it reaches the maximum for r=q22q+22qr=\left\lfloor\frac{q^{2}-2q+2}{2q}\right\rfloor or r=q22q+22qr=\left\lceil\frac{q^{2}-2q+2}{2q}\right\rceil. for r=q22q+22qr=\frac{q^{2}-2q+2}{2q}. Since the interval of rr is 0r<q0\leq r<q and also 0q22q+22q<q0\leq\frac{q^{2}-2q+2}{2q}<q therefore the minimum of f2f_{2} for 0r<q0\leq r<q is the smaller of the two f2(q,0)f_{2}(q,0) and f2(q,q1)f_{2}(q,q-1).

Hence

f2(q,0)=q22q+1,f2(q,q1)=q1f_{2}(q,0)=q^{2}-2q+1,\,\,\,\,f_{2}(q,q-1)=q-1 (114)

Since for every q2q\geq 2 it holds that min{f2(q,0),f2(q,q1)}0\min\{f_{2}(q,0),f_{2}(q,q-1)\}\geq 0 then f2(q,r)0f_{2}(q,r)\geq 0 for every fixed q2q\geq 2 and 0r<q0\leq r<q, which implies that also for k=(r+1)q1k=(r+1)q-1, f(q,k,r)0f(q,k,r)\geq 0. Therefore f(q,k,r)0f(q,k,r)\geq 0 for every rqk<(r+1)qrq\leq k<(r+1)q for r=0,,qr=0,\ldots,q.

As f(q,k,r)=32(2q,q(q1)+k)f(q,k,r)=-\frac{3}{2}\cdot\mathcal{F}(2q,q(q-1)+k) when rqk<(r+1)qrq\leq k<(r+1)q, then due to the arbitrary choice of rr it holds that (n,𝒳(n)+k)0\mathcal{F}(n,\mathcal{X}(n)+k)\leq 0 for n=2qn=2q and 0k<q20\leq k<q^{2}. As one may observe, the above reasoning does not cover k=q2k=q^{2}. This is the last “point interval” that needs to be considered. For k=q2k=q^{2} we have

(2q,q(q1)+q2)=13(2)q(2q2+(14q)2q+2(2q1)q)\mathcal{F}(2q,q(q-1)+q^{2})=\frac{1}{3}(-2)q\left(\lfloor 2q\rfloor^{2}+(1-4q)\lfloor 2q\rfloor+2(2q-1)q\right) (115)

Since q+q\in\mathbb{N}_{+} then 2q=2q\lfloor 2q\rfloor=2q. Hence it is easy to verify that

(2q,q(q1)+q2)=0\mathcal{F}(2q,q(q-1)+q^{2})=0 (116)

Which completes the first part of the proof of (67).

Proof. Equation (67), part 2.

Let nn be odd i.e. n=2q+1n=2q+1 where q+q\in\mathbb{N}_{+}. Since (4) to prove that (n,𝒳(n)+k)\mathcal{F}(n,\mathcal{X}(n)+k) is smaller than 0 it is enough to show that for every integer q,kq,k such that q2q\geq 2 and 1k(2q2)q21=q2q11\leq k\leq\binom{2q}{2}-q^{2}-1=q^{2}-q-1 it holds that (2q+1,q2+k)0\mathcal{F}(2q+1,q^{2}+k)\leq 0. After a series of elementary transformations applied to (48) we obtain:

(2q+1,q2+k)=\displaystyle\mathcal{F}(2q+1,q^{2}+k)= 13((2q+1)2(q2+k)2q+12\displaystyle-\frac{1}{3}\left((2q+1)\left\lfloor\frac{2\left(q^{2}+k\right)}{2q+1}\right\rfloor^{2}\right.
(4k+4q22q1)2(q2+k)2q+1\displaystyle-\left(4k+4q^{2}-2q-1\right)\left\lfloor\frac{2\left(q^{2}+k\right)}{2q+1}\right\rfloor (117)
+(2q1)(3k+(q1)q))\displaystyle\left.\begin{array}[]{c}\\ \\ \\ \end{array}+(2q-1)(3k+(q-1)q)\right) (121)

Since 1kq2q11\leq k\leq q^{2}-q-1 we may estimate the upper and the lower bound for 2(q2+k)2q+1\left\lfloor\frac{2\left(q^{2}+k\right)}{2q+1}\right\rfloor as

q12q22q+1+2k2q+12(q2+k)2q+1q-1\leq\left\lfloor\frac{2q^{2}}{2q+1}\right\rfloor+\left\lfloor\frac{2k}{2q+1}\right\rfloor\leq\left\lfloor\frac{2\left(q^{2}+k\right)}{2q+1}\right\rfloor (122)

and

2(q2+k)2q+1\displaystyle\left\lfloor\frac{2\left(q^{2}+k\right)}{2q+1}\right\rfloor\leq 2(q2+q2q1)2q+14q22q2q+22q+1=\displaystyle\left\lfloor\frac{2\left(q^{2}+q^{2}-q-1\right)}{2q+1}\right\rfloor\leq\left\lfloor\frac{4q^{2}}{2q}-\frac{2q+2}{2q+1}\right\rfloor= (123)
2q2q+22q+1=2q2=2q2\displaystyle\left\lfloor 2q-\frac{2q+2}{2q+1}\right\rfloor=\left\lfloor 2q-2\right\rfloor=2q-2

Let us denote r=df2(q2+k)2q+1r\overset{\textit{df}}{=}\left\lfloor\frac{2\left(q^{2}+k\right)}{2q+1}\right\rfloor. Thus, q1r2q2q-1\leq r\leq 2q-2. Let us consider the relationship between kk and rr. It holds that 2(q2+k)2q+1=r\left\lfloor\frac{2\left(q^{2}+k\right)}{2q+1}\right\rfloor=r wherever r2(q2+k)2q+1<r+1r\leq\frac{2\left(q^{2}+k\right)}{2q+1}<r+1. Thus it is easy to determine that 2(q2+k)2q+1=r\left\lfloor\frac{2\left(q^{2}+k\right)}{2q+1}\right\rfloor=r wherever 12(2qr+r2q2)k<12((r+1)(2q+1)2q2)\frac{1}{2}\left(2qr+r-2q^{2}\right)\leq k<\frac{1}{2}\left(\left(r+1\right)\left(2q+1\right)-2q^{2}\right).

Let us consider the function (2q+1,q2+k)\mathcal{F}(2q+1,q^{2}+k) for k+k\in\mathbb{N}_{+} such that 12(2qr+r2q2)k<12((r+1)(2q+1)2q2)\frac{1}{2}\left(2qr+r-2q^{2}\right)\leq k<\frac{1}{2}\left(\left(r+1\right)\left(2q+1\right)-2q^{2}\right). For this purpose, let us define ff

f(q,k,r)=df(2q+1)r2r(4k+4q22q1)+(2q1)(3k+(q1)q)f(q,k,r)\overset{\textit{df}}{=}(2q+1)r^{2}-r\left(4k+4q^{2}-2q-1\right)+(2q-1)(3k+(q-1)q) (124)

It is easy to verify that

(2q+1,q2+k)=13f(q,k,r)\mathcal{F}(2q+1,q^{2}+k)=-\frac{1}{3}f(q,k,r) (125)

providing that q,r+q,r\in\mathbb{N}_{+}, 12(2qr+r2q2)k<12((r+1)(2q+1)2q2)\frac{1}{2}\left(2qr+r-2q^{2}\right)\leq k<\frac{1}{2}\left(\left(r+1\right)\left(2q+1\right)-2q^{2}\right), q1r2q2q-1\leq r\leq 2q-2 and q2q\geq 2. Hence, wherever f(q,k,r)0f(q,k,r)\geq 0 then (2q+1,q2+k)0\mathcal{F}(2q+1,q^{2}+k)\leq 0. Let us observe that ff is linear with respect to kk. Therefore it is enough to check the value of f(q,k,r)f(q,k,r) at the edges of the admissible interval for kk, and prove that those values are above 0 in any possible interval determined by rr. For this purpose let us define

f2(q,r)=dff(q,12(2qr+r2q2),r)f_{2}(q,r)\stackrel{{\scriptstyle\textit{df}}}{{=}}f(q,\frac{1}{2}\left(2qr+r-2q^{2}\right),r) (126)

for the lower bound, and

f3(q,r)=dff(q,12((r+1)(2q+1)2q2)1,r)f_{3}(q,r)\stackrel{{\scriptstyle\textit{df}}}{{=}}f(q,\frac{1}{2}\left(\left(r+1\right)\left(2q+1\right)-2q^{2}\right)-1,r) (127)

for the upper bound. Hence

f2(q,r)=12(2q+1)(4q26qr2q+2r2+r)f_{2}(q,r)=-\frac{1}{2}(2q+1)\left(4q^{2}-6qr-2q+2r^{2}+r\right) (128)
f3(q,r)=4q3+6q2(r+1)q(2r2+2r+5)+12(2r2+3r+3)f_{3}(q,r)=-4q^{3}+6q^{2}(r+1)-q\left(2r^{2}+2r+5\right)+\frac{1}{2}\left(-2r^{2}+3r+3\right) (129)

Let us reorganize the above equations with respect to rr:

f2(q,r)=(2q+1)r2+(2q+6q212)r4q3+qf_{2}(q,r)=-\left(2q+1\right)r^{2}+\left(2q+6q^{2}-\frac{1}{2}\right)r-4q^{3}+q (130)
f3(q,r)=(2q+1)r2+(6q22q+32)r4q3+6q25q+32f_{3}(q,r)=-\left(2q+1\right)r^{2}+\left(6q^{2}-2q+\frac{3}{2}\right)r-4q^{3}+6q^{2}-5q+\frac{3}{2} (131)

Since both f2f_{2} and f3f_{3} have second degree polynomials with respect to rr, and the coefficients nearby r2r^{2} are negative, then f2f_{2} and f3f_{3} are concave parabolas. Therefore f2f_{2} and f3f_{3} are not smaller than 0 within the interval q1r2q2q-1\leq r\leq 2q-2 if they are not negative at both ends of the interval i.e. q1q-1 and 2q22q-2. As the estimation (122) is not perfect, let us assume for a moment that rr is in qr2q2q\leq r\leq 2q-2, whilst the case r=q1r=q-1 we handle separately.

Let us examine (130).

f2(q,r)=q2+q2whenr=qf_{2}(q,r)=q^{2}+\frac{q}{2}\,\,\,\textit{when}\,\,\,\,r=q (132)

and

f2(q,r)=(2q3)(2q+1)whenr=2q2f_{2}(q,r)=(2q-3)(2q+1)\,\,\,\textit{when}\,\,\,r=2q-2 (133)

Since q2q\geq 2 both of the above equations are greater than 0. For (131) it is enough to assume that q1r2q2q-1\leq r\leq 2q-2. Thus,

f3(q,r)=q23q21whenr=q1f_{3}(q,r)=q^{2}-\frac{3q}{2}-1\,\,\,\textit{when}\,\,\,r=q-1 (134)

and

f3(q,r)=2q2+2q112whenr=2q2f_{3}(q,r)=2q^{2}+2q-\frac{11}{2}\,\,\,\textit{when}\,\,\,r=2q-2 (135)

Similarly, it is easy to verify that both of the above expressions are non negative as q2q\geq 2.

At the end, let us explicitly calculate

f(q,k,q1)=2kq+kf(q,k,q-1)=2kq+k (136)

As kk is always non negative, then also in this case ff is non negative 0. Thereby for every 1kq2q11\leq k\leq q^{2}-q-1 it holds that (2q+1,q2+k)0\mathcal{F}(2q+1,q^{2}+k)\leq 0 which completes the proof of the Lemma 7 \square

Appendix B Proof of the Lemma 8

Thesis.

For every n+,n3n\in\mathbb{N}_{+},n\geq 3 the function 𝒞\mathcal{C}:

  1. 1.

    is constant and equals 𝒞(n,m)=0\mathcal{C}(n,m)=0 for every mm such that 0m<n0\leq m<n

  2. 2.

    is strictly increasing for every m+m\in\mathbb{N}_{+} such that nm(n2)n\leq m\leq\binom{n}{2}, i.e.

𝒞(n,m+1)𝒞(n,m)>0\mathcal{C}(n,m+1)-\mathcal{C}(n,m)>0

Proof. Claim 1.

The first claim that 𝒞(n,m)=0\mathcal{C}(n,m)=0 for every mm such that 0m<n0\leq m<n is a direct consequence of the equation (25). It is enough to note that the right side of expression (25) is the product where the first part is 12mn\frac{1}{2}\left\lfloor\frac{m}{n}\right\rfloor. Hence, wherever m<nm<n the product often equals 0.

Proof. Claim 2.

Due to (Theorem 4) it holds that

𝒞(n,m+1)𝒞(n,m)=\displaystyle\mathcal{C}(n,m+1)-\mathcal{C}(n,m)= 12(mn(nmn2m+n)\displaystyle\frac{1}{2}\left(\left\lfloor\frac{m}{n}\right\rfloor\left(n\left\lfloor\frac{m}{n}\right\rfloor-2m+n\right)-\right.
m+1n(nm+1n2m+n2))\displaystyle\left.\left\lfloor\frac{m+1}{n}\right\rfloor\left(n\left\lfloor\frac{m+1}{n}\right\rfloor-2m+n-2\right)\right) (136)

It is easy to observe that for some positive integer p=1,2,p=1,2,\ldots when m=np1m=np-1 then mn=p1,m+1n=p\left\lfloor\frac{m}{n}\right\rfloor=p-1,\left\lfloor\frac{m+1}{n}\right\rfloor=p. Next, by increasing mm by one we get m=npm=np and mn=p,m+1n=p\left\lfloor\frac{m}{n}\right\rfloor=p,\left\lfloor\frac{m+1}{n}\right\rfloor=p. Then, for m=n(p+1)1m=n(p+1)-1 the values of our floored expressions change to mn=p,m+1n=p+1\left\lfloor\frac{m}{n}\right\rfloor=p,\left\lfloor\frac{m+1}{n}\right\rfloor=p+1, and then by increasing mm by one we get mn=p+1,m+1n=p+1\left\lfloor\frac{m}{n}\right\rfloor=p+1,\left\lfloor\frac{m+1}{n}\right\rfloor=p+1. Hence, there are two different intervals with respect to the values mn\left\lfloor\frac{m}{n}\right\rfloor and m+1n\left\lfloor\frac{m+1}{n}\right\rfloor. The first one in which both expressions have the same value, and the other one (composed of one point) in which their values differ by one. In general, we may observe that:

wherever m=np1m=np-1 then mn=p1,m+1n=p\left\lfloor\frac{m}{n}\right\rfloor=p-1,\left\lfloor\frac{m+1}{n}\right\rfloor=p, and wherever npm<n(p+1)1np\leq m<n(p+1)-1 then mn=p,m+1n=p\left\lfloor\frac{m}{n}\right\rfloor=p,\left\lfloor\frac{m+1}{n}\right\rfloor=p.

Let us define the auxiliary function hh by replacing in (136) mn\left\lfloor\frac{m}{n}\right\rfloor by rr and m+1n\left\lfloor\frac{m+1}{n}\right\rfloor by tt:

h(n,m,r,t)=df12(r(nr2m+n)t(nt2m+n2))h(n,m,r,t)\overset{\textit{df}}{=}\frac{1}{2}\left(r\left(nr-2m+n\right)-t\left(nt-2m+n-2\right)\right) (137)

The function hh can be rewritten with respect to mm, so

h(n,m,r,t)=12nr2+m(tr)+12nr12nt212nt+th(n,m,r,t)=\frac{1}{2}nr^{2}+m\left(t-r\right)+\frac{1}{2}nr-\frac{1}{2}nt^{2}-\frac{1}{2}nt+t (138)

It is easy to observe that

𝒞(n,m+1)𝒞(n,m)=h(n,m,r,t)\mathcal{C}(n,m+1)-\mathcal{C}(n,m)=h(n,m,r,t) (139)

where r=mnr=\left\lfloor\frac{m}{n}\right\rfloor and t=m+1nt=\left\lfloor\frac{m+1}{n}\right\rfloor. Thus, instead of analyzing h(n,m,r,t)h(n,m,r,t) for mm such that nm(n2)n\leq m\leq\binom{n}{2} we analyze h(n,m,r,t)h(n,m,r,t) in two intervals m=np1m=np-1 and npm<n(p+1)1np\leq m<n(p+1)-1. This, due to the arbitrary choice of pp, would apply to 𝒞(n,m+1)𝒞(n,m)\mathcal{C}(n,m+1)-\mathcal{C}(n,m) over the whole interval nm(n2)n\leq m\leq\binom{n}{2}.

Let us observe that hh is linear with respect to mm. Thus to prove that h(n,m,r,t)>0h(n,m,r,t)>0 when n,r,tn,r,t are constant, one needs only to verify the value of hh at the ends of both intervals to which mm may belong. Thus, let us consider the first “point” interval m=np1m=np-1. In this interval mn=p1,m+1n=p\left\lfloor\frac{m}{n}\right\rfloor=p-1,\left\lfloor\frac{m+1}{n}\right\rfloor=p, thus:

h(n,np1,p1,p)=p1h(n,np-1,p-1,p)=p-1 (140)

As mnm\geq n, and m=np1m=np-1 thus p2p\geq 2. Hence,

h(n,np1,p1,p)21=1h(n,np-1,p-1,p)\geq 2-1=1 (141)

This supports the thesis of the theorem, i.e. npm<n(p+1)1np\leq m<n(p+1)-1, where mn=p,m+1n=p\left\lfloor\frac{m}{n}\right\rfloor=p,\left\lfloor\frac{m+1}{n}\right\rfloor=p. For both its ends we have:

h(n,np,p,p)=ph(n,np,p,p)=p (142)
h(n,n(p+1)1,p,p)=ph(n,n(p+1)-1,p,p)=p (143)

As mnm\geq n and npmnp\leq m then p1p\geq 1. Thus in both cases hh is strictly greater than 0. Hence, for every np1mn(p+1)1np-1\leq m\leq n(p+1)-1 it holds that

𝒞(n,m+1)𝒞(n,m)>0\mathcal{C}(n,m+1)-\mathcal{C}(n,m)>0 (144)

Due to the arbitrary choice of pp this statement completes the proof of the theorem. \square

Appendix C Proof of the Lemma 9

Thesis.

For every n+,n3n\in\mathbb{N}_{+},n\geq 3 it holds that

(n3)𝒞(n,𝒳(n))=𝒴(n)\binom{n}{3}-\mathcal{C}(n,\mathcal{X}(n))=\mathcal{Y}(n)

Proof. Part 1.

Let n=4qn=4q (nn is even, and n2=n2=2q\left\lfloor\frac{n}{2}\right\rfloor=\left\lceil\frac{n}{2}\right\rceil=2q is even), n4n\geq 4, hence q1q\geq 1 and 𝒳(4q)=2q(2q1)\mathcal{X}(4q)=2q(2q-1). Thus to prove (69) for even numbers we show that

(4q3)𝒞(4q,2q(2q1))𝒴(4q)=0\binom{4q}{3}-\mathcal{C}(4q,2q(2q-1))-\mathcal{Y}(4q)=0 (144)

Since (64) reduces to:

𝒴(4q)=\displaystyle\mathcal{Y}(4q)= (4q3)((2q3)q(q21)3)\displaystyle\binom{4q}{3}-\left(\binom{2q}{3}-\frac{q\left(q^{2}-1\right)}{3}\right)
((2q3)q(q21)3)\displaystyle-\left(\binom{2q}{3}-\frac{q\left(q^{2}-1\right)}{3}\right) (145)

by elementary transformations one may show that (144) is equivalent to

2q(12q+q1)2=02q\left(\left\lceil\frac{1}{2}-q\right\rceil+q-1\right)^{2}=0 (146)

The above is true as 12q=1q\left\lceil\frac{1}{2}-q\right\rceil=1-q for every q+q\in\mathbb{N}_{+}.

Proof. Part 2.

Let n=4q+1n=4q+1 (nn is odd, n2=2q\left\lfloor\frac{n}{2}\right\rfloor=2q is even, and n2=2q+1\left\lceil\frac{n}{2}\right\rceil=2q+1 is odd), n4n\geq 4, hence q1q\geq 1 and 𝒳(4q+1)=(n22)+(n22)=(2q2)+(2q+12)=4q2\mathcal{X}(4q+1)=\binom{\left\lfloor\frac{n}{2}\right\rfloor}{2}+\binom{\left\lceil\frac{n}{2}\right\rceil}{2}=\binom{2q}{2}+\binom{2q+1}{2}=4q^{2}. Thus to prove (69) for n=4q+1n=4q+1 we show that

(4q+13)𝒞(4q+1,4q2)𝒴(4q+1)=0\binom{4q+1}{3}-\mathcal{C}(4q+1,4q^{2})-\mathcal{Y}(4q+1)=0 (147)

Since (64) reduces to:

𝒴(4q+1)=\displaystyle\mathcal{Y}(4q+1)= (4q+13)((2q3)q(q21)3)\displaystyle\binom{4q+1}{3}-\left(\binom{2q}{3}-\frac{q\left(q^{2}-1\right)}{3}\right)
((2q+13)q(2q2+3q+1)6)\displaystyle-\left(\binom{2q+1}{3}-\frac{q\left(2q^{2}+3q+1\right)}{6}\right) (148)

by elementary transformations one may show that (147) is equivalent to

12((4q+1)4q24q+12+\displaystyle\frac{1}{2}\left((4q+1)\left\lfloor\frac{4q^{2}}{4q+1}\right\rfloor^{2}+\right.
(8q2+4q+1)4q24q+1+q(4q25q+1))\displaystyle\left.\left(-8q^{2}+4q+1\right)\left\lfloor\frac{4q^{2}}{4q+1}\right\rfloor+q\left(4q^{2}-5q+1\right)\right) =0\displaystyle=0 (149)

Let us note that for every q1q\geq 1 it holds111111compare with (89). that 4q24q+1=q1\left\lfloor\frac{4q^{2}}{4q+1}\right\rfloor=q-1. Thus, the above equation can be written in the form

12((8q2+4q+1)(q1)+(4q25q+1)q+(4q+1)(q1)2)=0\frac{1}{2}\left(\left(-8q^{2}+4q+1\right)(q-1)+\left(4q^{2}-5q+1\right)q+(4q+1)(q-1)^{2}\right)=0 (150)

which can be easily verified as true.

Proof. Part 3.

Let n=4q+2n=4q+2 (nn is even, n2=2q+1\left\lfloor\frac{n}{2}\right\rfloor=2q+1 is odd, and n2=2q+1\left\lceil\frac{n}{2}\right\rceil=2q+1 is odd) and 𝒳(4q+2)=(n22)+(n22)=(2q+12)+(2q+12)=2q(2q+1)\mathcal{X}(4q+2)=\binom{\left\lfloor\frac{n}{2}\right\rfloor}{2}+\binom{\left\lceil\frac{n}{2}\right\rceil}{2}=\binom{2q+1}{2}+\binom{2q+1}{2}=2q(2q+1) Thus, to prove (69) for n=4q+2n=4q+2 we show that

(4q+23)𝒞(4q+2,2q(2q+1))𝒴(4q+2)=0\binom{4q+2}{3}-\mathcal{C}(4q+2,2q(2q+1))-\mathcal{Y}(4q+2)=0 (151)

Since (64) reduces to:

𝒴(4q+2)=(4q+23)2((2q+13)q(2q2+3q+1)6)\mathcal{Y}(4q+2)=\binom{4q+2}{3}-2\left(\binom{2q+1}{3}-\frac{q\left(2q^{2}+3q+1\right)}{6}\right) (152)

by elementary transformations one may show that (151) is equivalent to

(2q+1)(q2+(12q)q+(q1)q)=0(2q+1)\left(\lfloor q\rfloor^{2}+(1-2q)\lfloor q\rfloor+(q-1)q\right)=0 (153)

As qq is an integer it is easy to show that (153) is true.

Proof. Part 4.

Let n=4q+3n=4q+3 (nn is odd n2=2q+1\left\lfloor\frac{n}{2}\right\rfloor=2q+1 is odd, and n2=2q+2\left\lceil\frac{n}{2}\right\rceil=2q+2 is even) and 𝒳(4q+3)=(n22)+(n22)=(2q+12)+(2q+22)=(2q+1)2\mathcal{X}(4q+3)=\binom{\left\lfloor\frac{n}{2}\right\rfloor}{2}+\binom{\left\lceil\frac{n}{2}\right\rceil}{2}=\binom{2q+1}{2}+\binom{2q+2}{2}=(2q+1)^{2}. Thus, to prove (69) for n=4q+3n=4q+3 we show that

(4q+33)𝒞(4q+3,(2q+1)2)𝒴(4q+3)=0\binom{4q+3}{3}-\mathcal{C}(4q+3,(2q+1)^{2})-\mathcal{Y}(4q+3)=0 (154)

by elementary transformations one may show that (154) is equivalent to:

12((8q24q+1)(2q+1)24q+3+\displaystyle\frac{1}{2}\left(\left(-8q^{2}-4q+1\right)\left\lfloor\frac{(2q+1)^{2}}{4q+3}\right\rfloor\right.+
(4q+3)(2q+1)24q+32+(4q2+q1)q)\displaystyle\left.(4q+3)\left\lfloor\frac{(2q+1)^{2}}{4q+3}\right\rfloor^{2}+\left(4q^{2}+q-1\right)q\right) =0\displaystyle=0 (155)

Since121212Let us notice that (2q+1)24q+3=4q2+4q+14q+3==q+q+14q+3\left\lfloor\frac{(2q+1)^{2}}{4q+3}\right\rfloor=\left\lfloor\frac{4q^{2}+4q+1}{4q+3}\right\rfloor=\ldots=\left\lfloor q+\frac{q+1}{4q+3}\right\rfloor. The fact that for q=0,1,q=0,1,\ldots the expression q+14q+3\frac{q+1}{4q+3} is always smaller than 11, implies that (2q+1)24q+3=q\left\lfloor\frac{(2q+1)^{2}}{4q+3}\right\rfloor=\left\lfloor q\right\rfloor. (2q+1)24q+3=q=q\left\lfloor\frac{(2q+1)^{2}}{4q+3}\right\rfloor=\left\lfloor q\right\rfloor=q then the above expression can be written as:

12((4q+3)q2+(4q2+q1)q+(8q24q+1)q)=0\frac{1}{2}\left((4q+3)q^{2}+\left(4q^{2}+q-1\right)q+\left(-8q^{2}-4q+1\right)q\right)=0 (156)

which can easily be verified as true. This also completes the proof of the Lemma 9.

\square

Appendix D Proof of the Lemma 10

Thesis.

For every n+,n3n\in\mathbb{N}_{+},n\geq 3 the function 𝒢\mathcal{G} is strictly decreasing for every m+m\in\mathbb{N}_{+} such that 1m𝒳(n)1\leq m\leq\mathcal{X}(n), i.e.

𝒢(n,m)𝒢(n,m+1)>0where   1m<𝒳(n)\mathcal{G}(n,m)-\mathcal{G}(n,m+1)>0\,\,\,\textit{where}\,\,\,1\leq m<\mathcal{X}(n)

Proof of (70), part 1 (for even numbers)

Let n=2qn=2q (even), n3n\geq 3, hence q2q\geq 2, and m,m+1𝒳(2q)=q(q1)m,m+1\leq\mathcal{X}(2q)=q(q-1). Note that, in particular, the last assumption implies that mq(q1)1m\leq q(q-1)-1. Hence (70) can be written as:

3(𝒢(n,m)𝒢(n,m+1))=\displaystyle 3\left(\mathcal{G}(n,m)-\mathcal{G}(n,m+1)\right)= 2qmq2+(4m2q)mq+2qm+1q2\displaystyle-2q\left\lfloor\frac{m}{q}\right\rfloor^{2}+(4m-2q)\left\lfloor\frac{m}{q}\right\rfloor+2q\left\lfloor\frac{m+1}{q}\right\rfloor^{2}
3qm2q2+3qm+12q24mm+1q\displaystyle-3q\left\lfloor\frac{m}{2q}\right\rfloor^{2}+3q\left\lfloor\frac{m+1}{2q}\right\rfloor^{2}-4m\left\lfloor\frac{m+1}{q}\right\rfloor
+2qm+1q4m+1q+3(mq)m2q\displaystyle+2q\left\lfloor\frac{m+1}{q}\right\rfloor-4\left\lfloor\frac{m+1}{q}\right\rfloor+3(m-q)\left\lfloor\frac{m}{2q}\right\rfloor
3mm+12q+3qm+12q3m+12q+6q6\displaystyle-3m\left\lfloor\frac{m+1}{2q}\right\rfloor+3q\left\lfloor\frac{m+1}{2q}\right\rfloor-3\left\lfloor\frac{m+1}{2q}\right\rfloor+6q-6 (156)

Let us denote r1=mq,r2=m2q,r3=m+1q,r4=m+12qr_{1}=\left\lfloor\frac{m}{q}\right\rfloor,r_{2}=\left\lfloor\frac{m}{2q}\right\rfloor,r_{3}=\left\lfloor\frac{m+1}{q}\right\rfloor,r_{4}=\left\lfloor\frac{m+1}{2q}\right\rfloor. This allows us to denote

3(𝒢(n,m)𝒢(n,m+1))=\displaystyle 3\left(\mathcal{G}(n,m)-\mathcal{G}(n,m+1)\right)= 2qr12+(4m2q)r1+2qr323qr22\displaystyle-2qr_{1}^{2}+(4m-2q)r_{1}+2qr_{3}^{2}-3qr_{2}^{2}
+3qr424mr3+2qr34r3+3(mq)r2\displaystyle+3qr_{4}^{2}-4mr_{3}+2qr_{3}-4r_{3}+3(m-q)r_{2}
3mr4+3qr43r4+6q6\displaystyle-3mr_{4}+3qr_{4}-3r_{4}+6q-6 (157)

Let us introduce the auxiliary function hh such that

h(q,m,r1,r2,r3,r4)=df\displaystyle h(q,m,r_{1},r_{2},r_{3},r_{4})\overset{\textit{df}}{=} r1(4m2q)+3r2(mq)4mr33mr4\displaystyle r_{1}(4m-2q)+3r_{2}(m-q)-4mr_{3}-3mr_{4}
2qr123qr22+2qr32+3qr42+2qr3\displaystyle-2qr_{1}^{2}-3qr_{2}^{2}+2qr_{3}^{2}+3qr_{4}^{2}+2qr_{3}
+3qr4+6q4r33r46\displaystyle+3qr_{4}+6q-4r_{3}-3r_{4}-6 (158)

It is easy to verify that

3(𝒢(n,m)𝒢(n,m+1))=h(q,m,r1,r2,r3,r4)3\left(\mathcal{G}(n,m)-\mathcal{G}(n,m+1)\right)=h(q,m,r_{1},r_{2},r_{3},r_{4}) (159)

Let us try to investigate changes in the values r1,r2,r3r_{1},r_{2},r_{3} and r4r_{4}. To do so, let us create the following table:

interval of mm mq\left\lfloor\frac{m}{q}\right\rfloor m2q\left\lfloor\frac{m}{2q}\right\rfloor m+1q\left\lfloor\frac{m+1}{q}\right\rfloor m+12q\left\lfloor\frac{m+1}{2q}\right\rfloor
0qm<1q10q\leq m<1q-1 0 0 0 0
1q1=m1q-1=m 0 0 11 0
1qm<2q11q\leq m<2q-1 11 0 11 0
2q1=m2q-1=m 11 0 22 11
2qm<3q12q\leq m<3q-1 22 11 22 11
3q1=m3q-1=m 22 11 33 11
3qm<4q13q\leq m<4q-1 33 11 33 11
4q1=m4q-1=m 33 11 44 22
4qm<5q14q\leq m<5q-1 44 22 44 22

As we can see, there are four kinds of interval (hereinafter referred to as cases) that need to be considered with respect to mm. Every analyzed interval is parametrized by the auxiliary variable s{0}s\in\mathbb{N}\cup\{0\}. By choosing arbitrarily s=0,1,2,3,s=0,1,2,3,\ldots we are able to analyze the function hh, and as follows 𝒢(n,m)𝒢(n,m+1)\mathcal{G}(n,m)-\mathcal{G}(n,m+1), for every interesting mm. The cases we need to consider are:

Case interval of mm mq\left\lfloor\frac{m}{q}\right\rfloor m2q\left\lfloor\frac{m}{2q}\right\rfloor m+1q\left\lfloor\frac{m+1}{q}\right\rfloor m+12q\left\lfloor\frac{m+1}{2q}\right\rfloor
1a 2sqm<(2s+1)q12sq\leq m<(2s+1)q-1 2s2s ss 2s2s ss
2a (2s+1)q1=m(2s+1)q-1=m 2s2s ss 2s+12s+1 s+1s+1
3a (2s+1)qm<(2s+2)q1\left(2s+1\right)q\leq m<(2s+2)q-1 2s+12s+1 ss 2s+12s+1 ss
4a (2s+1)q1=m(2s+1)q-1=m 2s2s ss 2s+12s+1 ss

Case 1a

Let 2sqm<(2s+1)q12sq\leq m<(2s+1)q-1. As mq(q1)1m\leq q(q-1)-1, then the candidate for the highest value of ss is the smallest integer for which q(q1)1<(2s+1)q1q(q-1)-1<(2s+1)q-1, hence q22<s\frac{q-2}{2}<s. This means that q22+1=s\left\lfloor\frac{q-2}{2}\right\rfloor+1=s, hence q22+1s\frac{q-2}{2}+1\geq s. On the other hand, as 2sqm2sq\leq m and mq(q1)1m\leq q(q-1)-1 then sq(q1)12qs\leq\frac{q(q-1)-1}{2q}. Since the second condition is more restrictive131313Note that (q22+1)q(q1)12q=1+q2q\left(\frac{q-2}{2}+1\right)-\frac{q(q-1)-1}{2q}=\frac{1+q}{2q} we assume that sq(q1)12qs\leq\frac{q(q-1)-1}{2q}. Let us denote

h(q,m,r1,r2,r3,r4)=h(q,m,2s,s,2s,s)h(q,m,r_{1},r_{2},r_{3},r_{4})=h(q,m,2s,s,2s,s) (160)

Hence,

h(q,m,2s,s,2s,s)=6q11s6h(q,m,2s,s,2s,s)=6q-11s-6 (161)

The highest possible value of ss is q(q1)12q\frac{q(q-1)-1}{2q}, hence the minimal value of hh providing this constraint is 6(q1)11q(q1)12q6(q-1)-11\frac{q(q-1)-1}{2q} i.e.

h(q,m,2s,s,2s,s)6(q1)11q(q1)12qh(q,m,2s,s,2s,s)\geq 6(q-1)-11\frac{q(q-1)-1}{2q} (162)

Which is equivalent to

h(q,m,2s,s,2s,s)q2q+112qh(q,m,2s,s,2s,s)\geq\frac{q^{2}-q+11}{2q} (163)

Hence, it is clear that for q2q\geq 2 the right side of the above equation is always greater than 0.

Case 2a

Let (2s+1)q1=m(2s+1)q-1=m. Since mq(q1)1m\leq q(q-1)-1 then ss cannot be higher than the maximal integer which meets the inequality (2s+1)q1q(q1)1(2s+1)q-1\leq q(q-1)-1, i.e. sq22s\leq\frac{q-2}{2}. Let us calculate hh, for m=(2s+1)q1m=(2s+1)q-1, r1=2s,r2=s,r3=2s+1r_{1}=2s,r_{2}=s,r_{3}=2s+1 and r4=s+1r_{4}=s+1.

h(q,m,r1,r2,r3,r4)=9q11s6h(q,m,r_{1},r_{2},r_{3},r_{4})=9q-11s-6 (164)

As the maximal s=q22s=\frac{q-2}{2} then

h(q,m,r1,r2,r3,r4)9q11q226h(q,m,r_{1},r_{2},r_{3},r_{4})\geq 9q-11\frac{q-2}{2}-6 (165)

which is equivalent to

h(q,m,r1,r2,r3,r4)7q2+5h(q,m,r_{1},r_{2},r_{3},r_{4})\geq\frac{7q}{2}+5 (166)

It is clear that for q2q\geq 2 the right side of the above equation is always greater than 0.

Case 3a

Let (2s+1)qm<(2s+2)q1\left(2s+1\right)q\leq m<(2s+2)q-1

Since mq(q1)1m\leq q(q-1)-1 then ss is not higher than the maximal integer which meets the inequality q(q1)1<(2s+2)q1q(q-1)-1<(2s+2)q-1, i.e. q32<s\frac{q-3}{2}<s. Thus, s=q32+1s=\left\lfloor\frac{q-3}{2}\right\rfloor+1, hence sq32+1s\leq\frac{q-3}{2}+1. On the other hand, also (2s+1)qm\left(2s+1\right)q\leq m and mq(q1)1m\leq q(q-1)-1. Thus ss should meet (2s+1)qq(q1)1\left(2s+1\right)q\leq q(q-1)-1, i.e. s12(q(q1)1q1)s\leq\frac{1}{2}\left(\frac{q(q-1)-1}{q}-1\right). The second condition is more restrictive141414as (q32+1)12(q(q1)1q1)=q+12q\left(\frac{q-3}{2}+1\right)-\frac{1}{2}\left(\frac{q(q-1)-1}{q}-1\right)=\frac{q+1}{2q} hence we assume that s12(q(q1)1q1)s\leq\frac{1}{2}\left(\frac{q(q-1)-1}{q}-1\right). Let us calculate hh assuming r1=2s+1,r2=s,r3=2s+1r_{1}=2s+1,r_{2}=s,r_{3}=2s+1, and r4=sr_{4}=s. So,

h(q,m,r1,r2,r3,r4)=h(q,m,2s+1,s,2s+1,s)h(q,m,r_{1},r_{2},r_{3},r_{4})=h(q,m,2s+1,s,2s+1,s) (167)

and thus,

h(q,m,2s+1,s,2s+1,s)=6q11s10h(q,m,2s+1,s,2s+1,s)=6q-11s-10 (168)

The highest allowed value of ss is 12(q(q1)1q1)\frac{1}{2}\left(\frac{q(q-1)-1}{q}-1\right), thus it is true that

h(q,m,2s+1,s,2s+1,s)6q112(q(q1)1q1)10h(q,m,2s+1,s,2s+1,s)\geq 6q-\frac{11}{2}\left(\frac{q(q-1)-1}{q}-1\right)-10 (169)

which is equivalent to

h(q,m,2s+1,s,2s+1,s)12(q+11q+2)h(q,m,2s+1,s,2s+1,s)\geq\frac{1}{2}\left(q+\frac{11}{q}+2\right) (170)

It is clear that for q2q\geq 2 the above equation is always greater than 0.

Case 4a

Let (2s+1)q1=m(2s+1)q-1=m

Since mq(q1)1m\leq q(q-1)-1 then ss cannot be higher than the maximal integer which meets the inequality (2s+1)q1q(q1)1(2s+1)q-1\leq q(q-1)-1, i.e. sq22s\leq\frac{q-2}{2}. Let us calculate hh, by the assumptions that m=(2s+1)q1m=(2s+1)q-1, r1=2s,r2=s,r3=2s+1r_{1}=2s,r_{2}=s,r_{3}=2s+1 and r4=sr_{4}=s.

h(q,m,r1,r2,r3,r4)=6q11s6h(q,m,r_{1},r_{2},r_{3},r_{4})=6q-11s-6 (171)

Since the maximal ss is q22\frac{q-2}{2} then

h(q,m,r1,r2,r3,r4)6q11(q22)6h(q,m,r_{1},r_{2},r_{3},r_{4})\geq 6q-11\left(\frac{q-2}{2}\right)-6 (172)

which is equivalent to

h(q,m,r1,r2,r3,r4)q2+5h(q,m,r_{1},r_{2},r_{3},r_{4})\geq\frac{q}{2}+5 (173)

It is clear that for q2q\geq 2 the above equation is always greater than 0. This remark completes the proof for n=2qn=2q.

Proof of (70), part 2 (for odd numbers)

Let n=2q+1n=2q+1 (odd), n3n\geq 3, hence q1q\geq 1 and 0m,m+1𝒳(2q+1)=q20\leq m,m+1\leq\mathcal{X}(2q+1)=q^{2}. In particular, the last assumption implies that 0mq210\leq m\leq q^{2}-1. When n=2q+1n=2q+1 it holds that:

6(𝒢(n,m)𝒢(n,m+1))=\displaystyle 6\left(\mathcal{G}(n,m)-\mathcal{G}(n,m+1)\right)= 6+12q+(6m6q3)m2q+1\displaystyle-6+12q+(6m-6q-3)\left\lfloor\frac{m}{2q+1}\right\rfloor
3(2q+1)m2q+12+(8m4q2)2m2q+1\displaystyle-3(2q+1)\left\lfloor\frac{m}{2q+1}\right\rfloor^{2}+(8m-4q-2)\left\lfloor\frac{2m}{2q+1}\right\rfloor
2(2q+1)2m2q+123m+12q+1\displaystyle-2(2q+1)\left\lfloor\frac{2m}{2q+1}\right\rfloor^{2}-3\left\lfloor\frac{m+1}{2q+1}\right\rfloor
6mm+12q+1+6qm+12q+1+3m+12q+12\displaystyle-6m\left\lfloor\frac{m+1}{2q+1}\right\rfloor+6q\left\lfloor\frac{m+1}{2q+1}\right\rfloor+3\left\lfloor\frac{m+1}{2q+1}\right\rfloor^{2}
+6qm+12q+1262(m+1)2q+14q2(m+1)2q+12\displaystyle+6q\left\lfloor\frac{m+1}{2q+1}\right\rfloor^{2}-6\left\lfloor\frac{2(m+1)}{2q+1}\right\rfloor-4q\left\lfloor\frac{2(m+1)}{2q+1}\right\rfloor^{2}
+22(m+1)2q+12+8m2(m+1)2q+1+4q2(m+1)2q+1\displaystyle+2\left\lfloor\frac{2(m+1)}{2q+1}\right\rfloor^{2}+8m\left\lfloor\frac{2(m+1)}{2q+1}\right\rfloor+4q\left\lfloor\frac{2(m+1)}{2q+1}\right\rfloor (174)

Let us denote r1=2m2q+1r_{1}=\left\lfloor\frac{2m}{2q+1}\right\rfloor, r2=m2q+1r_{2}=\left\lfloor\frac{m}{2q+1}\right\rfloor, r3=2(m+1)2q+1r_{3}=\left\lfloor\frac{2(m+1)}{2q+1}\right\rfloor and r4=m+12q+1r_{4}=\left\lfloor\frac{m+1}{2q+1}\right\rfloor. This allows us to simplify the above equation to

6(𝒢(n,m)𝒢(n,m+1))=\displaystyle 6\left(\mathcal{G}(n,m)-\mathcal{G}(n,m+1)\right)= 6+12q+r1(8m4q2)\displaystyle-6+12q+r_{1}(8m-4q-2)
2(2q+1)r12+r2(6m6q3)\displaystyle-2(2q+1)r_{1}^{2}+r_{2}(6m-6q-3)
8mr36mr4+3(2q+1)r22+4qr32+6qr42\displaystyle-8mr_{3}-6mr_{4}+3(2q+1)r_{2}^{2}+4qr_{3}^{2}+6qr_{4}^{2}
+4qr3+6qr4+2r32+3r426r33r4\displaystyle+4qr_{3}+6qr_{4}+2r_{3}^{2}+3r_{4}^{2}-6r_{3}-3r_{4} (175)

Let us define:

h(q,m,r1,r2,r3,r4)=\displaystyle h(q,m,r_{1},r_{2},r_{3},r_{4})= 6+12q+r1(8m4q2)2(2q+1)r12\displaystyle-6+12q+r_{1}(8m-4q-2)-2(2q+1)r_{1}^{2}
+r2(6m6q3)8mr36mr4\displaystyle+r_{2}(6m-6q-3)-8mr_{3}-6mr_{4}
+3(2q+1)r22+4qr32+6qr42\displaystyle+3(2q+1)r_{2}^{2}+4qr_{3}^{2}+6qr_{4}^{2}
+4qr3+6qr4+2r32+3r426r33r4\displaystyle+4qr_{3}+6qr_{4}+2r_{3}^{2}+3r_{4}^{2}-6r_{3}-3r_{4} (176)

It is clear that

6(𝒢(n,m)𝒢(n,m+1))>0h(q,m,r1,r2,r3,r4)>06\left(\mathcal{G}(n,m)-\mathcal{G}(n,m+1)\right)>0\Leftrightarrow h(q,m,r_{1},r_{2},r_{3},r_{4})>0 (177)

Let us try to investigate changes in the values r1,r2,r3r_{1},r_{2},r_{3} and r4r_{4}. To do so, let us write down a few cases of each in the form of a table:

interval 2m2q+1\left\lfloor\frac{2m}{2q+1}\right\rfloor
0m<12(2q+1)0\leq m<\text{$\frac{1}{2}$}(2q+1) 0
12(2q+1)m<22(2q+1)\text{$\frac{1}{2}$}(2q+1)\leq m<\text{$\frac{2}{2}$}(2q+1) 11
22(2q+1)m<32(2q+1)\frac{2}{2}(2q+1)\leq m<\text{$\frac{3}{2}$}(2q+1) 22
32(2q+1)m<42(2q+1)\frac{3}{2}(2q+1)\leq m<\text{$\frac{4}{2}$}(2q+1) 33
42(2q+1)m<52(2q+1)\frac{4}{2}(2q+1)\leq m<\text{$\frac{5}{2}$}(2q+1) 44
interval m2q+1\left\lfloor\frac{m}{2q+1}\right\rfloor
0m<2q+10\leq m<2q+1 0
2q+1m<2(2q+1)2q+1\leq m<2(2q+1) 11
2(2q+1)m<3(2q+1)2(2q+1)\leq m<3(2q+1) 22
3(2q+1)m<4(2q+1)3(2q+1)\leq m<4(2q+1) 33
4(2q+1)m<5(2q+1)4(2q+1)\leq m<5(2q+1) 44
interval 2(m+1)2q+1\left\lfloor\frac{2(m+1)}{2q+1}\right\rfloor
0m<12(2q+1)10\leq m<\text{$\frac{1}{2}$}(2q+1)-1 0
12(2q+1)1m<22(2q+1)1\text{$\frac{1}{2}$}(2q+1)-1\leq m<\text{$\frac{2}{2}$}(2q+1)-1 11
22(2q+1)1m<32(2q+1)1\frac{2}{2}(2q+1)-1\leq m<\text{$\frac{3}{2}$}(2q+1)-1 22
32(2q+1)1m<42(2q+1)1\frac{3}{2}(2q+1)-1\leq m<\text{$\frac{4}{2}$}(2q+1)-1 33
42(2q+1)1m<52(2q+1)1\frac{4}{2}(2q+1)-1\leq m<\text{$\frac{5}{2}$}(2q+1)-1 44
interval m+12q+1\left\lfloor\frac{m+1}{2q+1}\right\rfloor
0m<(2q+1)10\leq m<(2q+1)-1 0
(2q+1)1m<2(2q+1)1(2q+1)-1\leq m<2(2q+1)-1 11
2(2q+1)1m<3(2q+1)12(2q+1)-1\leq m<3(2q+1)-1 22
3(2q+1)1m<4(2q+1)13(2q+1)-1\leq m<4(2q+1)-1 33
4(2q+1)1m<5(2q+1)14(2q+1)-1\leq m<5(2q+1)-1 44

As we can see, there are four kinds of interval (hereinafter referred to as cases) that need to be considered with respect to mm. Every analyzed interval is parametrized by the auxiliary variable s{0}s\in\mathbb{N}\cup\{0\}. By choosing arbitrarily s=0,1,2,3,s=0,1,2,3,\ldots we are able to analyze the function hh, and as follows 𝒢(n,m)𝒢(n,m+1)\mathcal{G}(n,m)-\mathcal{G}(n,m+1), for every interesting mm. The cases we need to consider are:

Case interval of mm 2m2q+1\left\lfloor\frac{2m}{2q+1}\right\rfloor m2q+1\left\lfloor\frac{m}{2q+1}\right\rfloor 2(m+1)2q+1\left\lfloor\frac{2(m+1)}{2q+1}\right\rfloor m+12q+1\left\lfloor\frac{m+1}{2q+1}\right\rfloor
11b 2s2(2q+1)m<2s+12(2q+1)1\frac{2s}{2}(2q+1)\leq m<\frac{2s+1}{2}(2q+1)-1 2s2s ss 2s2s ss
22b m=2s+12(2q+1)1m=\frac{2s+1}{2}(2q+1)-1 2s2s ss 2s+12s+1 ss
33b 2s+12(2q+1)m<2s+22(2q+1)1\frac{2s+1}{2}(2q+1)\leq m<\frac{2s+2}{2}(2q+1)-1 2s+12s+1 ss 2s+12s+1 ss
44b m=2s+22(2q+1)1m=\frac{2s+2}{2}(2q+1)-1 2s+12s+1 ss 2s+22s+2 s+1s+1

Case 1b

Let 2s2(2q+1)m<2s+12(2q+1)1\frac{2s}{2}(2q+1)\leq m<\frac{2s+1}{2}(2q+1)-1.

In general 0mq210\leq m\leq q^{2}-1, thus 02s2(2q+1)0\leq\frac{2s}{2}(2q+1) and q21<2s+12(2q+1)1q^{2}-1<\frac{2s+1}{2}(2q+1)-1 which implies (providing that s{0}s\in\mathbb{N}\cup\{0\}) that 0s0\leq s and ss should not be greater than the smallest integer that meets the inequality s>q22q+11s>\frac{q^{2}}{2q+1}-1. This implies that s=q22q+11+1s=\left\lfloor\frac{q^{2}}{2q+1}-1\right\rfloor+1, thus sq22q+1s\leq\frac{q^{2}}{2q+1}. On the other hand, 2s2(2q+1)m\frac{2s}{2}(2q+1)\leq m and mq21m\leq q^{2}-1. This suggests that 2s2(2q+1)q21\frac{2s}{2}(2q+1)\leq q^{2}-1, i.e. sq212q+1s\leq\frac{q^{2}-1}{2q+1}. Since the second constraint is more restrictive151515as q22q+1q212q+1=12q+1\frac{q^{2}}{2q+1}-\frac{q^{2}-1}{2q+1}=\frac{1}{2q+1} then we adopt sq212q+1s\leq\frac{q^{2}-1}{2q+1}

Thus, let us consider h(q,m,r1,r2,r3.r4)h(q,m,r_{1},r_{2},r_{3}.r_{4}) where, following the assumptions of case 1, r1=2sr_{1}=2s, r2=sr_{2}=s, r3=2sr_{3}=2s and r4=sr_{4}=s. It is easy to calculate that

h(q,m,2s,s,2s,s)=12q22s6h(q,m,2s,s,2s,s)=12q-22s-6 (178)

The highest possible ss is q212q+1\frac{q^{2}-1}{2q+1}, hence it holds that

h(q,m,2s,s,2s,s)6(2q1)22(q212q+1)h(q,m,2s,s,2s,s)\geq 6(2q-1)-22\left(\frac{q^{2}-1}{2q+1}\right) (179)

which is true if and only if

h(q,m,2s,s,2s,s)2(q2+8)2q+1h(q,m,2s,s,2s,s)\geq\frac{2\left(q^{2}+8\right)}{2q+1} (180)

It is clear that the above expression is strictly higher than 0 for q1q\geq 1.

Case 2b

Let m=2s+12(2q+1)1m=\frac{2s+1}{2}(2q+1)-1

The highest possible value of mm is q21q^{2}-1 thus m=2s+12(2q+1)1q21m=\frac{2s+1}{2}(2q+1)-1\leq q^{2}-1, hence, s12(2q22q+11)s\leq\frac{1}{2}\left(\frac{2q^{2}}{2q+1}-1\right).

Let us consider h(q,m,r1,r2,r3.r4)h(q,m,r_{1},r_{2},r_{3}.r_{4}) where (see case 2) r1=2sr_{1}=2s, r2=sr_{2}=s, r3=2s+1r_{3}=2s+1, r4=sr_{4}=s and denote:

h^(q,m,r1,r2,r3.r4)=dfh(q,2s+12(2q+1)1,2s,s,2s+1,s)\widehat{h}(q,m,r_{1},r_{2},r_{3}.r_{4})\stackrel{{\scriptstyle\textit{df}}}{{=}}h(q,\frac{2s+1}{2}(2q+1)-1,2s,s,2s+1,s) (181)

Thus, we may calculate that

h^(q,m,r1,r2,r3.r4)=12q22s6\widehat{h}(q,m,r_{1},r_{2},r_{3}.r_{4})=12q-22s-6 (182)

Adopting the upper bound of s=12(2q22q+11)s=\frac{1}{2}\left(\frac{2q^{2}}{2q+1}-1\right) we obtain

h^(q,m,r1,r2,r3.r4)12q22(12(2q22q+11))6\widehat{h}(q,m,r_{1},r_{2},r_{3}.r_{4})\geq 12q-22\left(\frac{1}{2}\left(\frac{2q^{2}}{2q+1}-1\right)\right)-6 (183)

which is equivalent to

h^(q,m,r1,r2,r3.r4)2q2+22q+52q+1\widehat{h}(q,m,r_{1},r_{2},r_{3}.r_{4})\geq\frac{2q^{2}+22q+5}{2q+1} (184)

It is clear that the right side of the above expression is strictly higher than 0 for q1q\geq 1.

Case 3b

Let 2s+12(2q+1)m<2s+22(2q+1)1\frac{2s+1}{2}(2q+1)\leq m<\frac{2s+2}{2}(2q+1)-1. The highest possible value of mm is q21q^{2}-1, thus the highest possible value of ss cannot be greater than the smallest positive integer for which q21<2s+22(2q+1)1q^{2}-1<\frac{2s+2}{2}(2q+1)-1. Hence q22q+12<s\frac{q^{2}}{2q+1}-2<s, which implies that q22q+12+1=s\left\lfloor\frac{q^{2}}{2q+1}-2\right\rfloor+1=s. Therefore q22q+11s\frac{q^{2}}{2q+1}-1\geq s. On the other hand, 2s+12(2q+1)m\frac{2s+1}{2}(2q+1)\leq m and mq21m\leq q^{2}-1. This suggests that 12(2(q21)2q+11)s\frac{1}{2}\left(\frac{2\left(q^{2}-1\right)}{2q+1}-1\right)\geq s. Since the first condition is more restrictive161616as 12(2(q21)2q+11)(q22q+11)=2q14q+2\frac{1}{2}\left(\frac{2\left(q^{2}-1\right)}{2q+1}-1\right)-\left(\frac{q^{2}}{2q+1}-1\right)=\frac{2q-1}{4q+2} then we assume that q22q+11s\frac{q^{2}}{2q+1}-1\geq s.

Let us consider h(q,m,r1,r2,r3.r4)h(q,m,r_{1},r_{2},r_{3}.r_{4}) where (following case 2) r1=2s+1r_{1}=2s+1, r2=sr_{2}=s, r3=2s+1r_{3}=2s+1 and r4=sr_{4}=s. It is easy to calculate that

h(q,m,2s+1,s,2s+1,s)=2(6q11s7)h(q,m,2s+1,s,2s+1,s)=2(6q-11s-7) (185)

The upper bound for ss is q22q+11\frac{q^{2}}{2q+1}-1, thus

h(q,m,2s+1,s,2s+1,s)2(6q11(q22q+11)7)h(q,m,2s+1,s,2s+1,s)\geq 2\left(6q-11\left(\frac{q^{2}}{2q+1}-1\right)-7\right) (186)

which is equivalent to

h(q,m,2s+1,s,2s+1,s)2(q2+14q+4)2q+1h(q,m,2s+1,s,2s+1,s)\geq\frac{2\left(q^{2}+14q+4\right)}{2q+1} (187)

It is clear that the above expression is strictly higher than 0 for q1q\geq 1.

Case 4b

Let m=2s+22(2q+1)1m=\frac{2s+2}{2}(2q+1)-1. The highest possible value of mm is q21q^{2}-1. Thus m=2s+22(2q+1)1q21m=\frac{2s+2}{2}(2q+1)-1\leq q^{2}-1, which is equivalent to s12(q22q+11)s\leq\frac{1}{2}\left(\frac{q^{2}}{2q+1}-1\right).

Let us consider h(q,m,r1,r2,r3.r4)h(q,m,r_{1},r_{2},r_{3}.r_{4}) where (see case 4) r1=2s+1r_{1}=2s+1, r2=sr_{2}=s, r3=2s+2r_{3}=2s+2, r4=s+1r_{4}=s+1 and denote:

h^(q,m,r1,r2,r3.r4)=dfh(q,2s+22(2q+1)1,2s+1,s,2s+2,s+1)\widehat{h}(q,m,r_{1},r_{2},r_{3}.r_{4})\stackrel{{\scriptstyle\textit{df}}}{{=}}h(q,\frac{2s+2}{2}(2q+1)-1,2s+1,s,2s+2,s+1) (188)

It is easy to calculate that

h^(q,m,r1,r2,r3.r4)=2(6q11s7)\widehat{h}(q,m,r_{1},r_{2},r_{3}.r_{4})=2(6q-11s-7) (189)

As the highest possible value of ss is 12(q22q+11)\frac{1}{2}\left(\frac{q^{2}}{2q+1}-1\right) then

h^(q,m,r1,r2,r3.r4)2(6q11(12(q22q+11))7)\widehat{h}(q,m,r_{1},r_{2},r_{3}.r_{4})\geq 2\left(6q-11\left(\frac{1}{2}\left(\frac{q^{2}}{2q+1}-1\right)\right)-7\right) (190)

Which is equivalent to

h^(q,m,r1,r2,r3.r4)13q2+6q32q+1\widehat{h}(q,m,r_{1},r_{2},r_{3}.r_{4})\geq\frac{13q^{2}+6q-3}{2q+1} (191)

It is easy to verify that the above expression is strictly greater than 0 for q1q\geq 1. The last observation completes the proof of the lemma. \square