This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Dynamics of node influence in
network growth modelsthanks: This document does not contain technology or technical data controlled under either the U.S. International Traffic in Arms Regulations or the U.S. Export Administration Regulations.

Shravika Mittal\star, Tanmoy Chakraborty\star Siddharth Pal\dagger
\starDept. of CSE, IIIT-Delhi, India \daggerRaytheon BBN Technologies, USA
{shravika16093, tanmoy}@iiitd.ac.in, siddharth.pal@raytheon.com
Abstract

Many classes of network growth models have been proposed in the literature for capturing real-world complex networks. Existing research primarily focuses on global characteristics of these models, e.g., degree distribution. We aim to shift the focus towards studying the network growth dynamics from the perspective of individual nodes. In this paper, we study how a metric for node influence in network growth models behaves over time as the network evolves. This metric, which we call node visibility, captures the probability of the node to form new connections. First, we conduct an investigation on three popular network growth models – preferential attachment, additive, and multiplicative fitness models; and primarily look into the “influential nodes” or “leaders” to understand how their visibility evolves over time. Subsequently, we consider a generic fitness model and observe that the multiplicative model strikes a balance between allowing influential nodes to maintain their visibility, while at the same time making it possible for new nodes to gain visibility in the network. Finally, we observe that a spatial growth model with multiplicative fitness can curtail the global reach of influential nodes, thereby allowing the emergence of a multiplicity of “local leaders” in the network.

Index Terms:
Network growth models, node dynamics, Barabasi-Albert graphs, fitness based models, spatial models.

1 Introduction

Over the past two decades, complex networks have been used to model real-world systems across different domains ranging from social, biological, information, and technological systems [26, 25]. Investigating the behavior of influential entities or leaders in these real-world networks would help us understand how they are able to gather and maintain prominence over time. For instance, influential papers in citation networks continue to acquire new citations every year [31, 11]. Likewise, celebrities in online social networks keep increasing their follower count over time. These influential entities act as potential spreaders of information in networks. Therefore, keeping a track of their characteristics in an evolving network could have significance in applications ranging from viral marketing and target advertisement to rumor and epidemic control, and protection from spam attacks [13], [18], [20]. To understand and model the dynamics of how a leader node maintains its influence over time, one needs to study the temporal behavior of nodes in a network. In this paper, we introduce a notion, called visibility of a node which is defined as the probability of the node to form new connections in a growing network. For instance, in a preferential attachment model [4, 5], the visibility of a node is proportional to its degree, and inversely proportional to the number of edges in the network. An essential aspect of the study is to investigate the visibility profile of a node which characterizes the temporal evolution of the node’s influence as the network grows. We argue that studying the visibility profile of nodes leads to a better understanding of network evolution due to attachment dynamics, which might not be possible to obtain by simply analysing global network properties such as degree distribution or local node-centric properties such as degree, clustering coefficient, etc. Similar to node persistence over time studied in [28], our approach allows to make headway into this understanding by characterizing the visibility behavior of leaders in the network. While the framework is applicable to arbitrary nodes as well, it is more interesting to first understand the leaders’ behavior. For example, Chakraborty et al. [11] argued that the growth of the degree of a node (its visibility) in a citation network follows one of the five patterns – early rise, late rise, frequent rise, steady rise and steady drop. In a subsequent study [12], they also concluded that highly-cited papers and authors (leaders) follow steady rising pattern. However, it was not clear whether existing network growth models are able to describe such patterns particularly for leader nodes [24]. This motivates us to study the temporal evolution of the visibility of nodes, in particular leaders in networks generated by different network growth models.

We study the visibility of influential nodes111We use “leaders” and “influential nodes” interchangeably to denote high-degree nodes that can keep attracting new edges over long periods of time. in the graphs simulated by the following network growth models. Barabási-Albert (BA) model [5], aka the preferential attachment model, was able to explain power law behavior in real-world networks using the idea of network growth and the “rich-get-richer” phenomenon. However, the BA model could only capture the “old-get-rich” phenomenon or the “first-mover-advantage” whereby older nodes increase their connectivity and become dominant at the expense of younger nodes in the network. It does not take into account the competitive characteristics of a node that help them flourish in a very short period of time [1, 21]; for instance, in citation networks a few research papers are able to gain lot of citations within a short span of time [3]. Using this as a motivation, Bianconi and Barabási [7, 8] introduced a new class of network growth models in which the incoming nodes form connections based on inherent characteristics of nodes such as novelty, usefulness, etc., captured through a fitness value [27, 10, 16]. This was inspired by the “fit-get-richer” phenomenon observed in real-world networks [7]. Following this, Ergun and Rodgers [14] analysed the degree distribution of network growth models with an attachment mechanism combining the degree and fitness information of nodes in an additive and multiplicative manner.

There are several real-world networks in which the aspect of space plays an important role to understand dynamics of network evolution. For instance, in the biological domain the regions in brain that are spatially closer have a higher probability of being connected as compared to the far-off regions [9]. Similar significance of space can also be observed in online social networks capturing spatial features [2, 22], transportation networks [17], and communication networks. To model networks incorporating spatial features, the class of spatial network models has been proposed. A basic spatial model incorporating notions of preferential and spatial attachment was proposed by Yook et al. [32] to capture underlying mechanisms driving the evolution of the Internet topology. Subsequently, Kaiser et al. [19] analysed a spatial growth model where the edge connection probability decreases with node distance either in an exponential or a power-law manner to explain multiple, interconnected clusters that emerge in real-world networks. See Barthélemy [6] for a comprehensive study on spatial networks. Recently, we proposed a new spatial growth model [24] that was better able to capture the five growth patterns presented in [11], compared to the preferential attachment model and its variants (i.e., additive and multiplicative fitness models [10, 30]).

In this paper, we extend our previous studies [29, 24] and address the following problem statement: Given an influential node with high visibility at a certain point in time, how would its visibility evolve over time? We study this phenomenon across three popular and diverse network growth models – Barabási-Albert (BA) model, (additive, multiplicative, and general) fitness based model and spatial models. One of the primary theoretical findings that we continue to build on from our previous work [29] is that leaders are able to gain more visibility in the multiplicative fitness model setting as compared to the BA and additive fitness models. Along with this, in the multiplicative fitness model, the influential nodes are able to increase their visibility over time, given that their fitness value remains high in comparison to the rest of the network. On the other hand, the visibility values always decrease over time for the BA and additive fitness models (see Section 3 for details). In Section 3.1, we study a general framework of fitness models and observe that a non-linear attachment rule based on degree and fitness would lead to highly dominant nodes and make it exceedingly difficult for new nodes to gain influence in the network. Experimental analysis provided in Section 4 supports our theoretical analysis that suggest multiplicative fitness models best explain the prolonged influence of leader nodes in certain real-world networks, while at the same time allowing new high fitness nodes to gain influence over time. However, we observe that multiplicative fitness models do not allow multiple influential nodes to exist at the same time. In Section 5, we give theoretical insights on how spatial growth models can allow a multiplicity of leaders to coexist, while Section 6 provides experimental justifications for the same.

Reproducibility: To encourage reproducible research, the codes are publicly available at https://github.com/mittalshravika/Network-Growth-Models.

2 Network growth models

Following our previous work [29], here we set up the notations and problem definition. Consider the following sequence of graphs {𝔾t,t=0,1,}\{{\mathbb{G}}_{t},\ t=0,1,\ldots\}, where 𝔾t=(Vt,𝔼t){\mathbb{G}}_{t}=(V_{t},{\mathbb{E}}_{t}), with VtV_{t} and 𝔼t{\mathbb{E}}_{t} being the set of nodes and edges in 𝔾t{\mathbb{G}}_{t} respectively. In a network growth model, we have VtVt+1V_{t}\subset V_{t+1} and 𝔼t𝔼t+1{\mathbb{E}}_{t}\subseteq{\mathbb{E}}_{t+1} for every t=0,1,t=0,1,\ldots. In other words, new nodes arrive at every time step tt, and form connections with existing nodes, thus adding to the edge set of the previous graph 𝔾t1{\mathbb{G}}_{t-1}. For purposes of simplicity, here we consider the basic model where a single node enters at any time step tt, and forms a connection with one node in the existing graph 𝔾t1{\mathbb{G}}_{t-1}. Therefore, we can label the incoming node by the time index of its entry to the network, which leads to Vt={0,1,,t}V_{t}=\{0,1,\ldots,t\} for t=0,1,t=0,1,\ldots. Note that all our results can be easily extended to more general scenarios where multiple nodes can enter the network and incoming nodes can form multiple connections. At time tt, let the degree of the node ii in VtV_{t} be denoted by Dt(i)D_{t}(i). Also, let the rv St+1S_{t+1} denote the node with which an incoming node t+1t+1 connects.

Barabási-Albert (BA) model: In the preferential attachment mechanism [5], new nodes connect preferentially to existing nodes with higher degree. Let 𝐩BA(t+1)=(piBA(t+1),iVt){\bf p}^{BA}(t+1)=(p_{i}^{BA}(t+1),\ i\in V_{t}) be the pmf with which the new node indexed as t+1t+1 connects with the existing graph 𝔾t{\mathbb{G}}_{t}, i.e., piBA(t+1)p_{i}^{BA}(t+1) is the probability with which node t+1t+1 connects with an existing node ii. This is given by:

piBA(t+1)=𝐏[St+1=i|𝔾t]=Dt(i)jVtDt(j),iVt.p_{i}^{BA}(t+1)={{\bf P}}\left[{S_{t+1}=i\ |\ {\mathbb{G}}_{t}}\right]=\frac{D_{t}(i)}{\sum_{j\in V_{t}}D_{t}(j)},\ i\in V_{t}. (1)

Note that we term node ii’s visibility in the graph 𝔾t{\mathbb{G}}_{t} by piBA(t+1)p_{i}^{BA}(t+1).

Fitness based attachment rules: In fitness based models [7, 14, 23], every node is assumed to have a fitness value independently drawn from a distribution, and new nodes connect preferentially on the basis of the fitness and degree values of the existing nodes. We describe multiple ways in which such an attachment could occur.

Assume a sequence of i.i.d. fitness rvs (ξ,ξi,i=0,1,)(\xi,\xi_{i},\ i=0,1,\ldots) with ξi\xi_{i} denoting the fitness value of node ii. A generic fitness model can be described by the following attachment rule

piGF(t+1)=g(ξi,Dt(i))jVtg(ξj,Dt(j)),iVt.p_{i}^{GF}(t+1)=\frac{g(\xi_{i},D_{t}(i))}{\sum_{j\in V_{t}}g(\xi_{j},D_{t}(j))},\ i\in V_{t}. (2)

for an attachment function g:×g:\mathbb{R}\times\mathbb{R}\to\mathbb{R} which determines the relative importance of the fitness and degree values. In the additive fitness model, new nodes connect preferentially to existing nodes having a higher sum of degree and fitness value. For t=0,1,t=0,1,\ldots, let the pmf delineating formation of new connections at time t+1t+1 be given by 𝐩AF(t+1){\bf p}^{AF}(t+1), where

piAF(t+1)=ξi+Dt(i)jVt(ξj+Dt(j)),iVt.p_{i}^{AF}(t+1)=\frac{\xi_{i}+D_{t}(i)}{\sum_{j\in V_{t}}(\xi_{j}+D_{t}(j))},\ i\in V_{t}. (3)

Similarly, the attachment rule for multiplicative fitness (MF) model is given by

piMF(t+1)=ξiDt(i)jVtξjDt(j),iVt.p_{i}^{MF}(t+1)=\frac{\xi_{i}\cdot D_{t}(i)}{\sum_{j\in V_{t}}\xi_{j}\cdot D_{t}(j)},\ i\in V_{t}. (4)

Therefore, the visibilities of node ii in graph 𝔾t{\mathbb{G}}_{t} are given by piAF(t+1)p_{i}^{AF}(t+1) and piMF(t+1)p_{i}^{MF}(t+1) for the additive and multiplicative fitness models, respectively. Note that the influential nodes as described in Section 1 relate to nodes having high visibility as defined for the particular network growth model in question.

Spatial attachment rules: In spatial models [32, 19, 15], every node is assumed to have a location vector drawn from a distribution over a location space AA. Assume a sequence of i.i.d. location rvs (χ,χi,i=0,1,)(\chi,\ \chi_{i},\ i=0,1,\ldots) and i.i.d. fitness rvs (ξ,ξi,i=0,1,)(\xi,\xi_{i},\ i=0,1,\ldots) with χi\chi_{i} and ξi\xi_{i} denoting the location and fitness values of node ii respectively. A generic spatial attachment model can be given by the following attachment rule

piAT(t+1)=h(χi,χt+1;ξi,Di(t))jVth(χj,χt+1;ξj,Dj(t)).p_{i}^{AT}(t+1)=\frac{h(\chi_{i},\chi_{t+1};\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi_{t+1};\xi_{j},D_{j}(t))}. (5)

The attachment probability now depends on the location vector of the new node χt\chi_{t}. This is different from the models described previously. Therefore, we could have multiple definitions of visibility. A global variant of visibility is given below

piglobal(t+1)=𝔼χ[h(χi,χ;ξi,Di(t))jVth(χj,χ;ξj,Dj(t))],p_{i}^{\mbox{global}}(t+1)=\mathbb{E}_{\chi}\left[\frac{h(\chi_{i},\chi;\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi;\xi_{j},D_{j}(t))}\right],

while a local version is given as

pilocal(t+1)=h(χi,χi;ξi,Di(t))jVth(χj,χi;ξj,Dj(t)),p_{i}^{\mbox{local}}(t+1)=\frac{h(\chi_{i},\chi_{i};\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi_{i};\xi_{j},D_{j}(t))},

where h:A×A××h:A\times A\times\mathbb{R}\times\mathbb{N}\to\mathbb{R}. We find it useful to consider the following separable form of attachment function

h(χ1,χ2;ξ1,D1)=α(χ1,χ2)β(ξ1,D1),χ1,χ2A,h(\chi_{1},\chi_{2};\xi_{1},D_{1})=\alpha(\chi_{1},\chi_{2})\beta(\xi_{1},D_{1}),\ \chi_{1},\chi_{2}\in A, (6)

where α:A×A\alpha:A\times A\to\mathbb{R} and β:×\beta:\mathbb{R}\times\mathbb{N}\to\mathbb{R}. While the notion of global visibility models the overall attractivity of a node in the entire attribute space, the local visibility considers only its attractivity from the local neighborhood in the attribute space of a node. This model allows for nodes whose global attractivity is low, with their local attractivity being high.

3 Analytical results on node visibility – BA and Fitness models

In this section, we study and compare the evolution of visibility of a node over time for the BA model and two fitness models, namely the additive and multiplicative fitness models. The following lemma describes the change in visibility with time for the three growth models.

First, we introduce some notation: Define Ξt=iVtξi\Xi_{t}=\sum_{i\in V_{t}}\xi_{i} and ψt=iVtξiDt(i)\psi_{t}=\sum_{i\in V_{t}}\xi_{i}D_{t}(i), for t=0,1,t=0,1,\ldots.

Lemma 3.1

For every t=0,1,,t=0,1,\ldots, and ii in Vt1V_{t-1}: Let 𝔾t1{\mathbb{G}}_{t-1} be the graph at time t1t-1, we have

  • (i)
    𝔼[piBA(t+1)piBA(t)|𝔾t1]=Dt1(i)4t(t1),{\mathbb{E}}\left[{p_{i}^{BA}(t+1)-p_{i}^{BA}(t)\ |\ {\mathbb{G}}_{t-1}}\right]=-\frac{D_{t-1}(i)}{4t(t-1)}, (7)
  • (ii)
    𝔼[piAF(t+1)piAF(t)|𝔾t1,ξt]\displaystyle{\mathbb{E}}\left[{p_{i}^{AF}(t+1)-p_{i}^{AF}(t)\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]
    =(ξi+Dt1(i))(ξt+1)(Ξt1+2(t1))(Ξt+2t),\displaystyle\hskip 5.69054pt=-\frac{\left(\xi_{i}+D_{t-1}(i)\right)\left(\xi_{t}+1\right)}{\left(\Xi_{t-1}+2(t-1)\right)\left(\Xi_{t}+2t\right)}, (8)
  • (iii)
    𝔼[piMF(t+1)piMF(t)|𝔾t1,ξt]\displaystyle{\mathbb{E}}\left[{p_{i}^{MF}(t+1)-p_{i}^{MF}(t)\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]
    ξiDt1(i)jiξjDt(j)[ξiξtξj]ψt12(ψt1+ξi+ξt).\displaystyle\hskip 5.69054pt\gtrsim\xi_{i}D_{t-1}(i)\frac{\sum_{j\neq i}\xi_{j}D_{t}(j)\left[\xi_{i}-\xi_{t}-\xi_{j}\right]}{\psi_{t-1}^{2}(\psi_{t-1}+\xi_{i}+\xi_{t})}. (9)

Proof.  Fix t=0,1,t=0,1,\ldots, and ii in VtV_{t}.

Preferential Attachment model: The difference in the visibility of node ii in the BA model between time t+1t+1 and tt is given as

piBA(t+1)piBA(t)=Dt(i)2tDt1(i)2(t1)\displaystyle p_{i}^{BA}(t+1)-p_{i}^{BA}(t)=\frac{D_{t}(i)}{2t}-\frac{D_{t-1}(i)}{2(t-1)}
=Dt1(i)+𝟏[St=i]2tDt1(i)2(t1)\displaystyle\hskip 5.69054pt=\frac{D_{t-1}(i)+{\bf 1}\left[S_{t}=i\right]}{2t}-\frac{D_{t-1}(i)}{2(t-1)} (10)

The above follows by noting that the sum of degree rvs Dt(i)D_{t}(i) for all the nodes in the vertex set VtV_{t} equals 2t2t. Furthermore, by noting that when looking at the expected difference in visibility conditioned on the graph at time t1t-1, StS_{t} is the only random variable in (10), we obtain

𝔼[piBA(t+1)piBA(t)|𝔾t1]\displaystyle{\mathbb{E}}\left[{p_{i}^{BA}(t+1)-p_{i}^{BA}(t)\ |\ {\mathbb{G}}_{t-1}}\right]
=Dt1(i)+𝐏[St=i|𝔾t1]2tDt1(i)2(t1)\displaystyle=\frac{D_{t-1}(i)+{{\bf P}}\left[{S_{t}=i\ |\ {\mathbb{G}}_{t-1}}\right]}{2t}-\frac{D_{t-1}(i)}{2(t-1)} (11)

and (7) follows.

Additive Fitness model: Similarly in the additive fitness model, the difference in the visibility of node ii can be written as

piAF(t+1)piAF(t)\displaystyle p_{i}^{AF}(t+1)-p_{i}^{AF}(t)
=ξi+Dt(i)jVtξj+Dt(j)ξi+Dt1(i)jVt1ξj+Dt1(j)\displaystyle=\frac{\xi_{i}+D_{t}(i)}{\sum_{j\in V_{t}}\xi_{j}+D_{t}(j)}-\frac{\xi_{i}+D_{t-1}(i)}{\sum_{j\in V_{t-1}}\xi_{j}+D_{t-1}(j)}
=ξi+Dt1(i)+𝟏[St=i]Ξt1+ξt+2tξi+Dt1(i)Ξt1+2(t1).\displaystyle=\frac{\xi_{i}+D_{t-1}(i)+{\bf 1}\left[S_{t}=i\right]}{\Xi_{t-1}+\xi_{t}+2t}-\frac{\xi_{i}+D_{t-1}(i)}{\Xi_{t-1}+2(t-1)}. (12)

Taking expectation on both sides conditioned on 𝔾t1{\mathbb{G}}_{t-1} and ξt\xi_{t} leads to (8).

Multiplicative Fitness model: The difference in the visibility of node ii can be written for the multiplicative model as follows

piMF(t+1)piMF(t)\displaystyle p_{i}^{MF}(t+1)-p_{i}^{MF}(t)
=ξiDt(i)jVtξjDt(j)ξiDt1(i)jVt1ξjDt1(j)\displaystyle=\frac{\xi_{i}D_{t}(i)}{\sum_{j\in V_{t}}\xi_{j}D_{t}(j)}-\frac{\xi_{i}D_{t-1}(i)}{\sum_{j\in V_{t-1}}\xi_{j}D_{t-1}(j)}
=ξi[Dt1(i)+𝟏[St=i]]ψt1+ξSt+ξtξiDt1(i)ψt1\displaystyle=\frac{\xi_{i}\left[D_{t-1}(i)+{\bf 1}\left[S_{t}=i\right]\right]}{\psi_{t-1}+\xi_{S_{t}}+\xi_{t}}-\frac{\xi_{i}D_{t-1}(i)}{\psi_{t-1}} (13)

Furthermore, we lower bound the expected change in visibility as follows

𝔼[piMF(t+1)piMF(t)|𝔾t1,ξt]\displaystyle{\mathbb{E}}\left[{p_{i}^{MF}(t+1)-p_{i}^{MF}(t)\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]
=ξi[Dt1(i)+1ψt1+ξi+ξtDt1(i)ψt1]𝐏[St=i|𝔾t1,ξt]\displaystyle=\xi_{i}\left[\frac{D_{t-1}(i)+1}{\psi_{t-1}+\xi_{i}+\xi_{t}}-\frac{D_{t-1}(i)}{\psi_{t-1}}\right]{{\bf P}}\left[{S_{t}=i\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]
+ξii[Dt1(i)ψt1+ξ+ξtDt1(i)ψt1]𝐏[St=|𝔾t1,ξt]\displaystyle\hskip 5.69054pt+\xi_{i}\sum_{\ell\neq i}\left[\frac{D_{t-1}(i)}{\psi_{t-1}+\xi_{\ell}+\xi_{t}}-\frac{D_{t-1}(i)}{\psi_{t-1}}\right]{{\bf P}}\left[{S_{t}=\ell\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]
ξi[ψt1𝐏[St=i|𝔾t1,ξt]ψt1(ψt1+ξi+ξt)Dt1(i)ψt1\displaystyle\approx\xi_{i}\Bigg{[}\frac{\psi_{t-1}{{\bf P}}\left[{S_{t}=i\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]}{\psi_{t-1}\left(\psi_{t-1}+\xi_{i}+\xi_{t}\right)}-\frac{D_{t-1}(i)}{\psi_{t-1}}
×[i𝐏[St=|𝔾t1,ξt]ξ+ξtψt1+ξ+ξt]]\displaystyle\hskip 11.38109pt\times\Bigg{[}\sum_{\ell\neq i}{{\bf P}}\left[{S_{t}=\ell\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]\cdot\frac{\xi_{\ell}+\xi_{t}}{\psi_{t-1}+\xi_{\ell}+\xi_{t}}\Bigg{]}\Bigg{]}
ξi[ξiDt1(i)ψt1(ψt1+ξi+ξt)\displaystyle\geq\xi_{i}\Bigg{[}\frac{\xi_{i}D_{t-1}(i)}{\psi_{t-1}\left(\psi_{t-1}+\xi_{i}+\xi_{t}\right)}
Dt1(i)ψt1iξDt1()(ξ+ξt)ψt12]\displaystyle\hskip 11.38109pt-\frac{D_{t-1}(i)}{\psi_{t-1}}\cdot\frac{\sum_{\ell\neq i}\xi_{\ell}D_{t-1}(\ell)(\xi_{\ell}+\xi_{t})}{\psi_{t-1}^{2}}\Bigg{]}
ξiDt1(i)[ξiψt1iξDt1()(ξ+ξt)ψt12(ψt1+ξi+ξt)]\displaystyle\simeq\xi_{i}D_{t-1}(i)\left[\frac{\xi_{i}\psi_{t-1}-\sum_{\ell\neq i}\xi_{\ell}D_{t-1}(\ell)(\xi_{\ell}+\xi_{t})}{\psi_{t-1}^{2}(\psi_{t-1}+\xi_{i}+\xi_{t})}\right]

and the result follows.  

We observe from (10) that a node’s visibility increases if it forms a new edge connection in the BA model. However, it is also evident from Lemma 3.1 that the visibility of the node decreases in expectation, in a manner that is directly proportional to its degree Dt1(i)D_{t-1}(i). This can be understood from the fact that higher degree nodes in the network have higher visibility values as a result of which, their decrease in visibility would be more as compared to nodes that have lower visibility values. In the additive fitness model, we can infer from (12) that a node’s visibility increases when it forms a new edge, provided Ξt1+2(t1)>(ξt+2)[ξi+Dt1(i)]\Xi_{t-1}+2(t-1)>(\xi_{t}+2)[\xi_{i}+D_{t-1}(i)]. This condition is expected to hold for large values of tt, unless the fitness value ξt\xi_{t}, or ξi\xi_{i}, or both, are very large. Similar to the BA model, node visibility decreases in expectation, with the magnitude of decrease being directly proportional to the sum of degree and fitness values, and the fitness value of the new incoming node ξt\xi_{t}.

In contrast with the above, we can see from (9) that in expectation, the nodes are able to increase their visibility over time, given that their fitness value remains large with respect to the network. In addition to this, the expected change in visibility directly depends on the product of fitness and the present degree of the node, boosting the visibility of a leader much more as compared to the BA and additive fitness models. Note that we derive results for change in visibility values over a single time step. The results can be easily generalized to any fixed number of time steps.

3.1 Node visibility over time – General fitness model

We study the change in node visibility over time for a general fitness model described in (2). For ease of notation, we define Γt=iVtg(ξi,Dt(i))\Gamma_{t}=\sum_{i\in V_{t}}g(\xi_{i},D_{t}(i)) and Δt,ig=g(ξi,Dt1(i)+1)g(ξi,Dt1(i))\Delta_{t,i}^{g}=g(\xi_{i},D_{t-1}(i)+1)-g(\xi_{i},D_{t-1}(i)) for ii in VtV_{t} and t=1,2,t=1,2,\ldots.

Lemma 3.2

For every t=0,1,,t=0,1,\ldots, and ii in Vt1V_{t-1}: Let 𝔾t1{\mathbb{G}}_{t-1} be the graph at time t1t-1, we have

𝔼[piGF(t+1)piGF(t)|𝔾t1,ξt]\displaystyle{\mathbb{E}}\left[{p_{i}^{GF}(t+1)-p_{i}^{GF}(t)\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]
(g(ξi,Dt1(i))Γt12)[g(ξi,Dt1(i))Γt1(Δt,igg(ξt,1))\displaystyle\simeq\left(\frac{g(\xi_{i},D_{t-1}(i))}{\Gamma_{t-1}^{2}}\right)\Bigg{[}\frac{g(\xi_{i},D_{t-1}(i))}{\Gamma_{t-1}}\cdot\left(\Delta_{t,i}^{g}-g(\xi_{t},1)\right)
+ki(g(ξk,Dt1(k))Γt1)(Δt,igΔt,kgg(ξt,1))].\displaystyle\hskip 5.69054pt+\sum_{k\neq i}\left(\frac{g(\xi_{k},D_{t-1}(k))}{\Gamma_{t-1}}\right)\left(\Delta_{t,i}^{g}-\Delta_{t,k}^{g}-g(\xi_{t},1)\right)\Bigg{]}. (14)

Proof.

The difference in the visibility of node ii can be written as follows

piGF(t+1)piGF(t)\displaystyle p_{i}^{GF}(t+1)-p_{i}^{GF}(t)
=𝔼[g(ξi,Dt1(i)+𝟏[St=i])jVtg(ξj,Dt1(j)+𝟏[St=j])\displaystyle=\mathbb{E}\Bigg{[}\frac{g(\xi_{i},D_{t-1}(i)+{\bf 1}\left[S_{t}=i\right])}{\sum_{j\in V_{t}}g(\xi_{j},D_{t-1}(j)+{\bf 1}\left[S_{t}=j\right])}
g(ξi,Dt1(i))jVt1g(ξj,Dt1(j))]\displaystyle\hskip 14.22636pt-\frac{g(\xi_{i},D_{t-1}(i))}{\sum_{j\in V_{t-1}}g(\xi_{j},D_{t-1}(j))}\Bigg{]} (15)

We further introduce the following notation for kk in VtV_{t}, Ωt,k=𝟏[St=k]\Omega_{t,k}={\bf 1}\left[S_{t}=k\right].

piGF(t+1)piGF(t)=\displaystyle p_{i}^{GF}(t+1)-p_{i}^{GF}(t)=
=Ωt,i[g(ξi,Dt1(i)+1)Γt1+Δt,ig+g(ξt,1)g(ξi,Dt1(i))Γt1]\displaystyle=\Omega_{t,i}\left[\frac{g(\xi_{i},D_{t-1}(i)+1)}{\Gamma_{t-1}+\Delta_{t,i}^{g}+g(\xi_{t},1)}-\frac{g(\xi_{i},D_{t-1}(i))}{\Gamma_{t-1}}\right]
+kiΩt,k[g(ξi,Dt1(i))Γt1+Δt,kg+g(ξt,1)g(ξi,Dt1(i))Γt1]\displaystyle+\sum_{k\neq i}\Omega_{t,k}\left[\frac{g(\xi_{i},D_{t-1}(i))}{\Gamma_{t-1}+\Delta_{t,k}^{g}+g(\xi_{t},1)}-\frac{g(\xi_{i},D_{t-1}(i))}{\Gamma_{t-1}}\right]

Furthermore, we introduce the following shorthand, g^t,i=g(ξi,Dt1(i))\hat{g}_{t,i}=g(\xi_{i},D_{t-1}(i)) and continue from above.

piGF(t+1)piGF(t)\displaystyle p_{i}^{GF}(t+1)-p_{i}^{GF}(t)
=Ωt,i[jijVt1g^t,jΔt,igg(ξt,1)g^t,iΓt1(Γt1+Δt,ig+g(ξt,1))]\displaystyle=\Omega_{t,i}\ \left[\frac{\sum_{\begin{subarray}{c}j\neq i\\ j\in V_{t-1}\end{subarray}}\hat{g}_{t,j}\Delta_{t,i}^{g}-g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma_{t-1}\left(\Gamma_{t-1}+\Delta_{t,i}^{g}+g(\xi_{t},1)\right)}\right]
+kiΩt,k[g^t,iΔt,kgg(ξt,1)g^t,iΓt1(Γt1+Δt,kg+g(ξt,1))]\displaystyle+\sum_{k\neq i}\Omega_{t,k}\left[\frac{-\hat{g}_{t,i}\Delta_{t,k}^{g}-g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma_{t-1}\left(\Gamma_{t-1}+\Delta_{t,k}^{g}+g(\xi_{t},1)\right)}\right] (16)

Using expression (16), we approximate the expected change in visibility for sufficiently large values of tt, as follows

𝔼[piGF(t+1)piGF(t)|𝔾t1,ξt]\displaystyle{\mathbb{E}}\left[{p_{i}^{GF}(t+1)-p_{i}^{GF}(t)\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]
𝐏[St=i|𝔾t1,ξt][(jijVt1g^t,jΔt,ig)g(ξt,1)g^t,iΓt12]\displaystyle\simeq{{\bf P}}\left[{S_{t}=i\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]\left[\frac{\left(\sum_{\begin{subarray}{c}j\neq i\\ j\in V_{t-1}\end{subarray}}\hat{g}_{t,j}\Delta_{t,i}^{g}\right)-g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma^{2}_{t-1}}\right]
ki𝐏[St=k|𝔾t1,ξt][g^t,iΔt,kgΓt12+g(ξt,1)g^t,iΓt12]\displaystyle-\sum_{k\neq i}{{\bf P}}\left[{S_{t}=k\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right]\left[\frac{\hat{g}_{t,i}\Delta_{t,k}^{g}}{\Gamma^{2}_{t-1}}+\frac{g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma_{t-1}^{2}}\right] (17)

on substituting the expressions for {𝐏[St=|𝔾t1,ξt],Vt1}\left\{{{\bf P}}\left[{S_{t}=\ell\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right],\ \ell\in V_{t-1}\right\}, we obtain

g^t,iΔt,igΓt12g^t,ikig^t,kΓt1Δt,kgΓt12g(ξt,1)g^t,iΓt12\displaystyle\simeq\frac{\hat{g}_{t,i}\Delta^{g}_{t,i}}{\Gamma_{t-1}^{2}}-\hat{g}_{t,i}\sum_{k\neq i}\frac{\hat{g}_{t,k}}{\Gamma_{t-1}}\frac{\Delta^{g}_{t,k}}{\Gamma_{t-1}^{2}}-\frac{g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma_{t-1}^{2}}
=(g^t,iΓt12)[Δt,igkig^t,kΓt1Δt,kgg(ξt,1)]\displaystyle=\left(\frac{\hat{g}_{t,i}}{\Gamma_{t-1}^{2}}\right)\left[\Delta_{t,i}^{g}-\sum_{k\neq i}\frac{\hat{g}_{t,k}}{\Gamma_{t-1}}\Delta_{t,k}^{g}-g(\xi_{t},1)\right]
=(g^t,iΓt12)[(g^t,iΓt1)(Δt,igg(ξt,1))\displaystyle=\left(\frac{\hat{g}_{t,i}}{\Gamma_{t-1}^{2}}\right)\Bigg{[}\left(\frac{\hat{g}_{t,i}}{\Gamma_{t-1}}\right)\left(\Delta_{t,i}^{g}-g(\xi_{t},1)\right)
+ki(g^t,kΓt1)(Δt,igΔt,kgg(ξt,1))].\displaystyle\hskip 5.69054pt+\sum_{k\neq i}\left(\frac{\hat{g}_{t,k}}{\Gamma_{t-1}}\right)\left(\Delta_{t,i}^{g}-\Delta_{t,k}^{g}-g(\xi_{t},1)\right)\Bigg{]}.

 
The expected change in visibility will be positive if

Δt,igΔt,kg,ki,kVt1\Delta_{t,i}^{g}\geq\Delta_{t,k}^{g},\ k\neq i,k\in V_{t-1}

and Δt,igg(ξt,1)\Delta_{t,i}^{g}\geq g(\xi_{t},1). While this is a sufficient condition, the expected change in visibility will be positive if node ii has significantly large visibility in the network. Observe that for the BA and additive fitness models, Δt,ig=1\Delta_{t,i}^{g}=1. While for the BA model, the approximation is too crude, we obtain a decrease in expected visibility for the additive model from (14). For the MF model,

Δt,ig=ξi(Dt1(i)+1)ξiDt1(i)=ξi.\Delta_{t,i}^{g}=\xi_{i}(D_{t-1}(i)+1)-\xi_{i}D_{t-1}(i)=\xi_{i}.

Therefore, Δt,ig\Delta_{t,i}^{g} will be greater for nodes with higher fitness values. However, an incoming node with a high fitness value can lead to a decrease in the expected visibility of an influential node. This agrees with our findings in Lemma 3.1. For attachment functions which combine the fitness and degree information in a nonlinear fashion, influential nodes can retain greater visibility for longer periods of time while being protected from decrease in visibility due to new nodes with high fitness values. For example, for nonlinear attachment rules like g(ξi,Dt(i))=(ξiDt(i))2g(\xi_{i},D_{t}(i))=\left(\xi_{i}D_{t}(i)\right)^{2}, Δt,ig=ξi2(2Dt(i)+1)\Delta^{g}_{t,i}=\xi_{i}^{2}\left(2D_{t}(i)+1\right), i.e., it is a function of both the fitness and degree values. In this scenario, new nodes with high fitness values cannot lead to a decrease in expected visibility for influential nodes because of their low initial degree. However, nodes with low degree and high fitness values could experience a decrease in their visibility because only nodes with significantly large fitness and degree values can increase their visibility in an expected sense, which is not the case for the multiplicative model. This would lead to a great difficulty for new nodes with high fitness values to attain visibility in the network. Furthermore, it will become progressively more difficult for new nodes to become visible in the network.

4 Experimental results on node visibility – Fitness models

In order to illustrate Lemmas 3.1 and 3.2 and depict the change in visibility of influential nodes, we carry out two sets of simulation experiments. We compare how the visibility of leaders or influential nodes change over time in the Barabasi-Albert (BA), additive fitness (AF), multiplicative fitness (MF) and general fitness (GF) models. Throughout, the fitness variable ξ\xi is taken to be Pareto distributed with parameter αp\alpha_{p}.

For each given growth model and parameter value of the Pareto distribution, we generate a graph 𝔾T0X{\mathbb{G}}_{T_{0}}^{X}, where X{BA,AF,MF,GF}X\in\{BA,AF,MF,GF\} and T0=10000T_{0}=10000. We define p(k)(T0;𝔾tX)p_{(k)}(T_{0};{\mathbb{G}}_{t}^{X}) to be the visibility of the node in graph 𝔾tX{\mathbb{G}}_{t}^{X} which had the kk-highest visibility in graph 𝔾T0X{\mathbb{G}}_{T_{0}}^{X}. For each growth model, starting from 𝔾T0X{\mathbb{G}}_{T_{0}}^{X}, we generate RR realizations 𝔾TX,(1),𝔾TX,(2),𝔾TX,(R){\mathbb{G}}_{T}^{X,(1)},{\mathbb{G}}_{T}^{X,(2)},\ldots{\mathbb{G}}_{T}^{X,(R)} with T=100000T=100000, which are mutually independent conditioned on 𝔾T0X{\mathbb{G}}_{T_{0}}^{X}. Since we are interested in the evolution of the visibility of influential nodes, for the purpose of this experiment we track the visibility of the top 50 nodes starting from t=10000t=10000 to t=100000t=100000. However, conditioned on the graph at time T0T_{0}, the visibility values are random variables; therefore, we average the visibility values across all the realizations at any given time. We define p¯(k)(T0;t;X)=1Rr=1Rp(k)(T0;𝔾tX,(r))\bar{p}_{(k)}(T_{0};t;X)=\frac{1}{R}\sum_{r=1}^{R}p_{(k)}(T_{0};{\mathbb{G}}_{t}^{X,(r)}) as the averaged visibility of the node at time tt (t>T0t>T_{0}) which had the k-highest visibility at time T0T_{0} for growth model XX.

In other words, we track p(k)(T0;𝔾tX,(r))p_{(k)}\left(T_{0};{\mathbb{G}}_{t}^{X,(r)}\right) for k=1,2,,50k=1,2,\ldots,50, r=1,2,,Rr=1,2,\ldots,R, t=10000,10001,,100000t=10000,10001,\ldots,100000, and X{BA,AF,MF,GF}X\in\{BA,AF,MF,GF\}. For large enough independent runs RR, we expect p¯(k)(T0;t;X)\bar{p}_{(k)}(T_{0};t;X) to be a reasonable approximation of 𝔼[p(k)(T0;𝔾tX,(1))|𝔾T0X]{\mathbb{E}}\left[{p_{(k)}\left(T_{0};{\mathbb{G}}_{t}^{X,(1)}\right)\big{|}{\mathbb{G}}_{T_{0}}^{X}}\right], which is the expected value of visibility at time tt for the node which had the kk-highest visibility value at time T0T_{0} conditioned on the graph at time T0T_{0}.

Figures 1 and 2 show the change in visibility of top 50 nodes at t=T0=10000t=T_{0}=10000 when the graph is allowed to grow for 90000 iterations until t=100000t=100000. Visibility values averaged over R=50R=50 independent runs from T0=10000T_{0}=10000 are shown. We observe from the box plots in the two figures that the highest visibility nodes in the BA and AF slowly reduce in their visibility values over time as was predicted by Lemma 3.1. We also observe that lot more nodes in BA and AF models have significantly higher visibility values. However, for the multiplicative fitness model only 2 nodes exhibit high visibility values (for αp=1,2\alpha_{p}=1,2), with one node dominating the entire network at any point of time for most of the duration. In the MF model, for αp=1,2\alpha_{p}=1,2, we observe that a node with lower visibility replaces one with higher visibility between t=T0=10000t=T_{0}=10000 and t=100000t=100000, because the lower visibility node joined the network later but with a significantly higher fitness value. For αp=3\alpha_{p}=3, we do not see this behavior because it is less likely that a node with a significantly high fitness value will enter the network.

While we observe that in both the MF and GF models, nodes with high visibility are able to maintain their influence (or, visibility) over time, we next investigate how easy it is for new nodes with high fitness to gain influence over time. For this purpose, we conduct an experiment where we introduce a node at time t=T0+1=10001t=T_{0}+1=10001, with fitness value ξT0+1=2maxtVT0ξt\xi_{T_{0}+1}=2\max_{t\in V_{T_{0}}}\xi_{t}, i.e., twice the fitness value of the maximum fitness value among all nodes until time T0T_{0}. For growth model X{AF,MF,GF}X\in\{AF,MF,GF\}, we generate R=50R=50 realizations beyond time T0T_{0} by setting the fitness value of node T0+1T_{0}+1 as mentioned above. We define pi(𝔾tX)p_{i}\left({\mathbb{G}}_{t}^{X}\right) to be the visibility of the node ii in graph 𝔾tX{\mathbb{G}}_{t}^{X}. We average the visibility values across RR realizations, 𝔾TX,(1),𝔾TX,(2),,𝔾TX,(R){\mathbb{G}}_{T}^{X,(1)},{\mathbb{G}}_{T}^{X,(2)},\ldots,{\mathbb{G}}_{T}^{X,(R)}, and compute the averaged visibility of node T0+1T_{0}+1 defined as p¯T0+1(t)=1Rr=1R(𝔾tX,(r))\bar{p}_{T_{0}+1}(t)=\frac{1}{R}\sum_{r=1}^{R}\left({\mathbb{G}}_{t}^{X,(r)}\right), t>T0t>T_{0}. Subsequently, we track the visibility of this newly introduced node in the three fitness models and present the results in Figure 3. We observe that in the AF and GF models, the visibility of the newly introduced node decreases with time. This concurs with Lemma 3.1 where we showed that the visibility of nodes in AF models decreases with time; and with Lemma 3.2 where we argue that in the general fitness model with a nonlinear attachment rule, it becomes progressively difficult for new nodes to get visible in the network. For the MF model with αp=1\alpha_{p}=1, we observe that p¯T0+1(t)\bar{p}_{T_{0}+1}(t) increases slightly but decays beyond t=30000t=30000. This is because nodes with even higher fitness values enter the network after node T0+1T_{0}+1. However, for αp=2,3\alpha_{p}=2,3 the visibility of node T0+1T_{0}+1 keeps increasing until t=100000t=100000. Note that for αp=3\alpha_{p}=3, node T0+1T_{0}+1 becomes dominant in the network very quickly because as the node increases its degree, the degree-fitness product becomes large compared to that of other nodes in the network because having a large fitness node is a rarer event for larger value of αp\alpha_{p}.

From the experiments we reaffirm that multiplicative models allow high visibility nodes to maintain their influence in the network for a longer period of time, while at the same time allowing high fitness nodes that are introduced later in the network to become influential. However, we observe that only a few number of nodes can be influential in the network at any given moment of time. This leads us to consider spatial attachment rules in conjunction with the multiplicative model to enforce regions of influence for each individual node such that multiple influential nodes can coexist in a network at the same time.

Refer to caption
Figure 1: Visibility of nodes (averaged over 50 independent runs) over time (after T=100000T=100000 iterations) in the BA model.
Refer to caption
Figure 2: Visibility of nodes (averaged over 50 independent runs) over time in three growth models. The columns represent different parameters of the Pareto distribution: αp=1,2,3\alpha_{p}=1,2,3. Each row shows visibility results of the 50 most influential nodes after T=100000T=100000 iterations for the three growth models – additive, multiplicative and general fitness model.
Refer to caption
Figure 3: Visibility of new node (averaged over 50 independent runs) added with a high fitness value over time in the three growth models – additive, multiplicative and general fitness model. This represents results for different parameters of the Pareto distribution: αp=1,2,3\alpha_{p}=1,2,3.

5 Analytical results on node visibility – Spatial models

In the previous sections, we investigated the node visibility dynamics of the fitness models. A major takeaway was that multiplicative fitness models allow influential nodes to maintain their visibility while still permitting newly introduced nodes with high fitness to gain influence over time. However, a shortcoming of the MF model was that it could not support a multiplicity of influential nodes. We investigate whether a configuration of spatial models exists that retains the positive aspects of the MF model while addressing this shortcoming.

5.1 Preliminaries - Results on various notions of visibility in spatial models

Having defined global and local visibilities for a node in Section 2, we find it helpful to define a notion of maximum visibility

pimax(t+1)=maxχh(χi,χ;ξi,Di(t))jVth(χj,χ;ξj,Dj(t)),p_{i}^{\mbox{max}}(t+1)=\max_{\chi}\frac{h(\chi_{i},\chi;\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi;\xi_{j},D_{j}(t))}, (18)

with χt,i\chi_{t,i}^{*} being the location vector for which the maximum is attained. For rest of the analysis, we assume that the attachment function hh is separable into a product form of α(χ1,χ2)β(ξ,D)\alpha(\chi_{1},\chi_{2})\cdot\beta(\xi,D) as shown in (6). For analytical purposes, we set α(χ1,χ2)=eγd(χ1,χ2)\alpha(\chi_{1},\chi_{2})=e^{-\gamma d(\chi_{1},\chi_{2})}, where d:A×Ad:A\times A\to\mathbb{R} is a metric on the location space AA.

Lemma 5.1

For α(χ1,χ2)=eγd(χ1,χ2)\alpha(\chi_{1},\chi_{2})=e^{-\gamma d(\chi_{1},\chi_{2})} and every t=1,2,,t=1,2,\ldots,

limγpilocal(t+1)=1=limγpimax(t+1)\lim_{\gamma\to\infty}p_{i}^{\mbox{local}}(t+1)=1=\lim_{\gamma\to\infty}p_{i}^{\mbox{max}}(t+1) (19)

and

limγχt,i=χi.\lim_{\gamma\to\infty}\chi_{t,i}^{*}=\chi_{i}. (20)

In other words, for all δ>0\delta>0 and for all t=1,2,t=1,2,\ldots, there exists γt,δ\gamma_{t,\delta} such that for all γγt,δ\gamma\geq\gamma_{t,\delta}

|pilocal(t+1)pimax(t+1)|<δ\left|p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{max}}(t+1)\right|<\delta

and

|χiχt,i|<δ.\left|\chi_{i}-\chi_{t,i}^{*}\right|<\delta.

Proof.  For ii in VtV_{t} and t=1,2,3,t=1,2,3,\ldots

pilocal(t+1)\displaystyle p_{i}^{\mbox{local}}(t+1)
=β(ξi,Di(t))β(ξi,Di(t))+jieγd(χi,χj)β(ξj,Dj(t))\displaystyle=\frac{\beta(\xi_{i},D_{i}(t))}{\beta(\xi_{i},D_{i}(t))+\sum_{j\neq i}e^{-\gamma d(\chi_{i},\chi_{j})}\beta(\xi_{j},D_{j}(t))}
β(ξi,Di(t))β(ξi,Di(t))+(tmaxkVtξk)jietϵd(χi,χj)\displaystyle\geq\frac{\beta(\xi_{i},D_{i}(t))}{\beta(\xi_{i},D_{i}(t))+\left(t\max_{k\in V_{t}}\xi_{k}\right)\sum_{j\neq i}e^{-t^{\epsilon}d(\chi_{i},\chi_{j})}}
γ=tϵ1\displaystyle\xrightarrow{\gamma=t^{\epsilon}\to\infty}1

and we obtain first part of (19). Note that the scaling γ=tϵ\gamma=t^{\epsilon} implies that for larger graphs, a smaller γ\gamma would be necessary. Second part is obtained by noting that pimax(t+1)pilocal(t+1)p_{i}^{\mbox{max}}(t+1)\geq p_{i}^{\mbox{local}}(t+1) for every node ii and time tt. Equation (20) also follows similarly.  

Lemma 5.1 suggests that pimaxp_{i}^{\mbox{max}} is a good approximation for pilocalp_{i}^{\mbox{local}} when γ\gamma is sufficiently large. Next, we derive relationships between the global and local notions of visibility.

Lemma 5.2

For ϵ>0\epsilon>0, we have

e2γϵ𝐏[d(χ,χi)<ϵ]pilocal(t+1)piglobal(t+1)\displaystyle e^{-2\gamma\epsilon}{{\bf P}}\left[{d(\chi,\chi_{i})<\epsilon}\right]p_{i}^{\mbox{local}}(t+1)\leq p_{i}^{\mbox{global}}(t+1)
pimax(t+1)pilocal(t+1)\displaystyle\leq p_{i}^{\mbox{max}}(t+1)\approx p_{i}^{\mbox{local}}(t+1) (21)

Proof.  For convenience, we use the shorthand notation, βi(t)=β(ξi,Di(t))\beta_{i}(t)=\beta(\xi_{i},D_{i}(t)) for ii in VtV_{t} and t=1,2,t=1,2,\ldots. The upper bound on piglobal(t+1)p_{i}^{\mbox{global}}(t+1) follows from the definition of pimax(t+1)p_{i}^{\mbox{max}}(t+1). To obtain the lower bound, for a fixed ϵ>0\epsilon>0 we condition on the event 𝟏[d(χi,χ)<ϵ]{\bf 1}\left[d(\chi_{i},\chi)<\epsilon\right]

piglobal(t+1)\displaystyle p_{i}^{\mbox{global}}(t+1)
𝐏[d(χi,χ)<ϵ]\displaystyle\geq{{\bf P}}\left[{d(\chi_{i},\chi)<\epsilon}\right]
×𝔼χ[h(χi,χ;ξi,Di(t))jVth(χj,χ;ξj,Dj(t))𝟏[d(χi,χ)<ϵ]]\displaystyle\hskip 5.69054pt\times\mathbb{E}_{\chi}\left[\frac{h(\chi_{i},\chi;\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi;\xi_{j},D_{j}(t))}{\bf 1}\left[d(\chi_{i},\chi)<\epsilon\right]\right]
𝐏[d(χi,χ)<ϵ]minχ:d(χ,χi)<ϵh(χi,χ;ξi,Di(t))jVth(χj,χ;ξj,Dj(t))\displaystyle\geq{{\bf P}}\left[{d(\chi_{i},\chi)<\epsilon}\right]\cdot\min_{\chi:d(\chi,\chi_{i})<\epsilon}\frac{h(\chi_{i},\chi;\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi;\xi_{j},D_{j}(t))}
=𝐏[d(χi,χ)<ϵ]\displaystyle={{\bf P}}\left[{d(\chi_{i},\chi)<\epsilon}\right]
×minχ:d(χ,χi)<ϵeγd(χ,χi)βi(t)eγd(χ,χi)βi(t)+jieγd(χ,χj)βj(t)\displaystyle\times\min_{\chi:d(\chi,\chi_{i})<\epsilon}\frac{e^{-\gamma d(\chi,\chi_{i})}\beta_{i}(t)}{e^{-\gamma d(\chi,\chi_{i})}\beta_{i}(t)+\sum_{j\neq i}e^{-\gamma d(\chi,\chi_{j})}\beta_{j}(t)}
𝐏[d(χi,χ)<ϵ]e2γϵβi(t)βi(t)+jieγd(χi,χj)βj(t)\displaystyle\geq{{\bf P}}\left[{d(\chi_{i},\chi)<\epsilon}\right]\cdot\frac{e^{-2\gamma\epsilon}\beta_{i}(t)}{\beta_{i}(t)+\sum_{j\neq i}e^{-\gamma d(\chi_{i},\chi_{j})}\beta_{j}(t)}
=e2γϵ𝐏[d(χ,χi)<ϵ]pilocal(t+1)\displaystyle=e^{-2\gamma\epsilon}{{\bf P}}\left[{d(\chi,\chi_{i})<\epsilon}\right]p_{i}^{\mbox{local}}(t+1) (22)

where the penultimate step follows from triangle inequality.   

Lemma 5.2 gives a lower and upper bound for the global visibility in terms of the local visibility. As argued previously, since the model is more concerned with local attractivity of nodes, we will present analysis for pilocalp_{i}^{\mbox{local}}. Lemma 5.2 suggests that changes in the local visibility should also be aptly reflected in the global visibility.

5.2 Node visibility over time

For convenience, we define some shorthand notation: For ii in VtV_{t}, βi(t)=β(ξi,Di(t))\beta_{i}(t)=\beta(\xi_{i},D_{i}(t)). For i,ji,j in VtV_{t}, h^t,ij=h(χi,χj;ξi,Dt1(i))\hat{h}_{t,i\to j}=h(\chi_{i},\chi_{j};\xi_{i},D_{t-1}(i)), h^t,ij+=h(χi,χj;ξi,Dt1(i)+1)\hat{h}_{t,i\to j}^{+}=h(\chi_{i},\chi_{j};\xi_{i},D_{t-1}(i)+1) represent the attachment function for node ii at the location of node jj, and Δt,ijh=h(χi,χj;ξi,Dt1(i)+1)h(χi,χj;ξi,Dt1(i))\Delta^{h}_{t,i\to j}=h(\chi_{i},\chi_{j};\xi_{i},D_{t-1}(i)+1)-h(\chi_{i},\chi_{j};\xi_{i},D_{t-1}(i)). For kk in VtV_{t}, Ωt,k=𝟏[St=k]\Omega_{t,k}={\bf 1}\left[S_{t}=k\right] and Γt,k=iVth^t,ik\Gamma_{t,k}=\sum_{i\in V_{t}}\hat{h}_{t,i\to k}.

Lemma 5.3

For every t=0,1,,t=0,1,\ldots, and ii in Vt1V_{t-1}: Let 𝔾t1{\mathbb{G}}_{t-1} be the graph at time t1t-1, we have

𝔼[pilocal(t+1)pilocal(t)|𝔾t1,ξt,χt]\displaystyle{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ |\ {\mathbb{G}}_{t-1},\xi_{t},\chi_{t}}\right]
C1kVtβk(t1)eγd(χi,χk){ξieγd(χi,χk)ξkξt}\displaystyle\lesssim C_{1}\sum_{k\in V_{t}}\beta_{k}(t-1)e^{-\gamma d(\chi_{i},\chi_{k})}\big{\{}\xi_{i}-e^{-\gamma d(\chi_{i},\chi_{k})}\xi_{k}-\xi_{t}\big{\}} (23)

and

𝔼[pilocal(t+1)pilocal(t)|𝔾t1,ξt,χt]\displaystyle{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ |\ {\mathbb{G}}_{t-1},\xi_{t},\chi_{t}}\right]
C2kVtβk(t1)eγd(χi,χk)\displaystyle\gtrsim C_{2}\sum_{k\in V_{t}}\beta_{k}(t-1)e^{-\gamma d(\chi_{i},\chi_{k})}
×{ξie2ϵγeγd(χi,χk)ξke3ϵγξt}\displaystyle\hskip 56.9055pt\times\big{\{}\xi_{i}-e^{2\epsilon\gamma}e^{-\gamma d(\chi_{i},\chi_{k})}\xi_{k}-e^{3\epsilon\gamma}\xi_{t}\big{\}} (24)

with

C1=1(jVth(χj,χi;ξj,Dt1(j))3)βi(t1)C_{1}=\frac{1}{\left(\sum_{j\in V_{t}}h(\chi_{j},\chi_{i};\xi_{j},D_{t-1}(j))^{3}\right)}\beta_{i}(t-1)

and

C2\displaystyle C_{2} =𝐏[d(χt,χ)<ϵ]eϵγ\displaystyle={{\bf P}}\left[{d(\chi_{t},\chi)<\epsilon}\right]e^{-\epsilon\gamma}
×1(jVth(χj,χi;ξj,Dt1(j))3)βi(t1).\displaystyle\times\frac{1}{\left(\sum_{j\in V_{t}}h(\chi_{j},\chi_{i};\xi_{j},D_{t-1}(j))^{3}\right)}\beta_{i}(t-1).

Proof.  The difference in the local visibility of node ii can be written as follows

pilocal(t+1)pilocal(t)\displaystyle p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)
=h(χi,χi;ξi,Dt1(i)+𝟏[St=i])jVth(χj,χi;ξj,Dt1(j)+𝟏[St=j])\displaystyle=\frac{h(\chi_{i},\chi_{i};\xi_{i},D_{t-1}(i)+{\bf 1}\left[S_{t}=i\right])}{\sum_{j\in V_{t}}h(\chi_{j},\chi_{i};\xi_{j},D_{t-1}(j)+{\bf 1}\left[S_{t}=j\right])}
h(χi,χi;ξi,Dt1(i))jVt1h(χj,χi;ξj,Dt1(j))\displaystyle\hskip 5.69054pt-\frac{h(\chi_{i},\chi_{i};\xi_{i},D_{t-1}(i))}{\sum_{j\in V_{t-1}}h(\chi_{j},\chi_{i};\xi_{j},D_{t-1}(j))}
=Ωt,i[h^t,ii+Δt,iih+Γt,i+h(χt,χi;ξt,1)h^t,iiΓt,i]\displaystyle=\Omega_{t,i}\left[\frac{\hat{h}^{+}_{t,i\to i}}{\Delta^{h}_{t,i\to i}+\Gamma_{t,i}+h(\chi_{t},\chi_{i};\xi_{t},1)}-\frac{\hat{h}_{t,i\to i}}{\Gamma_{t,i}}\right]
+kiΩt,k[h^t,iiΔt,kih+Γt,i+h(χt,χi;ξt,1)h^t,iiΓt,i]\displaystyle+\sum_{k\neq i}\Omega_{t,k}\left[\frac{\hat{h}_{t,i\to i}}{\Delta^{h}_{t,k\to i}+\Gamma_{t,i}+h(\chi_{t},\chi_{i};\xi_{t},1)}-\frac{\hat{h}_{t,i\to i}}{\Gamma_{t,i}}\right]
Ωt,i[Δt,iihΓt,i]\displaystyle\approx\Omega_{t,i}\left[\frac{\Delta^{h}_{t,i\to i}}{\Gamma_{t,i}}\right]
kiΩt,k[h^t,iiΔt,kihΓt,i2+h(χt,χi;ξt,1)h^t,iiΓt,i2]\displaystyle\hskip 5.69054pt-\sum_{k\neq i}\Omega_{t,k}\Bigg{[}\frac{\hat{h}_{t,i\to i}\Delta^{h}_{t,k\to i}}{\Gamma_{t,i}^{2}}+\frac{h(\chi_{t},\chi_{i};\xi_{t},1)\hat{h}_{t,i\to i}}{\Gamma_{t,i}^{2}}\Bigg{]} (25)

Using (25), we upper bound the expected change in local visibility by noting the fact that the expected increase in visibility is the most when the new node has the highest probability to form connection with node ii, which occurs when the attribute of the new node is close to χi\chi_{i}

𝔼[pilocal(t+1)pilocal(t)|𝔾t,ξt,χt]\displaystyle{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ |{\mathbb{G}}_{t},\xi_{t},\chi_{t}}\right]
𝔼[pilocal(t+1)pilocal(t)|𝔾t,ξt,χt=χi]\displaystyle\leq{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ |{\mathbb{G}}_{t},\xi_{t},\chi_{t}=\chi_{i}}\right]
1Γt,i[h^t,iiΓt,iΔt,iihki(h^t,kiΓt,i)(h^t,iiΓt,iΔt,kih\displaystyle\leq\frac{1}{\Gamma_{t,i}}\Bigg{[}\frac{\hat{h}_{t,i\to i}}{\Gamma_{t,i}}\Delta^{h}_{t,i\to i}-\sum_{k\neq i}\left(\frac{\hat{h}_{t,k\to i}}{\Gamma_{t,i}}\right)\Bigg{(}\frac{\hat{h}_{t,i\to i}}{\Gamma_{t,i}}\Delta^{h}_{t,k\to i}
h(χt,χi;ξt,1)h^t,iiΓt,i)]\displaystyle\hskip 5.69054pt-\frac{h(\chi_{t},\chi_{i};\xi_{t},1)\hat{h}_{t,i\to i}}{\Gamma_{t,i}}\Bigg{)}\Bigg{]}
1Γt,i3kVt{h^t,iih^t,kiΔt,iihh^t,iih^t,kiΔt,kih\displaystyle\approx\frac{1}{\Gamma_{t,i}^{3}}\sum_{k\in V_{t}}\Big{\{}\hat{h}_{t,i\to i}\hat{h}_{t,k\to i}\Delta_{t,i\to i}^{h}-\hat{h}_{t,i\to i}\hat{h}_{t,k\to i}\Delta_{t,k\to i}^{h}
h^t,kih^t,iih(χi,χi;ξt,1)}\displaystyle\hskip 5.69054pt-\hat{h}_{t,k\to i}\hat{h}_{t,i\to i}h(\chi_{i},\chi_{i};\xi_{t},1)\Big{\}}
1Γt,i3β(ξi,Dt1(i))kVtβ(ξk,Dt1(k))eγd(χi,χk)×\displaystyle\approx\frac{1}{\Gamma_{t,i}^{3}}\beta(\xi_{i},D_{t-1}(i))\sum_{k\in V_{t}}\beta(\xi_{k},D_{t-1}(k))e^{-\gamma d(\chi_{i},\chi_{k})}\times
{Δt,iihΔt,kihβ(ξt,1)}\displaystyle\hskip 56.9055pt\big{\{}\Delta^{h}_{t,i\to i}-\Delta^{h}_{t,k\to i}-\beta(\xi_{t},1)\big{\}} (26)

Observe that Δt,iih=β(ξi,Dt1(i)+1)β(ξi,Dt1(i))\Delta^{h}_{t,i\to i}=\beta(\xi_{i},D_{t-1}(i)+1)-\beta(\xi_{i},D_{t-1}(i)) which equals ξi\xi_{i} in a multiplicative α\alpha model. Similarly, Δt,kih\Delta^{h}_{t,k\to i} equals eγd(χi,χk)ξke^{-\gamma d(\chi_{i},\chi_{k})}\xi_{k}. Therefore, in a multiplicative model, (26) reduces to

𝔼[pilocal(t+1)pilocal(t)|𝔾t,ξt,χt]1Γt,i3βi(t1)\displaystyle{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ |\ {\mathbb{G}}_{t},\xi_{t},\chi_{t}}\right]\lesssim\frac{1}{\Gamma_{t,i}^{3}}\beta_{i}(t-1)
×kVtβk(t1)eγd(χi,χk){ξieγd(χi,χk)ξkξt}\displaystyle\hskip 5.69054pt\times\sum_{k\in V_{t}}\beta_{k}(t-1)e^{-\gamma d(\chi_{i},\chi_{k})}\big{\{}\xi_{i}-e^{-\gamma d(\chi_{i},\chi_{k})}\xi_{k}-\xi_{t}\big{\}} (27)

which gives the upper bound (24). To lower bound the same, we define the following notation – For ϵ>0\epsilon>0,

χi,minϵ=argminχ:d(χ,χi)<ϵh(χi,χ;ξi,Di(t))jVth(χj,χ;ξj,Dj(t)).\chi_{i,\min}^{\epsilon}=\arg\min_{\chi:d(\chi,\chi_{i})<\epsilon}\frac{h(\chi_{i},\chi;\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi;\xi_{j},D_{j}(t))}.

In other words, χi,minϵ\chi_{i,\min}^{\epsilon} is the attribute vector on the ϵ\epsilon-ball around χi\chi_{i} where the attachment probability to ii is the lowest.

Using (25), we lower bound the expected change in local visibility by noting that conditioned on the event that the new node has an attribute vector within an ϵ\epsilon-ball around χi\chi_{i}, it will be minimum when it is equal to χi,minϵ\chi_{i,\min}^{\epsilon}. Accordingly, we define the following notation: h^t,kimin=h(χk,χi,minϵ;ξk,Dt1(k))\hat{h}^{\min}_{t,k\to i}=h(\chi_{k},\chi_{i,min}^{\epsilon};\xi_{k},D_{t-1}(k)). We lower bound using conditioning arguments

𝔼[pilocal(t+1)pilocal(t)|𝔾t,ξt,χt]\displaystyle{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ |{\mathbb{G}}_{t},\xi_{t},\chi_{t}}\right]
𝔼[pilocal(t+1)pilocal(t)|𝔾t,ξt,d(χt,χ)<ϵ]\displaystyle\geq{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ \big{|}\ {\mathbb{G}}_{t},\xi_{t},d(\chi_{t},\chi)<\epsilon}\right]
×𝐏[d(χt,χ)<ϵ]\displaystyle\hskip 5.69054pt\times{{\bf P}}\left[{d(\chi_{t},\chi)<\epsilon}\right]
𝔼[pilocal(t+1)pilocal(t)|𝔾t,ξt,χt=χminϵ]\displaystyle\geq{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ \big{|}\ {\mathbb{G}}_{t},\xi_{t},\chi_{t}=\chi_{\min}^{\epsilon}}\right]
×𝐏[d(χt,χ)<ϵ]\displaystyle\hskip 5.69054pt\times{{\bf P}}\left[{d(\chi_{t},\chi)<\epsilon}\right]
𝐏[d(χt,χ)<ϵ]1Γt,i3kVt{h^t,iiminh^t,kiΔt,iih\displaystyle\gtrsim{{\bf P}}\left[{d(\chi_{t},\chi)<\epsilon}\right]\cdot\frac{1}{\Gamma_{t,i}^{3}}\sum_{k\in V_{t}}\Big{\{}\hat{h}^{\min}_{t,i\to i}\hat{h}_{t,k\to i}\Delta^{h}_{t,i\to i}
h^t,iih^t,kiminΔt,kihh^t,kiminh^t,iih(χi,χi,minϵ;ξt,1)}\displaystyle-\hat{h}_{t,i\to i}\hat{h}^{\min}_{t,k\to i}\Delta^{h}_{t,k\to i}-\hat{h}^{\min}_{t,k\to i}\hat{h}_{t,i\to i}h(\chi_{i},\chi_{i,\min}^{\epsilon};\xi_{t},1)\Big{\}}
𝐏[d(χt,χ)<ϵ]1Γt,i3β(ξi,Dt1(i))\displaystyle\gtrsim{{\bf P}}\left[{d(\chi_{t},\chi)<\epsilon}\right]\frac{1}{\Gamma_{t,i}^{3}}\beta(\xi_{i},D_{t-1}(i))
×kVtβk(t1)eγd(χi,χk)\displaystyle\times\sum_{k\in V_{t}}\beta_{k}(t-1)e^{-\gamma d(\chi_{i},\chi_{k})}
×{eϵγξieϵγeγd(χi,χk)ξke2ϵγξt}\displaystyle\times\big{\{}e^{-\epsilon\gamma}\xi_{i}-e^{\epsilon\gamma}e^{-\gamma d(\chi_{i},\chi_{k})}\xi_{k}-e^{2\epsilon\gamma}\xi_{t}\big{\}} (28)

 
where the final step follows by applying triangle inequality. From the lower bound (23) it is clear that the visibility will decrease when nodes close to ii have high fitness values or the new node has a very high fitness value. The upper bound (24) indicates that if the fitness of node ii is sufficiently high compared to other nodes in its local neighborhood, then its visibility should increase. This shows how the local neighborhood of particular node impacts its visibility. Also, given the local nature of the behavior of visibility, more nodes end up being visible in the spatial model compared to the multiplicative fitness model.

Refer to caption
Figure 4: Visibility of nodes (averaged over 50 independent runs) over time in the spatial growth model. The columns represent different parameters of the Pareto distribution: α=1,2,3\alpha=1,2,3. Each row shows visibility results for different values of γ\gamma parameter, 5, 10 and 50 (from top to bottom).

6 Experimental results on node visibilities – Spatial models

To illustrate the findings of Section 5.2, we perform experiments discussed in this section. In the previous section, we theoretically argued how the spatial (S) model with multiplicative β\beta would lead to multiple leaders coexisting in the network. Here we present experimental results that show the multiplicity of leaders that can coexist in the network and how this varies with the decay parameter γ\gamma. Throughout, the fitness variable ξ\xi is taken to be Pareto distributed with parameter αp\alpha_{p}.

For each αp=1,2,3\alpha_{p}=1,2,3 and γ=5,10,50\gamma=5,10,50, we generate a graph 𝔾T0S{\mathbb{G}}_{T_{0}}^{S} as an instantiation of the spatial model, with T0=10000T_{0}=10000. We denote p(k)local(T0;𝔾tS)p^{\mbox{local}}_{(k)}(T_{0};{\mathbb{G}}_{t}^{S}) to be the local visibility of the node in graph 𝔾tS{\mathbb{G}}_{t}^{S} which had the kk-highest local visibility in graph 𝔾T0S{\mathbb{G}}_{T_{0}}^{S}. As previously, we generate RR realizations 𝔾TS,(1),𝔾TS,(2),𝔾TS,(R){\mathbb{G}}_{T}^{S,(1)},{\mathbb{G}}_{T}^{S,(2)},\ldots{\mathbb{G}}_{T}^{S,(R)} with T=100000T=100000, which are mutually independent conditioned on 𝔾T0S{\mathbb{G}}_{T_{0}}^{S}. We average the local visibility values across all the realizations at any given time and denote p¯(k)local(T0;t;S)=1Rr=1Rp(k)local(T0;𝔾tS,(r))\bar{p}^{\mbox{local}}_{(k)}(T_{0};t;S)=\frac{1}{R}\sum_{r=1}^{R}p^{\mbox{local}}_{(k)}(T_{0};{\mathbb{G}}_{t}^{S,(r)}) as the averaged local visibility of the node at time tt which had the k-highest visibility at time T0T_{0} for the spatial model.

In other words, we track p(k)local(T0;𝔾tS,(r))p^{\mbox{local}}_{(k)}\left(T_{0};{\mathbb{G}}_{t}^{S,(r)}\right) for nodes k=1,5,10,30,50,100,200k=1,5,10,30,50,100,200, runs r=1,2,,Rr=1,2,\ldots,R, and decay parameters γ=5,10,50\gamma=5,10,50, with 10000t10000010000\leq t\leq 100000.

Figure 4 shows the change in local visibility of top kthk^{\text{th}} nodes at t=T0=10000t=T_{0}=10000 when the graph is allowed to grow for 90000 iterations until t=100000t=100000. Visibility values averaged over R=50R=50 independent runs from T0=10000T_{0}=10000 are shown. We observe that with increasing γ\gamma, the number of nodes with high local visibility increases in the network. This corroborates the insight that with increasing γ\gamma, the region of influence of nodes decreases leading to the potential of larger number of influential nodes in the network. For γ=5\gamma=5 and αp=1\alpha_{p}=1, we see that the k=5thk=5^{\text{th}} node increases slightly in visibility beyond t=10000t=10000 but then decays beyond a certain point; and with αp=2\alpha_{p}=2, the same node maintains its visibility until t=100000t=100000, while for αp=3\alpha_{p}=3 the node increases its local visibility and dominates its region eventually. However for k=10,30,k=10,30,..., the corresponding nodes have very low values of local visibility. This changes for the γ=10\gamma=10 case. Here, the k=10thk=10^{\text{th}} node also shows a high value of local visibility due to the reduced region of influence of nearby influential nodes. This becomes even more pronounced in γ=50\gamma=50 where we observe that even the k=200thk=200^{\text{th}} node has non-trivial local visibility values that are maintained over some period of time for αp=2,3\alpha_{p}=2,3. This shows that with increasing the decay parameter of the spatial model we can significantly increase the number of leaders in the network, many of whom can maintain their influence in their region.

7 Conclusion

In this paper, we studied the visibility profile of nodes in different classes of network growth models. Firstly, we observed that in the multiplicative fitness model, nodes with high fitness values can successfully maintain visibility in the network to a greater extent when compared with the additive fitness and BA models. A general fitness model that has a non-linear attachment rule, e.g., that combines the degree and fitness values in a non-linear (quadratic) fashion, would also allow influential nodes to maintain visibility. However, unlike in multiplicative models, in these general fitness models with non-linear attachment rules we showed that it becomes progressively more difficult for new nodes with high fitness values to become influential in the entire network. We demonstrated through experimental results that in multiplicative models only a few number of nodes can be influential in the network at any given moment of time. This leads us to investigate spatial models that allows a multiplicity of influential nodes to exist in the network. We also show how the decay parameter in these spatial models can be used to control the number of leaders in the network.

8 Acknowledgements

The authors would like to thank Dr. Ralucca Gera at Naval Postgraduate School, and Dr. Soham De at DeepMind, for insightful discussions and collaboration on previous works that led to this paper.

References

  • [1] L. A. Adamic and B. A. Huberman. Power-law distribution of the world wide web. Science, 287:2115, 2000.
  • [2] M. Allamanis, S. Scellato, and C. Mascolo. Evolution of a location-based online social network: Analysis and models. In Proceedings of the Internet Measurement Conference, page 145–158, 2012.
  • [3] D. R. Amancio, O. N. Oliveira, and L. da Fontoura Costa. Three-feature model to reproduce the topology of citation networks and the effects from authors’ visibility on their h-index. Journal of Informetrics, 6(3):427 – 434, 2012.
  • [4] A.-L. Barabási. Scale-free networks: a decade and beyond. Science, 325(5939):412–413, 2009.
  • [5] A.-L. Barabási and R. Albert. Emergence of scaling in random networks. Science, 286(5439):509–512, 1999.
  • [6] M. Barthélemy. Spatial networks. Physics Reports, 499(1-3):1–101, Feb 2011.
  • [7] G. Bianconi and A.-L. Barabási. Bose-einstein condensation in complex networks. Physical Review Letters, 86(24):5632, 2001.
  • [8] G. Bianconi and A.-L. Barabási. Competition and multiscaling in evolving networks. Europhysics Letters, 54(4):436–442, may 2001.
  • [9] E. Bullmore and O. Sporns. Complex brain networks: graph theoretical analysis of structural and functional systems. Nature Reviews Neuroscience, 10(3):186–198, Mar 2009.
  • [10] G. Caldarelli, A. Capocci, P. De Los Rios, and M. A. Muñoz. Scale-free networks from varying vertex intrinsic fitness. Physical review letters, 89:258702, Dec 2002.
  • [11] T. Chakraborty, S. Kumar, P. Goyal, N. Ganguly, and A. Mukherjee. On the categorization of scientific citation profiles in computer science. Communications of the ACM, 58(9):82–90, 2015.
  • [12] T. Chakraborty and S. Nandi. Universal trajectories of scientific success. Knowledge and Information Systems, 54(2):487–509, February 2018.
  • [13] D. Chen, L. Lü, M.-S. Shang, Y.-C. Zhang, and T. Zhou. Identifying influential nodes in complex networks. Physica A: Statistical Mechanics and its Applications, 391(4):1777 – 1787, 2012.
  • [14] G. Ergün and G. J. Rodgers. Growing random networks with fitness. Physica A: Statistical Mechanics and its Applications, 303(1):261–272, 2002.
  • [15] L. Ferretti and M. Cortelezzi. Preferential attachment in growing spatial networks. Physical Review E, 84(1), Jul 2011.
  • [16] D. Garlaschelli and M. I. Loffredo. Fitness-dependent topological properties of the world trade web. Physical Review Letters, 93:188701, Oct 2004.
  • [17] R. Guimerá and L. A. N. Amaral. Modeling the world-wide airport network. The European Physical Journal B, 38(2):381–385, Mar 2004.
  • [18] C. Guo, L. Yang, X. Chen, D. Chen, H. Gao, and J. Ma. Influential nodes identification in complex networks via information entropy. Entropy, 22(2):242, Feb 2020.
  • [19] M. Kaiser and C. C. Hilgetag. Spatial growth of real-world networks. Physical Review E, 69(3), Mar 2004.
  • [20] M. Kimura, K. Saito, and R. Nakano. Extracting influential nodes for information diffusion on a social network. In Proceedings of the 22nd National Conference on Artificial Intelligence - Volume 2, AAAI’07, page 1371–1376. AAAI Press, 2007.
  • [21] J. S. Kong, N. Sarshar, and V. P. Roychowdhury. Experience versus talent shapes the structure of the web. Proceedings of the National Academy of Sciences, 105(37):13724–13729, Sep 2008.
  • [22] D. Liu, V. Fodor, and L. K. Rasmussen. Will scale-free popularity develop scale-free geo-social networks? IEEE Transactions on Network Science and Engineering, 6(3):587–598, 2019.
  • [23] M. Mitzenmacher. A brief history of generative models for power law and lognormal distributions. Internet mathematics, 1(2):226–251, 2004.
  • [24] D. Mohapatra, S. Pal, S. De, P. Kumaraguru, and T. Chakraborty. Modeling citation trajectories of scientific papers. In Advances in Knowledge Discovery and Data Mining - PAKDD’20, pages 620–632, 2020.
  • [25] M. Newman. The structure and function of complex networks. SIAM review, 45(2):167–256, 2003.
  • [26] M. Newman. Networks: an introduction, 2010.
  • [27] K. Nguyen and D. A. Tran. Fitness-Based Generative Models for Power-Law Networks, pages 39–53. Springer US, Boston, MA, 2012.
  • [28] A. Noulas, B. Shaw, R. Lambiotte, and C. Mascolo. Topological properties and temporal dynamics of place networks in urban environments. In Proceedings of the 24th International Conference on World Wide Web, pages 431–441, 2015.
  • [29] S. Pal, S. De, T. Chakraborty, and R. Gera. Visibility of nodes in network growth models. In 3rd International Winter School and Conference on Network Science, pages 35–45, 2017.
  • [30] V. D. P. Servedio, G. Caldarelli, and P. Buttà. Vertex intrinsic fitness: How to produce arbitrary scale-free networks. Physical Review E, 70(5):056126, 2004.
  • [31] D. Wang, C. Song, and A.-L. Barabási. Quantifying long-term scientific impact. Science, 342(6154):127–132, 2013.
  • [32] S.-H. Yook, H. Jeong, and A.-L. Barabási. Modeling the internet’s large-scale topology. Proceedings of the National Academy of Sciences, 99(21):13382–13386, 2002.
[Uncaptioned image] Shravika Mittal is a senior undergraduate student in Computer Science and Engineering at IIIT-Delhi. Her research interests include Network Science, Natural Language Processing and Machine Learning. She has received the Dean’s list for Excellence in Academics, and Innovation in Research and Development.
[Uncaptioned image] Tanmoy Chakraborty is an Assistant Professor and a Ramanujan Fellow at the Dept. of CSE, IIIT-Delhi, India, where he leads a research group, called LCS2 (http://lcs2.iiitd.edu.in/). His primary research interests include Social Network Analysis, Data Mining, and Natural Language Processing. He has received several awards including Faculty Awards from Google, IBM and Accenture; Early Career Research Award, DAAD Faculty Fellowship. He is a member of ACM and IEEE. More details at http://faculty.iiitd.ac.in/~tanmoy/.
[Uncaptioned image] Siddharth Pal received his Bachelors degree in Electronics and Telecommunication Engineering from Jadavpur University, India, in 2011, and his Masters and Ph.D degrees both in Electrical Engineering from University of Maryland College Park, USA, in 2014 and 2015 respectively. Since then he has been working as a research scientist at Raytheon BBN Technologies. His research interests include network science and analysis, game theory, machine learning with emphasis on neural network based approaches, and stochastic systems.