Dynamics of node influence in
network growth models^†^†thanks: This document does not contain technology or technical data controlled under either the U.S. International Traffic in Arms Regulations or the U.S. Export Administration Regulations.

Shravika Mittal^$\star$, Tanmoy Chakraborty^$\star$ Siddharth Pal^$\dagger$
^$\star$Dept. of CSE, IIIT-Delhi, India ^$\dagger$Raytheon BBN Technologies, USA
{shravika16093, tanmoy}@iiitd.ac.in, siddharth.pal@raytheon.com

Abstract

Many classes of network growth models have been proposed in the literature for capturing real-world complex networks. Existing research primarily focuses on global characteristics of these models, e.g., degree distribution. We aim to shift the focus towards studying the network growth dynamics from the perspective of individual nodes. In this paper, we study how a metric for node influence in network growth models behaves over time as the network evolves. This metric, which we call node visibility, captures the probability of the node to form new connections. First, we conduct an investigation on three popular network growth models – preferential attachment, additive, and multiplicative fitness models; and primarily look into the “influential nodes” or “leaders” to understand how their visibility evolves over time. Subsequently, we consider a generic fitness model and observe that the multiplicative model strikes a balance between allowing influential nodes to maintain their visibility, while at the same time making it possible for new nodes to gain visibility in the network. Finally, we observe that a spatial growth model with multiplicative fitness can curtail the global reach of influential nodes, thereby allowing the emergence of a multiplicity of “local leaders” in the network.

Index Terms:

Network growth models, node dynamics, Barabasi-Albert graphs, fitness based models, spatial models.

1 Introduction

Over the past two decades, complex networks have been used to model real-world systems across different domains ranging from social, biological, information, and technological systems [26, 25]. Investigating the behavior of influential entities or leaders in these real-world networks would help us understand how they are able to gather and maintain prominence over time. For instance, influential papers in citation networks continue to acquire new citations every year [31, 11]. Likewise, celebrities in online social networks keep increasing their follower count over time. These influential entities act as potential spreaders of information in networks. Therefore, keeping a track of their characteristics in an evolving network could have significance in applications ranging from viral marketing and target advertisement to rumor and epidemic control, and protection from spam attacks [13], [18], [20]. To understand and model the dynamics of how a leader node maintains its influence over time, one needs to study the temporal behavior of nodes in a network. In this paper, we introduce a notion, called visibility of a node which is defined as the probability of the node to form new connections in a growing network. For instance, in a preferential attachment model [4, 5], the visibility of a node is proportional to its degree, and inversely proportional to the number of edges in the network. An essential aspect of the study is to investigate the visibility profile of a node which characterizes the temporal evolution of the node’s influence as the network grows. We argue that studying the visibility profile of nodes leads to a better understanding of network evolution due to attachment dynamics, which might not be possible to obtain by simply analysing global network properties such as degree distribution or local node-centric properties such as degree, clustering coefficient, etc. Similar to node persistence over time studied in [28], our approach allows to make headway into this understanding by characterizing the visibility behavior of leaders in the network. While the framework is applicable to arbitrary nodes as well, it is more interesting to first understand the leaders’ behavior. For example, Chakraborty et al. [11] argued that the growth of the degree of a node (its visibility) in a citation network follows one of the five patterns – early rise, late rise, frequent rise, steady rise and steady drop. In a subsequent study [12], they also concluded that highly-cited papers and authors (leaders) follow steady rising pattern. However, it was not clear whether existing network growth models are able to describe such patterns particularly for leader nodes [24]. This motivates us to study the temporal evolution of the visibility of nodes, in particular leaders in networks generated by different network growth models.

We study the visibility of influential nodes¹¹1We use “leaders” and “influential nodes” interchangeably to denote high-degree nodes that can keep attracting new edges over long periods of time. in the graphs simulated by the following network growth models. Barabási-Albert (BA) model [5], aka the preferential attachment model, was able to explain power law behavior in real-world networks using the idea of network growth and the “rich-get-richer” phenomenon. However, the BA model could only capture the “old-get-rich” phenomenon or the “first-mover-advantage” whereby older nodes increase their connectivity and become dominant at the expense of younger nodes in the network. It does not take into account the competitive characteristics of a node that help them flourish in a very short period of time [1, 21]; for instance, in citation networks a few research papers are able to gain lot of citations within a short span of time [3]. Using this as a motivation, Bianconi and Barabási [7, 8] introduced a new class of network growth models in which the incoming nodes form connections based on inherent characteristics of nodes such as novelty, usefulness, etc., captured through a fitness value [27, 10, 16]. This was inspired by the “fit-get-richer” phenomenon observed in real-world networks [7]. Following this, Ergun and Rodgers [14] analysed the degree distribution of network growth models with an attachment mechanism combining the degree and fitness information of nodes in an additive and multiplicative manner.

There are several real-world networks in which the aspect of space plays an important role to understand dynamics of network evolution. For instance, in the biological domain the regions in brain that are spatially closer have a higher probability of being connected as compared to the far-off regions [9]. Similar significance of space can also be observed in online social networks capturing spatial features [2, 22], transportation networks [17], and communication networks. To model networks incorporating spatial features, the class of spatial network models has been proposed. A basic spatial model incorporating notions of preferential and spatial attachment was proposed by Yook et al. [32] to capture underlying mechanisms driving the evolution of the Internet topology. Subsequently, Kaiser et al. [19] analysed a spatial growth model where the edge connection probability decreases with node distance either in an exponential or a power-law manner to explain multiple, interconnected clusters that emerge in real-world networks. See Barthélemy [6] for a comprehensive study on spatial networks. Recently, we proposed a new spatial growth model [24] that was better able to capture the five growth patterns presented in [11], compared to the preferential attachment model and its variants (i.e., additive and multiplicative fitness models [10, 30]).

In this paper, we extend our previous studies [29, 24] and address the following problem statement: Given an influential node with high visibility at a certain point in time, how would its visibility evolve over time? We study this phenomenon across three popular and diverse network growth models – Barabási-Albert (BA) model, (additive, multiplicative, and general) fitness based model and spatial models. One of the primary theoretical findings that we continue to build on from our previous work [29] is that leaders are able to gain more visibility in the multiplicative fitness model setting as compared to the BA and additive fitness models. Along with this, in the multiplicative fitness model, the influential nodes are able to increase their visibility over time, given that their fitness value remains high in comparison to the rest of the network. On the other hand, the visibility values always decrease over time for the BA and additive fitness models (see Section 3 for details). In Section 3.1, we study a general framework of fitness models and observe that a non-linear attachment rule based on degree and fitness would lead to highly dominant nodes and make it exceedingly difficult for new nodes to gain influence in the network. Experimental analysis provided in Section 4 supports our theoretical analysis that suggest multiplicative fitness models best explain the prolonged influence of leader nodes in certain real-world networks, while at the same time allowing new high fitness nodes to gain influence over time. However, we observe that multiplicative fitness models do not allow multiple influential nodes to exist at the same time. In Section 5, we give theoretical insights on how spatial growth models can allow a multiplicity of leaders to coexist, while Section 6 provides experimental justifications for the same.

Reproducibility: To encourage reproducible research, the codes are publicly available at https://github.com/mittalshravika/Network-Growth-Models.

2 Network growth models

Following our previous work [29], here we set up the notations and problem definition. Consider the following sequence of graphs $\{{\mathbb{G}}_{t},\ t=0,1,\ldots\}$ , where ${\mathbb{G}}_{t}=(V_{t},{\mathbb{E}}_{t})$ , with $V_{t}$ and ${\mathbb{E}}_{t}$ being the set of nodes and edges in ${\mathbb{G}}_{t}$ respectively. In a network growth model, we have $V_{t}\subset V_{t+1}$ and ${\mathbb{E}}_{t}\subseteq{\mathbb{E}}_{t+1}$ for every $t=0,1,\ldots$ . In other words, new nodes arrive at every time step $t$ , and form connections with existing nodes, thus adding to the edge set of the previous graph ${\mathbb{G}}_{t-1}$ . For purposes of simplicity, here we consider the basic model where a single node enters at any time step $t$ , and forms a connection with one node in the existing graph ${\mathbb{G}}_{t-1}$ . Therefore, we can label the incoming node by the time index of its entry to the network, which leads to $V_{t}=\{0,1,\ldots,t\}$ for $t=0,1,\ldots$ . Note that all our results can be easily extended to more general scenarios where multiple nodes can enter the network and incoming nodes can form multiple connections. At time $t$ , let the degree of the node $i$ in $V_{t}$ be denoted by $D_{t}(i)$ . Also, let the rv $S_{t+1}$ denote the node with which an incoming node $t+1$ connects.

Barabási-Albert (BA) model: In the preferential attachment mechanism [5], new nodes connect preferentially to existing nodes with higher degree. Let ${\bf p}^{BA}(t+1)=(p_{i}^{BA}(t+1),\ i\in V_{t})$ be the pmf with which the new node indexed as $t+1$ connects with the existing graph ${\mathbb{G}}_{t}$ , i.e., $p_{i}^{BA}(t+1)$ is the probability with which node $t+1$ connects with an existing node $i$ . This is given by:

p_{i}^{BA}(t+1)={{\bf P}}\left[{S_{t+1}=i\ |\ {\mathbb{G}}_{t}}\right]=\frac{D_{t}(i)}{\sum_{j\in V_{t}}D_{t}(j)},\ i\in V_{t}.

(1)

Note that we term node $i$ ’s visibility in the graph ${\mathbb{G}}_{t}$ by $p_{i}^{BA}(t+1)$ .

Fitness based attachment rules: In fitness based models [7, 14, 23], every node is assumed to have a fitness value independently drawn from a distribution, and new nodes connect preferentially on the basis of the fitness and degree values of the existing nodes. We describe multiple ways in which such an attachment could occur.

Assume a sequence of i.i.d. fitness rvs $(\xi,\xi_{i},\ i=0,1,\ldots)$ with $\xi_{i}$ denoting the fitness value of node $i$ . A generic fitness model can be described by the following attachment rule

p_{i}^{GF}(t+1)=\frac{g(\xi_{i},D_{t}(i))}{\sum_{j\in V_{t}}g(\xi_{j},D_{t}(j))},\ i\in V_{t}.

(2)

for an attachment function $g:\mathbb{R}\times\mathbb{R}\to\mathbb{R}$ which determines the relative importance of the fitness and degree values. In the additive fitness model, new nodes connect preferentially to existing nodes having a higher sum of degree and fitness value. For $t=0,1,\ldots$ , let the pmf delineating formation of new connections at time $t+1$ be given by ${\bf p}^{AF}(t+1)$ , where

p_{i}^{AF}(t+1)=\frac{\xi_{i}+D_{t}(i)}{\sum_{j\in V_{t}}(\xi_{j}+D_{t}(j))},\ i\in V_{t}.

(3)

Similarly, the attachment rule for multiplicative fitness (MF) model is given by

p_{i}^{MF}(t+1)=\frac{\xi_{i}\cdot D_{t}(i)}{\sum_{j\in V_{t}}\xi_{j}\cdot D_{t}(j)},\ i\in V_{t}.

(4)

Therefore, the visibilities of node $i$ in graph ${\mathbb{G}}_{t}$ are given by $p_{i}^{AF}(t+1)$ and $p_{i}^{MF}(t+1)$ for the additive and multiplicative fitness models, respectively. Note that the influential nodes as described in Section 1 relate to nodes having high visibility as defined for the particular network growth model in question.

Spatial attachment rules: In spatial models [32, 19, 15], every node is assumed to have a location vector drawn from a distribution over a location space $A$ . Assume a sequence of i.i.d. location rvs $(\chi,\ \chi_{i},\ i=0,1,\ldots)$ and i.i.d. fitness rvs $(\xi,\xi_{i},\ i=0,1,\ldots)$ with $\chi_{i}$ and $\xi_{i}$ denoting the location and fitness values of node $i$ respectively. A generic spatial attachment model can be given by the following attachment rule

p_{i}^{AT}(t+1)=\frac{h(\chi_{i},\chi_{t+1};\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi_{t+1};\xi_{j},D_{j}(t))}.

(5)

The attachment probability now depends on the location vector of the new node $\chi_{t}$ . This is different from the models described previously. Therefore, we could have multiple definitions of visibility. A global variant of visibility is given below

p_{i}^{\mbox{global}}(t+1)=\mathbb{E}_{\chi}\left[\frac{h(\chi_{i},\chi;\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi;\xi_{j},D_{j}(t))}\right],

while a local version is given as

p_{i}^{\mbox{local}}(t+1)=\frac{h(\chi_{i},\chi_{i};\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi_{i};\xi_{j},D_{j}(t))},

where $h:A\times A\times\mathbb{R}\times\mathbb{N}\to\mathbb{R}$ . We find it useful to consider the following separable form of attachment function

h(\chi_{1},\chi_{2};\xi_{1},D_{1})=\alpha(\chi_{1},\chi_{2})\beta(\xi_{1},D_{1}),\ \chi_{1},\chi_{2}\in A,

(6)

where $\alpha:A\times A\to\mathbb{R}$ and $\beta:\mathbb{R}\times\mathbb{N}\to\mathbb{R}$ . While the notion of global visibility models the overall attractivity of a node in the entire attribute space, the local visibility considers only its attractivity from the local neighborhood in the attribute space of a node. This model allows for nodes whose global attractivity is low, with their local attractivity being high.

3 Analytical results on node visibility – BA and Fitness models

In this section, we study and compare the evolution of visibility of a node over time for the BA model and two fitness models, namely the additive and multiplicative fitness models. The following lemma describes the change in visibility with time for the three growth models.

First, we introduce some notation: Define $\Xi_{t}=\sum_{i\in V_{t}}\xi_{i}$ and $\psi_{t}=\sum_{i\in V_{t}}\xi_{i}D_{t}(i)$ , for $t=0,1,\ldots$ .

Lemma 3.1

For every $t=0,1,\ldots,$ and $i$ in $V_{t-1}$ : Let ${\mathbb{G}}_{t-1}$ be the graph at time $t-1$ , we have

(i)

{\mathbb{E}}\left[{p_{i}^{BA}(t+1)-p_{i}^{BA}(t)\ |\ {\mathbb{G}}_{t-1}}\right]=-\frac{D_{t-1}(i)}{4t(t-1)},

(7)

(ii)

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{AF}(t+1)-p_{i}^{AF}(t)\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle\hskip 5.69054pt=-\frac{\left(\xi_{i}+D_{t-1}(i)\right)\left(\xi_{t}+1\right)}{\left(\Xi_{t-1}+2(t-1)\right)\left(\Xi_{t}+2t\right)},$		(8)

(iii)

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{MF}(t+1)-p_{i}^{MF}(t)\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle\hskip 5.69054pt\gtrsim\xi_{i}D_{t-1}(i)\frac{\sum_{j\neq i}\xi_{j}D_{t}(j)\left[\xi_{i}-\xi_{t}-\xi_{j}\right]}{\psi_{t-1}^{2}(\psi_{t-1}+\xi_{i}+\xi_{t})}.$		(9)

Proof. Fix $t=0,1,\ldots$ , and $i$ in $V_{t}$ .

Preferential Attachment model: The difference in the visibility of node $i$ in the BA model between time $t+1$ and $t$ is given as

	$\displaystyle p_{i}^{BA}(t+1)-p_{i}^{BA}(t)=\frac{D_{t}(i)}{2t}-\frac{D_{t-1}(i)}{2(t-1)}$
	$\displaystyle\hskip 5.69054pt=\frac{D_{t-1}(i)+{\bf 1}\left[S_{t}=i\right]}{2t}-\frac{D_{t-1}(i)}{2(t-1)}$		(10)

The above follows by noting that the sum of degree rvs $D_{t}(i)$ for all the nodes in the vertex set $V_{t}$ equals $2t$ . Furthermore, by noting that when looking at the expected difference in visibility conditioned on the graph at time $t-1$ , $S_{t}$ is the only random variable in (10), we obtain

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{BA}(t+1)-p_{i}^{BA}(t)\ \|\ {\mathbb{G}}_{t-1}}\right]$
	$\displaystyle=\frac{D_{t-1}(i)+{{\bf P}}\left[{S_{t}=i\ \|\ {\mathbb{G}}_{t-1}}\right]}{2t}-\frac{D_{t-1}(i)}{2(t-1)}$		(11)

and (7) follows.

Additive Fitness model: Similarly in the additive fitness model, the difference in the visibility of node $i$ can be written as

	$\displaystyle p_{i}^{AF}(t+1)-p_{i}^{AF}(t)$
	$\displaystyle=\frac{\xi_{i}+D_{t}(i)}{\sum_{j\in V_{t}}\xi_{j}+D_{t}(j)}-\frac{\xi_{i}+D_{t-1}(i)}{\sum_{j\in V_{t-1}}\xi_{j}+D_{t-1}(j)}$
	$\displaystyle=\frac{\xi_{i}+D_{t-1}(i)+{\bf 1}\left[S_{t}=i\right]}{\Xi_{t-1}+\xi_{t}+2t}-\frac{\xi_{i}+D_{t-1}(i)}{\Xi_{t-1}+2(t-1)}.$		(12)

Taking expectation on both sides conditioned on ${\mathbb{G}}_{t-1}$ and $\xi_{t}$ leads to (8).

Multiplicative Fitness model: The difference in the visibility of node $i$ can be written for the multiplicative model as follows

	$\displaystyle p_{i}^{MF}(t+1)-p_{i}^{MF}(t)$
	$\displaystyle=\frac{\xi_{i}D_{t}(i)}{\sum_{j\in V_{t}}\xi_{j}D_{t}(j)}-\frac{\xi_{i}D_{t-1}(i)}{\sum_{j\in V_{t-1}}\xi_{j}D_{t-1}(j)}$
	$\displaystyle=\frac{\xi_{i}\left[D_{t-1}(i)+{\bf 1}\left[S_{t}=i\right]\right]}{\psi_{t-1}+\xi_{S_{t}}+\xi_{t}}-\frac{\xi_{i}D_{t-1}(i)}{\psi_{t-1}}$		(13)

Furthermore, we lower bound the expected change in visibility as follows

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{MF}(t+1)-p_{i}^{MF}(t)\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle=\xi_{i}\left[\frac{D_{t-1}(i)+1}{\psi_{t-1}+\xi_{i}+\xi_{t}}-\frac{D_{t-1}(i)}{\psi_{t-1}}\right]{{\bf P}}\left[{S_{t}=i\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle\hskip 5.69054pt+\xi_{i}\sum_{\ell\neq i}\left[\frac{D_{t-1}(i)}{\psi_{t-1}+\xi_{\ell}+\xi_{t}}-\frac{D_{t-1}(i)}{\psi_{t-1}}\right]{{\bf P}}\left[{S_{t}=\ell\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle\approx\xi_{i}\Bigg{[}\frac{\psi_{t-1}{{\bf P}}\left[{S_{t}=i\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]}{\psi_{t-1}\left(\psi_{t-1}+\xi_{i}+\xi_{t}\right)}-\frac{D_{t-1}(i)}{\psi_{t-1}}$
	$\displaystyle\hskip 11.38109pt\times\Bigg{[}\sum_{\ell\neq i}{{\bf P}}\left[{S_{t}=\ell\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]\cdot\frac{\xi_{\ell}+\xi_{t}}{\psi_{t-1}+\xi_{\ell}+\xi_{t}}\Bigg{]}\Bigg{]}$
	$\displaystyle\geq\xi_{i}\Bigg{[}\frac{\xi_{i}D_{t-1}(i)}{\psi_{t-1}\left(\psi_{t-1}+\xi_{i}+\xi_{t}\right)}$
	$\displaystyle\hskip 11.38109pt-\frac{D_{t-1}(i)}{\psi_{t-1}}\cdot\frac{\sum_{\ell\neq i}\xi_{\ell}D_{t-1}(\ell)(\xi_{\ell}+\xi_{t})}{\psi_{t-1}^{2}}\Bigg{]}$
	$\displaystyle\simeq\xi_{i}D_{t-1}(i)\left[\frac{\xi_{i}\psi_{t-1}-\sum_{\ell\neq i}\xi_{\ell}D_{t-1}(\ell)(\xi_{\ell}+\xi_{t})}{\psi_{t-1}^{2}(\psi_{t-1}+\xi_{i}+\xi_{t})}\right]$

and the result follows.

We observe from (10) that a node’s visibility increases if it forms a new edge connection in the BA model. However, it is also evident from Lemma 3.1 that the visibility of the node decreases in expectation, in a manner that is directly proportional to its degree $D_{t-1}(i)$ . This can be understood from the fact that higher degree nodes in the network have higher visibility values as a result of which, their decrease in visibility would be more as compared to nodes that have lower visibility values. In the additive fitness model, we can infer from (12) that a node’s visibility increases when it forms a new edge, provided $\Xi_{t-1}+2(t-1)>(\xi_{t}+2)[\xi_{i}+D_{t-1}(i)]$ . This condition is expected to hold for large values of $t$ , unless the fitness value $\xi_{t}$ , or $\xi_{i}$ , or both, are very large. Similar to the BA model, node visibility decreases in expectation, with the magnitude of decrease being directly proportional to the sum of degree and fitness values, and the fitness value of the new incoming node $\xi_{t}$ .

In contrast with the above, we can see from (9) that in expectation, the nodes are able to increase their visibility over time, given that their fitness value remains large with respect to the network. In addition to this, the expected change in visibility directly depends on the product of fitness and the present degree of the node, boosting the visibility of a leader much more as compared to the BA and additive fitness models. Note that we derive results for change in visibility values over a single time step. The results can be easily generalized to any fixed number of time steps.

3.1 Node visibility over time – General fitness model

We study the change in node visibility over time for a general fitness model described in (2). For ease of notation, we define $\Gamma_{t}=\sum_{i\in V_{t}}g(\xi_{i},D_{t}(i))$ and $\Delta_{t,i}^{g}=g(\xi_{i},D_{t-1}(i)+1)-g(\xi_{i},D_{t-1}(i))$ for $i$ in $V_{t}$ and $t=1,2,\ldots$ .

Lemma 3.2

For every $t=0,1,\ldots,$ and $i$ in $V_{t-1}$ : Let ${\mathbb{G}}_{t-1}$ be the graph at time $t-1$ , we have

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{GF}(t+1)-p_{i}^{GF}(t)\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle\simeq\left(\frac{g(\xi_{i},D_{t-1}(i))}{\Gamma_{t-1}^{2}}\right)\Bigg{[}\frac{g(\xi_{i},D_{t-1}(i))}{\Gamma_{t-1}}\cdot\left(\Delta_{t,i}^{g}-g(\xi_{t},1)\right)$
	$\displaystyle\hskip 5.69054pt+\sum_{k\neq i}\left(\frac{g(\xi_{k},D_{t-1}(k))}{\Gamma_{t-1}}\right)\left(\Delta_{t,i}^{g}-\Delta_{t,k}^{g}-g(\xi_{t},1)\right)\Bigg{]}.$		(14)

Proof.

The difference in the visibility of node $i$ can be written as follows

	$\displaystyle p_{i}^{GF}(t+1)-p_{i}^{GF}(t)$
	$\displaystyle=\mathbb{E}\Bigg{[}\frac{g(\xi_{i},D_{t-1}(i)+{\bf 1}\left[S_{t}=i\right])}{\sum_{j\in V_{t}}g(\xi_{j},D_{t-1}(j)+{\bf 1}\left[S_{t}=j\right])}$
	$\displaystyle\hskip 14.22636pt-\frac{g(\xi_{i},D_{t-1}(i))}{\sum_{j\in V_{t-1}}g(\xi_{j},D_{t-1}(j))}\Bigg{]}$		(15)

We further introduce the following notation for $k$ in $V_{t}$ , $\Omega_{t,k}={\bf 1}\left[S_{t}=k\right]$ .

	$\displaystyle p_{i}^{GF}(t+1)-p_{i}^{GF}(t)=$
	$\displaystyle=\Omega_{t,i}\left[\frac{g(\xi_{i},D_{t-1}(i)+1)}{\Gamma_{t-1}+\Delta_{t,i}^{g}+g(\xi_{t},1)}-\frac{g(\xi_{i},D_{t-1}(i))}{\Gamma_{t-1}}\right]$
	$\displaystyle+\sum_{k\neq i}\Omega_{t,k}\left[\frac{g(\xi_{i},D_{t-1}(i))}{\Gamma_{t-1}+\Delta_{t,k}^{g}+g(\xi_{t},1)}-\frac{g(\xi_{i},D_{t-1}(i))}{\Gamma_{t-1}}\right]$

Furthermore, we introduce the following shorthand, $\hat{g}_{t,i}=g(\xi_{i},D_{t-1}(i))$ and continue from above.

	$\displaystyle p_{i}^{GF}(t+1)-p_{i}^{GF}(t)$
	$\displaystyle=\Omega_{t,i}\ \left[\frac{\sum_{\begin{subarray}{c}j\neq i\\ j\in V_{t-1}\end{subarray}}\hat{g}_{t,j}\Delta_{t,i}^{g}-g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma_{t-1}\left(\Gamma_{t-1}+\Delta_{t,i}^{g}+g(\xi_{t},1)\right)}\right]$
	$\displaystyle+\sum_{k\neq i}\Omega_{t,k}\left[\frac{-\hat{g}_{t,i}\Delta_{t,k}^{g}-g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma_{t-1}\left(\Gamma_{t-1}+\Delta_{t,k}^{g}+g(\xi_{t},1)\right)}\right]$		(16)

Using expression (16), we approximate the expected change in visibility for sufficiently large values of $t$ , as follows

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{GF}(t+1)-p_{i}^{GF}(t)\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle\simeq{{\bf P}}\left[{S_{t}=i\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]\left[\frac{\left(\sum_{\begin{subarray}{c}j\neq i\\ j\in V_{t-1}\end{subarray}}\hat{g}_{t,j}\Delta_{t,i}^{g}\right)-g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma^{2}_{t-1}}\right]$
	$\displaystyle-\sum_{k\neq i}{{\bf P}}\left[{S_{t}=k\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]\left[\frac{\hat{g}_{t,i}\Delta_{t,k}^{g}}{\Gamma^{2}_{t-1}}+\frac{g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma_{t-1}^{2}}\right]$		(17)

on substituting the expressions for $\left\{{{\bf P}}\left[{S_{t}=\ell\ |\ {\mathbb{G}}_{t-1},\xi_{t}}\right],\ \ell\in V_{t-1}\right\}$ , we obtain

	$\displaystyle\simeq\frac{\hat{g}_{t,i}\Delta^{g}_{t,i}}{\Gamma_{t-1}^{2}}-\hat{g}_{t,i}\sum_{k\neq i}\frac{\hat{g}_{t,k}}{\Gamma_{t-1}}\frac{\Delta^{g}_{t,k}}{\Gamma_{t-1}^{2}}-\frac{g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma_{t-1}^{2}}$
	$\displaystyle=\left(\frac{\hat{g}_{t,i}}{\Gamma_{t-1}^{2}}\right)\left[\Delta_{t,i}^{g}-\sum_{k\neq i}\frac{\hat{g}_{t,k}}{\Gamma_{t-1}}\Delta_{t,k}^{g}-g(\xi_{t},1)\right]$
	$\displaystyle=\left(\frac{\hat{g}_{t,i}}{\Gamma_{t-1}^{2}}\right)\Bigg{[}\left(\frac{\hat{g}_{t,i}}{\Gamma_{t-1}}\right)\left(\Delta_{t,i}^{g}-g(\xi_{t},1)\right)$
	$\displaystyle\hskip 5.69054pt+\sum_{k\neq i}\left(\frac{\hat{g}_{t,k}}{\Gamma_{t-1}}\right)\left(\Delta_{t,i}^{g}-\Delta_{t,k}^{g}-g(\xi_{t},1)\right)\Bigg{]}.$

The expected change in visibility will be positive if

\Delta_{t,i}^{g}\geq\Delta_{t,k}^{g},\ k\neq i,k\in V_{t-1}

and $\Delta_{t,i}^{g}\geq g(\xi_{t},1)$ . While this is a sufficient condition, the expected change in visibility will be positive if node $i$ has significantly large visibility in the network. Observe that for the BA and additive fitness models, $\Delta_{t,i}^{g}=1$ . While for the BA model, the approximation is too crude, we obtain a decrease in expected visibility for the additive model from (14). For the MF model,

\Delta_{t,i}^{g}=\xi_{i}(D_{t-1}(i)+1)-\xi_{i}D_{t-1}(i)=\xi_{i}.

Therefore, $\Delta_{t,i}^{g}$ will be greater for nodes with higher fitness values. However, an incoming node with a high fitness value can lead to a decrease in the expected visibility of an influential node. This agrees with our findings in Lemma 3.1. For attachment functions which combine the fitness and degree information in a nonlinear fashion, influential nodes can retain greater visibility for longer periods of time while being protected from decrease in visibility due to new nodes with high fitness values. For example, for nonlinear attachment rules like $g(\xi_{i},D_{t}(i))=\left(\xi_{i}D_{t}(i)\right)^{2}$ , $\Delta^{g}_{t,i}=\xi_{i}^{2}\left(2D_{t}(i)+1\right)$ , i.e., it is a function of both the fitness and degree values. In this scenario, new nodes with high fitness values cannot lead to a decrease in expected visibility for influential nodes because of their low initial degree. However, nodes with low degree and high fitness values could experience a decrease in their visibility because only nodes with significantly large fitness and degree values can increase their visibility in an expected sense, which is not the case for the multiplicative model. This would lead to a great difficulty for new nodes with high fitness values to attain visibility in the network. Furthermore, it will become progressively more difficult for new nodes to become visible in the network.

4 Experimental results on node visibility – Fitness models

In order to illustrate Lemmas 3.1 and 3.2 and depict the change in visibility of influential nodes, we carry out two sets of simulation experiments. We compare how the visibility of leaders or influential nodes change over time in the Barabasi-Albert (BA), additive fitness (AF), multiplicative fitness (MF) and general fitness (GF) models. Throughout, the fitness variable $\xi$ is taken to be Pareto distributed with parameter $\alpha_{p}$ .

For each given growth model and parameter value of the Pareto distribution, we generate a graph ${\mathbb{G}}_{T_{0}}^{X}$ , where $X\in\{BA,AF,MF,GF\}$ and $T_{0}=10000$ . We define $p_{(k)}(T_{0};{\mathbb{G}}_{t}^{X})$ to be the visibility of the node in graph ${\mathbb{G}}_{t}^{X}$ which had the $k$ -highest visibility in graph ${\mathbb{G}}_{T_{0}}^{X}$ . For each growth model, starting from ${\mathbb{G}}_{T_{0}}^{X}$ , we generate $R$ realizations ${\mathbb{G}}_{T}^{X,(1)},{\mathbb{G}}_{T}^{X,(2)},\ldots{\mathbb{G}}_{T}^{X,(R)}$ with $T=100000$ , which are mutually independent conditioned on ${\mathbb{G}}_{T_{0}}^{X}$ . Since we are interested in the evolution of the visibility of influential nodes, for the purpose of this experiment we track the visibility of the top 50 nodes starting from $t=10000$ to $t=100000$ . However, conditioned on the graph at time $T_{0}$ , the visibility values are random variables; therefore, we average the visibility values across all the realizations at any given time. We define $\bar{p}_{(k)}(T_{0};t;X)=\frac{1}{R}\sum_{r=1}^{R}p_{(k)}(T_{0};{\mathbb{G}}_{t}^{X,(r)})$ as the averaged visibility of the node at time $t$ ( $t>T_{0}$ ) which had the k-highest visibility at time $T_{0}$ for growth model $X$ .

In other words, we track $p_{(k)}\left(T_{0};{\mathbb{G}}_{t}^{X,(r)}\right)$ for $k=1,2,\ldots,50$ , $r=1,2,\ldots,R$ , $t=10000,10001,\ldots,100000$ , and $X\in\{BA,AF,MF,GF\}$ . For large enough independent runs $R$ , we expect $\bar{p}_{(k)}(T_{0};t;X)$ to be a reasonable approximation of ${\mathbb{E}}\left[{p_{(k)}\left(T_{0};{\mathbb{G}}_{t}^{X,(1)}\right)\big{|}{\mathbb{G}}_{T_{0}}^{X}}\right]$ , which is the expected value of visibility at time $t$ for the node which had the $k$ -highest visibility value at time $T_{0}$ conditioned on the graph at time $T_{0}$ .

Figures 1 and 2 show the change in visibility of top 50 nodes at $t=T_{0}=10000$ when the graph is allowed to grow for 90000 iterations until $t=100000$ . Visibility values averaged over $R=50$ independent runs from $T_{0}=10000$ are shown. We observe from the box plots in the two figures that the highest visibility nodes in the BA and AF slowly reduce in their visibility values over time as was predicted by Lemma 3.1. We also observe that lot more nodes in BA and AF models have significantly higher visibility values. However, for the multiplicative fitness model only 2 nodes exhibit high visibility values (for $\alpha_{p}=1,2$ ), with one node dominating the entire network at any point of time for most of the duration. In the MF model, for $\alpha_{p}=1,2$ , we observe that a node with lower visibility replaces one with higher visibility between $t=T_{0}=10000$ and $t=100000$ , because the lower visibility node joined the network later but with a significantly higher fitness value. For $\alpha_{p}=3$ , we do not see this behavior because it is less likely that a node with a significantly high fitness value will enter the network.

While we observe that in both the MF and GF models, nodes with high visibility are able to maintain their influence (or, visibility) over time, we next investigate how easy it is for new nodes with high fitness to gain influence over time. For this purpose, we conduct an experiment where we introduce a node at time $t=T_{0}+1=10001$ , with fitness value $\xi_{T_{0}+1}=2\max_{t\in V_{T_{0}}}\xi_{t}$ , i.e., twice the fitness value of the maximum fitness value among all nodes until time $T_{0}$ . For growth model $X\in\{AF,MF,GF\}$ , we generate $R=50$ realizations beyond time $T_{0}$ by setting the fitness value of node $T_{0}+1$ as mentioned above. We define $p_{i}\left({\mathbb{G}}_{t}^{X}\right)$ to be the visibility of the node $i$ in graph ${\mathbb{G}}_{t}^{X}$ . We average the visibility values across $R$ realizations, ${\mathbb{G}}_{T}^{X,(1)},{\mathbb{G}}_{T}^{X,(2)},\ldots,{\mathbb{G}}_{T}^{X,(R)}$ , and compute the averaged visibility of node $T_{0}+1$ defined as $\bar{p}_{T_{0}+1}(t)=\frac{1}{R}\sum_{r=1}^{R}\left({\mathbb{G}}_{t}^{X,(r)}\right)$ , $t>T_{0}$ . Subsequently, we track the visibility of this newly introduced node in the three fitness models and present the results in Figure 3. We observe that in the AF and GF models, the visibility of the newly introduced node decreases with time. This concurs with Lemma 3.1 where we showed that the visibility of nodes in AF models decreases with time; and with Lemma 3.2 where we argue that in the general fitness model with a nonlinear attachment rule, it becomes progressively difficult for new nodes to get visible in the network. For the MF model with $\alpha_{p}=1$ , we observe that $\bar{p}_{T_{0}+1}(t)$ increases slightly but decays beyond $t=30000$ . This is because nodes with even higher fitness values enter the network after node $T_{0}+1$ . However, for $\alpha_{p}=2,3$ the visibility of node $T_{0}+1$ keeps increasing until $t=100000$ . Note that for $\alpha_{p}=3$ , node $T_{0}+1$ becomes dominant in the network very quickly because as the node increases its degree, the degree-fitness product becomes large compared to that of other nodes in the network because having a large fitness node is a rarer event for larger value of $\alpha_{p}$ .

From the experiments we reaffirm that multiplicative models allow high visibility nodes to maintain their influence in the network for a longer period of time, while at the same time allowing high fitness nodes that are introduced later in the network to become influential. However, we observe that only a few number of nodes can be influential in the network at any given moment of time. This leads us to consider spatial attachment rules in conjunction with the multiplicative model to enforce regions of influence for each individual node such that multiple influential nodes can coexist in a network at the same time.

Refer to caption — Figure 1: Visibility of nodes (averaged over 50 independent runs) over time (after $T=100000$ iterations) in the BA model.

5 Analytical results on node visibility – Spatial models

In the previous sections, we investigated the node visibility dynamics of the fitness models. A major takeaway was that multiplicative fitness models allow influential nodes to maintain their visibility while still permitting newly introduced nodes with high fitness to gain influence over time. However, a shortcoming of the MF model was that it could not support a multiplicity of influential nodes. We investigate whether a configuration of spatial models exists that retains the positive aspects of the MF model while addressing this shortcoming.

5.1 Preliminaries - Results on various notions of visibility in spatial models

Having defined global and local visibilities for a node in Section 2, we find it helpful to define a notion of maximum visibility

p_{i}^{\mbox{max}}(t+1)=\max_{\chi}\frac{h(\chi_{i},\chi;\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi;\xi_{j},D_{j}(t))},

(18)

with $\chi_{t,i}^{*}$ being the location vector for which the maximum is attained. For rest of the analysis, we assume that the attachment function $h$ is separable into a product form of $\alpha(\chi_{1},\chi_{2})\cdot\beta(\xi,D)$ as shown in (6). For analytical purposes, we set $\alpha(\chi_{1},\chi_{2})=e^{-\gamma d(\chi_{1},\chi_{2})}$ , where $d:A\times A\to\mathbb{R}$ is a metric on the location space $A$ .

Lemma 5.1

For $\alpha(\chi_{1},\chi_{2})=e^{-\gamma d(\chi_{1},\chi_{2})}$ and every $t=1,2,\ldots,$

\lim_{\gamma\to\infty}p_{i}^{\mbox{local}}(t+1)=1=\lim_{\gamma\to\infty}p_{i}^{\mbox{max}}(t+1)

(19)

and

\lim_{\gamma\to\infty}\chi_{t,i}^{*}=\chi_{i}.

(20)

In other words, for all $\delta>0$ and for all $t=1,2,\ldots$ , there exists $\gamma_{t,\delta}$ such that for all $\gamma\geq\gamma_{t,\delta}$

\left|p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{max}}(t+1)\right|<\delta

and

\left|\chi_{i}-\chi_{t,i}^{*}\right|<\delta.

Proof. For $i$ in $V_{t}$ and $t=1,2,3,\ldots$

	$\displaystyle p_{i}^{\mbox{local}}(t+1)$
	$\displaystyle=\frac{\beta(\xi_{i},D_{i}(t))}{\beta(\xi_{i},D_{i}(t))+\sum_{j\neq i}e^{-\gamma d(\chi_{i},\chi_{j})}\beta(\xi_{j},D_{j}(t))}$
	$\displaystyle\geq\frac{\beta(\xi_{i},D_{i}(t))}{\beta(\xi_{i},D_{i}(t))+\left(t\max_{k\in V_{t}}\xi_{k}\right)\sum_{j\neq i}e^{-t^{\epsilon}d(\chi_{i},\chi_{j})}}$
	$\displaystyle\xrightarrow{\gamma=t^{\epsilon}\to\infty}1$

and we obtain first part of (19). Note that the scaling $\gamma=t^{\epsilon}$ implies that for larger graphs, a smaller $\gamma$ would be necessary. Second part is obtained by noting that $p_{i}^{\mbox{max}}(t+1)\geq p_{i}^{\mbox{local}}(t+1)$ for every node $i$ and time $t$ . Equation (20) also follows similarly.

Lemma 5.1 suggests that $p_{i}^{\mbox{max}}$ is a good approximation for $p_{i}^{\mbox{local}}$ when $\gamma$ is sufficiently large. Next, we derive relationships between the global and local notions of visibility.

Lemma 5.2

For $\epsilon>0$ , we have

	$\displaystyle e^{-2\gamma\epsilon}{{\bf P}}\left[{d(\chi,\chi_{i})<\epsilon}\right]p_{i}^{\mbox{local}}(t+1)\leq p_{i}^{\mbox{global}}(t+1)$
	$\displaystyle\leq p_{i}^{\mbox{max}}(t+1)\approx p_{i}^{\mbox{local}}(t+1)$		(21)

Proof. For convenience, we use the shorthand notation, $\beta_{i}(t)=\beta(\xi_{i},D_{i}(t))$ for $i$ in $V_{t}$ and $t=1,2,\ldots$ . The upper bound on $p_{i}^{\mbox{global}}(t+1)$ follows from the definition of $p_{i}^{\mbox{max}}(t+1)$ . To obtain the lower bound, for a fixed $\epsilon>0$ we condition on the event ${\bf 1}\left[d(\chi_{i},\chi)<\epsilon\right]$

	$\displaystyle p_{i}^{\mbox{global}}(t+1)$
	$\displaystyle\geq{{\bf P}}\left[{d(\chi_{i},\chi)<\epsilon}\right]$
	$\displaystyle\hskip 5.69054pt\times\mathbb{E}_{\chi}\left[\frac{h(\chi_{i},\chi;\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi;\xi_{j},D_{j}(t))}{\bf 1}\left[d(\chi_{i},\chi)<\epsilon\right]\right]$
	$\displaystyle\geq{{\bf P}}\left[{d(\chi_{i},\chi)<\epsilon}\right]\cdot\min_{\chi:d(\chi,\chi_{i})<\epsilon}\frac{h(\chi_{i},\chi;\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi;\xi_{j},D_{j}(t))}$
	$\displaystyle={{\bf P}}\left[{d(\chi_{i},\chi)<\epsilon}\right]$
	$\displaystyle\times\min_{\chi:d(\chi,\chi_{i})<\epsilon}\frac{e^{-\gamma d(\chi,\chi_{i})}\beta_{i}(t)}{e^{-\gamma d(\chi,\chi_{i})}\beta_{i}(t)+\sum_{j\neq i}e^{-\gamma d(\chi,\chi_{j})}\beta_{j}(t)}$
	$\displaystyle\geq{{\bf P}}\left[{d(\chi_{i},\chi)<\epsilon}\right]\cdot\frac{e^{-2\gamma\epsilon}\beta_{i}(t)}{\beta_{i}(t)+\sum_{j\neq i}e^{-\gamma d(\chi_{i},\chi_{j})}\beta_{j}(t)}$
	$\displaystyle=e^{-2\gamma\epsilon}{{\bf P}}\left[{d(\chi,\chi_{i})<\epsilon}\right]p_{i}^{\mbox{local}}(t+1)$		(22)

where the penultimate step follows from triangle inequality.

Lemma 5.2 gives a lower and upper bound for the global visibility in terms of the local visibility. As argued previously, since the model is more concerned with local attractivity of nodes, we will present analysis for $p_{i}^{\mbox{local}}$ . Lemma 5.2 suggests that changes in the local visibility should also be aptly reflected in the global visibility.

5.2 Node visibility over time

For convenience, we define some shorthand notation: For $i$ in $V_{t}$ , $\beta_{i}(t)=\beta(\xi_{i},D_{i}(t))$ . For $i,j$ in $V_{t}$ , $\hat{h}_{t,i\to j}=h(\chi_{i},\chi_{j};\xi_{i},D_{t-1}(i))$ , $\hat{h}_{t,i\to j}^{+}=h(\chi_{i},\chi_{j};\xi_{i},D_{t-1}(i)+1)$ represent the attachment function for node $i$ at the location of node $j$ , and $\Delta^{h}_{t,i\to j}=h(\chi_{i},\chi_{j};\xi_{i},D_{t-1}(i)+1)-h(\chi_{i},\chi_{j};\xi_{i},D_{t-1}(i))$ . For $k$ in $V_{t}$ , $\Omega_{t,k}={\bf 1}\left[S_{t}=k\right]$ and $\Gamma_{t,k}=\sum_{i\in V_{t}}\hat{h}_{t,i\to k}$ .

Lemma 5.3

For every $t=0,1,\ldots,$ and $i$ in $V_{t-1}$ : Let ${\mathbb{G}}_{t-1}$ be the graph at time $t-1$ , we have

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ \|\ {\mathbb{G}}_{t-1},\xi_{t},\chi_{t}}\right]$
	$\displaystyle\lesssim C_{1}\sum_{k\in V_{t}}\beta_{k}(t-1)e^{-\gamma d(\chi_{i},\chi_{k})}\big{\{}\xi_{i}-e^{-\gamma d(\chi_{i},\chi_{k})}\xi_{k}-\xi_{t}\big{\}}$		(23)

and

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ \|\ {\mathbb{G}}_{t-1},\xi_{t},\chi_{t}}\right]$
	$\displaystyle\gtrsim C_{2}\sum_{k\in V_{t}}\beta_{k}(t-1)e^{-\gamma d(\chi_{i},\chi_{k})}$
	$\displaystyle\hskip 56.9055pt\times\big{\{}\xi_{i}-e^{2\epsilon\gamma}e^{-\gamma d(\chi_{i},\chi_{k})}\xi_{k}-e^{3\epsilon\gamma}\xi_{t}\big{\}}$		(24)

with

C_{1}=\frac{1}{\left(\sum_{j\in V_{t}}h(\chi_{j},\chi_{i};\xi_{j},D_{t-1}(j))^{3}\right)}\beta_{i}(t-1)

and

	$\displaystyle C_{2}$	$\displaystyle={{\bf P}}\left[{d(\chi_{t},\chi)<\epsilon}\right]e^{-\epsilon\gamma}$
		$\displaystyle\times\frac{1}{\left(\sum_{j\in V_{t}}h(\chi_{j},\chi_{i};\xi_{j},D_{t-1}(j))^{3}\right)}\beta_{i}(t-1).$

Proof. The difference in the local visibility of node $i$ can be written as follows

	$\displaystyle p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)$
	$\displaystyle=\frac{h(\chi_{i},\chi_{i};\xi_{i},D_{t-1}(i)+{\bf 1}\left[S_{t}=i\right])}{\sum_{j\in V_{t}}h(\chi_{j},\chi_{i};\xi_{j},D_{t-1}(j)+{\bf 1}\left[S_{t}=j\right])}$
	$\displaystyle\hskip 5.69054pt-\frac{h(\chi_{i},\chi_{i};\xi_{i},D_{t-1}(i))}{\sum_{j\in V_{t-1}}h(\chi_{j},\chi_{i};\xi_{j},D_{t-1}(j))}$
	$\displaystyle=\Omega_{t,i}\left[\frac{\hat{h}^{+}_{t,i\to i}}{\Delta^{h}_{t,i\to i}+\Gamma_{t,i}+h(\chi_{t},\chi_{i};\xi_{t},1)}-\frac{\hat{h}_{t,i\to i}}{\Gamma_{t,i}}\right]$
	$\displaystyle+\sum_{k\neq i}\Omega_{t,k}\left[\frac{\hat{h}_{t,i\to i}}{\Delta^{h}_{t,k\to i}+\Gamma_{t,i}+h(\chi_{t},\chi_{i};\xi_{t},1)}-\frac{\hat{h}_{t,i\to i}}{\Gamma_{t,i}}\right]$
	$\displaystyle\approx\Omega_{t,i}\left[\frac{\Delta^{h}_{t,i\to i}}{\Gamma_{t,i}}\right]$
	$\displaystyle\hskip 5.69054pt-\sum_{k\neq i}\Omega_{t,k}\Bigg{[}\frac{\hat{h}_{t,i\to i}\Delta^{h}_{t,k\to i}}{\Gamma_{t,i}^{2}}+\frac{h(\chi_{t},\chi_{i};\xi_{t},1)\hat{h}_{t,i\to i}}{\Gamma_{t,i}^{2}}\Bigg{]}$		(25)

Using (25), we upper bound the expected change in local visibility by noting the fact that the expected increase in visibility is the most when the new node has the highest probability to form connection with node $i$ , which occurs when the attribute of the new node is close to $\chi_{i}$

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ \|{\mathbb{G}}_{t},\xi_{t},\chi_{t}}\right]$
	$\displaystyle\leq{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ \|{\mathbb{G}}_{t},\xi_{t},\chi_{t}=\chi_{i}}\right]$
	$\displaystyle\leq\frac{1}{\Gamma_{t,i}}\Bigg{[}\frac{\hat{h}_{t,i\to i}}{\Gamma_{t,i}}\Delta^{h}_{t,i\to i}-\sum_{k\neq i}\left(\frac{\hat{h}_{t,k\to i}}{\Gamma_{t,i}}\right)\Bigg{(}\frac{\hat{h}_{t,i\to i}}{\Gamma_{t,i}}\Delta^{h}_{t,k\to i}$
	$\displaystyle\hskip 5.69054pt-\frac{h(\chi_{t},\chi_{i};\xi_{t},1)\hat{h}_{t,i\to i}}{\Gamma_{t,i}}\Bigg{)}\Bigg{]}$
	$\displaystyle\approx\frac{1}{\Gamma_{t,i}^{3}}\sum_{k\in V_{t}}\Big{\{}\hat{h}_{t,i\to i}\hat{h}_{t,k\to i}\Delta_{t,i\to i}^{h}-\hat{h}_{t,i\to i}\hat{h}_{t,k\to i}\Delta_{t,k\to i}^{h}$
	$\displaystyle\hskip 5.69054pt-\hat{h}_{t,k\to i}\hat{h}_{t,i\to i}h(\chi_{i},\chi_{i};\xi_{t},1)\Big{\}}$
	$\displaystyle\approx\frac{1}{\Gamma_{t,i}^{3}}\beta(\xi_{i},D_{t-1}(i))\sum_{k\in V_{t}}\beta(\xi_{k},D_{t-1}(k))e^{-\gamma d(\chi_{i},\chi_{k})}\times$
	$\displaystyle\hskip 56.9055pt\big{\{}\Delta^{h}_{t,i\to i}-\Delta^{h}_{t,k\to i}-\beta(\xi_{t},1)\big{\}}$		(26)

Observe that $\Delta^{h}_{t,i\to i}=\beta(\xi_{i},D_{t-1}(i)+1)-\beta(\xi_{i},D_{t-1}(i))$ which equals $\xi_{i}$ in a multiplicative $\alpha$ model. Similarly, $\Delta^{h}_{t,k\to i}$ equals $e^{-\gamma d(\chi_{i},\chi_{k})}\xi_{k}$ . Therefore, in a multiplicative model, (26) reduces to

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ \|\ {\mathbb{G}}_{t},\xi_{t},\chi_{t}}\right]\lesssim\frac{1}{\Gamma_{t,i}^{3}}\beta_{i}(t-1)$
	$\displaystyle\hskip 5.69054pt\times\sum_{k\in V_{t}}\beta_{k}(t-1)e^{-\gamma d(\chi_{i},\chi_{k})}\big{\{}\xi_{i}-e^{-\gamma d(\chi_{i},\chi_{k})}\xi_{k}-\xi_{t}\big{\}}$		(27)

which gives the upper bound (24). To lower bound the same, we define the following notation – For $\epsilon>0$ ,

\chi_{i,\min}^{\epsilon}=\arg\min_{\chi:d(\chi,\chi_{i})<\epsilon}\frac{h(\chi_{i},\chi;\xi_{i},D_{i}(t))}{\sum_{j\in V_{t}}h(\chi_{j},\chi;\xi_{j},D_{j}(t))}.

In other words, $\chi_{i,\min}^{\epsilon}$ is the attribute vector on the $\epsilon-$ ball around $\chi_{i}$ where the attachment probability to $i$ is the lowest.

Using (25), we lower bound the expected change in local visibility by noting that conditioned on the event that the new node has an attribute vector within an $\epsilon-$ ball around $\chi_{i}$ , it will be minimum when it is equal to $\chi_{i,\min}^{\epsilon}$ . Accordingly, we define the following notation: $\hat{h}^{\min}_{t,k\to i}=h(\chi_{k},\chi_{i,min}^{\epsilon};\xi_{k},D_{t-1}(k))$ . We lower bound using conditioning arguments

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ \|{\mathbb{G}}_{t},\xi_{t},\chi_{t}}\right]$
	$\displaystyle\geq{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ \big{\|}\ {\mathbb{G}}_{t},\xi_{t},d(\chi_{t},\chi)<\epsilon}\right]$
	$\displaystyle\hskip 5.69054pt\times{{\bf P}}\left[{d(\chi_{t},\chi)<\epsilon}\right]$
	$\displaystyle\geq{\mathbb{E}}\left[{p_{i}^{\mbox{local}}(t+1)-p_{i}^{\mbox{local}}(t)\ \big{\|}\ {\mathbb{G}}_{t},\xi_{t},\chi_{t}=\chi_{\min}^{\epsilon}}\right]$
	$\displaystyle\hskip 5.69054pt\times{{\bf P}}\left[{d(\chi_{t},\chi)<\epsilon}\right]$
	$\displaystyle\gtrsim{{\bf P}}\left[{d(\chi_{t},\chi)<\epsilon}\right]\cdot\frac{1}{\Gamma_{t,i}^{3}}\sum_{k\in V_{t}}\Big{\{}\hat{h}^{\min}_{t,i\to i}\hat{h}_{t,k\to i}\Delta^{h}_{t,i\to i}$
	$\displaystyle-\hat{h}_{t,i\to i}\hat{h}^{\min}_{t,k\to i}\Delta^{h}_{t,k\to i}-\hat{h}^{\min}_{t,k\to i}\hat{h}_{t,i\to i}h(\chi_{i},\chi_{i,\min}^{\epsilon};\xi_{t},1)\Big{\}}$
	$\displaystyle\gtrsim{{\bf P}}\left[{d(\chi_{t},\chi)<\epsilon}\right]\frac{1}{\Gamma_{t,i}^{3}}\beta(\xi_{i},D_{t-1}(i))$
	$\displaystyle\times\sum_{k\in V_{t}}\beta_{k}(t-1)e^{-\gamma d(\chi_{i},\chi_{k})}$
	$\displaystyle\times\big{\{}e^{-\epsilon\gamma}\xi_{i}-e^{\epsilon\gamma}e^{-\gamma d(\chi_{i},\chi_{k})}\xi_{k}-e^{2\epsilon\gamma}\xi_{t}\big{\}}$		(28)

where the final step follows by applying triangle inequality. From the lower bound (23) it is clear that the visibility will decrease when nodes close to $i$ have high fitness values or the new node has a very high fitness value. The upper bound (24) indicates that if the fitness of node $i$ is sufficiently high compared to other nodes in its local neighborhood, then its visibility should increase. This shows how the local neighborhood of particular node impacts its visibility. Also, given the local nature of the behavior of visibility, more nodes end up being visible in the spatial model compared to the multiplicative fitness model.

6 Experimental results on node visibilities – Spatial models

To illustrate the findings of Section 5.2, we perform experiments discussed in this section. In the previous section, we theoretically argued how the spatial (S) model with multiplicative $\beta$ would lead to multiple leaders coexisting in the network. Here we present experimental results that show the multiplicity of leaders that can coexist in the network and how this varies with the decay parameter $\gamma$ . Throughout, the fitness variable $\xi$ is taken to be Pareto distributed with parameter $\alpha_{p}$ .

For each $\alpha_{p}=1,2,3$ and $\gamma=5,10,50$ , we generate a graph ${\mathbb{G}}_{T_{0}}^{S}$ as an instantiation of the spatial model, with $T_{0}=10000$ . We denote $p^{\mbox{local}}_{(k)}(T_{0};{\mathbb{G}}_{t}^{S})$ to be the local visibility of the node in graph ${\mathbb{G}}_{t}^{S}$ which had the $k$ -highest local visibility in graph ${\mathbb{G}}_{T_{0}}^{S}$ . As previously, we generate $R$ realizations ${\mathbb{G}}_{T}^{S,(1)},{\mathbb{G}}_{T}^{S,(2)},\ldots{\mathbb{G}}_{T}^{S,(R)}$ with $T=100000$ , which are mutually independent conditioned on ${\mathbb{G}}_{T_{0}}^{S}$ . We average the local visibility values across all the realizations at any given time and denote $\bar{p}^{\mbox{local}}_{(k)}(T_{0};t;S)=\frac{1}{R}\sum_{r=1}^{R}p^{\mbox{local}}_{(k)}(T_{0};{\mathbb{G}}_{t}^{S,(r)})$ as the averaged local visibility of the node at time $t$ which had the k-highest visibility at time $T_{0}$ for the spatial model.

In other words, we track $p^{\mbox{local}}_{(k)}\left(T_{0};{\mathbb{G}}_{t}^{S,(r)}\right)$ for nodes $k=1,5,10,30,50,100,200$ , runs $r=1,2,\ldots,R$ , and decay parameters $\gamma=5,10,50$ , with $10000\leq t\leq 100000$ .

Figure 4 shows the change in local visibility of top $k^{\text{th}}$ nodes at $t=T_{0}=10000$ when the graph is allowed to grow for 90000 iterations until $t=100000$ . Visibility values averaged over $R=50$ independent runs from $T_{0}=10000$ are shown. We observe that with increasing $\gamma$ , the number of nodes with high local visibility increases in the network. This corroborates the insight that with increasing $\gamma$ , the region of influence of nodes decreases leading to the potential of larger number of influential nodes in the network. For $\gamma=5$ and $\alpha_{p}=1$ , we see that the $k=5^{\text{th}}$ node increases slightly in visibility beyond $t=10000$ but then decays beyond a certain point; and with $\alpha_{p}=2$ , the same node maintains its visibility until $t=100000$ , while for $\alpha_{p}=3$ the node increases its local visibility and dominates its region eventually. However for $k=10,30,...$ , the corresponding nodes have very low values of local visibility. This changes for the $\gamma=10$ case. Here, the $k=10^{\text{th}}$ node also shows a high value of local visibility due to the reduced region of influence of nearby influential nodes. This becomes even more pronounced in $\gamma=50$ where we observe that even the $k=200^{\text{th}}$ node has non-trivial local visibility values that are maintained over some period of time for $\alpha_{p}=2,3$ . This shows that with increasing the decay parameter of the spatial model we can significantly increase the number of leaders in the network, many of whom can maintain their influence in their region.

7 Conclusion

In this paper, we studied the visibility profile of nodes in different classes of network growth models. Firstly, we observed that in the multiplicative fitness model, nodes with high fitness values can successfully maintain visibility in the network to a greater extent when compared with the additive fitness and BA models. A general fitness model that has a non-linear attachment rule, e.g., that combines the degree and fitness values in a non-linear (quadratic) fashion, would also allow influential nodes to maintain visibility. However, unlike in multiplicative models, in these general fitness models with non-linear attachment rules we showed that it becomes progressively more difficult for new nodes with high fitness values to become influential in the entire network. We demonstrated through experimental results that in multiplicative models only a few number of nodes can be influential in the network at any given moment of time. This leads us to investigate spatial models that allows a multiplicity of influential nodes to exist in the network. We also show how the decay parameter in these spatial models can be used to control the number of leaders in the network.

8 Acknowledgements

The authors would like to thank Dr. Ralucca Gera at Naval Postgraduate School, and Dr. Soham De at DeepMind, for insightful discussions and collaboration on previous works that led to this paper.

References

[1] L. A. Adamic and B. A. Huberman. Power-law distribution of the world wide web. Science, 287:2115, 2000.
[2] M. Allamanis, S. Scellato, and C. Mascolo. Evolution of a location-based online social network: Analysis and models. In Proceedings of the Internet Measurement Conference, page 145–158, 2012.
[3] D. R. Amancio, O. N. Oliveira, and L. da Fontoura Costa. Three-feature model to reproduce the topology of citation networks and the effects from authors’ visibility on their h-index. Journal of Informetrics, 6(3):427 – 434, 2012.
[4] A.-L. Barabási. Scale-free networks: a decade and beyond. Science, 325(5939):412–413, 2009.
[5] A.-L. Barabási and R. Albert. Emergence of scaling in random networks. Science, 286(5439):509–512, 1999.
[6] M. Barthélemy. Spatial networks. Physics Reports, 499(1-3):1–101, Feb 2011.
[7] G. Bianconi and A.-L. Barabási. Bose-einstein condensation in complex networks. Physical Review Letters, 86(24):5632, 2001.
[8] G. Bianconi and A.-L. Barabási. Competition and multiscaling in evolving networks. Europhysics Letters, 54(4):436–442, may 2001.
[9] E. Bullmore and O. Sporns. Complex brain networks: graph theoretical analysis of structural and functional systems. Nature Reviews Neuroscience, 10(3):186–198, Mar 2009.
[10] G. Caldarelli, A. Capocci, P. De Los Rios, and M. A. Muñoz. Scale-free networks from varying vertex intrinsic fitness. Physical review letters, 89:258702, Dec 2002.
[11] T. Chakraborty, S. Kumar, P. Goyal, N. Ganguly, and A. Mukherjee. On the categorization of scientific citation profiles in computer science. Communications of the ACM, 58(9):82–90, 2015.
[12] T. Chakraborty and S. Nandi. Universal trajectories of scientific success. Knowledge and Information Systems, 54(2):487–509, February 2018.
[13] D. Chen, L. Lü, M.-S. Shang, Y.-C. Zhang, and T. Zhou. Identifying influential nodes in complex networks. Physica A: Statistical Mechanics and its Applications, 391(4):1777 – 1787, 2012.
[14] G. Ergün and G. J. Rodgers. Growing random networks with fitness. Physica A: Statistical Mechanics and its Applications, 303(1):261–272, 2002.
[15] L. Ferretti and M. Cortelezzi. Preferential attachment in growing spatial networks. Physical Review E, 84(1), Jul 2011.
[16] D. Garlaschelli and M. I. Loffredo. Fitness-dependent topological properties of the world trade web. Physical Review Letters, 93:188701, Oct 2004.
[17] R. Guimerá and L. A. N. Amaral. Modeling the world-wide airport network. The European Physical Journal B, 38(2):381–385, Mar 2004.
[18] C. Guo, L. Yang, X. Chen, D. Chen, H. Gao, and J. Ma. Influential nodes identification in complex networks via information entropy. Entropy, 22(2):242, Feb 2020.
[19] M. Kaiser and C. C. Hilgetag. Spatial growth of real-world networks. Physical Review E, 69(3), Mar 2004.
[20] M. Kimura, K. Saito, and R. Nakano. Extracting influential nodes for information diffusion on a social network. In Proceedings of the 22nd National Conference on Artificial Intelligence - Volume 2, AAAI’07, page 1371–1376. AAAI Press, 2007.
[21] J. S. Kong, N. Sarshar, and V. P. Roychowdhury. Experience versus talent shapes the structure of the web. Proceedings of the National Academy of Sciences, 105(37):13724–13729, Sep 2008.
[22] D. Liu, V. Fodor, and L. K. Rasmussen. Will scale-free popularity develop scale-free geo-social networks? IEEE Transactions on Network Science and Engineering, 6(3):587–598, 2019.
[23] M. Mitzenmacher. A brief history of generative models for power law and lognormal distributions. Internet mathematics, 1(2):226–251, 2004.
[24] D. Mohapatra, S. Pal, S. De, P. Kumaraguru, and T. Chakraborty. Modeling citation trajectories of scientific papers. In Advances in Knowledge Discovery and Data Mining - PAKDD’20, pages 620–632, 2020.
[25] M. Newman. The structure and function of complex networks. SIAM review, 45(2):167–256, 2003.
[26] M. Newman. Networks: an introduction, 2010.
[27] K. Nguyen and D. A. Tran. Fitness-Based Generative Models for Power-Law Networks, pages 39–53. Springer US, Boston, MA, 2012.
[28] A. Noulas, B. Shaw, R. Lambiotte, and C. Mascolo. Topological properties and temporal dynamics of place networks in urban environments. In Proceedings of the 24th International Conference on World Wide Web, pages 431–441, 2015.
[29] S. Pal, S. De, T. Chakraborty, and R. Gera. Visibility of nodes in network growth models. In 3rd International Winter School and Conference on Network Science, pages 35–45, 2017.
[30] V. D. P. Servedio, G. Caldarelli, and P. Buttà. Vertex intrinsic fitness: How to produce arbitrary scale-free networks. Physical Review E, 70(5):056126, 2004.
[31] D. Wang, C. Song, and A.-L. Barabási. Quantifying long-term scientific impact. Science, 342(6154):127–132, 2013.
[32] S.-H. Yook, H. Jeong, and A.-L. Barabási. Modeling the internet’s large-scale topology. Proceedings of the National Academy of Sciences, 99(21):13382–13386, 2002.

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{MF}(t+1)-p_{i}^{MF}(t)\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle=\xi_{i}\left[\frac{D_{t-1}(i)+1}{\psi_{t-1}+\xi_{i}+\xi_{t}}-\frac{D_{t-1}(i)}{\psi_{t-1}}\right]{{\bf P}}\left[{S_{t}=i\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle\hskip 5.69054pt+\xi_{i}\sum_{\ell\neq i}\left[\frac{D_{t-1}(i)}{\psi_{t-1}+\xi_{\ell}+\xi_{t}}-\frac{D_{t-1}(i)}{\psi_{t-1}}\right]{{\bf P}}\left[{S_{t}=\ell\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle\approx\xi_{i}\Bigg{[}\frac{\psi_{t-1}{{\bf P}}\left[{S_{t}=i\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]}{\psi_{t-1}\left(\psi_{t-1}+\xi_{i}+\xi_{t}\right)}-\frac{D_{t-1}(i)}{\psi_{t-1}}$
	$\displaystyle\hskip 11.38109pt\times\Bigg{[}\sum_{\ell\neq i}{{\bf P}}\left[{S_{t}=\ell\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]\cdot\frac{\xi_{\ell}+\xi_{t}}{\psi_{t-1}+\xi_{\ell}+\xi_{t}}\Bigg{]}\Bigg{]}$
	$\displaystyle\geq\xi_{i}\Bigg{[}\frac{\xi_{i}D_{t-1}(i)}{\psi_{t-1}\left(\psi_{t-1}+\xi_{i}+\xi_{t}\right)}$
	$\displaystyle\hskip 11.38109pt-\frac{D_{t-1}(i)}{\psi_{t-1}}\cdot\frac{\sum_{\ell\neq i}\xi_{\ell}D_{t-1}(\ell)(\xi_{\ell}+\xi_{t})}{\psi_{t-1}^{2}}\Bigg{]}$
	$\displaystyle\simeq\xi_{i}D_{t-1}(i)\left[\frac{\xi_{i}\psi_{t-1}-\sum_{\ell\neq i}\xi_{\ell}D_{t-1}(\ell)(\xi_{\ell}+\xi_{t})}{\psi_{t-1}^{2}(\psi_{t-1}+\xi_{i}+\xi_{t})}\right]$

	$\displaystyle{\mathbb{E}}\left[{p_{i}^{GF}(t+1)-p_{i}^{GF}(t)\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]$
	$\displaystyle\simeq{{\bf P}}\left[{S_{t}=i\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]\left[\frac{\left(\sum_{\begin{subarray}{c}j\neq i\\ j\in V_{t-1}\end{subarray}}\hat{g}_{t,j}\Delta_{t,i}^{g}\right)-g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma^{2}_{t-1}}\right]$
	$\displaystyle-\sum_{k\neq i}{{\bf P}}\left[{S_{t}=k\ \|\ {\mathbb{G}}_{t-1},\xi_{t}}\right]\left[\frac{\hat{g}_{t,i}\Delta_{t,k}^{g}}{\Gamma^{2}_{t-1}}+\frac{g(\xi_{t},1)\hat{g}_{t,i}}{\Gamma_{t-1}^{2}}\right]$		(17)

Dynamics of node influence in network growth models††thanks: This document does not contain technology or technical data controlled under either the U.S. International Traffic in Arms Regulations or the U.S. Export Administration Regulations.

Abstract

Index Terms:

1 Introduction

2 Network growth models

3 Analytical results on node visibility – BA and Fitness models

Lemma 3.1

3.1 Node visibility over time – General fitness model

Lemma 3.2

4 Experimental results on node visibility – Fitness models

5 Analytical results on node visibility – Spatial models

5.1 Preliminaries - Results on various notions of visibility in spatial models

Lemma 5.1

Lemma 5.2

5.2 Node visibility over time

Lemma 5.3

6 Experimental results on node visibilities – Spatial models

7 Conclusion

8 Acknowledgements

References

Dynamics of node influence in
network growth models^†^†thanks: This document does not contain technology or technical data controlled under either the U.S. International Traffic in Arms Regulations or the U.S. Export Administration Regulations.