\fnm

Josh \surJohnston

Scale Free Projections Arise from Bipartite Random Networks

jjohnston@u.boisestate.edu \fnmTim \surAndersen tandersen@boisestate.edu \orgdivDepartment of Computer Science, \orgnameBoise State University, \orgaddress\street777 W Main St, \cityBoise, \postcode83702, \stateID, \countryUSA

Abstract

The degree distribution of a real world network — the number of links per node — often follows a power law, with some hubs having many more links than traditional graph generation methods predict. For years, preferential attachment and growth have been the proposed mechanisms that lead to these scale free networks. However, the two sides of bipartite graphs like collaboration networks are usually not scale free, and are therefore not well-explained by these processes. Here we develop a bipartite extension to the Randomly Stopped Linking Model and show that mixtures of geometric distributions lead to power laws according to a Central Limit Theorem for distributions with high variance. The two halves of the actor-movie network are not scale free and can be represented by just $5$ geometric distributions, but they combine to form a scale free actor-actor unipartite projection without preferential attachment or growth. This result supports our claim that scale free networks are the natural result of many Bernoulli trials with high variance of which preferential attachment and growth are only one example.

keywords:

scale free Networks, Bipartite Graphs, Central Limit Theorem, Power Laws, Preferential Attachment

1 Introduction and Background

For two decades, the Barabási-Albert (BA) Model has explained why power laws and other heavy-tailed distributions often emerge in what are known as scale free networks [6]. They propose that degree distribution in a real network — the number of links per node — tends toward a power law due to preferential attachment [2]. Recent work showed that preferential attachment and growth are not required to generate scale free networks [11]. Linking processes behaving as Bernoulii trials with high variance also result in power law degree distributions. This is predicted by a Central Limit Theorem (CLT) that says mixtures of geometric distributions with high variance will follow a power law [11]. The critical element of scale free networks are high variance linking probabilities, not preferential attachment per se. We can build synthetic networks with the Randomly Stopped Linking Model [12], which uses mixtures of geometric distributions to model Bernoulli processes with high variance and then links nodes together with a reparameterization of the Configuration Model [8].

In this paper, we extend the Randomly Stopped Linking Model to bipartite graphs. These graphs have two types of nodes where links are only made from one type to the other [10]. Analyzing the actor-movie network provides insight to how projections of bipartite graphs can have power law degree distributions even when the distribution of each half of the network does not follow a power law. We use a reparameterized Bipartite Configuration Model to reconfigure the links between actors and movies, demonstrating that preferential attachment is not needed to result in a projected actor-actor network similar to the real world version. We further show that a synthetic network created with geometric (non-heavy-tailed) distributions produces a power law degree distribution in the projected actor-actor network.

1.1 The Actor-Actor Network

The network that links Hollywood actors who have appeared in movies together is one example of a scale free network [18], [19]. Like other co-occurrence graphs such as identity networks and scientific collaboration networks, the actor-actor network is actually a projection from a bipartite network to a unipartite network. In the bipartite newtork, a link connects from an actor node to a movie node to show an actor has appeared in a movie [3]. The version used to support the BA model is a projection where links connect actors directly to each other if they have been in a movie together [5]. This paper uses the terminology “actor-movie network” for the full bipartite network and “actor-actor network” for the projected view.

1.2 Fitting a Power Law to Degree Distribution

To consider a network scale free, we expect a power law to be a better fit than other distributions across at least 2–3 orders of magnitude in the degree distribution. We use a method [4] comparing the fit of a power law to a stretched exponential, using maximum-likelihood estimation (MLE) to determine whether a power law fit is best according to the Bayesian Information Criterion (BIC). Even in scale free degree distributions, power laws are rarely the best distribution to fit all of the data, and we are most interested in the tail where power laws tend to become visible [9]. So as long as the MLE does not support a power law fit, the method iteratively increases $k_{min}$ and tests the fit for data greater than this degree threshold.

As seen in Figure 1, the degree distribution of the projected actor-actor network follows a power law more closely than either of the actor or movie degree distributions in the bipartite network. As we show below, the number of actors per movie is in fact fit well by a geometric distribution and is not heavy-tailed, much less scale free. It has been previously noted that the two halves of collaboration networks, including the actor-movie network, have degree distributions that are usually not as heavy-tailed as the projected unipartite network [10].

Refer to caption — Figure 1: Three views of the Hollywood actor-movie network used for this paper, with data from [1]. (a) shows the degree of actor nodes when the bipartite graph is projected into an actor-actor view, with links connecting actors who appeared in the same movie together. The degree of the movie nodes (b) and actor nodes (c) from the underlying bipartite graph are also the number of actors in each movie (b) and the number of movies each actor appears in (c). $\gamma$ is the exponent of the fitted power law and $k_{min}$ is the minimum value for which the power law is the best fit to the data.

1.3 How the Actor-Movie Network is Formed

The BA Model assumes new actors preferentially appear in movies with high-degree actors because “a new actor is most likely to be cast in a supporting role with more established and better-known actors” [5]. This is an unproven assertion with plausible counterclaims, ie ‘new actors are more likely to appear in movies with other unknown actors than with a big star’. The bipartite network is a more complete representation of the relevant relationship dynamics than the projected view [13]. Actor-actor links do not typically form as organic collaborations of actors. Instead, movies are distinct entities, each with a fixed number of roles. Actors compete for these roles. While it makes sense that landing a well-respected actor will influence further casting, this potential network effect is likely small relative to the prior fitness of each actor competing for the role.

In any event, our model ignores any network effects and assumes actor-actor degree is dominated by the number of movies an actor appears in and the number of actors per movie, without regard to preferential attachment. Therefore, we explore two questions separately:

1.

How many actors are in each movie?
2.

How many movies does each actor appear in?

The answers to these questions become the degree distributions for each type of node. First, we will combine the two halves from the original degree distributions using a Bipartite Configuration Model, showing that preferential attachment is not required in the node linking step to produce a scale free network. Later, we will show that the degree distributions of each half of the bipartite model can be parameterized as geometric distributions and still result in a scale free projected actor-actor network.

2 Relinking the actor-actor network using the Bipartite Configuration Model

In the Bipartite Configuration Model, the two halves of the network (movie and actor) are initialized as link stubs attached to each node. At this point, the stubs do not yet connect to other nodes to form actual links. The number of stubs for each node could be drawn from a distribution, but in the first example we will use the original degree distribution of each half of the real actor-movie network. Conceptually, we break every link and will relink new pairs of nodes without regard to how pairs were linked in the real network.

The stubs are combined to form new links using a technique to build a random bipartite network from prescribed degree distributions[10]]. We create the graph by randomly selecting pairs of unlinked stubs — a movie stub and an actor stub — then linking them together. This step repeats until there are no more stubs [15].

After creating the bipartite actor-movie network, we extract the actor-actor projection and compare its degree distribution with that of the real network. Our model emphasizes two distinct, independent processes rather than being driven by preferential attachment or other network effects: movie writers create a number of roles according to one process and actors are cast in a number of movies according to another process. Then they are linked randomly (Figure 2).

3 From relinking to a fully synthetic network

We have shown that a scale free actor-actor network emerges from the two halves of a bipartite network when relinked randomly, without preferential attachment or other network effects. Since we used the original degree distributions for actor nodes and movie nodes, however, there may be network effects responsible for generating those distributions that explain the scale free nature of the relinked and projected network. In this section, we use geometric distributions to parameterize the two halves of the bipartite network, again without preferential attachment or network effects. This shows that Bernoulli processes can lead to scale free bipartite networks as predicted by the CLT for high variance distributions.

3.1 How many actors are in each movie?

In the bipartite actor-movie network, the number of actors in each movie is the degree of movie nodes (Figure 1). Our model recognizes that each role added to a movie is a discrete decision made in series. After the first role, there is a chance the writer will add another role. If not, the process ends. If the writer adds a second role, there is now a chance to add a third role, and so on. In the simplest approximation, we consider the marginal probability of adding each role to be the same. That process is described by a geometric distribution (Equation 1), the number $k$ Bernoulli trial failures before the first success. Equation 2 finds $\mu$ , the mean of the distribution, in terms of the parameter $p$ , which is then rearranged as Equation 3. In the real network, $\mu=11.5$ , so according to Equation 3, $p=0.087$ . The result of this fit is presented as Figure 3.

f(k)=(1-p)^{k}p

(1)

\mu=\frac{1-p}{p}

(2)

p=\frac{1}{1+\mu}

(3)

3.2 How many movies does each actor appear in?

Each actor in the network appears in some number of movies; this is the degree of the actor nodes in the actor-movie network. Unlike movie degree, a single geometric distribution does not fit the actor node degree distribution very closely. Instead, we use a mixture of geometric distributions, following the insight provided by the Randomly Stopped Linking Model [12]. Expected outcomes for actors have high variance; not everyone starts with the same chance of making it big. As a heterogeneous and constant property of nodes, a value of fitness can represent the competitive strength of each actor for roles [7] This follows from, and is justified by, the observation that some actors have a priori advantages and therefore higher fitness for being cast in movies than others.

Adapting the geometric distribution mixing function from[12], we fit four geometric distributions to the actor node degree distribution. This fit uses $8$ parameters, with each distribution having a value for $p$ in (Equation 3) as well as a coefficient weight $a$ , and is performed with the Trust Region Reflective technique implemented by the SciPy $curve\_fit$ function[16] [17]. The values of $p$ and $a$ that best fit the real network are presented in Table 1 and the result of this mixture is shown as Figure 4.

We can interpret the values of $p$ as the chance a member of a cohort of actors has not been cast in another movie. The corresponding value of $a$ is a description of the size of that cohort. The fit tells us that a priori, many more actors are expected to be in a small number of movies than are expected to make it big.

Table 1: Parameters for the best fit of four geometric distributions to the number of movies per actor in the actor-movie network.

p	a
0.046	0.094
0.184	0.178
0.528	0.311
0.940	0.562

3.3 Generating Synthetic Networks with the Bipartite Configuration Model

We have characterized the degree distribution of each half of the bipartite actor-movie network using parameterized distributions: geometric for the movie node degree and a mixture of four geometrics for the actor node degree.

To get from these PMFs to a network, we create the same number of movies and actors as contained in the real network. Each generated movie is a node with a number of stub actor links pulled from a geometric distribution with the parameter fit earlier (Figure 3). Separately, each actor is created as a node and assigned a number of movie roles by pulling from the PMF generated as a collection of geometric distributions (Figure 4). At this point, we have separately established the degree distributions for actors and movies, so each node has a certain number of stubs. The modeled network is formed by randomly selecting a stub from an actor node and a stub from a movie node, then replacing those stubs with a link [15]. As in the earlier relinking, there is no preferential attachment — links connect without regard to how many actors each movie is already connected to, and vice versa. The process repeats until all stubs are connected, creating the bipartite structure of the actor-movie network. Finally, the actor-actor network is projected and compared to the real network (Figure 5). This synthetic network is the bipartite adaptation of the Randomly Stopped Linking Model from [12].

The degree of our modeled actor-actor network is similar to that of the real network, but visual inspection on a log-log plot is insufficient to determine whether a power law is a better fit than other heavy-tailed distributions such as the log-normal [14]. We use the two-sided Kolmogorov-Smirnov (KS) test to quantify goodness of fit [9], then compare with a power law fit to the tail of the distribution above a $k_{min}$ [16] [17]. A low KS statistic indicates a close fit between two distributions.

The KS statistic for the Randomly Stopped Linking network compared to a power law fit is $0.027$ , which is near to, but worse than, the real network vs. power law KS stastistic of $0.022$ . However, the $k_{min}$ of the Randomly Stopped Linking network is $29$ compared to the real network’s $48$ . That means $46.6\%$ of the synthetic network’s data is best described by a power law, compared to only $38.3\%$ of the real network. This result shows that while the real network is different from the synthetic one built with geometric distributions, they are both scale free networks. Additional statistics characterizing the real network, the geometric fit of the actor and movies nodes, and the synthetic Randomly Stopped Linking network are shown as Table 2.

Table 2: Comparison of the real network and the Randomly Stopped Linking network, including the two halves of the bipartite network and the projected actor-actor view.

Power Law Fit

Distribution Statistics

Network

\gamma

k_{min}

Data Fraction

Variance

\sigma^{2}

Mean

\mu

VMR

\sigma^{2}/\mu

Real Network: Actors per Movie

6.5

88

0.2\%

138.2

11.5

12.0

Geometric Fit: Actors per Movie

4.5

36

4.2\%

120.8

11.5

10.5

Real Network: Movies per Actor

5.3

170

0.0\%

108.7

3.8

28.4

Mixture of Geometric Fit: Movies per Actor

2.8

31

2.0\%

72.5

3.7

19.8

Real Network: Actor-Actor Degree

2.1

48

38.3\%

44514.9

86.6

513.8

Randomly Stopped Linking Network: Actor-Actor Degree

2.0

29

46.6\%

30059.0

73.8

407.2

4 Discussion

4.1 Heavy-Tailed Projections Emerge from Bipartite Networks without Heavy Tails

Both the bipartite Randomly Stopped Linking network and the real network actor-actor degree distributions are well-characterized by power laws for several orders of magnitude, and can thus be considered scale free. The results in Table 2, however, show that neither the actor nor movie components of the bipartite networks are scale free. The $k_{min}$ of $88$ for the Actors per Movie and $170$ for Movie per Actor means that a power law best describes only a trivial range of the degree distribution. This is likely why most analyses showing collaboration networks to be scale free focus on the projection of the network. Figure 5 shows that the power law fit covers about $3.5$ orders of magnitude in this projected actor-actor network, which should therefore be considered scale free.

The result of projection is also visible in statistics from each network. Both the variance and variance-to-mean ratio (VMR) are much higher after projection than in either half of the original bipartite graph. This shows that a projection of a collaboration network can be scale free even when both halves and the original bipartite network are not. The two halves of the bipartite network have quite different degree distribution means and the variance increases substantially even when linked randomly without preferential attachment. This is another example of high variance Bernoulli processes leading to scale free networks.

4.2 Mixtures of High Variance Geometric Distributions Lead to Scale Free Networks

Table 2 also compares the real actor collaboration networks with an example generated from the modeling with geometric distributions described earlier in this paper. The degree distributions are indistinguishable between the generated and real networks. The number of actors per movie is drawn from a single geometric distribution and the number of movies per actor is pulled from a mixture of four geometric distributions. Therefore, the generated version of the projected actor-actor network is formed from only five geometric distributions, then linked according to the Configuration Model with no growth or preferential attachment.

We have previously shown that the Randomly Stopped Linking Model creates scale free networks from Bernoulli trials with high variance [11]. This paper extends the result by showing even a small number of geometric distributions can result in high enough variance to create a scale free network, especially when two halves of a bipartite network have widely separated means that increase the variance when assembled.

5 Conclusions

Since the discovery of power law degree distributions in real networks, preferential attachment has stood as the generally-assumed mechanism of their formation. We demonstrate experimentally that independent Bernoulli Processes — implemented by a small number of geometric distributions — accurately model the actor-actor network’s degree distribution without growth or preferential attachment. Our technique has several advantages over the BA model, including:

[1.]
1.

estimating the entire distribution rather than the minority of points in the right hand tail best fit by a power law
2.

recovering the full bipartite actor-movie graph rather than only the projected actor-actor view
3.

explaining the real network’s distribution with a general theory of fitness variance rather than switching from a power law to a crossover distribution — such as the stretched exponential — in the presence of a nonlinear preferential attachment regime

Abstracting from this specific actor-actor network, a generalized CLT provides a theoretical justification that heavy-tailed degree distributions are expected when the fitness of nodes has high variance. Applying these insights to other scale free networks may explain the ubiquity of heavy-tailed degree distributions as the predictable result of Bernoulli Processes with high variance node fitness, particularly in bipartite graphs like collaboration networks.

References

\bibcommenthead
paj [2004] (2004) Pajek data: Notre dame self-organized networks database. Tech. rep., Notre Dame University, South Bend, IN, http://vlado.fmf.uni-lj.si/pub/networks/data/nd/ndnets.htm. (Accessed: 27th July 2019)
Albert et al [1999] Albert R, Jeong H, Barabási AL (1999) Diameter of the world-wide web. nature 401(6749):130–131
Albert-Lászlo and Márton [2017] Albert-Lászlo B, Márton P (2017) Network science. Cambridge University Press
Alstott et al [2014] Alstott J, Bullmore E, Plenz D (2014) powerlaw: A python package for analysis of heavy-tailed distributions. PLOS ONE 9(1):1–11. 10.1371/journal.pone.0085777, URL https://doi.org/10.1371/journal.pone.0085777
Barabási and Albert [1999] Barabási AL, Albert R (1999) Emergence of scaling in random networks. Science 286(5439):509–512. 10.1126/science.286.5439.509, URL https://science.sciencemag.org/content/286/5439/509, https://science.sciencemag.org/content/286/5439/509.full.pdf
Barabási [2009] Barabási AL (2009) Scale-free networks: A decade and beyond. Science 325(5939):412–413. 10.1126/science.1173299, URL https://www.science.org/doi/abs/10.1126/science.1173299, https://www.science.org/doi/pdf/10.1126/science.1173299
Bianconi and Barabási [2001] Bianconi G, Barabási AL (2001) Competition and multiscaling in evolving networks. Europhysics Letters (EPL) 54(4):436–442. 10.1209/epl/i2001-00260-6
Bollobás [1980] Bollobás B (1980) A probabilistic proof of an asymptotic formula for the number of labelled regular graphs. European Journal of Combinatorics 1(4):311–316. https://doi.org/10.1016/S0195-6698(80)80030-8, URL https://www.sciencedirect.com/science/article/pii/S0195669880800308
Clauset et al [2009] Clauset A, Shalizi CR, Newman MEJ (2009) Power-law distributions in empirical data. SIAM Review 51(4):661–703. 10.1137/070710111
Guillaume and Latapy [2006] Guillaume JL, Latapy M (2006) Bipartite graphs as models of complex networks. Physica A: Statistical Mechanics and its Applications 371(2):795–813. https://doi.org/10.1016/j.physa.2006.04.047, URL https://www.sciencedirect.com/science/article/pii/S0378437106004638
Johnston and Andersen [2020] Johnston J, Andersen T (2020) Randomly stopped linking generates scale free networks. In: SIAM 2020, Workshop on Network Science (NS20)
Johnston and Andersen [2022] Johnston J, Andersen T (2022) Random processes with high variance produce scale free networks. Physica A: Statistical Mechanics and its Applications 604:127588. https://doi.org/10.1016/j.physa.2022.127588, URL https://www.sciencedirect.com/science/article/pii/S0378437122004058
Newman et al [2002] Newman MEJ, Watts DJ, Strogatz SH (2002) Random graph models of social networks. Proceedings of the National Academy of Sciences 99(suppl_1):2566–2572. 10.1073/pnas.012582999, URL https://www.pnas.org/doi/abs/10.1073/pnas.012582999, https://www.pnas.org/doi/pdf/10.1073/pnas.012582999
Stumpf and Porter [2012] Stumpf MP, Porter MA (2012) Critical truths about power laws. Science 335(6069):665–666
Tian et al [2012] Tian L, He Y, Liu H, et al (2012) A general evolving model for growing bipartite networks. Physics Letters A 376(23):1827–1832. https://doi.org/10.1016/j.physleta.2012.04.020, URL https://www.sciencedirect.com/science/article/pii/S0375960112004653
Virtanen et al [2020] Virtanen P, Gommers R, Oliphant TE, et al (2020) SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nature Methods 17:261–272. 10.1038/s41592-019-0686-2
Vugrin et al [2007] Vugrin KW, Swiler LP, Roberts RM, et al (2007) Confidence region estimation techniques for nonlinear regression in groundwater flow: Three case studies. Water Resources Research 43(3). https://doi.org/10.1029/2005WR004804, URL https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1029/2005WR004804, https://agupubs.onlinelibrary.wiley.com/doi/pdf/10.1029/2005WR004804
Wasserman and Faust [1994] Wasserman S, Faust K (1994) Social Network Analysis: Methods and Applications. Structural Analysis in the Social Sciences, Cambridge University Press
Watts and Strogatz [1998] Watts DJ, Strogatz SH (1998) Collective dynamics of ‘small-world’ networks. Nature 393(6684):440–442. 10.1038/30918