The first eigenvector of a Distance Matrix
is nearly constant
Abstract.
Let be points in a metric space and define the distance matrix by . The Perron-Frobenius Theorem implies that there is an eigenvector with non-negative entries associated to the largest eigenvalue. We prove that this eigenvector is nearly constant in the sense that the inner product with the constant vector is large
and that each entry satisfies . Both inequalities are sharp.
Key words and phrases:
Distance Matrix, Perron-Frobenius, Graph Distance Matrix.2010 Mathematics Subject Classification:
51K05, 30L15, 31E05.1. Introduction
This paper is the result of trying to understand a curious phenomenon concerning the graph distance matrix (Problem 1, introduced below). We give a relatively simple argument (Proposition 1) which explains, in a particular setting, why one might expect to observe such a phenomenon. Proposition 1 is a conditional result and it is not a priori clear why or how often (meaning for ‘how many’ graphs) the condition should be satisfied. A surprisingly large number of times it is indeed satisfied for reasons that are not presently understood and we will refer to this as Problem 2. The main contribution of the paper, besides introducing these two problems, is to show that there is indeed a general result (see §1.2) in metric spaces which is of intrinsic interest and explains the second phenomenon ‘up to constants’. For many graphs that constant factor seems to rather close to (the largest it could be) and this is currently not understood: we refer to §1.3.
1.1. The first problem.
We start by explaining the first problem which first arose implicitly in [20]: let be a finite, connected graph on vertices and let denote the graph distance matrix defined by where is the graph distance between the vertices .
Problem 1. The linear system of equations
tends to have a solution for ‘most’ graphs. Why is that?
It is certainly the case that for ‘many’ graphs, the matrix simply happens to be invertible and this case is not particularly interesting. However, when the graph exhibits additional structure and/or symmetries, then this is often reflected in the non-invertibility of , however, even then the linear system seems to have a solution for a constant right-hand side. We can make this quantitative: the database of graphs in Mathematica 12 contains
7642 connected graphs with vertices. Among those, the matrix is not invertible for 3155 of these graphs (this large percentage is due to the fact that the database contains more ‘structured’ than ‘generic’ graphs). However, has a solution in all but 5 cases.
Using the Monte Carlo
method, we estimate that for Erdős-Renyi graphs the likelihood of a linear system associated to not having a solution is (uniformly in ) less than and
decays rapidly for larger . The phenomenon arose in [20] where solutions of are used to construct a notion of curvature on graphs. Naturally, the effectiveness of such a notion hinges on whether a solution of typically exists.
Our starting point is a spectral existence result.
Proposition 1.
Suppose has eigenvalues and eigenvector . If
then has a solution.
The right-hand side is positive but need not be large. Note that vanishes on the diagonal and thus the sum of the eigenvalues is 0. Given our assumption about the sign of the eigenvalues, this implies . We would thus expect that, generically, which makes the right-hand side rather small. Hence, the applicability of Proposition 1 depends on whether is close to 1, whether the first eigenvector is ‘nearly constant’.
1.2. The Result
If is a matrix with nonnegative entries, then the Perron-Frobenius theorem guarantees the existence of an eigenvector with nonnegative entries associated to the largest eigenvector.
Theorem.
Let be points in a metric space (not all identical) and denote the matrix given by . Its Perron Frobenius eigenvector , normalized to unit length , satisfies
We note that, by Cauchy-Schwarz, all of unit length satisfy
with equality if and only if all the entries of are .
The Theorem can be interpreted as saying that the Cauchy-Schwarz inequality is close to being attained (which happens if the vectors and are linearly dependent, meaning that is close to a constant vector). The two inequalities in the Theorem are sharp: if we consider the star graph, then the minimum of the largest eigenvector is attained in the center and behaves asymptotically as described in the Theorem. For the second inequality we consider a metric space where points are distance (for suitable depending on ) from each other with an additional point at distance from the first points. We refer to §2.3 for details.
1.3. The second problem.
Returning to our original motivation, the Theorem by itself does not quite explain the phenomenon introduced above. The constant , while sharp, is not particularly close to 1. However, when returning to the graph setting, it does seem as if is usually quite close to 1 and often strikingly so. Among the 7642 connected graphs having vertices that are implemented in the Mathematica database, the average value of is 0.996. Path graphs have a comparatively small constant that can be computed exactly (we refer to prior work of Ruzieh & Powers [17]). It is a nice fact (see [17]) that the eigenvector behaves likes a hyperbolic cosine.
Proposition 2.
Consider the path graph on vertices. Then
where is the positive number satisfying .
A search in the Mathematica database identifies the ‘sun’ graphs (also sometimes known as trampoline graph, see [9]) as having the smallest constant within the library. A computation shows that the limiting behavior of the constant for sun graphs is given by .
Motivated by the extremal metric space (see Fig. 1), it is natural to consider so-called broom graphs or comets [14, 23]. They are obtained by taking a star and attaching a path to the central vertex. Numerics suggest that they might lead to a relatively small inner product (certainly quite a bit smaller than the sun graphs and possibly even close to ), however, their leading eigenvector does not seem to be known. We conclude by formally stating the second problem.
Problem 2. If is an unweighted, connected graph on vertices and is the leading eigenvector of its distance matrix , then it seem for most graphs with a very small value of . Can this be made precise and/or quantified?
1.4. Related results
Let be a metric space and . The largest eigenvector of the distance matrix then describes the global maximum of the energy functional given by
This is naturally related to the study of energy integrals of the form
where is a probability measure. The study of such measures is classical, we refer to the 1958 paper of Björck [6] as well as Alexander [1, 2], Alexander & Stolarsky [3], Gross [13], Wolf [24, 25] and references therein. Both in the continuous [6] as well as in the discrete case [21] the optimal measure tends to be localized in a rather small set contained in the boundary. We observe that by switching from probability measures (essentially ) to , the geometry of the problem changes substantially: having mass localized is actually rather expensive and it is beneficial to spread things out. Our main result makes this precise. The distance matrix for graphs was introduced by Graham & Pollak [12]. Papendieck & Recht [16] studied the largest possible entry of the principal eigenvector. Das [10, 11] considered the problem of bounding the extremal entries of the principal eigenvectors of the distance matrix. The survey of Aouchiche & Hansen [4] lists several more results about the principal eigenvalue and principal eigenvector, we also refer to the books of Stanić [19], Stevanović [22]. Distance matrices in the Euclidean setting were introduced by Schoenberg [18] (see also Young & Householder [26]) and studied in [5, 7, 8, 26]. We caution that the classical Euclidean Distance Matrix (Menger [15]) is different and has .
2. Proofs
2.1. Proof of Proposition 1
Proof.
We consider the problem of maximizing the quadratic form
subject to the restriction
Since has at least two positive entries, we conclude via that
We will now argue that the maximum is attained. Using the symmetry of and the Spectral Theorem, we can decompose any vector into eigenvectors of
Recalling that , we have
We can decompose the leading eigenvector (corresponding to ) into its projection on and a remaining vector
The Pythagorean theorem implies that has mean value 0 and thus
Then
and since the sum of entries of is , this implies
and hence, for restricted to the hyperplane ,
Note that the first term is merely a constant, independent of , and thus
Using the bound from above and the Cauchy-Schwarz inequality,
Therefore, if
then there exists a global maximum that is attained because the quadratic form tends to in all directions of the hyperplane . Recalling that
we arrive at the desired condition. Let now denote the location of such a maximum. It maximizes a function () subject to a constraint () and therefore, for some Lagrange multiplier ,
We note that since , we have and thus . After rescaling, this implies the existence of a solution of . ∎
2.2. Proof of the Theorem
Proof.
We start by noting that since not all the points are identical, there exists at least a pair which has positive distance and thus the matrix has at least two positive entries (this is to ensure that we do not argue about the eigenvalues of the zero matrix). In particular, . Let denote the normalized Perron-Frobenius eigenvector all of whose entries are nonnegative. Then, for an arbitrary point , we have
Using the th row of the equation , we note that
Altogether, we conclude
and thus, since ,
Then, using the Cauchy-Schwarz inequality,
This implies, in particular, . However, one can do a little bit better: by noting that was arbitrary, we also obtain by summation over that
implying as desired. ∎
Remark. The proof suggests that the only way for the argument to be close to sharp is if the triangle inequality is close to being an equation for most cases. It seems interesting, albeit nontrivial, to see whether one can obtain quantitative improvements in settings where this is not the case.
2.3. Examples.
We will now discuss three relevant examples.
Star Graph. The star graph is comprised of a central vertex which has an edge to other vertices. The symmetry of the graph is reflected in its leading eigenvector which we assume to have values in the center and in all other vertices.
This leads to the problem of maximizing
A computation shows that the optimal choice for is
This is not too surprising: for the star graph, our argument is essentially optimal.
A metric space.
As for the second inequality, let us suppose we have elements of which are all distance from each other with one element at distance . The eigenvalue problem then consists of maximizing the expression
where is the value in points that are distance from each other and is the value in the single point that is far away. For , this is maximized for and . Basic stability theory implies that this persists for sufficiently small. Thus
Path Graphs: Proof of Proposition 2.
We use an exact expression for the first eigenvector of the distance matrix of due to Ruzieh & Powers [17]: they show that the entries for can be taken as
where is the positive solution of
A Taylor series expansion shows that for around 0
and thus, as , we see that . Making the ansatz for some unspecified constant , we see that
Therefore, as , we have that the implicit constant in the ansatz converges to the solution of the equation . This solution is approximately . The limiting profile of the eigenvector is given by for which, after some computation, implies the desired statement. ∎
References
- [1] R. Alexander, Generalized sums of distances. Pacific J. Math. 56 (1975), no. 2, 297–304.
- [2] R. Alexander, Two notes on metric geometry. Proc. Amer. Math. Soc. 64 (1977), p. 317–320.
- [3] R. Alexander and K. Stolarsky, Extremal problems of distance geometry related to energy integrals. Trans. Amer. Math. Soc. 193 (1974), 1–31.
- [4] M. Aouchiche and P. Hansen, Distance spectra of graphs: a survey. Linear Algebra Appl. 458 (2014), 301–386.
- [5] K. Ball, Eigenvalues of Euclidean distance matrices. J. Approx. Theory 68 (1992), 74–82.
- [6] G. Björck, Distributions of positive mass, which maximize a certain generalized energy integral, Ark. Mat., 3 (1958), 255–269.
- [7] E. Bogomolny, O. Bohigas and C. Schmit, Distance matrices and isometric embeddings. Zh. Mat. Fiz. Anal. Geom. 4 (2008), no. 1, 7–23, 202.
- [8] E. Bogomolny, O. Bohigas and C. Schmit, Spectral properties of distance matrices. Random matrix theory. J. Phys. A 36 (2003), no. 12, 3595–3616.
- [9] A. Brandstädt, A. V. B. Le and J. P. Spinrad, Graph Classes: A Survey. Philadelphia, PA: SIAM, 1987.
- [10] K. Das, Maximal and minimal entry in the principal eigenvector for the distance matrix of a graph. Discrete Math. 311 (2011), no. 22, 2593–2600.
- [11] K. Das, A sharp upper bound on the maximal entry in the principal eigenvector of symmetric nonnegative matrix Linear Algebra Appl., 431 (2009), pp. 1340–1350
- [12] R.L. Graham and H.O. Pollak. On the addressing problem for loop switching. Bell Syst. Tech. J. 50 (1971), 2495–2519.
- [13] O. Gross, The rendezvous value of metric space. 1964 Advances in game theory pp. 49–53 Princeton Univ. Press, Princeton, N.J.
- [14] A. Ilic, On the extremal properties of the average eccentricity. Computers & Mathematics with Applications, 64 (2012), 2877-2885.
- [15] K. Menger, Untersuchungen uber allgemeine Metrik, Math. Annalen 100, p. 75 – 163 (1928)
- [16] B. Papendieck and P. Recht, On maximal entries in the principal eigenvector of graphs. Linear Algebra Appl. 310 (2000), no. 1-3, 129–138.
- [17] S. Ruzieh and D.L. Powers, The distance spectrum of the path and the first distance eigenvector of connected graphs, Linear Multilinear Algebra 28 (1990) 75–81.
- [18] I. Schoenberg, Remarks to Maurice Frechet’s article Sur la definition axiomatique d’une classe d’espace distancies vectoriellement applicable sur l’espace de Hilbert, Ann. of Math. 36 (1935), 724-732.
- [19] Z. Stanić, Inequalities for graph eigenvalues. Vol. 423. Cambridge University Press, 2015.
- [20] S. Steinerberger, Curvature on Graphs via Equilibrium Measures, arXiv:2202.01658
- [21] S. Steinerberger, Sums of Distances on Graphs and Embeddings into Euclidean Space, arXiv:2204.13278
- [22] D. Stevanovic, Spectral radius of graphs. Academic Press, 2014.
- [23] D. Stevanovic and A. Ilic, Distance spectral radius of trees with fixed maximum degree. The Electronic Journal of Linear Algebra 20 (2010), p. 168-179.
- [24] R. Wolf, On the average distance property and certain energy integrals. Ark. Mat. 35 (1997), no. 2, 387–400.
- [25] R. Wolf, Averaging distances in real quasihypermetric Banach spaces of finite dimension. Israel J. Math. 110 (1999), 125–151.
- [26] G. Young and A. Householder, Discussion of a set of points in terms of their mutual distances, Psychometrika 3 (1938) 19–22.