This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Dynamic analysis of influential stocks based on conserved networks

Xin-Jian Xu1, Min Qin1, Xiao-Ying Song2 and Li-Jie Zhang3 1Department of Mathematics, Shanghai University, Shanghai 200444, People’s Republic of China
2School of Economics, Shanghai University, Shanghai 200444, People’s Republic of China
3Department of Physics, Shanghai University, Shanghai 200444, People’s Republic of China
lijzhang@shu.edu.cn
Abstract

Characterizing temporal evolution of stock markets is a fundamental and challenging problem. The literature on analyzing the dynamics of the markets has focused so far on macro measures with less predictive power. This paper addresses this issue from a micro point of view. Given an investigating period, a series of stock networks are constructed first by the moving-window method and the significance test of stock correlations. Then, several conserved networks are generated to extract different backbones of the market under different states. Finally, influential stocks and corresponding sectors are identified from each conserved network, based on which the longitudinal analysis is performed to describe the evolution of the market. The application of the above procedure to stocks belonging to Standard & Pool’s 500 Index from January 2006 to April 2010 recovers the 2008 financial crisis from the evolutionary perspective.

Keywords: stock networks, conserved networks, influential stocks

1 Introduction

Stock markets are well-defined complex systems consisting of multi heterogeneous stocks with complex relationships among them [1]. The prices of the stocks evolve as a consequence of their internal and external interactions, and different assets present turbulent financial time series making the market behaviors even more difficult to be examined. Therefore, it is important to mine essential information from the market and build an efficient model to characterize its dynamic properties, which not only provides a fundamental understanding of financial systems but also provides practical insights for policymakers and practitioners [2].

One seminal approach is the random matrix theory [3, 4, 5] which characterizes the eigenvalue distribution of the correlation coefficient matrix of time series of stocks and have unveiled many stylized facts of stock markets [6, 7, 8, 9]. For instance, a stock market containing many business sectors (groups of stocks sharing common economic properties) with hierarchial organization [10]. However, the random matrix could not draw interactions well among these sectors. To understand the market in a more exact way, the complex network theory [11] was adopted instead. Examples include the minimal spanning tree [12], the asset graph [13], the planar maximally filtered graph [14] and the threshold network [15]. Based on these models, many topological characteristics have been observed for the markets such as New York Stock Exchange [16, 17, 18, 19], German Stock Exchange [20], Tokyo Stock Exchange [21], Hong Kong Stock Market [22] and Shanghai Stock Market [24].

In general, a stock market evolves with economic states, resulting in sequential changes of stock prices from one state to another [23]. To explain the development of the economic state from the perspective of the market, there is an increasing interest in characterizing temporal evolution of stock networks. Yet, most studies focused on the evolution of global topologies, such as the edge density, the average clustering coefficient and the average shortest path length, which only provides the macro topological information of the market corresponding to different states [19, 22]. To get a deeper understanding of the dynamics of the market, it is essential to study the evolution of the market from a micro point of view [25, 26]. Specially, how to identify influential stocks and evaluate their roles in the diffusion of microfinance is of great importance [27].

There are two major approaches to identify influential stocks in the literature. The first approach is from the dynamical point of view. For instance, Wang et al. [28] and Benzaquen et al. [29] suggested the cross-response (impact) function to characterize the influence of a stock. That is, the stock with strong cross-response has large influence on other stocks, resulting in the rank of stocks. The second approach is from the structural point of view. Specially, the concept of centrality has been widely used to rank nodal influence. For example, Roy and Sarkar [30] employed the degree centrality to rank stocks. They compared top 1010 influential stocks corresponding to pre- and post-crisis and observed that the change in ranks of top 33 influential stocks are relatively low compared to those ranked lower. Nevertheless, a subtle analysis remains demanding.

The goal of this paper is to identify most influential stocks of a stock market from the evolutionary perspective such that it can recover a financial crisis efficiently. To this end, we first construct a series of threshold networks for stocks in an investigating period. Then, we consider different stages of the crisis and build a conserved network for each stage. Finally, we identify influential stocks and corresponding sectors to describe crisis propagation. To test its efficacy, we apply our framework to stocks belonging to Standard & Pool’s (S&P) 500 Index from January 2006 to April 2010 and recovers the 2008 financial crisis in an evolutionary way.

2 Methodology

Section 2.1 introduces the Pearson correlation coefficient, the P-threshold method with multiple hypothesis testing and the moving-window method to construct the dynamic sequence of stock networks. Section 2.2 presents conserved networks associated with different stages of a financial crisis. Section 2.3 introduces four typical centralities to measure nodal influence. Section 2.4 presents the order statistic to synthesize centralities of nodes to rank their influence.

2.1 Stock networks

Let pi(τ)p_{i}(\tau) (i=1,2,,N;τ=1,2,,M)(i=1,2,\cdots,N;\tau=1,2,\cdots,M) be the daily closing price of stock ii at time τ\tau, one obtains the logarithmic return of ii over a time interval Δτ\Delta\tau by

ri(τ)=lnpi(τ)lnpi(τΔτ).r_{i}(\tau)=\ln p_{i}(\tau)-\ln p_{i}(\tau-\Delta\tau). (1)

In this paper we set Δτ=1\Delta\tau=1, so ri(τ)r_{i}(\tau) represents the daily return of stock ii at time τ\tau. Then, the correlation coefficient between stocks ii and jj is defined by

wij=rirjrirjσiσj,w_{ij}=\frac{\langle r_{i}r_{j}\rangle-\langle r_{i}\rangle\langle r_{j}\rangle}{\sigma_{i}\sigma_{j}}, (2)

where ri=τ=1Mri(τ)/M\langle r_{i}\rangle=\sum_{\tau=1}^{M}r_{i}(\tau)/M is the mean and σi=τ=1M[riri]2/M\sigma_{i}=\sqrt{\sum_{\tau=1}^{M}\left[r_{i}-\langle r_{i}\rangle\right]^{2}/M} is the standard deviation. The ensemble of wijw_{ij} forms the correlation matrix 𝑾\bm{W} of a stock market in a window of width MM.

To filter wijw_{ij}, we use the P-threshold method and set the following hypothesis test [22],

H0:\displaystyle H_{0}: wij=0,\displaystyle w_{ij}=0, (3)
H1:\displaystyle H_{1}: wij0.\displaystyle w_{ij}\neq 0. (4)

The corresponding test statistic is

Tij=wijn21wij2tn2,T_{ij}=w_{ij}\sqrt{\frac{n-2}{1-w_{ij}^{2}}}\sim t_{n-2}, (5)

where nn is the sample size and n2n-2 is the degree of freedom. Given a significance level α\alpha, one should reject H0H_{0} if the absolute value of the test statistic exceeds the cut-off value tα/2(n2)t_{\alpha/2}(n-2), namely, if

|Tij|>tα/2(n2).|T_{ij}|>t_{\alpha/2}(n-2). (6)

To maximize the number of discoveries while controlling the fraction of false discoveries, we perform the multiple hypothesis test based on the Bonferroni correction. Specially, for the significance level of α=0.01\alpha=0.01, we include any interactions between stocks if |Tij|>tα/N(N1)(n2)|T_{ij}|>t_{\alpha/N(N-1)}(n-2). Since the Bonferroni correction assumes complete independence between the tested p-values, one may consider further the False Discovery Rate (FDR) approach [31] to relax the assumption of independence. As a consequence, more edges among stocks will be maintained. For smoothing purpose, we adopt the moving-window method [32]. Assuming the width of each window is MM and the sliding interval is ΔM\Delta M, one can obtain a series of windows overlap with each other for any oberving period with proper choices of MM and ΔM\Delta M. Inside each window, an edge is created between a pair of stocks ii and jj if wij0w_{ij}\neq 0. This process is repeated throughout all the elements of the correlation matrix and finally a stock network is generated. Specially, we assume the network G(V,E)G(V,E) is unweighted and undirected, which can be described by an adjacency matrix 𝑨=(auv)N×N\bm{A}=(a_{uv})_{N\times N} with elements

auv={1,if u and v are connected,0,otherwise.a_{uv}=\biggl{\{}\begin{array}[]{ll}1,&\quad\mbox{if $u$ and $v$ are connected,}\\ 0,&\quad\mbox{otherwise.}\end{array} (7)

2.2 Conserved networks

A typical stock market usually experiences various financial situations, including bull and bear runs, business as usual and financial crises. Of great importance is to delve into reliable indicators of the crisis from the market. However, the 2008 financial crisis has highlighted the main limitations of standard models, as they cannot detect the crisis even by using posterior data [33]. Here, we address this issue by means of conserved networks. According to different states associated with a crisis, we divide the whole investigating period into 55 stages: the normal stage before the crisis, the stage of the transition from the normal state to the crisis, the stage during the crisis, the stage of the transition from the crisis to the normal state and the stage after the crisis. Inside any stage, there are a number KK of consecutive windows overlap with each other, based on which KK stock networks are constructed. Furthermore, we assume that the interactions between significant stocks will persist while the interactions between insignificant stocks will vary with time, which leads to the idea of conserved networks: for any pair of stocks, an edge between them in the corresponding conserved network exists if and only if all these KK networks within the stage have this edge. As a consequence, we obtain 55 conserved networks, which characterize dynamic characteristics of the investigating period.

2.3 Centrality measures

The most influential stocks may help us understand risk propagation in a stock market and design corresponding control measures. To represent nodal influence in each conserved network, we adopt the concept of centrality. In this paper, we consider four typical measures: degree centrality (DC), eigenvector centrality (EC), closeness centrality (CC) and betweenness centrality (BC).

Given a stock network G(V,E)G(V,E), the DC of node uVu\in V is define by [34]

DC(u)=ku,\texttt{DC}(u)=k_{u}, (8)

where ku=v=1|V|auvk_{u}=\sum_{v=1}^{|V|}a_{uv} is the degree of node uu and |V||V| is the number of nodes. Although the DC is the simplest centrality measure, it can be illuminating. In stock markets, it seems reasonable to suppose that stocks with connections to many others might have more access to information than those with fewer connections. A natural extension of the DC is EC, defined as [35]

EC(u)=𝝊umax,\texttt{EC}(u)=\bm{\upsilon}_{u}^{\texttt{max}}, (9)

where 𝝊max\bm{\upsilon}^{\texttt{max}} is the eigenvector corresponding to the largest eigenvalue of the adjacency matrix 𝑨\bm{A} and 𝝊umax\bm{\upsilon}_{u}^{\texttt{max}} is the uuth element of 𝝊max\bm{\upsilon}^{\texttt{max}} corresponding to stock uu. As a consequence, the stock uu with larger EC(u)\texttt{EC}(u) can be important because it has many neighbors (even though those stocks may not be important themselves) or because it has important neighbors of high degrees.

Both the DC and EC only consider local information of a network. Regarding global information, some entirely different measures of centrality have been suggested incorporating shortest path lengths. One is CC, defined as [34]

CC(u)=|V|1vVl(u,v),\texttt{CC}(u)=\frac{|V|-1}{\sum_{v\in V}l(u,v)}, (10)

where l(u,v)l(u,v) is the shortest path length from node uu to node vv. According to Eq. (10), the smaller average distance from the stock uu to others, the larger value of CC(u)\texttt{CC}(u) it has. BC is another different concept of centrality, initially proposed by Bavelas [36] and generalized by Freeman [37],

BC(u)=η(s,u,t)sutη(s,t),\texttt{BC}(u)=\frac{\eta(s,u,t)}{\sum_{s\neq u\neq t}\eta(s,t)}, (11)

where η(s,u,t)\eta(s,u,t) is the number of those shortest paths passing through uu and η(s,t)\eta(s,t) is the total number of the shortest paths from node ss to node tt. In contrast to the CC, the BC measures the extent to which a stock lies on paths between other stocks.

2.4 Order statistic

Different centrality measures yield different ranks of nodal influence. To synthesize multiple centralities, we regard each rank as an order statistic [38] and obtain a Q-statistic from the joint cumulative distribution of the nn-dimensional order statistic:

Q(γ1,γ2,,γn)=n!0γ1q1γ2qn1γndqndqn1dq1,Q(\gamma_{1},\gamma_{2},\cdots,\gamma_{n})=n!\int_{0}^{\gamma_{1}}\int_{q_{1}}^{\gamma_{2}}\cdots\int_{q_{n-1}}^{\gamma_{n}}\texttt{d}q_{n}\texttt{d}q_{n-1}\cdots\texttt{d}q_{1}, (12)

where γi\gamma_{i} is the rank ratio for centrality ii and qiq_{i} is the lower bound of the (i+1)(i+1)th order statistic. In present work, we use 44 centrality measures, hence n=4n=4. In fact, the above integration can be calculated in a fast way

Q(γ1,γ2,,γn)=n!QnQ(\gamma_{1},\gamma_{2},\cdots,\gamma_{n})=n!Q_{n} (13)

with Qk=i=1k(1)i1Qkiγki+1i/i!Q_{k}=\sum_{i=1}^{k}(-1)^{i-1}Q_{k-i}\gamma_{k-i+1}^{i}/i!. The larger value of QQ, the greater influence the node has.

3 Application to S&P 500 stocks

To validate the above framework, we apply it to stocks belonging to S&P 500 Index. The data are daily records and the investigating period ranges from January 2006 to April 2010, yielding 10891089 observations of 422422 stocks. During the period, there was an economic crisis: the 2008 financial crisis.

Refer to caption
Figure 1: (Color online) Temporal evolution of the normalized DC, EC, BC and CC. The average correlation coefficient w\langle w\rangle is presented for comparison.

3.1 Dynamic networks

First of all, we divide the investigated period through moving windows. By setting the width of each window as M=125M=125 (about half a year) and the moving step as ΔM=5\Delta M=5 (about a week), we obtain 193193 windows. For each window, we fix the significance level at α=0.01\alpha=0.01, based on which a correlation network is constructed. As a result, we obtain 193193 consecutive networks for the whole period. Then, we calculate the centrality of each node, the average of which is used to stand for the characteristics of the network. For each network, we consider the DC, EC, CC and BC, respectively. Figure 1 shows the evolution of four centralities in comparison to the evolution of the average correlation coefficient w\langle w\rangle. For comparison, we normalize each plot by the corresponding maximum of the investigating period. However, it is not essential. The dark gray interval corresponds to the middle stage of the crisis and the two light gray intervals respectively correspond to the early transition from the normal state to the crisis and the late transition from the crisis to the normal state. Remarkably, the DC, EC and CC display the same trend with w\langle w\rangle, while the BC evolves in the opposite way. According to Eq. (8), one has DC=uVku/|V|=2|E|/|V|=(|V|1)e\langle\texttt{DC}\rangle=\sum_{u\in V}k_{u}/|V|=2|E|/|V|=(|V|-1)e, where |E||E| is the number of edges and e=2|E|/|V|/(|V|1)e=2|E|/|V|/(|V|-1) is the density of edges. Moreover, the edge density is proportional to w\langle w\rangle. Therefore, both the DC and EC exhibit the same trend as w\langle w\rangle. As to the CC and BC (see Eqs. (10) and (11)), the higher density of edges, the smaller distance that a node reach all the others, hence the larger value of the CC. On the contrary, the number of connected pairs of nodes increases, resulting in the decrease of the BC. Overall, the four measures can serve as good indicators of the market evolution from the macro perspective.

Table 1: Basic statistics of 55 conserved networks based on the Bonferroni correction.
|V||V| |E||E| k\langle k\rangle c\langle c\rangle l\langle l\rangle
Conserved Network 1 422 544 2.5782 0.1820 412.4460
Conserved Network 2 422 7556 35.8104 0.4562 176.1770
Conserved Network 3 422 9493 44.9905 0.5363 139.9614
Conserved Network 4 422 24658 116.8626 0.6864 52.0883
Conserved Network 5 422 7756 36.7583 0.4839 186.8250

3.2 Conserved networks

Considering the 2008 financial crisis, we divide the period from January 2006 to April 2010 into 55 stages: the normal stage before the crisis, the stage of the transition from the normal state to the crisis, the stage during the crisis, the stage of the transition from the crisis to the normal state and the stage after the crisis. For each stage, we generate a conserved network. Table 1 shows the basic characteristics of the 55 conserved networks, including the number of stocks NN, the number of edges EE, the average degree k\langle k\rangle, the average clustering coefficient c\langle c\rangle and the average shortest path length l\langle l\rangle. One notices apparent differences between these networks. For instance, the conserved network in the crisis is relatively dense because of the higher systemic risk. As a result, k\langle k\rangle and c\langle c\rangle are larger while l\langle l\rangle is smaller. But these topologies only provide macro information of the market development.

Table 2: Top 1010 influential stocks of the conserved network corresponding to the normal stage before the 2008 crisis.
Stock Sector QQ Cumulative return
KIM Real Estate 1 1.3755
RJF Financials 0.9862 1.2366
ESS Real Estate 0.9625 1.2327
REG Real Estate 0.9625 1.2188
BXP Real Estate 0.9625 1.3574
UDR Real Estate 0.9625 1.0163
FRT Real Estate 0.9534 1.4069
LNC Financials 0.9449 1.2018
DHR Health Care 0.9324 1.4872
PH Industrials 0.9241 1.6564

3.3 Influential stocks

The 2008 financial crisis is due to the U.S. financial problem of subprime mortgages. Mortgage is a loan taken from bank to buy a house, which is an agreement between homebuyers and banks. In general, people with low credit and low income can not get a loan from retail banks. But since year 2000’s, a third party got involved, namely investment banks. These investment banks started buying mortgage agreements from the retail banks. In this way, the retail banks sold the loans to investment banks to have zero liability and the mortgages bought by the investment banks were used to form Mortgage Backed Security. Although the Mortgage Backed Security is a special category of subprime mortgages with higher risk, it yields higher return at the same time. As more people were becoming eligible for the mortgage, the demand for homes started increasing. More people had money (borrowed) to buy a new home. The price of residential properties only went up, hence creating the bubble. Table 2 lists 1010 most influential stocks identified from the conserved network corresponding to the normal stage before the crisis (from June 30th 2006 to September 25th 2007). As is expected, 66 stocks are from the real estate sector.

Table 3: Top 1010 influential stocks of the conserved network corresponding to the early transition from the normal state to the crisis.
Stock Sector QQ Cumulative return
AMP Financials 1 0.8696
GL Financials 1 0.9815
XOM Energy 0.9945 0.9380
GE Industrials 0.9812 0.9034
AXP Financials 0.9719 0.7710
MET Financials 0.9718 0.8784
BEN Financials 0.9626 0.7921
DD Materials 0.9534 0.8589
WY Real Estate 0.9443 0.9217
TROW Financials 0.9442 0.9508

Investment banks made the loans were issued to people who had little capability to payback the loan. Inevitably, some of them eventually could not afford the monthly payments and their property went for foreclosure. At a point in 2007-2008, there were more houses on sale than there were buyers for it. This triggered a steady price fall. The housing bubble burst. When property prices started going down, people who had bought the property with the sole purpose of “buying low and selling high”stopped paying the mortgage. This led to more loan defaults and more foreclosures. As a result, the share price of the Mortgage Backed Security started to fall continuously and eventually started to affect on big investors that could not cover this urgency. So the crisis came into being. Table 3 lists 1010 most influential stocks identified from the conserved network corresponding to the early transition from the normal state to the crisis (from October 2nd 2007 to Marc 26th 2008), among which 66 stocks are financials, indicating the first shock of the crisis.

Table 4: Top 1010 influential stocks of the conserved network corresponding to the stage during the crisis.
Stock Sector QQ Cumulative return
BEN Financials 1 0.5093
TROW Financials 1 0.5309
ITW Industrials 0.9900 0.6188
MMM Industrials 0.9835 0.6185
AXP Financials 0.9812 0.2824
PCAR Industrials 0.9719 0.5449
ARE Real Estate 0.9718 0.3619
BXP Real Estate 0.9623 0.3377
DD Materials 0.9603 0.2195
AMP Financials 0.9534 0.3576

After the burst of the bubble in the housing market, many investment banks had more liabilities than assets and faced a big trouble of liquidity. For example, the New Century Financial Corporation filed for Chapter 11 bankruptcy protection in April 2008 because of repurchase agreements and the Lehman Brothers announced bankrupt in September 2008 due to asset deterioration. In addition to investment banks, many insurance companies and financial institutions have also been greatly impacted. A conspicuous example is the American International Group, which lost 250250 billion dollars in the second quarter of 2008 and was taken over by the U.S. government finally. Soon after the earthquake in the financial market, the real economy was also shocked. On the one hand, the depression of the housing market caused correlated companies to fold. One the other hand, the rising unemployment and the shrink of personal wealth decreased consuming intention for industrial products. Table 4 lists 1010 most influential stocks identified from the conserved network corresponding to the stage during the crisis (from April 2nd 2008 to March 30th 2009). We find that not only financial institutions but also correlated industrial companies were involved with enormous losses.

Table 5: Top 1010 influential stocks of the conserved network corresponding to the late transition from the crisis to the normal state.
Stock Sector QQ Cumulative return
ETN Industrials 1 1.6504
PCAR Industrials 1 1.5776
DOV Industrials 0.9812 1.5289
EMR Industrials 0.9719 1.4723
SWK Industrials 0.9626 1.4621
RTX Industrials 0.9618 1.4746
MMM Industrials 0.9534 1.5338
CMI Industrials 0.9532 1.8777
TROW Financials 0.9442 1.6997
WMB Energy 0.9442 1.6594

To contain the crisis, the Troubled Assets Relief Program was carried out in October 2008, which authorized the United States Treasury to spend up to 700700 billion dollars to purchase trouble assets both domestically and internationally. The act was widely credited with restoring stability and liquidity to the financial sector, unfreezing the markets for credit and capital and lowering borrowing costs for households and businesses. This, in turn, helped restore confidence in the financial system and restart economic growth. Another dose of fiscal stimulus is monetary easing. The lower interest rate spurred businesses to make new investments, spurred industrials to invest in renovations and spurred purchases of major durable goods like cars. Table 5 lists 1010 most influential stocks identified from the conserved network corresponding to the late transition from the crisis to the normal state (from April 6th 2009 to September 18th 2009), among which 88 stocks are industrials, implying the initial recovery.

Table 6: Top 1010 influential stocks of the conserved network corresponding to the normal state after the crisis.
Stock Sector QQ Cumulative return
L Financials 1 1.1066
EMR Industrials 0.9994 1.2794
ALB Materials 0.9906 1.2975
EMN Materials 0.9899 1.2640
DOV Industrials 0.9812 1.3617
VNO Real Estate 0.9811 1.2271
WMB Energy 0.9718 1.3315
UNM Financials 0.9534 1.1416
AFL Financials 0.9533 1.2802
HON Industrials 0.9438 1.2016

In fact, the speed of the recovery from the 2008 financial crisis has been unusually slow. Nevertheless, under the percolation of the stimulating policy of quantitative easing, the U.S. economy began to recover since the middle of 2009. With the reduction of the systematic risk and the rising opportunity for business, the stock market boomed again. Table 6 lists 1010 most influential stocks identified from the conserved network corresponding to the stage after the crisis (from September 25th 2009 to April 26th 2010), suggesting that the market is active across various sectors, including financials, industrials, materials, real estate and energy.

4 Discussion

We have synthesized the DC, EC, CC and BC by means of the Q-statistic. Although it does rank stocks in each stage, the values of QQ are relative close. Therefore, is is nature to ask do top 1010 influential stocks vary and how much is the variance if the procedure to construct the network is changed?

To answer this question, we first consider the change in the P-value. We perform simulations for α=0.02\alpha=0.02 and shown the corresponding results in Appendix. Comparing Tables 1 and 9, we find that all the basic characteristics of the generated networks are of the same order, indicating similar structure. We also compare top 1010 influential stocks in each stage as α\alpha increases from 0.010.01 to 0.020.02. Specially, the stock AVB belonging to Real Estate (Table 10) replaces the stock PH belonging to Industrials (Table 2) in Stage 1; the stock GPC belonging to Consumer Discretionary (Table 11) replaces the stock WY belonging to Real Estate (Table 3) in Stage 2; the stock EFX belonging to Industrials (Table 12) replaces the stock DD belonging to Materials (Table 4) in Stage 3; the stock OKE belonging to Energy (Table 13) replaces the stock TROW belonging to Financials (Table 5) in Stage 4; and two stocks MRO and FTI belonging to Energy (Table 14) replace stocks UNM and AFL belonging to Financials (Table 6) in Stage 5. Overall, we notice tiny change in top 1010 influential stocks in each stage, hence the robustness of our framework.

Table 7: Basic statistics of 55 conserved networks based on the FDR correction.
|V||V| |E||E| k\langle k\rangle c\langle c\rangle l\langle l\rangle
Conserved Network 1 422 5545 26.2796 0.4589 106.4866
Conserved Network 2 422 49960 236.7773 0.7955 9.3906
Conserved Network 3 422 47412 224.7014 0.8092 3.4824
Conserved Network 4 422 68310 323.7441 0.8863 1.2318
Conserved Network 5 422 48849 231.5118 0.7994 3.4502

Then, we adopt the FDR for the multiple hypothesis test to relax the assumption of independence of the Bonferroni correction. As shown in Table 7, the number of edges of the 55 conserved networks are 33 to 1010 times of those in Table 1. As a consequence, the networks are much dense. For example, conserved network 4, corresponding to the late transition from the crisis to the normal state, takes the value 1.23181.2318 of the average shortest path length. Intuitively, it is not the case of reality. In the stock market, the correlation matrix 𝑾\bm{W} always contain much noise, and therefore a restrictive procedure may perform better.

Table 8: Top 1010 influential stocks identified by the PageRank algorithm from the conserved network corresponding to the normal stage before the 2008 crisis.
Stock Sector PRPR Cumulative return
PH Industrials 0.0178 1.6564
RJF Financials 0.0123 1.2366
TFC Financials 0.0119 0.9615
HIG Financials 0.0103 1.0541
BEN Financials 0.0092 1.3221
WEC Utilities 0.0090 1.1311
GS Financials 0.0088 1.6365
TER Information Technology 0.0083 0.9465
SO Utilities 0.0082 1.0510
MTB Financials 0.0078 0.9397

Finally, we employ the Google’s PageRank algorithm [39] to identify influential stocks. As a paradigmatic example of centrality-based ranking algorithm, PageRank has been found application in a vast range of real systems, although it was devised originally to rank web pages. In Table 8, we show the results via the PageRank algorithm for the conserved network corresponding to the normal stage before the 2008 crisis. As aforesaid, stocks belonging to Real Estate should be more influential in this stage. However, none of the top 1010 influential stocks resulting from PageRank fall within that sector. We also observe contradictory results in other stages (not shown here).

5 Conclusion

The study of structure and dynamics of stock markets has attracted much attention from economists, mathematicians and physicists. An increasing interest is constructing reliable stock networks and analyzing their evolution [40, 41]. Most approaches, however, have focused on the macro characteristics of the networks with less predictive power.

In this paper, we have addressed this issue from the micro point of view by characterizing spreading influence of each stock and identifying most influential stocks in a dynamic way. For this purpose, we first divided the investigating period of a stock market into a large number of windows through the moving-window method. Then, we constructed a stock network for each window by the significance test of stock correlations. Furthermore, we generated several conserved networks to extract various backbones of the market under different stages. Finally, we used order statistics to rank nodal influence and identified most influential stocks for each conserved network. To illustrate its efficacy, we have applied this procedure to stocks belonging to S&P 500 Index from January 2006 to April 2010 and constructed 55 conserved networks, based on which we identified various influential stocks under different stages and recovered the 2008 financial crisis from the evolutionary perspective.

We note, however, the stock market is too complex to be predicted. The present framework could be generalized by incorporating more physical and structural properties of the market. The comprehensive investigation of the market dynamics from both global and local aspects will be subjected to future research.

Acknowledgments

We are grateful to referees for their valuable comments. This work was supported by the Natural Science Foundation of China under Grant Nos. 12071281 and 11771277.

Appendix. Results for α=0.02\alpha=0.02

Table 9: Basic statistics of 55 conserved networks for α=0.02\alpha=0.02.
|V||V| |E||E| k\langle k\rangle c\langle c\rangle l\langle l\rangle
Conserved Network 1 422 586 2.7773 0.1930 410.3142
Conserved Network 2 422 8611 40.8104 0.4771 154.3755
Conserved Network 3 422 10616 50.3128 0.5549 130.0294
Conserved Network 4 422 26478 125.4882 0.7035 46.4160
Conserved Network 5 422 8698 41.2228 0.4928 177.7238
Table 10: Top 1010 influential stocks of the conserved network corresponding to the normal stage before the 2008 crisis.
Stock Sector QQ Cumulative return
RJF Financials 0.9903 1.2366
ESS Real Estate 0.9811 1.2327
FRT Real Estate 0.9811 1.4069
KIM Real Estate 0.9811 1.3755
BEN Financials 0.9531 1.3221
REG Real Estate 0.9531 1.2188
BXP Real Estate 0.9531 1.3574
UDR Real Estate 0.9531 1.0163
DHR Health Care 0.9468 1.4872
AVB Real Estate 0.9442 1.2967
Table 11: Top 1010 influential stocks of the conserved network corresponding to the early transition from the normal state to the crisis.
Stock Sector QQ Cumulative return
AMP Financials 1 0.8696
GL Financials 1 0.9815
XOM Energy 0.9967 0.9380
GE Industrials 0.9812 0.9034
BEN Financials 0.9719 0.7921
MET Financials 0.9717 0.8784
DD Materials 0.9626 0.8589
AXP Financials 0.9625 0.7710
TROW Financials 0.9443 0.9508
GPC Consumer Discretionary 0.9263 0.7987
Table 12: Top 1010 influential stocks of the conserved network corresponding to the stage during the crisis.
Stock Sector QQ Cumulative return
BEN Financials 1 0.5093
TROW Financials 1 0.5309
PCAR Industrials 0.9905 0.5449
AXP Financials 0.9812 0.2824
ARE Real Estate 0.9719 0.3619
ITW Industrials 0.9625 0.6188
AMP Financials 0.9625 0.3576
MMM Industrials 0.9572 0.6185
EFX Industrials 0.9533 0.6871
BXP Real Estate 0.9440 0.3377
Table 13: Top 1010 influential stocks of the conserved network corresponding to the late transition from the crisis to the normal state.
Stock Sector QQ Cumulative return
ETN Industrials 1 1.6504
PCAR Industrials 0.9998 1.5776
EMR Industrials 0.9906 1.4723
DOV Industrials 0.9719 1.5289
MMM Industrials 0.9717 1.5338
RTX Industrials 0.9698 1.4746
SWK Industrials 0.9626 1.4621
CMI Industrials 0.9534 1.8777
WMB Energy 0.9440 1.6594
OKE Energy 0.9404 1.6499
Table 14: Top 1010 influential stocks of the conserved network corresponding to the normal state after the crisis.
Stock Sector QQ Cumulative return
L Financials 1 1.1066
ALB Materials 0.9906 1.2975
DOV Industrials 0.9810 1.3617
EMN Materials 0.9810 1.2640
WMB Energy 0.9809 1.3315
MRO Energy 0.9716 0.9854
HON Industrials 0.9715 1.2016
VNO Real Estate 0.9713 1.2271
EMR Industrials 0.9618 1.2794
FTI Energy 0.9534 1.2919

References

References

  • [1] Mantegna R N and Stanley H E 2000 An Introduction to Econophysics: Correlations and Complexity in Finance (Cambridge: Cambridge University Press)
  • [2] Campbell J, Lo A W and MacKinlay A C 1997 The Econometrics of Financial Markets (Princeton: Princeton University Press)
  • [3] Laloux L, Cizeau P, Bouchaud J P and Potters M 1999 Phys. Rev. Lett. 83 1467
  • [4] Plerou V, Gopikrishnan P, Rosenow B, Amaral L A N and Stanley H E 1999 Phys. Rev. Lett. 83 1471
  • [5] Potters M and Bouchaud J P 2021 A First Course in Random Matrix Theory (Cambridge: Cambridge University Press)
  • [6] Eom C, Oh G, Jung W S, Joeng H and Kim S 2009 Physica A 388 900
  • [7] Song D M, Tumminello M, Zhou W X and Mantegna R N 2011 Phys. Rev. E 84 026108
  • [8] Jiang X F, Chen T T and Zheng B 2014 Sci. Rep. 4 5321
  • [9] Wang D, Zhang X, Horvatic D, Podobnik B and Stanley H E 2017 Chaos 27 023104
  • [10] Djauhari M A and Gan S L 2016 J. Stat. Mech. 093401
  • [11] Newman M E J 2003 SIAM Rev. 45 167
  • [12] Bonanno G, Caldarelli G, Lillo F and Mantegna R N 2003 Phys. Rev. E 68 046130
  • [13] Onnela J P, Kaski K and Kertesz J 2004 Eur. Phys. J. B 38 353
  • [14] Tumminello M, Aste T, Di Matteo T and Mantegna R N 2005 Proc. Natl. Acad. Sci. USA 102 10421
  • [15] Boginski V, Butenko S and Pardalos P M 2005 Comput. Statist. Data Anal. 48 431
  • [16] Tumminello M, Di Matteo T, Aste T and Mantegna R N 2007 Eur. Phys. J. B 55 209
  • [17] Liu J, Tse C K and He K 2011 Quant. Finance 11 817
  • [18] Wang G J, Xie C and Chen S 2017 J. Econ. Interact. Coord. 12 561
  • [19] Xu X J, Wang K, Zhu L and Zhang L J 2018 Physica A 509 1080
  • [20] Wilinski M, Sienkiewicz A, Gubiec T, Kutner R and Struzik Z R 2013 Physica A 392 5963
  • [21] Wilinski M, Ikeda Y and Aoyama H 2018 J. Stat. Mech 023405
  • [22] Xu R, Wong W K, Chen G and Huang S 2017 Sci. Rep. 7 41379
  • [23] Musciotto F, Marotta L, Piilo J and Mantegna R N 2018 Palgrave Commun. 4 92
  • [24] Yang C, Shen Y and Xia B Y 2013 Mod. Phys. Lett. B 27 1350022
  • [25] Bardoscia M, Livan G and Marsili M 2012 J. Stat. Mech. P08017
  • [26] Bongiorno C and Challet D 2021 EPL 133 48001
  • [27] Abhijit B, Chandrasekhar A G, Esther D and Jackson M O 2013 Science 341 1236498
  • [28] Wang S, Schäfer R and Guhr T 2016 Eur. Phys. J. B 89 105
  • [29] Benzaquen M, Mastromatteo I, Eisler Z and Bouchaud J P 2017 J. Stat. Mech. P023406
  • [30] Roy R B and Sarkar U K 2011 Proc. of International Conference on Advances in Social Networks Analysis and Mining (Kaohsiung, Taiwan) pp 567
  • [31] Benjamini Y and Hochberg Y 1995 J. Roy. Stat. Soc. B 57 449
  • [32] Onnela J P, Chakraborti A, Kaski K, Kertész J and Kanto A 2003 Phys. Rev. E 68 056110
  • [33] Stiglitz J E 2016 Towards a General Theory of Deep Downturns (Cambridge: Palgrave Macmillan)
  • [34] Freeman L C 1978-1979 Soc. Netw. 1 215
  • [35] Bonacich P F 1987 Am. J. Soc. 92 1170
  • [36] Bavelas A 1948 Hum. Organ. 7 16
  • [37] Freeman L C 1977 Sociometry 40 35
  • [38] Dekking F M, Kraaikamp C, Lopuhaä H P and Meester L E 2005 A Modern Introduction to Probability and Statistics (London: Springer)
  • [39] Langville A and Meyer C 2006 Google’s PageRank and Beyond: The Science of Search Engine Rankings (Princeton: Princeton University Press)
  • [40] Mastromatteo I, Zarinelli E and Marsili M 2012 J. Stat. Mech. P03011
  • [41] Squartini T, Caldarelli G, Cimini G, Gabrielli A and Garlaschelli D 2018 Phys. Rep. 757 1