This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: BK21 Physics Research Division and Department of Physics, Sungkyunkwan University, Suwon 440-746, Korea, 22institutetext: Division of Business Administration, Chosun University, 501-759 Gwangju, Korea

Fractality of profit landscapes and validation of time series models for stock prices

Il Gu Yi 11    Gabjin Oh Email: phecogjoh@chosun.ac.kr22    and Beom Jun Kim Email: beomjun@skku.edu11
(Received: data / Revised version: date)
Abstract

We apply a simple trading strategy for various time series of real and artificial stock prices to understand the origin of fractality observed in the resulting profit landscapes. The strategy contains only two parameters pp and qq, and the sell (buy) decision is made when the log return is larger (smaller) than pp (q-q). We discretize the unit square (p,q)[0,1]×[0,1](p,q)\in[0,1]\times[0,1] into the N×NN\times N square grid and the profit Π(p,q)\Pi(p,q) is calculated at the center of each cell. We confirm the previous finding that local maxima in profit landscapes are scattered in a fractal-like fashion: The number MM of local maxima follows the power-law form MNaM\sim N^{a}, but the scaling exponent aa is found to differ for different time series. From comparisons of real and artificial stock prices, we find that the fat-tailed return distribution is closely related to the exponent a1.6a\approx 1.6 observed for real stock markets. We suggest that the fractality of profit landscape characterized by a1.6a\approx 1.6 can be a useful measure to validate time series model for stock prices.

pacs:
89.65.Gheconophysics, financial markets and 89.75.-kcomplex systems and 05.45.Dffractals

1 Introduction

More and more physicists have been drawn to the research fields of economics and finance in the last few decades Stanleybook ; Bouchardbook ; Sinhabook . Those econophysicists brought new insights into such interdisciplinary area and developed useful analytical and computational tools. Many stylized facts in stock markets have been repeatedly discovered by researchers: the heavy tail in the return distribution, rapidly decaying autocorrelation of returns, the clustering and long-term memory of volatility, to list a few.

Recently, a geometric interpretation of the profit in stock markets has been proposed, in which the profit landscape of a stock has been constructed based on a simple virtual trading strategy Gronlund . Although the used trading strategy is too simple to mimic behaviors of real market traders, the simplicity has its own benefit: It allows us to construct the profit landscape in low dimensions and thus making the geometric analysis straightforward. More specifically, we use the two-parameter strategies to compute the long-term profit of individual stock in the real stock market. Our study in this paper is an extension of Ref. Gronlund : We first show that the observed fractality in Ref. Gronlund is a generic property of real stock markets, by examining stock markets in two different countries (Unites States and South Korea). We then pursue the origin of this observed fractality by applying the same methodology to various time series, real and artificial. It is revealed that the fractal structure in the profit landscapes in real markets is closely related with the existence of fat tails in the return distribution.

The present paper is organized as follows: In Sec. 2, we present the datasets we use and also describe briefly how we generate four different artificial time series to compare with real stock price movements. Our two-parameter trading strategy is also explained. Section 3 is devoted for the presentation of our results, which is followed by Sec. 4 for conclusion.

2 Methods

2.1 Time Series

In the present study, we employ a simple long-short trading strategy Gronlund and apply it to the historical daily real stock price time series as well as to artificially generated time series. For real historic stock prices, we use 526 stocks in South Korean (SK) for 10 years (Jan. 4, 1999 to Oct. 19, 2010), and 95 stocks in United States (US) for 21 years (Jan. 2, 1983 to Dec. 30, 2004). For SK dataset, the total number of trading days is T=2918T=2918, while T=5301T=5301 for US. For artificially generated time series data, we use four different models that are widely used in fiance: the geometric Brownian motion (GBM), the fractional Brownian motion (FBM), the symmetric Lévy α\alpha-stable process (LP), and the Markov-Switching Multifractal model (MSM). For each stochastic process, we generate 100 different time series for the same time period T=2918T=2918 as for the SK dataset. In Table 1, we list the properties of different time series used in the present study.

GBM FBM LP MSM Real Markets
long-term memory ×\times \bigcirc ×\times \bigcirc \bigcirc
heavy-tail distribution ×\times ×\times \bigcirc \bigcirc \bigcirc
Table 1: Stock price data from real markets have the long-term memory and the heavy-tailed return distribution. Artificial financial time series produced by the geometric Brownian motion (GBM), the fractional Brownian motion (FBM), the symmetric Lévy α\alpha-stable process (LP), and the Markov-Switching Multifractal model (MSM) have different properties. While FBM shares the long-term memory property with real markets, LP and MSM exhibit heavy-tails in return distribution like real markets. See text for details.
Refer to caption
Figure 1: The price movements (left column) and their corresponding log-returns (right column) for (a) a real stock, (b) geometric Brownian motion (GBM), (c) the fractional Brownian motion (FBM) with the Hurst exponent H=0.7H=0.7, (d) the symmetric Lévy α\alpha-stable process (LP) with α=1.7\alpha=1.7, and (e) the Markov-Switching Multifractal model (MSM) with m0=1.4m_{0}=1.4. For comparisons of behaviors, see Table 1.

The GBM Black ; Hull is based on the stochastic differential equation

dSt=St(μdt+σdWt),dS_{t}=S_{t}(\mu dt+\sigma dW_{t}), (1)

where StS_{t} is the price of stock and WtW_{t} is the standard Brownian motion. For the drift μ\mu and the volatility σ\sigma in Eq. (1), we use the values obtained from the SK dataset.

To mimic the existence of a long-term memory in real stock prices the continuous time stochastic process FBM has been introduced Mandelbrot , in which the covariance between the two stochastic processes Bt(H)B_{t}^{(H)} and Bs(H)B_{s}^{(H)} at time tt and ss satisfies

Cov(Bt(H),Bs(H))=12[|t|2H+|s|2H|ts|2H].\textrm{Cov}(B_{t}^{(H)},B_{s}^{(H)})=\frac{1}{2}\left[|t|^{2H}+|s|^{2H}-|t-s|^{2H}\right]. (2)

Here, the Hurst exponent H(0,1]H\in(0,1] plays an important role: For H=1/2H=1/2, FBM is the same as the standard Brownian motion, while for H>1/2H>1/2 (H<1/2H<1/2), the increments of its stochastic process are positively (negatively) correlated. Accordingly, the long-term positive memory exists for H>1/2H>1/2. The long-term memory properties in various financial time series, such as stock indices, foreign exchange rates, futures, and commodity, have been found Ding ; Liu ; Cont ; Oh . In this work, we generate the process by using

dSt=St(μdt+σdBt(H)),dS_{t}=S_{t}(\mu dt+\sigma dB_{t}^{(H)}), (3)

where StS_{t} is the stochastic variable for the stock price and Bt(H)B_{t}^{(H)} is the fractional Brownian motion [compare with Eq. (1)]. We use the package dvfBm in R-project to get the sample paths of FBM and generate time series for different values of the Hurst exponent HH.

We also generate the stochastic time series using LP Sinhabook ; Mantegna ; Pantaleo ; Schoutensbook ; Fusaibook , to imitate the heavy-tail distribution of return in real market (see Table 1). We first produce the stochastic process XtX_{t} by LP and then generate the stock price StS_{t} Schoutensbook ; Applebaumbook via

St=S0exp(Xt).S_{t}=S_{0}\exp(X_{t}). (4)

LP contains the tunable parameter α(0,2]\alpha\in(0,2] called the stability index which characterizes the shape of distribution. The distribution Pα(x)P_{\alpha}(x) for the return xx converges to the Gaussian from as α2\alpha\rightarrow 2, while for α<2\alpha<2 it asymptotically exhibits the power-law behavior

Pα(x)|x|(1+α),P_{\alpha}(x)\sim|x|^{-(1+\alpha)},\ (5)

for |x||x|\rightarrow\infty. Consequently, we can adjust the thickness of the tail part of the distribution by changing α\alpha. We generate time series using LP for different values of α\alpha.

MSM has been introduced to mimic the behaviors of real markets such as the heavy-tailed return distribution, the clustering of volatility, and so on MFbook ; Calvet2001 ; Calvet2004 (see Table 1). It has become the one of the most well accepted models in the area of finance and econometrics. To generate artificial stock prices StS_{t} based on MSM, we use

St=St1exp(Rt),S_{t}=S_{t-1}\exp(R_{t}), (6)

where RtR_{t} is the log-return written as Rtσ(Mt)ϵtR_{t}\equiv\sigma(M_{t})\epsilon_{t}. The Gaussian random variable ϵt\epsilon_{t} has zero mean and unit variance, and the volatility σ(Mt\sigma(M_{t}) is constructed from the time series MtM_{t} in MSM MFbook ; Calvet2001 ; Calvet2004 . In this work, we use the binomial MSM and use the parameter m0(1,2)m_{0}\in(1,2) to generate different time series.

We generate artificial time series by using the above described four different models (GBM, FBM, LP, and MSM) and compare them with real market behavior in Fig. 1, which displays the stock price and the log-return time series in the left and the right columns, respectively. Qualitative difference between artificial time series [(b)-(e)] and the real price (a) is not easily seen. In contrast, the log-return shown in the right column of Fig. 1 exhibits a clear difference. In Fig 1(a) for the time series of the log-return for a real company in SK, one can recognize well-known stylized facts: Fluctuation of log-return around zero, called the volatility, has a broad range of sizes which implies the heavy-tailed return distribution. It is also seen that the volatility is clustered in time so that there is an active period of time when return fluctuates heavily and there is a calm period when return does not deviate much from zero. On the other hand, the two model time series GBM and FBM in Fig 1 (b) and (c) show somewhat different behavior: The size of fluctuation does not vary much, indicating the absence of the heavy tail in return distribution. In contrast, LP and MSM in Fig. 1 (d) and (e) exhibit the larger volatility fluctuation like the real market behavior in (a), indicating the heavy tail in return distribution. The difference between LP and real market can be seen in terms of the volatility clustering: In LP, the large volatility is not necessarily accompanied by other large volatility [Fig. 1(d)], whereas for MSM [Fig. 1(e)], and real market [Fig. 1(a)], volatility is often clustered in time.

Refer to caption
Figure 2: (Color online) Profit landscapes for Samsung Electronics [(a) and (b)] and Hyundai Motors [(c)] in SK. For (a) and (c) the strategy S(1)S^{(1)} is used, and for (b) S(2)S^{(2)} is used. Although the landscapes look different from each other, they show qualitative similarity: all are rough and have lots of local peaks and dips. We have used fb=0.5f_{b}=0.5, fs=0.5f_{s}=0.5 and d=10d=10. See text for details.
Refer to caption
Figure 3: (Color online) The number MM of local maxima, averaged over 526 stocks in SK, scales with the resolution parameter NN following the power-law form MNaM\sim N^{a}. In the large NN regime, both strategies S(1)S^{(1)} [(a) and (c)] and S(2)S^{(2)} [(b)] have a1.6a\approx 1.6 irrespective of the values of time lag dd [(a) and (b)] and the values of fbf_{b} and fsf_{s} [(c)]. For (a) and (b), fb=fs=0.5f_{b}=f_{s}=0.5 is used for S(1)S^{(1)} and S(2)S^{(2)}, respectively, and for (c) d=1d=1 is used. The exponent a1.6a\approx 1.6, also observed for US market in the previous study Gronlund , is found to be a robust feature of real stocks in SK and does not depend on details of trading strategy.

2.2 Trading Strategy

In our simple trading strategy (see Ref. Gronlund for more details), the trade decision (buy or sell) is made when the log return R(t,t)log[St/St]R(t,t^{\prime})\equiv\log[S_{t}/S_{t}^{\prime}] with t>tt>t^{\prime} passes upper and lower boundaries: Sell for R(t,t)>pR(t,t^{\prime})>p, buy for R(t,t)<qR(t,t^{\prime})<-q, and no trading if the log return remains within boundaries q<R(t,t)<p-q<R(t,t^{\prime})<p. As in Ref. Gronlund , we further parameterize the strategy by introducing the time lag dd defined by dttd\equiv t-t^{\prime}, the fraction fsf_{s} of holding shares of stock when sell decision is made, and the fraction fbf_{b} of cash in possession when buy.

We use the same notations as in Ref. Gronlund and call the above simple strategy as S(1)S^{(1)}. In the inverse strategy S(2)S^{(2)} Gronlund , buy (sell) decision is made for R(t,t)>pR(t,t^{\prime})>p [R(t,t)<q][R(t,t^{\prime})<-q]. It is interesting to see that S(1)S^{(1)} is similar to the contrarian strategy (or trend-opposing strategy) and that S(2)S^{(2)} to the momentum strategy (or trend-following strategy) Gronlund . However, we emphasize that the similarity is only superficial: The existence of trend has been strongly debated and in the area of econophysics the well-known absence of temporal correlation of the return is clearly against the existence of any trend in time scales longer than a couple of minutes Gopikrishnan .

The virtual trading strategy in this study is applied as follows:

  1. 1.

    Start with one billion Korean Won (m(1)=109m(1)=10^{9}) at t=1t=1 as initial amount of investment. 111The exchange rate between US Dollar and Korean Won (KRW) is about 1:10001:1000.

  2. 2.

    Pick a company ii.

  3. 3.

    For a given parameter pair (p,qp,q), keep applying the trading strategy, S(1)S^{(1)} or S(2)S^{(2)}, until the last trading day TT. For each trading, the amount of cash m(t)m(t) and the number of stocks n(t)n(t) change.

At the end of the trading period TT, we assess the total value of our virtual portfolio as the sum of cash and the stock holdings: m(T)+n(T)STm(T)+n(T)S_{T}. The performance of the strategy is evaluated by the ratio between the net profit and the initial investment, i.e.,

Π(p,q)m(T)+n(T)STm(1)m(1).\Pi(p,q)\equiv\frac{m(T)+n(T)S_{T}-m(1)}{m(1)}. (7)

We constrain that the lower bound of cash is zero, i.e., m(t)0m(t)\geq 0, and that the number of shares is non-negative, i.e, n(t)0n(t)\geq 0. For the sake of simplicity, we also assume that the market price of stock is the same as the daily closing price in the historic data, and the transaction commission fee 0.1%0.1\% is taken into account for every trading.

3 Results

3.1 Real time series

We calculate the profit Πi(p,q)\Pi^{i}(p,q) of each strategy for the ii-th company in order to study the fractality of profit landscape in stock markets. The unit square [0,1]×[0,1][0,1]\times[0,1] in pp-qq plane is discretized by the resolution parameter NN so that it contains N2N^{2} small squares of the size (1/N)×(1/N)(1/N)\times(1/N). The shape of the profit landscape not only differs from company to company but also depends on the details of strategy such as the values of fbf_{b}, fsf_{s} and time lags dd as seen in Fig. 2 for two companies in SK: Samsung Electronics and Hyundai Motors. We also find that the shape of the profit landscape and the locations of maxima and minima totally change for different time interval t[1,T/2]t\in[1,T/2], t[T/2+1,T]t\in[T/2+1,T], and t[1,T]t\in[1,T] Gronlund . Even if the shape of the profit landscape looks quite different for various parameter and different time interval, it is revealed that the nature of profit landscape does not change through Ref. Gronlund .

Once we get Πi(p,q)\Pi^{i}(p,q) for a given resolution NN, we count the number MM of local maxima, where the profit is larger than profits for its von Neumann neighborhood (the four nearest cells surrounding the central cell). Apparently, MM must be an increasing function of NN, but how it depends on NN can be an interesting question to pursue. As is already observed for US stocks Gronlund , we again confirm that MNaM\sim N^{a} with a1.6a\approx 1.6 as shown in Fig. 3, where MM is obtained from the average over 526 stocks in SK. Our finding of the same exponent a1.6a\approx 1.6 both for US and SK indicates that the fractality of the profit landscape can be of a universal feature of different stock markets. We also check the robustness of the exponent a1.6a\approx 1.6 for different values of the parameters. Figure 3 (a) and (b) show that the same scaling relation holds regardless of the specific strategy and different time lags dd. Figure 3 (c) confirms a1.6a\approx 1.6 again for various parameter combinations of (fbf_{b}, fsf_{s}). Although not shown here, a1.6a\approx 1.6 is again obtained when we use the Moore neighborhood (the eight cells surrounding the central cell) instead of the von Neumann neighbors, and when the size of parameter set is expanded to (p,q)[0,2]×[0,2](p,q)\in[0,2]\times[0,2]. These results seem to suggest that the fractal nature of the profit landscape is a genuine feature of the real stock markets.

Although the trading strategies employed in this work might be too simple to be used in reality, we believe that a complicated trading strategy can be decomposed into a time-varying sophisticated mixture of simple strategies like S(1)S^{(1)} and S(2)S^{(2)}. Our observation of the robustness of the fractality suggests that more realistic strategies beyond those used here must also have complicated profit landscape, which is expected to exhibit a fractal characteristic.

3.2 Artificial time series

We repeat the above procedure to construct profit landscapes for four different artificial time series models GBM, FBM, LP, and MSM (see Sec. 2). Figure 4 shows MM versus NN for (a) GBM and (b) FBM. Interestingly, we again find the scaling form MNaM\sim N^{a} but with a2.0a\approx 2.0 for both GBM and FBM, which is significantly different from a1.6a\approx 1.6 observed in real markets. It is to be noted that the value a2.0a\approx 2.0 indicates that the local maxima in the profit landscapes for GBM and FBM are scattered like two-dimensional objects. Also observed is that the scaling exponent aa does not change much as the Hurst exponent HH is varied in FBM as shown in Fig. 4(b). This is a particularly interesting result since FBM with H>1/2H>1/2 is known to have the long-term memory effect like real markets. The difference of aa between real markets and FBM implies that the nontrivial value a1.6a\approx 1.6 in real markets may not originate from the long-term memory effect.

Refer to caption
Figure 4: (Color online) The scaling relation (MM versus NN) yields a2.0a\approx 2.0 both for (a) GBM and (b) FBM. In (b), the Hurst exponent HH characterizes the existence of the long-term memory, which hardly changes the scaling exponent aa in the large NN regime. We have used S(1)S^{(1)} with fb=fs=0.5f_{b}=f_{s}=0.5.
Refer to caption
Figure 5: (Color online) The scaling relation MNaM\sim N^{a} for (a) LP has the exponent aa which varies from 1.0 to 2.0 as the stability index α\alpha in LP changes. As α\alpha is decreased from 2.0 (the Gaussian limit without heavy tail in return distribution), the scaling exponent aa also decreases, indicating the close relation between the fractality of the profit landscape and the tail thickness of the return distribution. (b) MSM yields a1.6a\approx 1.6, reflecting the heavy-tailed return distribution. For both (a) and (b) we have used S(1)S^{(1)} with fb=fs=0.5f_{b}=f_{s}=0.5.

We next construct profit landscapes for LP and MSM and measure the scaling exponents in Fig. 5(a) and (b), respectively. For LP, the stability index α\alpha controls the shape of the return distribution as described in Sec. 2. As α\alpha is decreased from the Gaussian limit α=2\alpha=2, the tail part of the return distribution becomes thicker. In Fig. 5(a), it is clearly shown that the scaling exponent aa which characterizes the fractality of the distribution of the local maxima in the profit landscapes changes in a systematic way with α\alpha. It starts from the Gaussian limit value a2a\approx 2 [compare with Fig. 4(a) for GBM] when α=2.0\alpha=2.0 and systematically decreases toward a1a\approx 1, as displayed in Fig. 6. Note also that S(1)S^{(1)} and S(2)S^{(2)} do not display any significant difference in Fig. 6. Consequently, it is tempting to conclude that the shape of the tail part of the return distribution (controlled by α\alpha in LP) determines the scaling exponent aa in the profit landscape. When α1.85\alpha\approx 1.85, we get a1.6a\approx 1.6 as for the real markets. In Table 1, MSM is also shown to have the heavy tail in return distribution. From the study of LP, one can expect that MSM should also lead to a1.6a\approx 1.6 as for LP. Figure 5(b) displays that a proper choice of the parameter in MSM gives us a1.6a\approx 1.6. Another interesting observation one can make from our MSM study is that a1.6a\approx 1.6 appears to be an maximum value one can have, and the reason for this behavior needs to be sought in a future study.

Refer to caption
Figure 6: (Color online) The scaling exponent aa versus the stability index α\alpha for LP. It is shown that aa systematically decreases from the Gaussian value (a2)(a\approx 2) toward a1a\approx 1 as α\alpha is decreased. The exponent aa hardly changes for different strategies S(1)S^{(1)} and S(2)S^{(2)}.

In order to isolate the effects of the fat-tailed return distribution from the long-term memory effects, we generate the time series of returns and fully shuffle them in time to produce a new stock price time series. In this shuffling process, all existing long-time correlations are destroyed, but the probability distribution of returns remains intact. We apply this method for real stocks and the MSM time series, and compute M(N)M(N). Although not shown here, we find that the scaling exponent a1.6a\approx 1.6 remains the same, which again indicates that the fractal nature of the profit landscapes hardly depends on the existence of long-term memory, as was already found for the LP.

4 Conclusion

In summary, we have investigated the origin of the fractality of the profit landscape in two different real markets in comparison with four different models for generating artificial time series. The number MM of local maxima has been shown to scale with the resolution parameter NN following the form MNaM\sim N^{a}. Although the real markets exhibit a1.6a\approx 1.6, time series models with the short-tailed return distributions (GBM and FBM) fail to produce this scaling exponent. In contrast, the two different model time series (LP and MSM) have been shown to give us the scaling exponent consistent with market value a1.6a\approx 1.6. In LP, a systematic tuning of the thickness of the tail part of the return distribution has led to the main conclusion of the present work: The origin of fractality in the profit landscape is the heavy-tail of the return distribution. We suggest that the observed fractality of the profit landscape with the exponent a1.6a\approx 1.6 can play an important role in validation procedure of artificial time series models for stock prices. In particular, one can tune the model parameters (α\alpha in LP and m0m_{0} in MSM, for example) to make the time series compatible with the observed fractality in real markets.

Acknowledgements.
B. J. K. (Grant No. 2011-0015731) and G. O. (Grant No. 2012-0359) acknowledge the support from the National Research Foundation of Korea (NRF) grant funded by the Korea government (MEST).

References

  • (1) R. N. Mantegna and H. E. Stanley, An Introduction to Econophysics (Cambridge University Press, Cambridge, 2000).
  • (2) J. P. Bouchard and M. Potters, Theory of Financial Risk and Derivative Pricing (Cambridge University Press, Cambridge, 2009).
  • (3) S. Sinha, A. Chatterjee, A. Chakraborti, B. K. Chakrabarti, Econophysics: An Introduction (Wiley-VCH, Weinheim, 2010).
  • (4) A. Grönlund, I. G. Yi, and B. J. Kim, PLoS ONE 7, e33960 (2012).
  • (5) F. Black, The Journal of Political Economy 81, 637 (1973).
  • (6) J. C. Hull, Options, Futures, and Other Derivatives (Pearson Education Ltd., London, 2009).
  • (7) B. b. Mandelbrot and J. W. Van Ness, SIAM review 10, 422 (1968).
  • (8) Z. Ding, C. W. J. Granger, and R. F. Engle, Journal of Empirical Finance 1, 83 (1993).
  • (9) Y. Liu, P. Gopikrishnan, P. Cizeau, M. Meyer, -C. -K. Peng, and H. E. Stanley, Phys. Rev. E 60, 1390 (1999).
  • (10) R. Cont, Quantitative Finance 1, 223 (2001).
  • (11) G. Oh and S. Kim, J. Korean Phys. Soc. 48, S197 (2006).
  • (12) R. N. Mantegna, Phys. Rev. E 49, 4677 (1994).
  • (13) E. Pantaleo, P. Facchi, and S. Pascazio, Physica Scripta T135, 014036 (2009).
  • (14) W. Schoutens, Lévy process in Finance (John Wiley & Sons Ltd., New York, 2003).
  • (15) G. Fusai and A. Roncoroni, Implementing Models in Quantitative Finance: Methods and Cases (Springer-Verlag, Berlin, 2008).
  • (16) D. Applebaum, Lévy Processes and Stochastic Calculus (Cambridge University Press, Cambridge, 2009).
  • (17) L. E. Calvet and A. J. Fisher, Multifractal Volatility (Academic Press, Burlington, 2008).
  • (18) L. E. Calvet and A. J. Fisher, Journal of Econometrics 105, 27 (2001).
  • (19) L. E. Calvet and A. J. Fisher, Journal of Financial Econometrics 2, 49 (2004).
  • (20) P. Gopikrishnan, V. Plerou, L. A. N. Amaral, M. Meyer, and H. E. Stanley, Phys. Rev. E 60, 5305 (1999).