Density power divergence for general integer-valued time series with exogenous covariates
Mamadou Lamine DIOP 111Supported by the MME-DII center of excellence (ANR-11-LABEX-0023-01) and William KENGNE 222Developed within the ANR BREAKRISK: ANR-17-CE26-0001-01 and the CY Initiative of Excellence (grant ”Investissements d’Avenir” ANR-16-IDEX-0008), Project ”EcoDep” PSI-AAP2020-0000000013
THEMA, CY Cergy Paris Université, 33 Boulevard du Port, 95011 Cergy-Pontoise Cedex, France.
E-mail: mamadou-lamine.diop@u-cergy.fr ; william.kengne@u-cergy.fr
Abstract:
In this article, we study a robust estimation method for a general class of integer-valued time series models.
The conditional distribution of the process belongs to a broad class of distribution and unlike classical autoregressive framework, the conditional mean of the process also depends on some exogenous covariates.
We derive a robust inference procedure based on the minimum density power divergence.
Under certain regularity conditions, we establish that the proposed estimator is consistent and asymptotically normal.
In the case where the conditional distribution belongs to the exponential family, we provide sufficient conditions for the existence of a stationary and ergodic -weakly dependent solution.
Simulation experiments are conducted to illustrate the empirical performances of the estimator. An application to the number of transactions per minute for the stock
Ericsson B is also provided.
Keywords: Robust estimation, minimum density power divergence, integer-valued time series models, exogenous covariates.
1 Introduction
The analysis of time series of counts has attracted much interest in the literature during the last two decades, given the large number of papers written in this direction. This is due among others to the applications in various fields: epidemiological surveillance (number of new infections), in finance (number of transactions), in industrial quality control (number of defects), and traffic accidents (number of road casualties), etc. To describe time series of count data, several questions have been addressed with various modeling approaches which are generally classified into two categories: observation-driven models and parameter-driven models (see Cox (1981)). One of the first important results on this topic were obtained independently by McKenzie (1985) and Al-Osh and Alzaid (1987); the INAR model has been introduced by using the binomial thinning operator. Due to its limitations, numerous extensions have been proposed; see, e.g., Al-Osh and Alzaid (1990). Later, new models with various marginal distributions and dependence structures have been studied by several authors; see among others, Fokianos et al. (2009), Doukhan et al. (2012, 2013), Doukhan and Kengne (2015), Davis and Liu (2016), Ahmad and Francq (2016), Douc et al. (2017), Fokianos et al. (2020) for some recent progress.
For most developments in the literature, the parametric inference is commonly based on the conditional maximum likelihood estimator (MLE) which provides a full asymptotic efficiency among regular estimators. However, it has been recognized that the MLE is very sensitive to small perturbations caused by outliers in the underlying model. Beran (1977) addressed this issue and was one of the first references in the literature to use the density-based minimum divergence methods. Numerous others works have been devoted to this topic; see, among others, Tamura and Boos (1986), Simpson (1987), Basu and Lindsay (1994), and Basu et al. (1998). In the context of modelling time series of counts, this question has already been investigated. For instance, Fokianos and Fried (2010, 2012) have studied the problem of intervention effects (that generating various types of outliers) in linear and log-linear Poisson autoregressive models. Fried et al. (2015) have proposed a Bayesian approach for handling additive outliers in INGARCH processes; they have used Metropolis-Hastings algorithm to estimate the parameters of the model. Recently, Kim and Lee (2017, 2019) adopted the approach of Basu et al. (1998) to construct a robust estimator based on the minimum density power divergence for zero-inflated Poisson autoregressive models, and a general integer-valued time series whose conditional distribution belongs to the one-parameter exponential family. See also Kang and Lee (2014) for application of such procedure to Poisson autoregressive models.
On the other hand, most of the models proposed for analysing time series of counts do not provide a framework within which we are able to analyze the possible dependence of the observations on a relevant exogenous covariates. In this vein, Davis and Wu (2009) have studied generalized linear models for time series of counts, where conditional on covariates, the observed process is modelled by a negative binomial distribution. Agosto et al. (2016) have developed a class of linear Poisson autoregressive models with exogenous covariates (PARX), where the parametric inference is based on the maximum likelihood method. Later, Pedersen and Rahbek (2018) proposed a theory for testing the significance of covariates in a class of PARX models. See also the recent work of Fokianos and Truquet (2019) which considered a class of categorical time series models with covariates and addressed stationarity and ergodicity question. The R package ”tscount” Liboschik et al. (2017) provides tools for analysis and modeling count time series models where the conditional mean of the process can take into account covariate effects.
In this new contribution, we consider a quite general class of observation-driven models for time series of counts with exogenous covariates. The mean of the discrete conditional distribution of depends on the whole past observations, some relevant covariates and a finite dimensional parameter . We study a robust estimator of by using the minimum density power divergence estimator (MDPDE) proposed by Basu et al. (1998). Compared to that of Kim and Lee (2019), the framework considered here is more general: (i) the class of models has the ability to handle the dependence of the observations on exogenous covariates, (ii) the dependence through the past is of infinite order (which enables a large dependence structure and to consider INGARCH()-type models) and (iii) even if in many applications the conditional distribution belongs to the exponential family, the class of the conditional distribution considered here is beyond the exponential family.
Recently, Aknouche A. and Francq (2020) consider a class of count time series with the conditional distributions that are not restricted to belong to the one-parameter exponential family and where the conditional mean depend on exogenous covariates. They provided conditions for stationarity and ergodicity. For both the linear and nonlinear conditional means considered by these authors, the feedback effects of the covariate is linear and it is an additive term in the conditional mean. We do not restrict to such setting and consider a larger dependence structure between the conditional means and the covariate. In the case where the conditional distribution belongs to the exponential family, we provide sufficient conditions that ensure the existence of a stationary and ergodic -weakly dependent solution. In this sense of a large dependence to the covariate, our result is more general.
The paper is structured as follows. Section 2 contains the model specification and the construction of the robust estimator as well as the main results. Section 3 is devoted to the application of the general results to some examples of dynamic models. Some simulation results are displayed in Section 4 whereas Section 5 focus on applications on a real data example. The proofs of the main results are provided in Section 6.
2 Model specification and estimation
2.1 Model formulation
Suppose that is a time series of counts and that represents a vector of covariates with . Denote by the -field generated by the whole past at time . Consider the general dynamic model where follows a discrete distribution whose mean satisfying:
(2.1) |
where is a measurable non-negative function defined on (with ) and assumed to be known up to the parameter which belongs in a compact subset (). Let us note that when (a constant), then the model (2.1) reduces to the classical integer-valued autoregressive model that has already been considered in the literature (see, e.g., Ahmad and Francq (2016)). In the sequel, we assume that the random variables have the same distribution and denote by the distribution of ; let be the probability density function of this distribution, where is the natural parameter of given by . Assume that the density is known up to the parameter ; and this density has a support set which is independent of .
Throughout the sequel, the following norms will be used:
-
•
, for any , ;
-
•
for any function , where denotes the set of matrices of dimension with coefficients in , for ;
-
•
, if is a random vector with finite order moments, for .
We set the following classical Lipschitz-type condition on the function .
Assumption A (): For any , the function is times continuously differentiable on with ; and there exists two sequences of non-negative real numbers and satisfying (or for ) and for ; such that for any ,
where denotes any vector or matrix norm.
2.2 Minimum power divergence estimator
In this subsection, we briefly describe the use of the density power divergence to obtain an estimation of the parameters of the model (2.1). The asymptotic behavior of the estimated parameter is also studied. Assume that the observations are generated from (2.1) according to the true parameter which is unknown; i.e., is the true conditional density of . Let be the parametric family of density functions indexed by . To estimate , Basu et al. (1998) have proposed a method which consists to choose the ”best approximating distribution” of in the family by minimizing a divergence between density functions and . The density power divergence between two density functions and is defined by (in the discrete set-up)
So, the empirical objective function (based on the divergence between conditional density functions) up to some terms which are independent of is where
with and . Since are not observed, is approximated by , where
with and . Therefore, the MDPDE of is defined by (cf. Basu et al. (1998) and Kim and Lee (2019))
Let us recall that when , the MDPDE corresponds to the MLE.
We need the following regularity assumptions to study the consistency and the asymptotic normality of the MDPDE.
-
(A0):
for all , ; moreover, such that for all ;
-
(A1):
is an interior point in the compact parameter space ;
-
(A2):
There exists a constant such that , for all ;
-
(A3):
for all , the function is twice continuously differentiable on and for some , ;
-
(A4):
the mapping is well definite on and for all , there exists a constant such that ;
-
(A5):
for all , the mapping is well definite on and there exists a constant such that ;
-
(A6):
the mapping (defined in ) satisfying the Lipschitz condition: there exists a constant such that, for all , ; moreover, .
-
(A7):
there exists two non-negative constants and such that the mappings:
(2.3) and (2.4) satisfy and , for all ;
-
(A8):
for all , a.s , where T denotes the transpose.
Assumption (A0) is an identifiability condition. The conditions (A3)-(A7) allow us to unify the theory and the treatment for a class of distributions that belongs or not to the exponential family. In the class of the exponential family distribution, these conditions can be written in a simpler form (see the Subsection 3.1). The other conditions are standard in this framework and can also be found in many studies; see for instance, Kim and Lee (2019). As detailed in Section 3, all these conditions are satisfied for many classical models.
The following theorem gives the consistency of the estimator .
Theorem 2.1
The following theorem gives the asymptotic normality of .
The trade-off between the robustness and the efficiency is controlled by the tuning parameter . As pointed out in numerous works (see for instance, Basu et al. (1998)), it is found that the estimators with large have strong robustness properties while small value of is suitable when the efficiency is preferred. So, the procedure is typically less efficient when increases. In the empirical studies, we will consider the value of between zero and one, and models with condition distribution belonging to the one-parameter exponential family.
3 Examples
In this section, we give some particular cases of class of integer-valued time series defined in (2.1). We show that the regularity conditions required for the asymptotic results of the previous section are satisfied for these models. Throughout the sequel, we consider that () represents a vector of covariates, belongs to a compact set () and denotes a positive constant whom value may differ from one inequality to another. For any , we will use the notation (with ) for a given element between and .
3.1 A general model with the exponential family distribution
As a first example, we consider a process satisfying:
(3.1) |
where is a discrete distribution that belongs to the one-parameter exponential family; that is,
where is the natural parameter (i.e., is the natural parameter of the distribution of ), , are known functions and is a non-negative function defined on , assumed to be know up to . Let us set the derivative of (which is assumed to exist a well as the second order derivative); it is known that . Therefore, the model (3.1) can be seen as a particular case of (2.1), where . Similar models without covariates have been studied by Davis and Liu (2016) (with the MLE) and Kim and Lee (2019) (with the MDPDE) where the conditional mean of depends only on . Cui and Zheng (2017) carried out the model (3.1) without covariates and proved the existence of a stationary, ergodic and -weakly dependent solution; they also considered inference based on the conditional maximum likelihood estimator. The following proposition establishes the existence of a unique solution for the class of model (3.1) with covariates. Let us impose a autoregressive-type structure on the covariates:
(3.2) |
where () is a sequence of independent and identically distributed (i.i.d) random variables and a function with values in , satisfying
(3.3) |
for some non-negative sequence such that .
Proposition 3.1
For the model (3.1), let us provide some sufficient conditions for the assumptions (A2)-(A7).
-
•
According to Assumption A, for all , we have
(3.4) Thus,
Hence, (A2) is satisfied.
-
•
Clearly, (A3) is satisfied.
-
•
According to the above notations, for any , , we have
Since the function is strictly increasing (because Var), we deduce that
(3.5) -
•
To verify (A6), remark that for the model (3.1), . Moreover, since is strictly increasing in , is also strictly increasing in . Then, from (A0), , for some , which implies that is bounded. Thus, the function satisfies the Lipschitz condition, which shows that (A6) holds. Clearly, the second part of (A6) is satisfied.
-
•
Now, let us show that (A7) is satisfied. From (A0), for all , we have
Therefore,
(3.8) By applying the Hölder’s inequality to the second term of the right hand side of (3.8 ), we get
To complete the verification of (A7), we need to impose the following regularity condition on the function (see also Kim and Lee (2019)):
(B0): , for some .
3.2 A particular case of linear models: INGARCH-X
As second example, consider the Poisson-INGARCH-X model defined by
(3.9) |
where , and is a non-negative function defined on . This class of models has already been studied within the maximum-likelihood framework; see for instance Agosto et al. (2016) and Pedersen and Rahbek (2018). Similarly, one can define the NB-INGARCH-X and BIN-INGARCH-X models with the negative binomial distribution and the Bernoulli distribution, respectively. Without loss of generality, we assume that the components of the exogenous covariate vector are non-negative and that is a linear function in . More precisely, there exists such that , for any . The true parameter of the model is ; therefore, is a compact subset of . If , then we can find two sequences and such that
which implies that , for any . As pointed out in Aknouche and Francq (2020), by using the arguments of Theorem 1 in Francq and Thieu (2019), a sufficient condition on the model (3.9) to be identifiable is that is not a measurable function of . Let us impose here a Markov-structure on the set of the covariates. Assume that for some function with values in and where () is a sequence of i.i.d random variables. To ensure the stability of the exogenous covariates, we make the following assumption (see also Agosto et al. (2016)).
(B1): and , for some and for all .
To check the conditions of Theorems 2.1 and 2.2, define the compact set as follows:
(3.10) |
for some .
-
•
Under the assumption (B1), if (see the definition in 3.10), A () holds; and Proposition 3.1 establishes the existence of a -weakly dependent stationary and ergodic solution of the model (3.9) as well as the NB-INGARCH-X and BIN-INGARCH-X models, satisfying . In the case of the Poisson-INGARCH-X model, we refer to the second part of the proof of Theorem 1 in Agosto et al. (2016) for the existence of the order moment with . Thus, according to (B1) (with ), the condition (2.2) (with ) holds, and consequently (A2) holds for the Poisson-INGARCH-X model (see subsection 3.1, and for sufficient conditions in a large class of models, including NB-INGARCH-X and BIN-INGARCH-X). In addition, for all and , which shows that the second part of (A0) holds. Since the one-parameter exponential family includes the Poisson distribution, by using (2.2), (A0) and A, we can go along similar lines as in the general model (3.1) to show that the assumptions (A3), (A4), (A5) and (A6) hold.
- •
4 Simulation and results
This section presents some simulation study to assess the efficiency and the robustness of the MDPDE. We compare the performances of the MDPDE with those of the MLE () for some dynamic models satisfying (2.1). To this end, the stability of the estimators under contaminated data will be studied. For each model considered, the results are based on replications of Monte Carlo simulations of sample sizes . The sample mean and the mean square error (MSE) of the estimators will be applied as evaluation criteria.
4.1 Poisson-INGARCH-X process
Consider the Poisson-INGARCH-X defined by
(4.1) |
where is the true parameter and follows an ARCH process given by
For this model, we set . This scenario is related and close to the real data example (see below).
We first consider the case where the data are not contaminated by outliers. We generate a trajectory of model (4.1) with .
For different values of , the parameter estimates and their corresponding MSEs (shown in parentheses) are summarized in
Table 1. In the table, the minimal MSE for each component of is indicated by the symbol ∙ (some values of the MSE have been rounded up, e.g. in Table1, , , the last column, in parentheses, the value is 0.088 instead of 0.09).
These results show that the MLE has the minimal MSEs for all the parameters, except for when and when .
For the parameters (when ) and (when ), the MDPDE with has the minimal MSE, but the values are close to those of the MLE.
One can also observe that the MSEs of the MDPDEs increase with .
These findings confirm the fact that the MLE generally outperforms the MDPDE when no outliers exist.
However, as increases, the performances of the MDPDE increase for each .
Now, we evaluate the robustness of the estimators by considering the case where the data are contaminated by additive outliers. Assume that we observe the contaminated process such that , where is generated from (4.1), is an i.i.d Bernoulli random variable with a success probability and is an i.i.d Poisson random variable with a mean . In the sequel, the variables , and are assumed to be independent. For and , the corresponding results are summarized in Table 2.
From this table, one can see that the MDPDE has smaller MSEs than the MLE, except for the estimations obtained
with and (see for instance );
which indicates that the MDPDE is more robust to outliers and overall outperforms the MLE in such cases.
We also observe that the selected optimal value of decreases as increases for all parameters.
4.2 NB-INGARCH-X process
Consider the negative-binomial-INGARCH-X (NB-INGARCH-X) model defined by
(4.2) |
where is the true parameter, is the covariate vector, (for ) is an autoregressive process satisfying
denotes the indicator function and denotes the negative binomial distribution with parameters and . For this model, we consider the cases of , and . We first generate a data of model (4.2) (without outliers). To evaluate the robustness of the estimators, we consider the contaminated data (presence of outliers) as follows: , where is an i.i.d Bernoulli random variable with a success probability and is an i.i.d . The results are presented in Tables 3 and 4. Once again, in the absence of outliers (see Table 3), the MLE outperforms the MDPDE and the efficiency of the MDPDE decreases as increases. When the data are contaminated by outliers (see Table 4), one can see that the MDPDE has smaller MSEs than the MLE; that is, the MDPDE is more robust than the MLE. As in the model (4.1), when increases, the symbol ∙ overall tends to move upwards.
4.3 A -knot dynamic model
We consider the -knot nonlinear dynamic model defined by (see also Davis and Liu (2016))
(4.3) |
where , , is a non-negative integer (so-called knot) and is the positive part of . The model (4.3) is a particular case of the models (2.1) and (3.1) with . The true parameter is . In this model, we consider the cases where and . We generate a data from (4.3) and a contaminated data such that , where is an i.i.d Bernoulli random variable with a success probability and is an i.i.d Poisson random variable with a mean . For each , the knot is estimated by minimizing the function over the set of integer values where is an upper bound of the true knot given by . The estimation can be summarized as follows:
-
•
For each fixed, compute the MDPDE of denoted .
-
•
Estimate the knot by the relation:
Some empirical statistics of the estimator are reported in Table 5. These results show that the estimation of the knot is reasonably good in terms of the mean and the quantiles. In addition, the empirical probability of selecting the true knot increases with . From Table 6, we can see that when the data are without outliers, the MLE displays the minimal MSE (except for the estimation of ); whereas in the presence of outliers (see Table 7), the MDPDE outperforms the MLE. Further, for the parameter , the MDPDE with has the minimal MSE; these results reveal that the estimation of is more damaged than that of the other parameters. This can be explained by the fact that the term (in the relation (4.3)) is very sensitive to outliers.
Sample size | Mean | SD | Min | Med | Max | |||
5 Real data application
The aim of this section is to apply the model (2.1) to analyze the number of transactions per minute for the stock Ericsson B during July 21, 2002. There are observations which represent trading from 09:35 to 17:14.
This time series is a part of a large dataset which has already been the subject of many works in the literature.
See, for instance, Fokianos et al. (2009), Fokianos and Neumann (2013), Davis and Liu (2016) (the series of July 2, 2002),
Doukhan and Kengne (2015) (the series of July 16, 2002), Diop and Kengne (2017) (the series of July 5, 2002)
and Brännäs and Quoreshi (2010).
The data (the transaction during July 21, 2002) and its autocorrelation function displayed on Figure 1 (see (a) and (b)) show three stylized facts: (i) a positive temporal dependence; (ii) the data are overdispersed (the empirical mean is while the empirical variance is ); (iii) presence of outliers is suspected.
Our purpose is to fit these data by taking into account a possible relationship between the number of transactions and the volume-volatility.
This question has been investigated in several financial studies during the past two decades; see, for example, Takaishi and Chen (2016), Belhaj et al. (2015) and Louhichi (2011).
In these works, the volume-volatility is found to exhibit a statistically significant impact on the trading volume (number of transactions or trade size).
Now, we consider the Poisson-INGARCH-X model given by
(5.1) |
where , and represents the volume-volatility at time . Observe that if , then the model (5.1) reduces to the classical Poisson-INGARCH model. We first examine the adequacy of the fitted (based on the MLE) model with exogenous covariate by comparing it with the Poisson-INGARCH model (without exogenous covariate). As evaluation criteria, we consider the estimated counterparts of the Pearson residuals defined by . Under the true model, the process is close to a white noise sequence with constant variance (see for instance, Kedem and Fokianos (2002)). The comparison of these two models is based on the MSE of the Pearson residuals which is defined by , where denotes the number of the estimated parameters. The approximated MSE of the Pearson residuals is for the model (5.1) and for the Poisson-INGARCH; which indicates a preference for the model with covariate. The test of the significance of the exogenous covariate of Pedersen and Rahbek (2018), applied to the series also confirms these results. Under the model (5.1) (i.e, with ), the cumulative periodogram plot of the Pearson residuals is displayed in Figure 1(d). From this figure, the associated residuals appear to be uncorrelated over time. This lends a substantial support to the choice of the model with exogenous covariate for fitting these data.
Since outliers are suspected in the data (see Figure 1(a)), we apply the MDEPE to estimate the parameters of the model. To choose the optimal tuning parameter , we adopt the idea of Warwick and Jones (2005). It is based on the minimization of an Asymptotic approximation of the summed Mean Squared Error (AMSE) defined by
where is the MDPDE obtained with and is a consistent estimator of the covariance matrix given in Theorem 2.2. To compute , the initial value is set to be the empirical mean of the data and is set to be the null vector. For different values of , the corresponding are displayed in Table 8. Based on the findings of this table, the optimal tuning parameter chosen is , which provides the minimum value of the (indicated by the symbol ∙). Thus, the MDPDE is more accurate than the MLE for this data. With , the MDPDE applied to the model (5.1) yields:
(5.2) |
where in parentheses are the standard errors of the estimators obtained from the robust sandwich matrix. Figure 1(c) displays the number of transactions (), the fitted values () and the -prediction interval based on the underlying Poisson distribution. This figure shows that the fitted values capture reasonably well the dynamics of the observed process.

6 Proofs of the main results
Without loss of generality, we only provide the proofs of Theorems 2.1 and 2.2 for .
The proofs in the case of the maximum likelihood estimators (i.e., ) can be done conventionally by using the classical methods.
Throughout the sequel, denotes a positive constant whom value may differ from an inequality to another.
6.1 Proof of Theorem 2.1
We consider the following lemma.
It suffices to show that and .
-
For any , we apply the mean value theorem at the function . For any , there exists between and such that
We deduce that
By using Kounias and Weng (1969), it suffices to show that
(6.2) -
By applying the mean value theorem at the function and from (A6), for all , there exists between and such that
According to [35], a sufficient condition for that is
(6.3) According to (3.6) and by using the same arguments as above, we get
We deduce that
Hence, (6.3) holds and thus, . This achieves the proof of Lemma 6.1.
To complete the proof of Theorem 2.1, we will show that: (1.) and (2.) the function has a unique minimum at .
-
(1.)
For any , we have
Hence, .
-
(2.)
Let , with . We have
where the equality holds a.s. if and only if according to (A0), (A3), (A6). Thus, the function has a unique minimum at .
Recall that since is stationary and ergodic, the process is also a stationary and ergodic sequence. Then, according to (1.), by the uniform strong law of large number applied on the process , it holds that
(6.6) |
Hence, from Lemma 6.1 and (6.6), we have
(6.9) |
(2.), (6.9) and standard arguments lead to the conclusion.
6.2 Proof of Theorem 2.2
The following lemma is needed.
Proof of Lemma 6.2
Remark that for all ,
where the function is defined in (2.3). can be computed in the same way and by using the relation , we get,
(6.11) |
The mean value theorem applied to the function gives,
where is between and ; and the function is defined in (2.4). Thus,
(6.12) |
From Assumption A, for all , we have
(6.13) |
and
(6.14) |
From (6.13), the Hölder’s inequality and the assumption (A7), we get
where the last convergence holds since . Therefore, the first term of the right hand side of (6.12) converges to 0. Moreover, according to (3.6), (6.2), the Hölder’s inequality and the assumption (A7), we have
Hence, the second term of the right hand side of (6.12) converges to 0. This complete the proof of the lemma.
The following lemma is also needed.
Proof of Lemma 6.3
Recall that . Then, for , we have
Let us show that , for all . From the Hölder’s inequality and the assumption (A7), we have
(6.16) |
Moreover, according to the assumption A, for all , we get
Hence, from the stationarity and ergodicity properties of the sequence and the uniform strong law of large numbers, it holds that
Thus, Lemma 6.3 is verified.
Now, we use the results of Lemma 6.2 and 6.3 to prove Theorem 2.1. For , by applying the Taylor expansion to the function , there exists between and such that
It comes that
(6.17) |
where
We can rewrite (6.17) as
According to Lemma 6.2, it holds that
Moreover, for large enough, , since
is a local minimum of the function .
So, for large enough, we have
(6.18) |
To complete the proof of Theorem 2.2, we will show that
-
(i)
is a stationary ergodic martingale difference sequence and .
-
(ii)
.
-
(iii)
The matrix is invertible.
-
(i)
Recall that and , where
Since the functions and are -measurable, we have
Thus, . Moreover, since , and have -order moment, by using (A7) and the Hölder’s inequality, we get
-
(ii)
For any , we have
This holds for any . Thus,
-
(iii)
Let be a non-zero vector of . We have
according to
and the assumption (A8). This implies that the matrix is symmetric and positive definite. Thus, is invertible.
Now, from (i), we apply the central limit theorem for stationary ergodic martingale difference sequence. It follows that
where
According to (6.18), for large enough, (ii) and (iii) imply that
This completes the proof of Theorem 2.2.
6.3 Proof of Proposition 3.1
Let be the cumulative distribution function of , with , and its inverse for all . Let be a sequence of independent uniform random variables and such that is independent and identically distributed over time. Let us prove the existence of a -weakly dependent stationary solution of (3.1) and (3.2) satisfying
Note that, by the Proposition A.2 of [14], we get for and ,
(6.19) |
For a solution that fulfills (3.1), (3.2) and (6.19), set . We get
(6.20) |
For a vector , define the norm for some . According to Doukhan and Wintenberger (2008), it suffices to show that: (i) for some and (ii) there exists a non-negative sequence satisfying such that for all .
References
- [1] Agosto, A., Cavaliere, G., Kristensen, D. and Rahbek, A. Modeling corporate defaults: Poisson autoregressions with exogenous covariates (PARX). Journal of Empirical Finance 38(B), (2016), 640-663.
- [2] Ahmad, A. and Francq, C. Poisson QMLE of count time series models. Journal of Time Series Analysis 37, (2016), 291-314.
- [3] Al-Osh MA, Alzaid AA. First order integer-valued autoregressive (INAR(1)) process. Journal of Time Series Analysis 8, (1987), 261-275.
- [4] Al-Osh MA, Alzaid AA. Integer-valued -order autoregressive structure. J. Appl. Probab. 27(2), (1990), 314-324.
- [5] Aknouche A. and Francq C. Count and duration time series with equal conditional stochastic and mean orders. Econometric Theory, (2020), 1-33.
- [6] Basu, S., Lindsay, B.G. Minimum disparity estimation for continuous models: efficiency, distributions and robustness. Ann. Inst. Statist. Math. 48, (1994), 683-705.
- [7] Basu, A., Harris, I.R., Hjort, N.L. and Jones, M.C. Robust and efficient estimation by minimizing a density power divergence. Biometrika 85, (1998), 549-559.
- [8] Belhaj, F., Abaoub, E. and Mahjoubi, M. Number of Transactions, Trade Size and the Volume-Volatility Relationship: An Interday and Intraday Analysis on the Tunisian Stock Market. International Business Research 8(6), (2015).
- [9] Beran, R. Minimum Hellinger distance estimates for parametric models. Ann. Statist. 5, (1977), 445-463.
- [10] Brännäs, K. and Shahiduzzaman Quoreshi, A.M.M. Integer-valued moving average modelling of the number of transactions in stocks. Applied Financial Economics 20, (2010), 1429-1440.
- [11] Cox, D.R. Statistical analysis of time series: Some recent developments. Scandinavian Journal of Statistics 8, (1981), 93-115.
- [12] Cui, Y. and Zheng, Q. Conditional maximum likelihood estimation for a class of observation-driven time series models for count data. Statistics & Probability Letters 123, (2017), 193-201.
- [13] Davis, R.A. and Wu, R. A negative binomial model for time series of counts. Biometrika 96, (2009), 735-749.
- [14] Davis, R.A. and Liu, H. Theory and Inference for a Class of Observation-Driven Models with Application to Time Series of Counts. Statistica Sinica 26, (2016), 1673-1707.
- [15] Diop, M.L. and Kengne, W. Testing parameter change in general integer-valued time series. J. Time Ser. Anal. 38, (2017), 880-894.
- [16] Douc, R., Fokianos, K., and Moulines, E. Asymptotic properties of quasi-maximum likelihood estimators in observation-driven time series models. Electronic Journal of Statistics 11, (2017), 2707-2740.
- [17] Doukhan, P., Fokianos, K., and Tjøstheim, D. On Weak Dependence Conditions for Poisson autoregressions. Statist. and Probab. Letters 82, (2012), 942-948.
- [18] Doukhan, P., Fokianos, K., and Tjøstheim, D. Correction to ”On weak dependence conditions for Poisson autoregressions” [Statist. Probab. Lett. 82 (2012) 942-948]. Statist. and Probab. Letters 83, (2013), 1926-1927.
- [19] Doukhan, P. and Kengne, W. Inference and testing for structural change in general poisson autoregressive models. Electronic Journal of Statistics 9, (2015), 1267-1314.
- [20] Doukhan, P. and Wintenberger, O. Weakly dependent chains with infinite memory. Stochastic Process. Appl. 118, (2008) 1997-2013.
- [21] Ferland, R., Latour, A. and Oraichi, D. Integer-valued GARCH process. J. Time Ser. Anal. 27, (2006), 923-942.
- [22] Fokianos, K., Rahbek, A. and Tjøstheim, D. Poisson autoregression. Journal of the American Statistical Association 104, (2009), 1430-1439.
- [23] Fokianos, K. and Fried, R. Interventions in INGARCH processes. J. Time Ser. Anal. 31, (2010), 210-225.
- [24] Fokianos, K. and Tjøstheim, D. Log-linear Poisson autoregression. Journal of multivariate analysis 102, (2011) 563-578.
- [25] Fokianos, K. and Fried, R. Interventions in log-linear Poisson autoregression. Statistical Modelling 12, (2012), 1-24.
- [26] Fokianos, K. and Neumann, M. A goodness-of-fit test for Poisson count processes. Electronic Journal of Statistics 7, (2013), 793-819.
- [27] Fokianos, K. and Truquet, L. On categorical time series models with covariates. Stochastic processes and their applications 129, (2019), 3446-3462.
- [28] Fokianos, K., Støve, B., Tjøstheim, D., and Doukhan, P. Multivariate count autoregression. Bernoulli 26, (2020), 471-499.
- [29] Francq, C. and Thieu, L.Q. QML inference for volatility models with covariates. Econometric Theory 35, (2019), 37-72.
- [30] Fried, R., Agueusop, I., Bornkamp, B., Fokianos, K., Fruth, J., Ickstadt, K. Retrospective Bayesian outlier detection in INGARCH series. Statistics and Computing 25, (2015), 365-374.
- [31] Kang, J. and Lee, S. Minimum density power divergence estimator for Poisson autoregressive models. Computational Statistics and Data Analysis 80, (2014), 44-56.
- [32] Kedem, B. and Fokianos, K. Regression Models for Time Series Analysis. Hoboken, Wiley, NJ, (2002).
- [33] Kim, B. and Lee, S. Robust estimation for zero-inflated Poisson autoregressive models based on density power divergence. Journal of Statistical Computation and Simulation 87, (2017), 2981-2996.
- [34] Kim B. and Lee, S. Robust estimation for general integer-valued time series models. Annals of the Institute of Statistical Mathematics, (2019), 1-26.
- [35] Kounias, E.G. and Weng, T.-S An inequality and almost sure convergence. Annals of Mathematical Statistics 33, (1969), 1091-1093.
- [36] Liboschik, T., Fokianos, K. and Fried, R. tscount: An R package for analysis of count time series following generalized linear models. Journal of Statistical Software 82, (2017), 1-51.
- [37] Louhichi, W. What drives the volume-volatility relationship on Euronext Paris? International Review of Financial Analysis 20, (2011) 200-206.
- [38] McKenzie, E. Some simple models for discrete variate time series. Water Resour Bull. 21, (1985), 645-650.
- [39] Pedersen, R.S. and Rahbek, A. Testing Garch-X Type Models. Econometric Theory 35(5), (2018), 1-36.
- [40] Simpson, D.G. Minimum Hellinger distance estimation for the analysis of count data. J. Amer. Statist. Assoc. 82, (1987), 802-807.
- [41] Takaishi, T., and Chen, T.T. The relationship between trading volumes, number of transactions, and stock volatility in GARCH models. Journal of Physics: Conference Series 738 (1), (2016).
- [42] Tamura, R.N. and Boos, D.D. Minimum Hellinger distance estimation for multivariate location and covariance. J. Amer. Statist. Assoc. 89, (1986), 223-239.
- [43] Warwick, J. and Jones, M.C. Choosing a robustness tuning parameter. J. Statist. Comput. Simul. 75, (2005), 581-588.