Kernel Estimation for Panel Data with Heterogeneous Dynamics¹¹1First Version: February, 2018.

Ryo Okui Department of Economics and the Institute of Economic Research, Seoul National University, Building 16, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea. Email: okui.ryo.3@gmail.com Takahide Yanagi²²2Graduate School of Economics, Kyoto University, Yoshida Honmachi, Sakyo, Kyoto, 606-8501, Japan. Email: yanagi@econ.kyoto-u.ac.jp

(May, 2019)

Abstract

This paper proposes nonparametric kernel-smoothing estimation for panel data to examine the degree of heterogeneity across cross-sectional units. We first estimate the sample mean, autocovariances, and autocorrelations for each unit and then apply kernel smoothing to compute their density functions. The dependence of the kernel estimator on bandwidth makes asymptotic bias of very high order affect the required condition on the relative magnitudes of the cross-sectional sample size ( $N$ ) and the time-series length ( $T$ ). In particular, it makes the condition on $N$ and $T$ stronger and more complicated than those typically observed in the long-panel literature without kernel smoothing. We also consider a split-panel jackknife method to correct bias and construction of confidence intervals. An empirical application and Monte Carlo simulations illustrate our procedure in finite samples.

Keywords: autocorrelation, density estimation, heterogeneity, incidental parameter, jackknife, kernel smoothing.

JEL Classification: C13, C14, C23.

1 Introduction

The characteristics of heterogeneity across economic units are informative for many econometric applications. For example, there is an interest in heterogeneity in the dynamics of price deviations or changes (e.g., Klenow and Malin, 2010; Crucini et al., 2015). As another example, allowing for the presence of heterogeneity may make a crucial difference in identification and estimation of production functions (e.g., Ackerberg et al., 2007; Kasahara et al., 2017). Thus, there are many econometric studies that investigate the degree of heterogeneity using panel data (e.g., Hsiao et al., 1999; Fernández-Val and Lee, 2013; Jochmans and Weidner, 2019; Okui and Yanagi, 2019).

This paper proposes kernel-smoothing estimation for panel data to analyze heterogeneity across cross-sectional units.³³3An R package to implement the proposed procedure is available from the authors’ websites. After estimating the mean, autocovariances, and autocorrelations of each unit, we compute the kernel densities based on these estimated quantities. This easy-to-implement procedure provides useful visual information for heterogeneity in a model-free manner. For example, the densities of the heterogeneous mean, variance, and first-order autocorrelation of the price deviations indicate visually the characteristics of heterogeneity in the long-run level, variance, and persistence of the price deviations across items (goods and services) that are cross-sectional units in this example. Indeed, several empirical studies have used such estimation for various applications (e.g., Kasahara et al., 2017, Figure 2 and Roca and Puga, 2017, Figure 8), but there is no theoretical foundation for kernel-smoothing to examine heterogeneity in long-panel data.

We show consistency and asymptotic normality of the kernel density estimator based on double asymptotics under which both the cross-sectional size $N$ and the time-series length $T$ tend to infinity with the bandwidth $h$ shrinking to zero (denoted by $N,T\to\infty$ and $h\to 0$ ).⁴⁴4More precisely, the double asymptotics $N,T\to\infty$ are any monotonic sequence $T=T(N)\to\infty$ as $N\to\infty$ and the bandwidth $h\to 0$ is any monotonic sequence $h=h(N,T(N))=h(N)\to 0$ as $N,T\to\infty$ in our setting. Note that each theoretical result in this paper specifies additional conditions on the relative magnitudes of $N$ , $T$ , and $h$ . The asymptotic properties exhibit several unique features that have not been well examined in the long-panel literature. Most importantly, asymptotic bias of even very high order affects the conditions on the relative magnitudes of $N$ , $T$ , and $h$ required for consistency and asymptotic normality. As a result, the different orders of asymptotic expansion we can execute have different relative magnitude conditions. This unique feature contrasts our analysis with the existing analyses without kernel smoothing where the required relative magnitude conditions do not depend on the order of expansions (e.g., $N/T^{2}\to 0$ for asymptotic normality in Hsiao et al., 1999 and Okui and Yanagi, 2019). The weakest condition (i.e., how small $T$ can be compared with $N$ ) can be obtained by executing an infinite order expansion. Even in that case, the required condition is stronger than those typically observed in the literature without kernel smoothing. Moreover, it requires nontrivial discussions for the expansion (e.g., the summability of the infinite-order series). We clarify that these unique features are caused by the presence of the bandwidth $h$ and by using the estimated quantities.

Based on an infinite-order expansion, we show three asymptotic biases for the density estimation. The first is the standard kernel-smoothing bias of order $O(h^{2})$ (see, e.g., Li and Racine, 2007). The second is caused by the incidental parameter problem (Neyman and Scott, 1948 and Nickell, 1981) and is $O(1/T)$ . The third results from the nonlinearity of the kernel function and the difference between the estimated quantity and the true quantity. We show that this is $O(1/(Th^{2}))+\sum_{j=3}^{\infty}O(1/\sqrt{T^{j}h^{2j}})$ , which is obtained only if we execute an infinite-order expansion. By showing these asymptotic biases, we prove that the relative magnitude conditions for consistency and asymptotic normality are $N^{2}/T^{5}\to 0$ and $N^{2}/T^{3}\to 0$ , respectively, when using the standard bandwidth $h\asymp N^{-1/5}$ in the density estimation with second-order kernels.

We propose to apply a split-panel jackknife method in Dhaene and Jochmans (2015) to reduce these biases. In particular, we formally show that the half-panel jackknife (HPJ) corrects the incidental parameter bias and the second-order nonlinearity bias without inflating the asymptotic variance. While the jackknife is useful in bias reduction especially when $T$ is small, we also show that it does not weaken the relative magnitude conditions for consistency and asymptotic normality.

We also develop confidence interval (CI) estimation and selection of bandwidth. To construct CI, we extend the robust bias-corrected (RBC) procedure in Calonico et al. (2018a) to split-panel jackknife bias-corrected estimation. This method explicitly corrects all three biases above. For the bandwidth selection, we can apply any standard procedures in the literature. This is because, under the relative magnitude conditions, the asymptotic mean squared error (AMSE) and asymptotic distribution of the split-panel jackknife bias-corrected estimator are the same as those of the infeasible estimator based on the true quantity.

We also examine the properties of the cumulative distribution function (CDF) estimator constructed by integrating the kernel density estimator. This kernel CDF estimator also exhibits asymptotic bias that varies in the order of the asymptotic expansion that we execute. We also derive the closed form formula for the asymptotic bias. This is an interesting result from theoretical viewpoint because the formula for asymptotic bias for the empirical distribution is available only for Gaussian errors (Jochmans and Weidner, 2019) and has not been derived in general form (Okui and Yanagi, 2019). However, the required conditions on $N$ and $T$ for the kernel CDF estimation turn out to be stronger than those for the empirical CDF estimation derived in those studies.

We illustrate our procedures by an empirical application on heterogeneity of price deviations from the law of one price (LOP). Our procedures reveal significant heterogeneity in the price deviations dynamics. The split-panel jackknife bias-corrected density estimates imply much more volatile and persistent dynamics than the estimates without bias correction and the difference is visually noticeable. This result highlights the importance of the bias correction in that the bias-corrected densities can provide distinct visual information for heterogeneity from the densities without bias correction.

Related literature.

Our setting and motivation closely relate to Okui and Yanagi (2019), but there are several important distinctions in both theoretical and practical aspects. First, our relative magnitude conditions are different from Okui and Yanagi (2019) in which second-order expansions suffice to derive the conditions on estimating the moments of the quantities (e.g., the variance of the heterogeneous mean). This feature in particular contrasts the theoretical contributions in both papers, and indeed our relative magnitude conditions are new in the literature. Second, we show the new insight that the split-panel jackknife is applicable even to kernel estimation. Third, because it is well known that bootstrap inferences do not capture kernel-smoothing bias (see, e.g., Hall and Horowitz, 2013), we extend the RBC inference in Calonico et al. (2018a) instead of the cross-sectional bootstrap in Okui and Yanagi (2019).⁵⁵5The failure of the cross-sectional bootstrap inference in our kernel estimation is formally shown in the previous version of this study uploaded to arXiv (arXiv:1802.08825v2). Finally, while Okui and Yanagi (2019) do not clarify asymptotic biases for their empirical CDFs, we formalize those of our kernel estimators.

Our CDF estimation relates to Jochmans and Weidner (2019) who derive the bias of the empirical distribution based on noisy measurements (e.g., estimated quantities) for the true variables of interest. Their results are complementary to ours. They consider a situation where observations exhibit Gaussian errors. We do not assume such errors. The kernel smoothing allows us to derive bias under much weaker distributional assumptions at the price of creating additional higher-order biases.

Many econometric studies examine heterogeneity in panel data (e.g., Pesaran and Smith, 1995; Hsiao et al., 1999; Pesaran et al., 1999; Fernández-Val and Lee, 2013). Among them, Horowitz and Markatou (1996), Arellano and Bonhomme (2012), and Mavroeidis et al. (2015) propose to estimate the densities of heterogeneous quantities with short-panel data based on deconvolution techniques under some model specifications. Compared with them, we propose model-free kernel-smoothing estimation with long-panel data.

Several studies propose model-free analyses for panel data, but do not focus on the degree of heterogeneity in the dynamics. For example, Okui (2008, 2011, 2014) and Lee et al. (2018) consider homogeneous dynamics, and Galvao and Kato (2014) study the properties of the possibly misspecified fixed effects estimator in the presence of heterogeneous dynamics.

Kernel density estimation using estimated quantities is also examined in the literature on structural estimation of auction models. For example, Ma et al. (2019) and Guerre et al. (2000) estimate the density of individual evaluations of auctioned goods. In their first stage, individual evaluations of auctioned goods are estimated nonparametrically and their second stage is the kernel density estimation applied to estimated evaluations. They also observe that the estimation errors from the first stage affect the asymptotic behavior of the second stage estimator in a nonstandard way. However, their problems are different from ours. Their main issue is the cross-sectional correlation caused by the use of the same set of observations to estimate individual evaluations. As a result, their estimation errors affect the precision and the convergence rate of the second stage estimator. In our case, estimation errors in the first stage are cross-sectionally independent and affect the bias but not the (first-order) variance of the second stage estimator.

Paper organization.

Section 2 introduces our setting and density estimation. Section 3 develops the asymptotic theory, bias correction, CI estimation, bandwidth selection, and CDF estimation. Section 4 presents the application. Section 5 concludes. The supplementary appendix contains the proofs of the theorems, technical lemmas, other technical discussions, and Monte Carlo simulations.

2 Kernel density estimation

This section describes the setting and the proposed estimation. We explain our setting and motivation in a succinct manner because they are similar to those in Okui and Yanagi (2019).⁶⁶6Several remarks and possible extensions can be found in the previous version of this study and Okui and Yanagi (2019). For example, we can consider the presence of covariates and time effects and estimation based on other heterogeneous quantities, such as random coefficients in linear models, with minor modifications. In this paper, we explain our estimation briefly to save space.

We observe panel data $\{\{y_{it}\}_{t=1}^{T}\}_{i=1}^{N}$ where $y_{it}$ is a scalar random variable. We assume that $y_{it}$ is strictly stationary across time and that each individual time series $\{y_{it}\}_{t=1}^{T}$ is generated from some unknown probability distribution $\mathcal{L}(\{y_{it}\}_{t=1}^{T};\alpha_{i})$ , where $\alpha_{i}$ is a (possibly infinite dimensional) random variable specifying the dynamics of $y_{it}$ . We note that $\alpha_{i}$ is an abstract parameter and it does not appear in the actual implementations of our proposed procedure. Characterizing heterogenous dynamics using this abstract parameter is mathematically convenient because it allows us to keep an i.i.d. assumption. Existing studies without model specifications also employ this approach (e.g., Galvao and Kato, 2014). We denote the conditional expectation given $\alpha_{i}$ by $E(\cdot|i)$ .

Our goal is to examine the degree of heterogeneity of the dynamics of $y_{it}$ across units in a model-free manner. To this end, we focus on estimating the density of the mean $\mu_{i}\coloneqq E(y_{it}|i)$ , $k$ -th autocovariance $\gamma_{k,i}\coloneqq E((y_{it}-\mu_{i})(y_{i,t-k}-\mu_{i})|i)$ , and $k$ -th autocorrelation $\rho_{k,i}\coloneqq\gamma_{k,i}/\gamma_{0,i}$ . We first estimate $\mu_{i}$ , $\gamma_{k,i}$ , and $\rho_{k,i}$ by the sample analogues: $\hat{\mu}_{i}\coloneqq\bar{y}_{i}\coloneqq T^{-1}\sum_{t=1}^{T}y_{it}$ , $\hat{\gamma}_{k,i}\coloneqq(T-k)^{-1}\sum_{t=k+1}^{T}(y_{it}-\bar{y}_{i})(y_{i,t-k}-\bar{y}_{i})$ , and $\hat{\rho}_{k,i}\coloneqq\hat{\gamma}_{k,i}/\hat{\gamma}_{0,i}$ . Throughout the paper, we use the notation $\xi_{i}$ to represent one of $\mu_{i}$ , $\gamma_{k,i}$ , or $\rho_{k,i}$ and the notation $\hat{\xi}_{i}$ for the corresponding estimator. The kernel estimator for the density $f_{\xi}(x)$ is given by:

\displaystyle\hat{f}_{\hat{\xi}}(x)\coloneqq\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\hat{\xi}_{i}}{h}\right),

(1)

where $x\in\mathbb{R}$ is a fixed point, $K:\mathbb{R}\to\mathbb{R}$ is a kernel function, and $h>0$ is a bandwidth satisfying $h\to 0$ .⁷⁷7We can consider estimating the joint density for $\mu_{i}$ , $\gamma_{k,i}$ , and $\rho_{k,i}$ in the same manner. This is a standard estimator except that we replace the true $\xi_{i}$ with the estimated $\hat{\xi}_{i}$ .

3 Asymptotic theory

This section develops our asymptotic theory, CI estimation, bandwidth selection, and CDF estimation based on the density estimator $\hat{f}_{\hat{\xi}}(x)$ . We define the notations $w_{it}\coloneqq y_{it}-\mu_{i}=y_{it}-E(y_{it}|i)$ and $\bar{w}_{i}\coloneqq T^{-1}\sum_{t=1}^{T}w_{it}$ . By construction, $y_{it}=\mu_{i}+w_{it}$ . Note that $\hat{\mu}_{i}=\bar{y}_{i}=\mu_{i}+\bar{w}_{i}$ , $E(w_{it}|i)=0$ , and $\gamma_{k,i}=E(w_{it}w_{i,t-k}|i)$ .

3.1 Unique features in asymptotic investigations

Before formally showing the asymptotic properties, we explore the unique features of our asymptotic investigations in an informal manner. By doing so, we clarify the mechanism behind the observation that even very high orders of asymptotic bias matter for our asymptotic analysis.

We here focus on the density estimator for $\hat{\mu}_{i}$ , but similar discussions are also relevant for $\hat{\gamma}_{k,i}$ and $\hat{\rho}_{k,i}$ . Noting that $\hat{\mu}_{i}-\mu_{i}=\bar{w}_{i}$ , we examine the $J$ -th order Taylor expansion of $\hat{f}_{\hat{\mu}}(x)$ :

\displaystyle\begin{split}\hat{f}_{\hat{\mu}}(x)=&\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\mu_{i}}{h}\right)\\ &+\sum_{j=1}^{J-1}\frac{(-1)^{j}}{j!Nh^{j+1}}\sum_{i=1}^{N}(\bar{w}_{i})^{j}K^{(j)}\left(\frac{x-\mu_{i}}{h}\right)\\ &+\frac{(-1)^{J}}{J!Nh^{J+1}}\sum_{i=1}^{N}(\bar{w}_{i})^{J}K^{(J)}\left(\frac{x-\tilde{\mu}_{i}}{h}\right),\end{split}

(2)

where $K^{(j)}$ denotes the $j$ -th order derivative and $\tilde{\mu}_{i}$ is between $\mu_{i}$ and $\hat{\mu}_{i}$ .

The first term in (2) is the infeasible density estimator based on the true $\mu_{i}$ , and its asymptotic behavior is standard and well known in the kernel-smoothing literature. It converges in probability to the density of interest $f_{\mu}(x)$ as $N\to\infty$ and $h\to 0$ with $Nh\to\infty$ . In addition, when $Nh^{5}\to C\in[0,\infty)$ also holds, it can hold that:

\displaystyle\sqrt{Nh}\left(\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\mu_{i}}{h}\right)-f_{\mu}(x)-h^{2}\frac{\kappa_{1}f_{\mu}^{{}^{\prime\prime}}(x)}{2}\right)\stackrel{{\scriptstyle d}}{{\longrightarrow}}\mathcal{N}\big{(}0,\kappa_{2}f_{\mu}(x)\big{)},

where $\kappa_{1}\coloneqq\int s^{2}K(s)ds$ and $\kappa_{2}\coloneqq\int K^{2}(s)ds$ and $\mathcal{N}(\mu,\sigma^{2})$ is a normal distribution with mean $\mu$ and variance $\sigma^{2}$ .

The unique features in our situation are caused from the second and third terms in (2). For the second term, under regularity conditions, the mean can be evaluated as:

	$\displaystyle E\left(\frac{(-1)^{j}}{j!Nh^{j+1}}\sum_{i=1}^{N}(\bar{w}_{i})^{j}K^{(j)}\left(\frac{x-\mu_{i}}{h}\right)\right)$	$\displaystyle=\frac{(-1)^{j}}{j!h^{j+1}}E\left(E\left((\bar{w}_{i})^{j}\|\mu_{i}\right)K^{(j)}\left(\frac{x-\mu_{i}}{h}\right)\right)$
		$\displaystyle=\frac{(-1)^{j}}{j!h^{j}}E((\bar{w}_{i})^{j}\|\mu_{i}=x)f_{\mu}(x)\int K^{(j)}(s)ds+o\left(\frac{1}{\sqrt{T^{j}h^{2j}}}\right)$
		$\displaystyle=O\left(\frac{1}{\sqrt{T^{j}h^{2j}}}\right),$

where we used $E((\bar{w}_{i})^{j}|\mu_{i}=x)=O(T^{-j/2})$ (see Assumption 7 below and Lemma 1 in the supplement). Noting that $E(\bar{w}_{i}|\mu_{i})=0$ , the bias caused from the second term in (2) can be written as $\sum_{j=2}^{J-1}O(1/\sqrt{T^{j}h^{2j}})$ . This bias is negligible when $1/(Th^{2})\to 0$ , which is identical to the relative magnitude condition $N^{2}/T^{5}\to 0$ when using the standard bandwidth $h\asymp N^{-1/5}$ in the density estimation with second-order kernels. For the third term in (2), the absolute mean can be evaluated as:

\displaystyle E\left|\frac{(-1)^{J}}{J!Nh^{J+1}}\sum_{i=1}^{N}(\bar{w}_{i})^{J}K^{(J)}\left(\frac{x-\tilde{\mu}_{i}}{h}\right)\right|\leq\frac{M}{h^{J+1}}E|\bar{w}_{i}|^{J}=O\left(\frac{1}{\sqrt{T^{J}h^{2J+2}}}\right),

where $0<M<\infty$ denotes a generic positive constant and we use $E|\bar{w}_{i}|^{J}=O(T^{-J/2})$ (see Lemma 1). Hence, the third term in (2) is $O_{p}(1/\sqrt{T^{J}h^{2J+2}})$ by Markov’s inequality. Remarkably, this term does not vanish even when $1/(Th^{2})\to 0$ under which the lower-order terms are negligible. This term can be negligible only if $1/(Th^{2+2/J})\to 0$ , which implies $N^{(2+2/J)}/T^{5}\to 0$ when $h\asymp N^{-1/5}$ . Note that $N^{(2+2/J)}/T^{5}\to 0$ is “stronger” than $N^{2}/T^{5}$ .

The asymptotic investigation above exhibits several unique features. First, it implies that the relative magnitude condition for consistency (and also that for asymptotic normality) varies in the order of the expansion. Specifically, we need $1/(Th^{2+2/J})\to 0$ to achieve the consistency of $\hat{f}_{\hat{\mu}}(x)$ based on the $J$ -th order expansion. Second, we can obtain the “weakest” relative magnitude condition $1/(Th^{2})\to 0$ for consistency, only if we execute the infinite-order expansion (that is, as $J\to\infty$ ). Finally, while we can derive the suitable condition $1/(Th^{2})\to 0$ via the infinite-order expansion, it requires the existence of higher-order moments of $w_{it}$ . The evaluation based on the infinite-order expansion demands the existence of $E|w_{it}|^{j}$ for any $j$ . Hence, there is a trade-off between the relative magnitude condition and the existence of higher-order moments.

Asymptotic normality requires a further stronger condition. Because the rate of convergence of the kernel estimator is $\sqrt{Nh}$ , it requires $N/(T^{J}h^{2J+1})\to 0$ . This condition is at best $N^{2}/T^{3}\to 0$ , which is obtained under an infinite order expansion with standard bandwidth ( $h\asymp N^{-1/5}$ ). Note that, as in the density estimation above, the highest order of the expansion determines the required condition for asymptotic normality. Such a very high order of bias cannot be corrected in practice, even though methods to correct the first few orders of bias are available in the long-panel literature (e.g., Dhaene and Jochmans, 2015). This result is in stark contrast to the existing studies in which bias correction improves the conditions on the relative magnitudes of $N$ and $T$ .

The main reason behind these unique features is that the curvature of the summand (i.e., $K((x-\hat{\mu}_{i})/h)$ ) depends on the bandwidth $h$ . Roughly speaking, as $h\to 0$ , the summand function becomes steeper and more “nonlinear.” It exacerbates the bias caused by the nonlinearity and it turns out that even a very high order derivative of $K$ affects the bias. Alternatively, we may also interpret this problem based on the equation $K((x-\hat{\mu}_{i})/h)=K((x-\mu_{i})/h+(\mu_{i}-\hat{\mu}_{i})/h))$ . The contribution of the error by using the estimated $\hat{\mu}_{i}$ is $(\mu_{i}-\hat{\mu}_{i})/h$ and it increases as $h\to 0$ . Hence, the bias of the density estimator heavily depends on the magnitude of $h$ and the nonlinearity of $K$ .

3.2 Asymptotic biases for the density estimation

We here formally show the presence of asymptotic biases of the kernel density estimator in (1). We conduct asymptotic investigations based on an infinite-order expansion under which the weakest possible condition on the relative magnitude of $N$ and $T$ is obtained.

We assume the following basic conditions for the data-generating process. These are essentially the same as the assumptions in Okui and Yanagi (2019).

Assumption 1.

The sample space of $\alpha_{i}$ is some Polish space and $y_{it}\in\mathbb{R}$ is a scalar real random variable. $\{(\{y_{it}\}_{t=1}^{T},\alpha_{i})\}_{i=1}^{N}$ is i.i.d. across $i$ .

Assumption 2.

For each $i$ , $\{y_{it}\}_{t=1}^{\infty}$ is strictly stationary and $\alpha$ -mixing given $\alpha_{i}$ with mixing coefficients $\{\alpha(m|i)\}_{m=0}^{\infty}$ . For any natural number $r_{m}\in\mathbb{N}$ , there exists a sequence $\{\alpha(m)\}_{m=0}^{\infty}$ such that for any $i$ and $m$ , $\alpha(m|i)\leq\alpha(m)$ and $\sum_{m=0}^{\infty}(m+1)^{r_{m}/2-1}\alpha(m)^{\delta/(r_{m}+\delta)}<\infty$ for some $\delta>0$ .

Assumption 3.

For any natural number $r_{d}\in\mathbb{N}$ , it holds that $E|w_{it}|^{r_{d}+\delta}<\infty$ for some $\delta>0$ .

Assumption 4.

There exists a constant $\epsilon>0$ such that $\gamma_{0,i}>\epsilon$ almost surely.

Assumptions 1 and 2 require that the individual time series given $\alpha_{i}$ is strictly stationary across time but i.i.d. across units. The identical distribution across $i$ is essential for our analysis. The independence assumption across $i$ makes our asymptotic investigations tractable, while the consistency result and the same asymptotic biases could be derived even under weak cross-sectional dependence. Note that the i.i.d. assumption does not exclude the presence of heterogeneity in panel data. In our setting, heterogeneity is caused by differences in the realized values of $\{\alpha_{i}\}_{i=1}^{N}$ across units. Assumption 2 also restricts the degree of persistence of the individual time series. The conditions for stationarity and degree of persistence require that the times series for each unit is not a unit root process and that the initial value of each time series is generated from a stationary distribution. Assumption 3 requires the existence of the moments of $w_{it}$ , and it allows us to derive the asymptotic biases of the estimators. While we can develop the theoretical properties of the estimators in situations where Assumptions 2 and 3 do not hold for some numbers $r_{m}$ and $r_{d}$ , we cannot derive the higher-order biases based on infinite-order expansions in such situations. As a result, in such situations, we demand stronger conditions on the relative magnitudes as discussed in the previous section. Assumption 4 allows us to derive the asymptotic properties of the kernel estimators for $\rho_{k,i}$ . All of the assumptions can be satisfied in popular panel data models. For example, they all hold when $y_{it}$ follows a heterogeneous stationary panel autoregressive moving–average model with a Gaussian error term (e.g., $y_{it}=c_{i}+\phi_{i}y_{i,t-1}+u_{it}+\theta_{i}u_{i,t-1}$ with $u_{it}\sim\mathcal{N}(0,\sigma^{2})$ ).

We also assume the following additional conditions.

Assumption 5.

The kernel function $K:\mathbb{R}\to\mathbb{R}$ is bounded, symmetric, and infinitely differentiable. It satisfies $\int K(s)ds=1$ , $\int|K^{(j)}(s)|ds<\infty$ , $\int|sK^{(j)}(s)|ds<\infty$ , $\int|s^{2}K^{(j)}(s)|ds<\infty$ , and $\int|s^{3}K^{(j)}(s)|ds<\infty$ for any nonnegative integer $j$ .

Assumption 5 includes the standard conditions for the kernel function, except for infinite differentiability. We require the differentiability in order to expand the kernel estimator for the estimated $\hat{\xi}_{i}$ at the true $\xi_{i}$ based on the infinite-order expansion. Note that the symmetry of $K$ implies that $\int K^{(j)}(s)ds=0$ for any odd $j$ .

Assumption 6.

The random variables $\mu_{i}\in\mathbb{R}$ , $\gamma_{k,i}\in\mathbb{R}$ , and $\rho_{k,i}\in(-1,1)$ are continuously distributed. The densities $f_{\xi}$ with $\xi=\mu$ , $\gamma_{k}$ , and $\rho_{k}$ are bounded away from zero near $x$ and three-times boundedly continuously differentiable near $x$ .

Assumption 6 requires that $\xi_{i}$ is continuously distributed without probability mass. The continuity of the random variable is essential for implementing kernel-smoothing estimation as it rules out situations where there is no heterogeneity for $\xi_{i}$ (that is, the situation where $\xi_{i}=\xi$ for any $i$ with some constant $\xi$ ) and where there is finitely grouped heterogeneity (that is, $\xi_{i_{1}}=\xi_{i_{2}}$ for any $i_{1},i_{2}\in\mathbb{I}_{g}$ with some sets $\mathbb{I}_{1},\mathbb{I}_{2},\dots,\mathbb{I}_{G}$ satisfying $\bigoplus_{g=1}^{G}\mathbb{I}_{g}=\{1,2,\dots,N\}$ ).

Assumption 7.

The following functions are twice boundedly continuously differentiable near $x$ for any $T\in\mathbb{N}$ with finite limits at $x$ as $T\to\infty$ :

	$\displaystyle\sqrt{T^{j}}E\left((\bar{w}_{i})^{j}\middle\|\mu_{i}=\cdot\right),\qquad\sqrt{T^{j}}E\left((\bar{w}_{i})^{j}\middle\|\gamma_{k,i}=\cdot\right),\qquad\sqrt{T^{j}}E\left((\bar{w}_{i})^{j}\middle\|\rho_{k,i}=\cdot\right),$
	$\displaystyle\frac{1}{\sqrt{T^{j}}}E\left(\left(\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j}\middle\|\gamma_{k,i}=\cdot\right),$
	$\displaystyle\frac{1}{\sqrt{T^{j_{1}+j_{2}}}}E\left(\left(\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j_{1}}\left(\sum_{t=1}^{T}(w_{it}^{2}-\gamma_{0,i})\right)^{j_{2}}\frac{\gamma_{k,i}^{j_{3}}}{\gamma_{0,i}^{j_{4}}}\middle\|\rho_{k,i}=\cdot\right),$

for any nonnegative integers $j,j_{1},j_{2},j_{3},j_{4}$ .

Assumption 7 states the existence and smoothness of the conditional expectations. This assumption allows us to derive the exact forms of the asymptotic biases. The convergence rates of the terms are standard and guaranteed by Lemmas 1 and 3 in Appendix B. For example, the assumption requires that $T\cdot E((\bar{w}_{i})^{2}|\mu_{i}=\cdot)=O(1)$ and the convergence rate is consistent with the result in Lemma 1.

The following theorem shows that the kernel density estimators are consistent and asymptotically normal but exhibit asymptotic biases. While the theorem assumes an infinite-order Taylor expansion and the summability of the infinite series of the asymptotic biases directly, we can show their validity under unrestrictive regularity conditions. Because these discussions are highly technical and demand lengthy explanations, they appear in Appendices C and D.

Theorem 1.

Let $x\in\mathbb{R}$ be an interior point in the support of $\xi_{i}=\mu_{i}$ , $\gamma_{k,i}$ , or $\rho_{k,i}$ . Suppose that Assumptions 1, 2, 3, 5, 6, and 7 hold. In addition, if $\xi_{i}=\rho_{k,i}$ , suppose that Assumption 4 also holds. Suppose that the infinite-order Taylor expansion of $\hat{f}_{\hat{\xi}}(x)=(Nh)^{-1}\sum_{i=1}^{N}K((x-\hat{\xi}_{i})/h)$ at $\xi_{i}$ holds and that the infinite series of the asymptotic biases below is well defined. When $N,T\to\infty$ and $h\to 0$ with $Nh\to\infty$ , $Nh^{5}\to C\in[0,\infty)$ , and $Th^{2}\to\infty$ , it holds that:

\displaystyle\hat{f}_{\hat{\xi}}(x)-f_{\xi}(x)=\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\xi_{i}}{h}\right)-f_{\xi}(x)+\frac{A_{\xi,1}(x)}{T}+\frac{A_{\xi,2}(x)}{Th^{2}}+\sum_{j=3}^{\infty}\frac{A_{\xi,j}(x)}{\sqrt{T^{j}h^{2j}}}+o_{p}\left(\frac{1}{Th^{2}}\right),

where $A_{\xi,j}(x)$ is a nonrandom bias term that depends on $x$ and satisfies $A_{\mu,1}(x)=0$ for any $x$ (the formula of $A_{\xi,j}(x)$ is given in the proof). As a result, when $N/(T^{3}h^{5})\to 0$ also holds, it holds that:

\displaystyle\sqrt{Nh}\left(\hat{f}_{\hat{\xi}}(x)-f_{\xi}(x)-h^{2}\frac{\kappa_{1}f_{\xi}^{{}^{\prime\prime}}(x)}{2}-\frac{A_{\xi,1}(x)}{T}-\frac{A_{\xi,2}(x)}{Th^{2}}\right)\stackrel{{\scriptstyle d}}{{\longrightarrow}}\mathcal{N}\big{(}0,\kappa_{2}f_{\xi}(x)\big{)}.

The density estimator can be written as the sum of the infeasible estimator based on the true $\xi_{i}$ , say $\hat{f}_{\xi}(x)\coloneqq(Nh)^{-1}\sum_{i=1}^{N}K((x-\xi_{i})/h)$ , and the asymptotic biases. The convergence rate of the estimator is the standard order of $O_{p}(1/\sqrt{Nh})$ , and the asymptotic distribution is the same as that of the infeasible estimator $\hat{f}_{\xi}(x)$ . However, the feasible estimator exhibits asymptotic biases. These results also require the relative magnitude conditions of $N$ , $T$ , and $h$ ; that is, $1/(Th^{2})\to 0$ and $N/(T^{3}h^{5})\to 0$ for consistency and asymptotic normality, respectively.

The density estimator for $\mu_{i}$ has two main asymptotic biases given $A_{\mu,1}(x)=0$ , but the density estimators for $\gamma_{k,i}$ and $\rho_{k,i}$ have three main asymptotic biases, in addition to the higher-order biases. The first bias of the form $h^{2}\kappa_{1}f_{\xi}^{\prime\prime}(x)/2$ is the standard kernel-smoothing bias. The second bias of the form $A_{\xi,1}(x)/T$ is the incidental parameter bias caused from estimating $\gamma_{k,i}$ and $\rho_{k,i}$ by $\hat{\gamma}_{k,i}$ and $\hat{\rho}_{k,i}$ , respectively. The estimation of $\hat{\gamma}_{k,i}$ and $\hat{\rho}_{k,i}$ involves estimating $\mu_{i}$ by $\hat{\mu}_{i}=\bar{y}_{i}$ for each $i$ , which becomes a source of the incidental parameter bias. The third bias of the form $A_{\xi,2}(x)/(Th^{2})$ is the second-order nonlinearity bias caused by expanding $K((x-\hat{\xi}_{i})/h)$ for $K((x-\xi_{i})/h)$ by Taylor expansion. Moreover, the $j$ -th order nonlinearity bias exhibits the form $A_{\xi,j}(x)/\sqrt{T^{j}h^{2j}}$ for $j\geq 3$ .

We need the two conditions, $1/(Th^{2})\to 0$ and $N/(T^{3}h^{5})\to 0$ , to ensure the asymptotic negligibility of the higher-order nonlinearity biases. If we use the standard bandwidth $h\asymp N^{-1/5}$ with second-order kernels, the conditions $1/(Th^{2})\to 0$ and $N/(T^{3}h^{5})\to 0$ imply that $N^{2}/T^{5}\to 0$ and that $N^{2}/T^{3}\to 0$ , respectively, which are integrated to $N^{2}/T^{3}\to 0$ . Note that while the incidental parameter bias and the second-order nonlinearity bias are also asymptotically negligible under these conditions, the practical magnitudes of these biases would be larger than those of the higher-order nonlinearity biases.

We have already discussed the source of the nonlinearity bias in Section 3.1 so here we provide a slightly more detailed discussion of the incidental parameter bias. It does not appear in $\hat{f}_{\hat{\mu}}(x)$ because the estimation error in $\hat{\mu}_{i}$ (that is, $\bar{w}_{i}$ ) has zero mean. However, errors in $\gamma_{k,i}$ and $\rho_{k,i}$ are not mean-zero. For example, $\hat{\gamma}_{0,i}=\sum_{t=1}^{T}(y_{it}-\bar{y}_{i})^{2}/T=\gamma_{0,i}+\sum_{t=1}^{T}(w_{it}^{2}-\gamma_{0,i})/T-(\bar{w}_{i})^{2}$ and $(\bar{w}_{i})^{2}$ is not mean-zero although it converges to zero at the rate $1/T$ . This is the source of the incidental parameter bias $A_{\gamma_{0},1}/T$ and the order $1/T$ comes from the fact that $(\bar{w}_{i})^{2}$ is $O_{p}(1/T)$ .

Remark 1.

Some might surmise that our kernel smoothing requires a “weaker” condition, such as $Nh/T^{2}\to 0$ , than the condition $N/T^{2}\to 0$ in the existing literature because the kernel estimation is essentially taking the average number of observations in a local neighborhood that contains $Nh$ observations on average. However, the above theorem clarifies that such conjecture is not true. The failure of the conjecture stems from the fact that, as $h\to 0$ , the summands, $K((x-\hat{\xi}_{i})/h)$ , become more nonlinear, which increases the nonlinear biases and necessitates imposing a stronger assumption to ignore higher-order nonlinear biases.

Remark 2.

When using higher-order kernels, the relative magnitude conditions of $N$ and $T$ for consistency and asymptotic normality are altered. For example, when using fourth-order kernels, the optimal bandwidth is $h\asymp N^{-1/9}$ (see, e.g., Li and Racine, 2007, Section 1.11). Then, the conditions $1/(Th^{2})\to 0$ and $N/(T^{3}h^{5})\to 0$ are identical to $N^{2}/T^{9}\to 0$ and $N^{14}/T^{27}\to 0$ , respectively, which are weaker than the relative magnitude conditions with second-order kernels. Thus, one may employ higher-order kernels especially when $T$ is much smaller than $N$ . Nonetheless, our Monte Carlo simulations observe that the performance of the jackknife bias-corrected estimator with a second-order kernel is satisfactory even when $T$ is small.

3.3 Split-panel jackknife bias correction for density estimation

As the incidental parameter bias and the nonlinearity biases in $\hat{f}_{\hat{\xi}}(x)$ may be severe in practice, we propose adoption of the split-panel jackknife to correct them. Among split-panel jackknifes, here we consider half-panel jackknife (HPJ) bias correction. For simplicity, suppose that $T$ is even.⁸⁸8The bias correction with odd $T$ is similar. See Dhaene and Jochmans (2015, page 999) for details. For $\xi_{i}=\mu_{i}$ , $\gamma_{k,i}$ , or $\rho_{k,i}$ , we obtain the estimators $\hat{f}_{\hat{\xi},(1)}(x)$ and $\hat{f}_{\hat{\xi},(2)}(x)$ of $f_{\xi}(x)$ based on two half-panel data $\{\{y_{it}\}_{t=1}^{T/2}\}_{i=1}^{N}$ and $\{\{y_{it}\}_{t=T/2+1}^{T}\}_{i=1}^{N}$ , respectively. The HPJ bias-corrected estimator is $\hat{f}_{\hat{\xi}}^{H}(x)\coloneqq\hat{f}_{\hat{\xi}}(x)-(\bar{f}_{\hat{\xi}}(x)-\hat{f}_{\hat{\xi}}(x))$ where $\bar{f}_{\hat{\xi}}(x)\coloneqq[\hat{f}_{\hat{\xi},(1)}(x)+\hat{f}_{\hat{\xi},(2)}(x)]/2$ . The term $\bar{f}_{\hat{\xi}}(x)-\hat{f}_{\hat{\xi}}(x)$ estimates the bias in the original estimator $\hat{f}_{\hat{\xi}}(x)$ . Importantly, the bandwidths for computing $\hat{f}_{\hat{\xi},(1)}(x)$ and $\hat{f}_{\hat{\xi},(2)}(x)$ must be the same as that for the original estimator $\hat{f}_{\hat{\xi}}(x)$ to reduce the biases.

The next theorem formally shows that the HPJ bias-corrected estimator $\hat{f}_{\hat{\xi}}^{H}(x)$ does not suffer from incidental parameter bias and second-order bias, and does not alter the asymptotic variance of the estimator.

Theorem 2.

Suppose that the assumptions in Theorem 1 hold. When $N,T\to\infty$ and $h\to 0$ with $Nh\to\infty$ , $Nh^{5}\to C\in[0,\infty)$ , $Th^{2}\to\infty$ , and $N/(T^{3}h^{5})\to 0$ , it holds that:

\displaystyle\sqrt{Nh}\left(\hat{f}_{\hat{\xi}}^{H}(x)-f_{\xi}(x)-h^{2}\frac{\kappa_{1}f_{\xi}^{{}^{\prime\prime}}(x)}{2}\right)\stackrel{{\scriptstyle d}}{{\longrightarrow}}\mathcal{N}\big{(}0,\kappa_{2}f_{\xi}(x)\big{)}.

Note that HPJ bias correction does not weaken the relative magnitude condition of $N$ , $T$ , and $h$ for asymptotic normality in Theorem 1; that is, $N/(T^{3}h^{5})\to 0$ . This is because HPJ bias correction cannot eliminate higher-order nonlinearity biases. This result is in stark contrast to the existing literature where bias correction typically weakens the condition on the relative magnitudes of $N$ and $T$ (see, e.g., Dhaene and Jochmans, 2015).

Remark 3.

We can also consider higher-order jackknifes to eliminate higher-order biases as in Dhaene and Jochmans (2015) and Okui and Yanagi (2019). For example, we can consider the third-order jackknife (TOJ) in the same manner as in Okui and Yanagi (2019), which is slightly different from the original TOJ in Dhaene and Jochmans (2015) because both studies treat different higher-order biases. We investigate its performance by Monte Carlo simulations in the appendix, which shows that the TOJ can work better than the HPJ, especially when the naive estimator without bias correction exhibits a large bias. Hence, for practical situations, we recommend the adoption of higher-order jackknifes as well as HPJ bias correction.

Remark 4.

The half-series jackknife to correct bias of each $\hat{\gamma}_{k,i}$ proposed by Quenouille (1949, 1956) cannot be employed for reducing the nonlinearity biases such as $A_{\gamma_{k},2}(x)/(Th^{2})$ . It corrects bias in $\hat{\gamma}_{k,i}$ due to estimating $\mu_{i}$ by $\hat{\mu}_{i}=\bar{y}_{i}$ for each $i$ , so that it can reduce only the incidental parameter bias $A_{\gamma_{k},1}(x)/T$ .

3.4 Confidence interval and bandwidth selection for density estimation

This section considers CI estimation and the selection of optimal bandwidth for density estimation.

CI estimation.

We propose to apply the RBC procedure in Calonico et al. (2018a) for CI estimation. It allows us to construct a valid $1-\alpha$ CI of $f_{\xi}(x)$ while correcting the kernel-smoothing bias $\mathcal{B}_{\xi}(x)\coloneqq h^{2}\kappa_{1}f_{\xi}^{{}^{\prime\prime}}(x)/2$ .

The RBC procedure based on the naive estimator $\hat{f}_{\hat{\xi}}(x)$ is almost the same as the original procedure in Calonico et al. (2018a). We first note that the kernel-smoothing bias can be estimated by $\hat{\mathcal{B}}_{\hat{\xi}}(x)\coloneqq h^{2}\kappa_{1}\hat{f}_{\hat{\xi}}^{\prime\prime}(x)$ where $\hat{f}_{\hat{\xi}}^{\prime\prime}(x)\coloneqq(Nb^{3})^{-1}\sum_{i=1}^{N}L^{\prime\prime}((x-\hat{\xi}_{i})/b)$ with a kernel function $L$ and bandwidth $b\to 0$ . Then, the estimator that corrects the kernel-smoothing bias is:

\displaystyle\hat{f}_{\hat{\xi}}(x)-\hat{\mathcal{B}}_{\hat{\xi}}(x)=\frac{1}{Nh}\sum_{i=1}^{N}\left(K\left(\frac{x-\hat{\xi}_{i}}{h}\right)-\kappa_{1}\lambda^{3}L^{\prime\prime}\left(\frac{x-\hat{\xi}_{i}}{b}\right)\right)\eqqcolon\frac{1}{Nh}\sum_{i=1}^{N}\mathcal{K}_{i}(x),

where $\lambda\coloneqq h/b$ . Choosing $b$ such that $\lambda\to c$ for some $c\in(0,\infty)$ enables us to capture variance inflation caused by the bias correction while successfully removing the kernel-smoothing bias. In practice, one can set $\lambda=1$ by following the suggestion in Calonico et al. (2018a). The RBC $t$ statistic is given by:

\displaystyle T_{RBC}(x)\coloneqq\frac{[\hat{f}_{\hat{\xi}}(x)-\hat{\mathcal{B}}_{\hat{\xi}}(x)]-f_{\xi}(x)}{\hat{\sigma}_{RBC}(x)},

where $\hat{\sigma}_{RBC}^{2}(x)$ is the estimator of the nonasymptotic variance of $\hat{f}_{\hat{\xi}}(x)-\hat{B}_{\hat{\xi}}(x)$ :

\displaystyle\hat{\sigma}_{RBC}^{2}(x)\coloneqq\frac{1}{Nh^{2}}\left[\frac{1}{N}\sum_{i=1}^{N}\mathcal{K}_{i}^{2}(x)-\left(\frac{1}{N}\sum_{i=1}^{N}\mathcal{K}_{i}(x)\right)^{2}\right].

It holds that $T_{RBC}(x)\stackrel{{\scriptstyle d}}{{\longrightarrow}}\mathcal{N}(0,1)$ under similar conditions in Theorem 1, so that we can construct the $1-\alpha$ CI of $f_{\xi}(x)$ in the usual manner.

The RBC procedure based on the split-panel jackknife bias-corrected estimator demands some modifications. To see this, the HPJ bias-corrected estimator that also reduces the kernel-smoothing bias can be written as follows:

	$\displaystyle\hat{f}_{\hat{\xi}}^{H}(x)-\hat{\mathcal{B}}_{\hat{\xi}}(x)$	$\displaystyle=\left[2\hat{f}_{\hat{\xi}}(x)-\frac{1}{2}\left(\hat{f}_{\hat{\xi}}^{(1)}(x)+\hat{f}_{\hat{\xi}}^{(2)}(x)\right)\right]-\hat{\mathcal{B}}_{\hat{\xi}}(x)$
		$\displaystyle=\frac{1}{Nh}\sum_{i=1}^{N}\left[2K\left(\frac{x-\hat{\xi}_{i}}{h}\right)-\frac{1}{2}\left(K\left(\frac{x-\hat{\xi}_{i}^{(1)}}{h}\right)+K\left(\frac{x-\hat{\xi}_{i}^{(2)}}{h}\right)\right)-\kappa_{1}\lambda^{3}L^{\prime\prime}\left(\frac{x-\hat{\xi}_{i}}{b}\right)\right]$
		$\displaystyle\eqqcolon\frac{1}{Nh}\sum_{i=1}^{n}\mathcal{K}_{i}^{H}(x),$

where $\hat{\xi}_{i}^{(1)}$ and $\hat{\xi}_{i}^{(2)}$ are the estimators based on the half-series $\{y_{it}\}_{t=1}^{T/2}$ and $\{y_{it}\}_{t=T/2+1}^{T}$ , respectively. Then, the nonasymptotic variance of $\hat{f}_{\hat{\xi}}^{H}(x)-\hat{\mathcal{B}}_{\hat{\xi}}(x)$ can be estimated by:

\displaystyle(\hat{\sigma}_{RBC}^{H}(x))^{2}\coloneqq\frac{1}{Nh^{2}}\left[\frac{1}{N}\sum_{i=1}^{N}(\mathcal{K}_{i}^{H}(x))^{2}-\left(\frac{1}{N}\sum_{i=1}^{n}\mathcal{K}_{i}^{H}(x)\right)^{2}\right].

As a result, the RBC $t$ statistic based on the HPJ estimator is:

\displaystyle T_{RBC}^{H}(x)\coloneqq\frac{[\hat{f}_{\hat{\xi}}^{H}(x)-\hat{\mathcal{B}}_{\hat{\xi}}(x)]-f_{\xi}(x)}{\hat{\sigma}_{RBC}^{H}(x)}.

Note that $\hat{\sigma}_{RBC}^{H}(x)$ is different from $\hat{\sigma}_{RBC}(x)$ above because the former also captures the finite-sample variability of HPJ bias correction. We can construct the $1-\alpha$ CI of $f_{\xi}(x)$ based on $T_{RBC}^{H}(x)$ in the usual manner. We can also consider similar RBC procedures based on higher-order split-panel jackknife bias correction.

Remark 5.

Undersmoothing is often used to construct CI for the kernel density estimator. However, it is not desirable in our context. Undersmoothing means that we use bandwidth that converges faster than $N^{-1/5}$ so that the smoothing bias does not appear in the asymptotic distribution. In our setting, the smaller is the bandwidth, the larger is the higher-order nonlinearity bias, which in turn calls for a stronger assumption on the relative magnitude of $N$ and $T$ . We thus prefer the method based on Calonico et al. (2018a) because we can still use the bandwidth of order $N^{-1/5}$ . Note also that Calonico et al. (2018a) demonstrate that their method provides better coverage than undersmoothing.

Bandwidth selection.

We can select the bandwidth $h$ for the density estimation using any standard procedures based on the estimated $\hat{\xi}_{i}$ . This is because Theorem 2 shows that the AMSE and asymptotic distribution of the HPJ bias-corrected estimator $\hat{f}_{\hat{\xi}}^{H}(x)$ are identical to those of the infeasible estimator $\hat{f}_{\xi}(x)=(Nh)^{-1}\sum_{i=1}^{N}K((x-\xi_{i})/h)$ . In our application and Monte Carlo simulations, we apply the coverage error optimal bandwidth selection procedure in Calonico et al. (2018a) because of its desirable properties as shown in the paper. Furthermore, their bandwidth tends to be larger than the bandwidth that minimizes AMSE and would be more suitable in our context because a larger bandwidth makes the nonlinearity biases smaller. Our Monte Carlo simulations also confirm the appropriate finite-sample properties of the procedure.

3.5 Asymptotic biases for CDF estimation

In this section, we consider the smoothed CDF estimator and derive its asymptotic biases. The CDF $F_{\xi}(x)\coloneqq\Pr(\xi_{i}\leq x)$ can be estimated by integrating the kernel density estimator: $\hat{F}_{\hat{\xi}}(x)=\int^{x}_{-\infty}\hat{f}_{\hat{\xi}}(v)dv$ . It is convenient to write this kernel CDF estimator as:

\displaystyle\hat{F}_{\hat{\xi}}(x)\coloneqq\frac{1}{N}\sum_{i=1}^{N}\mathbb{K}\left(\frac{x-\hat{\xi}_{i}}{h}\right),

where $x\in\mathbb{R}$ is a fixed point, and $\mathbb{K}:\mathbb{R}\to[0,1]$ is a Borel-measurable CDF (or $\mathbb{K}(a)=\int^{x}_{-\infty}K(v)dv$ ).

For the CDF estimation, we need the following condition instead of Assumption 6. The continuity of the random variable is essential, even for the kernel-smoothing CDF estimation.

Assumption 8.

The random variables $\mu_{i}\in\mathbb{R}$ , $\gamma_{k,i}\in\mathbb{R}$ , and $\rho_{k,i}\in(-1,1)$ are continuously distributed. The CDFs $F_{\xi}$ with $\xi=\mu$ , $\gamma_{k}$ , and $\rho_{k}$ are three-times boundedly continuously differentiable near $x$ .

The following theorem shows the presence of asymptotic biases for the kernel CDF estimator.

Theorem 3.

Let $x\in\mathbb{R}$ be an interior point in the support of $\xi_{i}=\mu_{i}$ , $\gamma_{k,i}$ , or $\rho_{k,i}$ . Suppose that Assumptions 1, 2, 3, 5, 7, and 8 hold. In addition, if $\xi_{i}=\rho_{k,i}$ , suppose that Assumption 4 also holds. Suppose that the infinite-order Taylor expansion of $\hat{F}_{\hat{\xi}}(x)=N^{-1}\sum_{i=1}^{N}\mathbb{K}((x-\hat{\xi}_{i})/h)$ at $\xi_{i}$ holds and that the infinite series of the asymptotic biases below is well defined. When $N,T\to\infty$ and $h\to 0$ with $Nh^{3}\to C\in[0,\infty)$ and $Th^{2}\to\infty$ , it holds that:

\displaystyle\hat{F}_{\hat{\xi}}(x)-F_{\xi}(x)=\frac{1}{N}\sum_{i=1}^{N}\mathbb{K}\left(\frac{x-\xi_{i}}{h}\right)-F_{\xi}(x)+\frac{B_{\xi,1}(x)}{T}+\frac{B_{\xi,2}(x)}{T}+\sum_{j=3}^{\infty}\frac{B_{\xi,j}(x)}{\sqrt{T^{j}h^{2j-2}}}+o_{p}\left(\frac{1}{\sqrt{T^{3}h^{4}}}\right),

where $B_{\xi,j}(x)$ is a nonrandom bias term that depends on $x$ and that satisfies $B_{\mu,1}(x)=0$ for any $x$ (the formula of $B_{\xi,j}(x)$ is given in the proof). As a result, when $N/(T^{3}h^{4})\to 0$ also holds, it holds that:

\displaystyle\sqrt{N}\left(\hat{F}_{\hat{\xi}}(x)-F_{\xi}(x)-\frac{B_{\xi,1}(x)}{T}-\frac{B_{\xi,2}(x)}{T}\right)\stackrel{{\scriptstyle d}}{{\longrightarrow}}\mathcal{N}\big{(}0,F_{\xi}(x)[1-F_{\xi}(x)]\big{)}.

The CDF estimator can be rearranged as the sum of the infeasible estimator based on the true $\xi_{i}$ and the asymptotic biases. We present the result based on an infinite-order expansion because it yields the best possible condition of the relative magnitudes of $N$ and $T$ but it requires the validity of the infinite-order expansion, in particular the summability of the infinite series and they hold under technical regularity conditions as in the case of the density estimation in Theorem 1. The biases of the forms $B_{\xi,1}(x)/T$ and $B_{\xi,2}(x)/T$ are the incidental parameter bias and the second-order nonlinearity bias, respectively. Note that $\hat{F}_{\hat{\mu}}(x)$ does not exhibit the incidental parameter bias as in the case of the density estimation. We also note that the standard kernel-smoothing bias of order $O(h^{2})$ does not exist under asymptotic normality because it is asymptotically negligible under $Nh^{3}\to C$ (see Lemma 8 in Appendix B). Consistency and asymptotic normality require the conditions $1/(Th^{2})\to 0$ and $N/(T^{3}h^{4})\to 0$ , respectively, which asymptotically eliminate the higher-order biases. When using the standard bandwidth $h\asymp N^{-1/3}$ in the CDF estimation with second-order kernels, the conditions $1/(Th^{2})\to 0$ and $N/(T^{3}h^{4})\to 0$ are the same as $N^{2}/T^{3}\to 0$ and $N^{7}/T^{9}\to 0$ , respectively, which are integrated to $N^{7}/T^{9}\to 0$ . Note that we can weaken the relative magnitude condition by using higher-order kernels, which leads to a larger bandwidth, as in the density estimation.

The relative magnitude conditions for the kernel CDF estimation with second-order kernels are stronger than those for the empirical CDF estimation in Jochmans and Weidner (2019) and Okui and Yanagi (2019). The empirical CDF estimation is also easier to implement in practice. Hence, one should probably employ empirical CDF estimation in practice, and here we do not explore split-panel jackknife, CI estimation, and bandwidth selection for the kernel CDF estimation (although they are feasible). Nonetheless, the asymptotic biases for the kernel CDF estimation in Theorem 3 are new in the literature, and they would be interesting in their own right.

Remark 6.

The bias of order $O(1/T)$ for $\hat{F}_{\hat{\mu}}(x)$ corresponds to the result in Jochmans and Weidner (2019). They derive the asymptotic bias of the empirical distribution under Gaussian errors. Suppose that $\hat{\mu}_{i}\sim\mathcal{N}(\mu_{i},\sigma_{i}^{2}/T)$ as in Jochmans and Weidner (2019). Note that $B_{\mu,1}(x)=0$ . The formula for $B_{\mu,2}(x)$ is available in the proof of Theorem 3 and becomes $0.5\cdot\partial(E(\sigma_{i}^{2}|\mu_{i}=x)f_{\mu}(x))/\partial x$ in this case.⁹⁹9To derive this result, note that integration by parts can lead to $\int sK^{\prime}(s)ds=[sK(s)]_{-\infty}^{\infty}-\int K(s)ds=-1$ . It is identical to the bias formula in Jochmans and Weidner (2019). Note that the bias of order $O(1/T)$ for $\hat{F}_{\hat{\gamma}_{k}}(x)$ and $\hat{F}_{\hat{\rho}_{k}}(x)$ includes $B_{\gamma_{k},1}(x)$ and $B_{\rho_{k},1}(x)$ , respectively, which do not appear in Jochmans and Weidner (2019).

Remark 7.

While we obtain the same bias formula of order $O(1/T)$ for the CDF estimator for $\hat{\mu}_{i}$ as that in Jochmans and Weidner (2019), it is still not clear whether bias formulas including higher-order terms correspond to each other in both papers. The empirical CDF can be regarded as the kernel CDF by letting $h\to 0$ in the given sample (i.e., when $h\to 0$ while keeping $N$ and $T$ fixed). However, the higher-order biases of our kernel CDF are derived in the joint asymptotics (i.e., $N,T\to\infty$ and $h\to 0$ ) and explode as $h\to 0$ , so that it is not trivial how those higher-order asymptotic biases contribute as $h\to 0$ , while keeping $N$ and $T$ fixed (or as $N,T\to\infty$ after $h\to 0$ ). Therefore, although we obtain the same bias formulas of order $O(1/T)$ , we still hesitate to conclude definitely that our bias formula, including higher-order terms, corresponds exactly to that in Jochmans and Weidner (2019).

4 Empirical application

We apply our procedure to panel data on prices of items in US cities. Our procedure allows us to examine the heterogeneous properties of the deviations of prices from the LOP across items and cities, and the difference in the degree of heterogeneity between goods and services.

Many empirical studies examine the heterogeneous properties of the level and variance of price deviations and the speed of price adjustment toward the long-run LOP deviation (see Anderson and Van Wincoop, 2004 for a review). For example, Engel and Rogers (2001), Parsley and Wei (2001), and Crucini et al. (2015) examine such heterogeneous properties and find that the LOP deviation dynamics are significantly heterogeneous across items and cities based on regression models. Our investigation below complements such empirical analyses by using our model-free procedure, as it provides visual information concerning the degree of heterogeneity.

We estimate the densities of the mean $\mu_{i}$ , variance $\gamma_{0,i}$ , and first-order autocorrelation $\rho_{1,i}$ . We use the Epanechnikov kernel with the coverage error optimal bandwidth in Calonico et al. (2018a).¹⁰¹⁰10We also observed similar results with different kernels and different bandwidths. The codes to compute the CIs and the optimal bandwidths are developed based on the nprobust package for R (Calonico et al., 2018b).

Data.

We use data from the American Chamber of Commerce Researchers Association Cost of Living Index produced by the Council of Community and Economic Research.¹¹¹¹11Mototsugu Shintani kindly provided us with the data set ready for analysis. The same data set is used by Parsley and Wei (1996), Yazgan and Yilmazkuday (2011), Crucini et al. (2015), Lee et al. (2018), and Okui and Yanagi (2019). The data set contains quarterly price series of 48 consumer price index items (goods and services) for 52 US cities from 1990Q1 to 2007Q4.¹²¹²12While the original data source contains price information for more items in additional cities, we restrict the observations to obtain a balanced panel data set, as in Crucini et al. (2015). The categorization of goods and services can be found in Okui and Yanagi (2019, Table 2).

We define the LOP deviation for item $k$ in city $i$ at time $t$ as $y_{ikt}=\ln P_{ikt}-\ln P_{0kt}$ , where $P_{ikt}$ is the price of item $k$ in city $i$ at time $t$ and $P_{0kt}$ is that for the benchmark city of Albuquerque, NM. We regard each item–city pair as a cross-sectional unit, such that we focus on the degree of heterogeneity of the LOP deviations across item–city pairs. The number of cross-sectional units is $N=48\times(52-1)=2448$ and the length of the time series is $T=18\times 4=72$ .

Results.

Figure 2 depicts the density estimates for $\mu_{i}$ , $\gamma_{0,i}$ , and $\rho_{1,i}$ . In each panel, the solid black line indicates the density estimates without split-panel jackknife bias correction, the red dashed line shows the HPJ estimates, and the blue dotted line shows the TOJ estimates.

Refer to caption — Figure 1: The densities of $\mu$ , $\gamma_{0}$ , and $\rho_{1}$ . In each figure, the solid black line indicates the estimates without split-panel jackknife correction, the dashed red line shows the HPJ bias-corrected estimates, and the dotted blue line shows the TOJ bias-corrected estimates.

The estimation results with and without bias correction show that the LOP deviation dynamics are significantly heterogeneous across items. The density estimates without bias correction for $\mu_{i}$ are similar to those with bias correction. The results for $\mu_{i}$ also show that the mode of the heterogeneous long-run LOP deviations is close to zero, with a nearly symmetric, unimodal distribution. In contrast, the estimates without bias correction for $\gamma_{0,i}$ and $\rho_{1,i}$ are very different from the bias-corrected estimates. The bias-corrected estimates for $\gamma_{0,i}$ demonstrate larger variances for the LOP deviation dynamics, while the bias-corrected estimates for $\rho_{1,i}$ show more persistent dynamics with a more left-skewed distribution. These results suggest the severe impact of the incidental parameter biases, which highlights the importance of bias correction methods.

Figure 2 depicts 95% point-wise confidence bands based on the HPJ estimates. The confidence bands are narrow, implying that our HPJ estimates seem to be precise and reliable.

Figure 3 illustrates the HPJ estimates of $\mu_{i}$ , $\gamma_{0,i}$ , and $\rho_{1,i}$ for goods and services separately. The solid black lines are the HPJ estimates for goods, and the dashed red lines are those for services. The estimated densities and CDFs show that the heterogeneous properties are significantly different between goods and services. The densities for $\mu_{i}$ show that the long-run LOP deviation for goods generally tends to be larger than that for services (in an absolute sense). The estimation results for $\gamma_{0,i}$ and $\rho_{1,i}$ show that the LOP deviation for goods tends to be more volatile but less persistent than that for services. These results suggest that goods tend to have more volatile processes with faster adjustment speeds toward the nonnegligible long-run LOP deviation.

If we seek to examine the degree of heterogeneity of the LOP deviations across items and cities as in Crucini et al. (2015), our model-free results are informative in their own right. There are several possible sources of differences in the degree of heterogeneity, including the differences in trade costs across items (e.g., Anderson and Van Wincoop, 2004) and differences in sale and nonsale prices across goods and services (e.g., Nakamura and Steinsson, 2008). Furthermore, our model-free results also suggest how we should model heterogeneity when implementing structural estimation for price deviations or change. For example, as our procedure demonstrates that the heterogeneous properties of goods and services differ, we should model unobserved heterogeneity differently for goods and services.

5 Conclusion

This paper presented nonparametric kernel-smoothing estimation to examine the degree of heterogeneity in panel data. The kernel density and CDF estimators are consistent and asymptotically normal under the relative magnitude conditions on the cross-sectional size $N$ , time-series length $T$ , and bandwidth $h$ . Because of the presence of incidental parameter bias and nonlinearity biases, the relative magnitude conditions vary in the order of the expansions. Via infinite-order expansions, we derived the relative magnitude conditions that are suitable for microeconometric applications. We discussed the split-panel jackknife to correct biases, the construction of CIs, and the selection of bandwidth. We also illustrated our procedure based on an application on price deviations.

Acknowledgments

The authors greatly appreciate the assistance of Mototsugu Shintani in providing the price panel data. The authors would also like to thank Stephane Bonhomme, Kazuhiko Hayakawa, Koen Jochmans, Shin Kanaya, Hiroyuki Kasahara, Yoonseok Lee, Oliver Linton, Jun Ma, Yukitoshi Matsushita, Martin Weidner, Yohei Yamamoto, Yu Zhu and the participants of many conferences for helpful discussions and comments. Sebastian Calonico kindly helped us better understand the use of the nprobust package. All remaining errors are our own. Part of this research was conducted while Okui was at Vrije Universiteit Amsterdam, Kyoto University, and NYU Shanghai and while Yanagi was at Hitotsubashi University. This work was supported by the New Faculty Startup Fund from Seoul National University, JSPS KAKENHI Grant Numbers JP25780151, JP25285067, JP15H03329, JP16K03598, JP15H06214, and JP17K13715.

References

Ackerberg et al. (2007) D. Ackerberg, C. L. Benkard, S. Berry, and A. Pakes. Econometric tools for analyzing market outcomes. In J. J. Heckman and E. E. Leamer, editors, Handbook of Econometrics, volume 6, chapter 63, pages 4171–4276. Elsevier, 2007.
Anderson and Van Wincoop (2004) J. E. Anderson and E. Van Wincoop. Trade costs. Journal of Economic Literature, 42(3):691–751, 2004.
Arellano and Bonhomme (2012) M. Arellano and S. Bonhomme. Identifying distributional characteristics in random coefficients panel data models. Review of Economic Studies, 79:987–1020, 2012.
Calonico et al. (2018a) S. Calonico, M. D. Cattaneo, and M. H. Farrell. On the effect of bias estimation on coverage accuracy in nonparametric inference. Journal of the American Statistical Association, 113(522):767–779, 2018a.
Calonico et al. (2018b) S. Calonico, M. D. Cattaneo, and M. H. Farrell. nprobust: Nonparametric Robust Estimation and Inference Methods using Local Polynomial Regression and Kernel Density Estimation, 2018b. URL https://CRAN.R-project.org/package=nprobust. R package version 0.1.3.
Crucini et al. (2015) M. J. Crucini, M. Shintani, and T. Tsuruga. Noisy information, distance and law of one price dynamics across us cities. Journal of Monetary Economics, 74:52–66, 2015.
Dhaene and Jochmans (2015) G. Dhaene and K. Jochmans. Split-panel jackknife estimation of fixed effects models. Review of Economic Studies, 82:991–1030, 2015.
Engel and Rogers (2001) C. Engel and J. H. Rogers. Deviations from purchasing power parity: causes and welfare costs. Journal of International Economics, 55(1):29–57, 2001.
Fernández-Val and Lee (2013) I. Fernández-Val and J. Lee. Panel data models with nonadditive unobserved heterogeneity: Estimation and inference. Quantitative Economics, 4:453–481, 2013.
Galvao and Kato (2014) A. F. Galvao and K. Kato. Estimation and inference for linear panel data models under misspecification when both $n$ and $T$ are large. Journal of Business and Economic Statistics, 32(2):285–309, 2014.
Guerre et al. (2000) E. Guerre, I. Perrigne, and Q. Vuong. Optimal nonparametric estimation of first-price auctions. Econometrica, 68(3):525–574, 2000.
Hall and Horowitz (2013) P. Hall and J. Horowitz. A simple bootstrap method for constructing nonparametric confidence bands for functions. The Annals of Statistics, 41(4):1892–1921, 2013.
Horowitz and Markatou (1996) J. L. Horowitz and M. Markatou. Semiparametric estimation of regression models for panel data. Review of Economic Studies, 63(1):145–168, 1996.
Hsiao et al. (1999) C. Hsiao, M. H. Pesaran, and A. K. Tahmiscioglu. Bayes estimation of short-run coefficients in dynamic panel data models. In K. L. C. Hsiao, L.F. Lee and M. Pesaran, editors, Analysis of Panels and Limited Dependent Variables Models, pages 268–296. Cambridge University press, 1999.
Jochmans and Weidner (2019) K. Jochmans and M. Weidner. Inference on a distribution from noisy draws. Cambridge Working Paper in Economics 19/46, 2019.
Kasahara et al. (2017) H. Kasahara, P. Schrimpf, and M. Suzuki. Identification and estimation of production function with unobserved heterogeneity. mimeo, 2017.
Klenow and Malin (2010) P. J. Klenow and B. A. Malin. Microeconomic evidence on price-setting. Handbook of Monetary Economics, 3:231–284, 2010.
Lee et al. (2018) Y.-J. Lee, R. Okui, and M. Shintani. Asymptotic inference for dynamic panel estimators of infinite order autoregressive processes. Journal of Econometrics, 204:147–158, 2018.
Li and Racine (2007) Q. Li and J. S. Racine. Nonparametric Econometrics: Theory and Practice. Princeton University Press, 2007.
Ma et al. (2019) J. Ma, V. Marmer, and A. Shneyerov. Inference for first-price auctions with Guerre, Perrigne, and Vuong’s estimator. forthcoming in the Journal of Econometrics, 2019.
Mavroeidis et al. (2015) S. Mavroeidis, Y. Sasaki, and I. Welch. Estimation of heterogenous autoregressive parameters using short panel data. Journal of Econometrics, 188:219–235, 2015.
Nakamura and Steinsson (2008) E. Nakamura and J. Steinsson. Five facts about prices: A reevaluation of menu cost models. The Quarterly Journal of Economics, 123(4):1415–1464, 2008.
Neyman and Scott (1948) J. Neyman and E. L. Scott. Consistent estimates based on partially consistent observations. Econometrica, 16:1–32, 1948.
Nickell (1981) S. Nickell. Biases in dynamic models with fixed effects. Econometrica, 49(6):1417–1426, 1981.
Okui (2008) R. Okui. Panel AR(1) estimators under misspecification. Economics Letters, 101:210–213, 2008.
Okui (2011) R. Okui. Asymptotically unbiased estimation of autocovariances and autocorrelations for panel data with incidental trends. Economics Letters, 112:49–52, 2011.
Okui (2014) R. Okui. Asymptotically unbiased estimation of autocovariances and autocorrelations with panel data in the presence of individual and time effects. Journal of Time Series Econometrics, 6(2):129–181, 2014.
Okui and Yanagi (2019) R. Okui and T. Yanagi. Panel data analysis with heterogeneous dynamics. forthcoming in Journal of Econometrics, 2019.
Pagan and Ullah (1999) A. Pagan and A. Ullah. Nonparametric Econometrics. Cambridge University Press, 1999.
Parsley and Wei (1996) D. C. Parsley and S.-J. Wei. Convergence to the law of one price without trade barriers or currency fluctuations. Quarterly Journal of Economics, 111(4):1211–1236, 1996.
Parsley and Wei (2001) D. C. Parsley and S.-J. Wei. Explaining the border effect: the role of exchange rate variability, shipping costs, and geography. Journal of International Economics, 55(1):87–105, 2001.
Pesaran and Smith (1995) M. H. Pesaran and R. Smith. Estimating long-run relationships from dynamic heterogeneous panels. Journal of Econometrics, 68(1):79–113, 1995.
Pesaran et al. (1999) M. H. Pesaran, Y. Shin, and R. P. Smith. Pooled mean group estimation of dynamic heterogeneous panels. Journal of the American Statistical Association, 94(446):621–634, 1999.
Quenouille (1949) M. H. Quenouille. Approximate tests of correlation in time-series 3. In Mathematical Proceedings of the Cambridge Philosophical Society, volume 45-03, pages 483–484. Cambridge Univ Press, 1949.
Quenouille (1956) M. H. Quenouille. Notes on bias in estimation. Biometrika, 43(3 and 4):353–360, 1956.
Roca and Puga (2017) J. D. L. Roca and D. Puga. Learning by working in big cities. Review of Economic Studies, 84(1):106–142, 2017.
Yazgan and Yilmazkuday (2011) M. E. Yazgan and H. Yilmazkuday. Price-level convergence: New evidence from U.S. cities. Economics Letters, 110(2):76–78, 2011.

Supplementary Appendix of “Kernel Estimation for Panel Data with Heterogeneous Dynamics”

Ryo Okui and Takahide Yanagi

May, 2019

This supplementary appendix contains technical discussions omitted from the main text and Monte Carlo simulation results. Appendix A presents the proofs of the theorems in the main body of the paper. Appendix B presents the technical lemmas used in the proofs of the theorems. Appendices C and D present the technical discussions on the validity of the infinite-order expansions in Theorem 1. Appendix E presents the simulation results.

Appendix A Appendix: Proofs of the theorems

This appendix collects the proofs of the theorems. In the following, we denote a generic positive constant by $0<M<\infty$ .

A.1 Proof of Theorem 1

The density of $\mu_{i}$ .

We evaluate each term in the following Taylor expansion:

$\displaystyle\hat{f}_{\hat{\mu}}(x)-f_{\mu}(x)=$	$\displaystyle\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\mu_{i}}{h}\right)-f_{\mu}(x)$	(A.1)
	$\displaystyle-\frac{1}{Nh^{2}}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})K^{\prime}\left(\frac{x-\mu_{i}}{h}\right)$	(A.2)
	$\displaystyle+\sum_{j=2}^{\infty}\frac{(-1)^{j}}{j!Nh^{j+1}}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})^{j}K^{(j)}\left(\frac{x-\mu_{i}}{h}\right).$	(A.3)

For (A.1), we use the standard results for the kernel density estimation. Lemma 7 under Assumptions 1, 5, and 6 shows that $(Nh)^{-1}\sum_{i=1}^{N}K((x-\mu_{i})/h)-f_{\mu}(x)\stackrel{{\scriptstyle p}}{{\longrightarrow}}0$ as $N\to\infty$ and $h\to 0$ with $Nh\to\infty$ . Furthermore, Lemma 7 also shows that:

\displaystyle\sqrt{Nh}\left(\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\mu_{i}}{h}\right)-f_{\mu}(x)-h^{2}\frac{\kappa_{1}f_{\mu}^{{}^{\prime\prime}}(x)}{2}\right)\stackrel{{\scriptstyle d}}{{\longrightarrow}}\mathcal{N}\big{(}0,\kappa_{2}f_{\mu}(x)\big{)},

as $N\to\infty$ and $h\to 0$ with $Nh\to\infty$ and $Nh^{5}\to C\in[0,\infty)$ .

For (A.2), the mean is zero by the law of iterated expectations, because $\hat{\mu}_{i}-\mu_{i}=\bar{w}_{i}$ and $E(\bar{w}_{i}|i)=0$ . The variance is:

\displaystyle var\left(\frac{1}{Nh^{2}}\sum_{i=1}^{N}\bar{w}_{i}K^{\prime}\left(\frac{x-\mu_{i}}{h}\right)\right)=\frac{1}{Nh^{4}}E\left((\bar{w}_{i})^{2}\left(K^{\prime}\left(\frac{x-\mu_{i}}{h}\right)\right)^{2}\right)=O\left(\frac{1}{NTh^{3}}\right),

by Lemmas 1 and 6. Therefore, (A.2) is $O_{p}(1/\sqrt{NTh^{3}})$ by Markov inequality.

For the term in (A.3), the mean is:

	$\displaystyle E\left(\frac{1}{j!Nh^{j+1}}\sum_{i=1}^{N}(\bar{w}_{i})^{j}K^{(j)}\left(\frac{x-\mu_{i}}{h}\right)\right)$
	$\displaystyle=\frac{1}{j!h^{j+1}}E\left(E\left((\bar{w}_{i})^{j}\|\mu_{i}\right)K^{(j)}\left(\frac{x-\mu_{i}}{h}\right)\right)$
	$\displaystyle=\frac{1}{j!T^{j/2}h^{j}}E\left(T^{j/2}(\bar{w}_{i})^{j}\middle\|\mu_{i}=x\right)f_{\mu}(x)\int K^{(j)}(s)ds+o\left(\frac{1}{T^{j/2}h^{j}}\right)$
	$\displaystyle=(-1)^{j}\frac{A_{\mu,j}(x)}{\sqrt{T^{j}h^{2j}}}+o\left(\frac{1}{\sqrt{T^{j}h^{2j}}}\right),$

by the law of iterated expectations and Lemma 6 with the definition of

\displaystyle A_{\mu,j}(x)\coloneqq\lim_{T\to\infty}\frac{(-1)^{j}}{j!}E\left(\sqrt{T^{j}}(\bar{w}_{i})^{j}\middle|\mu_{i}=x\right)f_{\mu}(x)\int K^{(j)}(s)ds.

The variance is:

\displaystyle var\left(\frac{1}{j!Nh^{j+1}}\sum_{i=1}^{N}(\bar{w}_{i})^{j}K^{(j)}\left(\frac{x-\mu_{i}}{h}\right)\right)=O\left(\frac{1}{NT^{j}h^{2j+1}}\right),

by Lemmas 1 and 6. Thus, it holds that:

\displaystyle\sum_{j=2}^{\infty}\frac{(-1)^{j}}{j!Nh^{j+1}}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})^{j}K^{(j)}\left(\frac{x-\mu_{i}}{h}\right)=\sum_{j=2}^{\infty}\left(\frac{A_{\mu,j}(x)}{\sqrt{T^{j}h^{2j}}}+o_{p}\left(\frac{1}{\sqrt{T^{j}h^{2j}}}\right)\right).

Consequently, we obtain the desired result for $\hat{f}_{\hat{\mu}}(x)$ by Slutsky’s theorem.

The density of $\gamma_{k,i}$ .

We evaluate each term in the following Taylor expansion:

$\displaystyle\hat{f}_{\hat{\gamma}_{k}}(x)-f_{\gamma_{k}}(x)=$	$\displaystyle\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\gamma_{k,i}}{h}\right)-f_{\gamma_{k}}(x)$	(A.4)
	$\displaystyle-\frac{1}{Nh^{2}}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)$	(A.5)
	$\displaystyle+\sum_{j=2}^{\infty}\frac{(-1)^{j}}{j!Nh^{j+1}}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j}K^{(j)}\left(\frac{x-\gamma_{k,i}}{h}\right).$	(A.6)

For (A.4), the consistency and asymptotic normality of the term are established by the same argument as for the density of $\mu_{i}$ .

For (A.5), we have the following equation based on the expansion for $\hat{\gamma}_{k,i}$ :

	$\displaystyle\frac{1}{Nh^{2}}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)$
$\displaystyle=$	$\displaystyle\frac{1}{Nh^{2}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)$	(A.7)
	$\displaystyle-\frac{1}{Nh^{2}}\sum_{i=1}^{N}\frac{T+k}{T-k}(\bar{w}_{i})^{2}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)$	(A.8)
	$\displaystyle+\frac{1}{Nh^{2}}\sum_{i=1}^{N}\frac{1}{T-k}\sum_{t=1}^{k}w_{it}\bar{w}_{i}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)$	(A.9)
	$\displaystyle+\frac{1}{Nh^{2}}\sum_{i=1}^{N}\frac{1}{T-k}\sum_{t=T-k+1}^{T}w_{it}\bar{w}_{i}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right).$	(A.10)

For (A.7), the mean is zero by the law of iterated expectations given $E(w_{it}w_{i,t-k}|i)=\gamma_{k,i}$ . The variance is:

\displaystyle var\left(\frac{1}{Nh^{2}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)=O\left(\frac{1}{NTh^{3}}\right),

by Lemmas 3 and 6. Thus, (A.7) is $O_{p}(1/\sqrt{NTh^{3}})$ . For (A.8), denoting $a_{T}(x)=E(T(\bar{w}_{i})^{2}|\gamma_{k,i}=x)f_{\gamma_{k}}(x)$ , the mean is expanded as:

	$\displaystyle E\left(\frac{1}{Nh^{2}}\sum_{i=1}^{N}\frac{T+k}{T-k}(\bar{w}_{i})^{2}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)=$	$\displaystyle\frac{T+k}{T(T-k)h^{2}}E\left(E\left(T(\bar{w}_{i})^{2}\|\gamma_{k,i}\right)K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)$
	$\displaystyle=$	$\displaystyle\frac{a_{T}(x)}{Th}\int K^{\prime}(s)ds-\frac{a_{T}^{\prime}(x)}{T}\int sK^{\prime}(s)ds+o\left(\frac{1}{T}\right)$
	$\displaystyle=$	$\displaystyle-\frac{a_{T}^{\prime}(x)}{T}\int sK^{\prime}(s)ds+o\left(\frac{1}{T}\right)$
	$\displaystyle=$	$\displaystyle-\frac{A_{\gamma_{k},1}(x)}{T}+o\left(\frac{1}{T}\right),$

by the law of iterated expectations, Lemma 6, and $\int K^{\prime}(s)ds=0$ with the definition of:

\displaystyle A_{\gamma_{k},1}(x)=\lim_{T\to\infty}a_{T}^{\prime}(x)\int sK^{\prime}(s)ds.

The variance is:

	$\displaystyle var\left(\frac{1}{Nh^{2}}\sum_{i=1}^{N}\frac{T+k}{T-k}(\bar{w}_{i})^{2}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)$	$\displaystyle\leq\frac{(T+k)^{2}}{Nh^{4}(T-k)^{2}}\sqrt{E\left((\bar{w}_{i})^{8}\right)}\sqrt{E\left(\left(K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)^{4}\right)}$
		$\displaystyle=O\left(\frac{1}{Nh^{4}}\right)\cdot O\left(\frac{1}{T^{2}}\right)\cdot O\left(\sqrt{h}\right)=O\left(\frac{1}{\sqrt{N^{2}T^{4}h^{7}}}\right),$

by the Cauchy–Schwarz inequality and Lemmas 1 and 6. Thus, (A.8) is $A_{\gamma_{k},1}(x)/T+o_{p}(1/T)$ . For (A.9), the triangle inequality and the Cauchy–Schwarz inequality lead to:

	$\displaystyle E\left\|\frac{1}{Nh^{2}}\sum_{i=1}^{N}\frac{1}{T-k}\sum_{t=1}^{k}w_{it}\bar{w}_{i}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right\|$
	$\displaystyle\leq\frac{1}{(T-k)h^{2}}E\left\|\sum_{t=1}^{k}w_{it}\bar{w}_{i}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right\|$
	$\displaystyle\leq\frac{1}{(T-k)h^{2}}\sqrt{E\left(\left(\sum_{t=1}^{k}w_{it}\bar{w}_{i}\right)^{2}\right)}\sqrt{E\left(\left(K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)^{2}\right)}$
	$\displaystyle=O\left(\frac{1}{Th^{2}}\right)\cdot O\left(\frac{1}{\sqrt{T}}\right)\cdot O\left(\sqrt{h}\right)=O\left(\frac{1}{\sqrt{T^{3}h^{3}}}\right),$

by Lemmas 1 and 6. Thus, (A.9) is $O_{p}(1/\sqrt{T^{3}h^{3}})$ . In the same manner, we can show that (A.10) is also $O_{p}(1/\sqrt{T^{3}h^{3}})$ . These results mean that (A.5) is $A_{\gamma_{k},1}(x)/T+o_{p}(1/T)$ .

For (A.6), it is easy to see that:

		$\displaystyle\frac{1}{j!Nh^{j+1}}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j}K^{(j)}\left(\frac{x-\gamma_{k,i}}{h}\right)$
	$\displaystyle=$	$\displaystyle\frac{1}{j!Nh^{j+1}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j}K^{(j)}\left(\frac{x-\gamma_{k,i}}{h}\right)+o_{p}\left(\frac{1}{\sqrt{T^{j}h^{2j}}}\right),$

by the same procedures to show the order of the terms in (A.5). The mean of the term is:

	$\displaystyle E\left(\frac{1}{j!Nh^{j+1}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j}K^{(j)}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)$
	$\displaystyle=\frac{1}{j!(T-k)^{j/2}h^{j+1}}E\left(E\left(\frac{1}{(T-k)^{j/2}}\left(\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j}\middle\|\gamma_{k,i}\right(K^{(j)}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)$
	$\displaystyle=\frac{1}{j!T^{j/2}h^{j}}f_{\gamma_{k}}(x)E\left(\frac{1}{T^{j/2}}\left(\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j}\middle\|\gamma_{k,i}=x\right)\int K^{(j)}(s)ds+o\left(\frac{1}{T^{j/2}h^{j}}\right)$
	$\displaystyle=(-1)^{j}\frac{A_{\gamma_{k},j}(x)}{\sqrt{T^{j}h^{2j}}}+o\left(\frac{1}{\sqrt{T^{j}h^{2j}}}\right),$

by the law of iterated expectations and Lemma 6 with the definition of

\displaystyle A_{\gamma_{k},j}(x)\coloneqq\lim_{T\to\infty}(-1)^{j}\frac{f_{\gamma_{k}}(x)}{j!}E\left(\frac{1}{\sqrt{T^{j}}}\left(\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j}\middle|\gamma_{k,i}=x\right)\int K^{(j)}(s)ds.

The variance of the term is:

\displaystyle var\left(\frac{1}{Nh^{j+1}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j}K^{(j)}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)=O\left(\frac{1}{NT^{j}h^{2j+1}}\right),

by Lemmas 3 and 6. Thus, it holds that:

\displaystyle\sum_{j=2}^{\infty}\frac{(-1)^{j}}{j!Nh^{j+1}}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j}K^{(j)}\left(\frac{x-\gamma_{k,i}}{h}\right)=\sum_{j=2}^{\infty}\left(\frac{A_{\gamma_{k},j}(x)}{\sqrt{T^{j}h^{2j}}}+o_{p}\left(\frac{1}{\sqrt{T^{j}h^{2j}}}\right)\right).

Consequently, we obtain the desired result for $\hat{f}_{\hat{\gamma}_{k}}(x)$ by Slutsky’s theorem.

The density of $\rho_{k,i}$ .

We regard $K((x-\hat{\rho}_{k,i})/h)=K((x-\hat{\gamma}_{k,i}/\hat{\gamma}_{0,i})/h)$ as a function of two variables $(\hat{\gamma}_{k,i},\hat{\gamma}_{0,i})$ . Taylor’s theorem for multivariate functions leads to:

	$\displaystyle\hat{f}_{\hat{\rho}_{k}}(x)-f_{\rho_{k}}(x)$
$\displaystyle=$	$\displaystyle\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\rho_{k,i}}{h}\right)-f_{\rho_{k}}(x)$	(A.11)
	$\displaystyle-\frac{1}{Nh}\sum_{i=1}^{N}\sum_{j_{1}+j_{2}=1}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j_{1}}(\hat{\gamma}_{0,i}-\gamma_{0,i})^{j_{2}}\frac{\partial^{j_{1}+j_{2}}}{\partial a^{j_{1}}\partial b^{j_{2}}}K\left(\frac{x-a/b}{h}\right)\Big{\|}_{a=\gamma_{k,i},b=\gamma_{0,i}}$	(A.12)
	$\displaystyle+\sum_{j=2}^{\infty}\frac{(-1)^{j}}{Nh}\sum_{i=1}^{N}\sum_{j_{1}+j_{2}=j}\frac{1}{j_{1}!j_{2}!}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j_{1}}(\hat{\gamma}_{0,i}-\gamma_{0,i})^{j_{2}}\frac{\partial^{j_{1}+j_{2}}}{\partial a^{j_{1}}\partial b^{j_{2}}}K\left(\frac{x-a/b}{h}\right)\Big{\|}_{a=\gamma_{k,i},b=\gamma_{0,i}}.$	(A.13)

We evaluate each term below.

For (A.11), the consistency and asymptotic normality of the term are established by the same argument as for the density of $\mu_{i}$ .

(A.12) contains two terms. Of these, we consider only $(Nh^{2})^{-1}\sum_{i=1}^{N}\gamma_{0,i}^{-1}(\hat{\gamma}_{k,i}-\gamma_{k,i})K^{\prime}((x-\rho_{k,i})/h)$ , because the other term can be evaluated by the same argument. However, this term is analogous to that in (A.5), so it can be evaluated by the same argument. This means that (A.12) can be written as $A_{\rho_{k},1}(x)/T+o_{p}(1/T)$ for a nonrandom $A_{\rho_{k},1}(x)$ .

For (A.13), we evaluate the mean of the term:

\displaystyle\sum_{j_{1}+j_{2}=j}\frac{1}{j_{1}!j_{2}!}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j_{1}}(\hat{\gamma}_{0,i}-\gamma_{0,i})^{j_{2}}\frac{\partial^{j_{1}+j_{2}}}{\partial a^{j_{1}}\partial b^{j_{2}}}K\left(\frac{x-a/b}{h}\right)\Big{|}_{a=\gamma_{k,i},b=\gamma_{0,i}},

which contains $j+1$ terms. Among these, we consider only $(j!Nh^{j+1})^{-1}\sum_{i=1}^{N}\gamma_{0,i}^{-j}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j}K^{(j)}((x-\rho_{k,i})/h)$ , as the other terms can be evaluated in the same manner. However, this term is analogous to that in (A.6), so it can be evaluated by the same argument. This means that (A.13) can be written as:

	$\displaystyle\sum_{j=2}^{\infty}\frac{(-1)^{j}}{Nh}\sum_{i=1}^{N}\sum_{j_{1}+j_{2}=j}\frac{1}{j_{1}!j_{2}!}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j_{1}}(\hat{\gamma}_{0,i}-\gamma_{0,i})^{j_{2}}\frac{\partial^{j_{1}+j_{2}}}{\partial a^{j_{1}}\partial b^{j_{2}}}K\left(\frac{x-a/b}{h}\right)\Big{\|}_{a=\gamma_{k,i},b=\gamma_{0,i}}$
	$\displaystyle=\sum_{j=2}^{\infty}\left(\frac{A_{\rho_{k},j}(x)}{\sqrt{T^{j}h^{2j}}}+o_{p}\left(\frac{1}{\sqrt{T^{j}h^{2j}}}\right)\right),$

for a nonrandom $A_{\rho_{k},j}(x)$ .

Consequently, we have the desired result for $\hat{f}_{\hat{\rho}_{k}}(x)$ by Slutsky’s theorem. ∎

A.2 Proof of Theorem 2

We show the proof for the density estimator of $\gamma_{k,i}$ only. Those of $\mu_{i}$ and $\rho_{k,i}$ are the same. The proof of Theorem 1 has shown that:

\displaystyle\hat{f}_{\hat{\gamma}_{k}}(x)-f_{\gamma_{k}}(x)=\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\gamma_{k,i}}{h}\right)-f_{\gamma_{k}}(x)+\frac{A_{\gamma_{k},1}(x)}{T}+\frac{A_{\gamma_{k},2}(x)}{Th^{2}}+O_{p}\left(\frac{1}{\sqrt{T^{3}h^{6}}}\right).

This result implies that the estimators based on the half-panel data are:

\displaystyle\hat{f}_{\hat{\gamma}_{k},(l)}(x)-f_{\gamma_{k}}(x)=\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\gamma_{k,i}}{h}\right)-f_{\gamma_{k}}(x)+\frac{2A_{\gamma_{k},1}(x)}{T}+\frac{2A_{\gamma_{k},2}(x)}{Th^{2}}+O_{p}\left(\frac{1}{\sqrt{T^{3}h^{6}}}\right),

for $l=1,2$ . As a result, the HPJ bias-corrected estimator satisfies:

\displaystyle\hat{f}_{\hat{\gamma}_{k}}^{H}(x)-f_{\gamma_{k}}(x)=\frac{1}{Nh}\sum_{i=1}^{N}K\left(\frac{x-\gamma_{k,i}}{h}\right)-f_{\gamma_{k}}(x)+O_{p}\left(\frac{1}{\sqrt{T^{3}h^{6}}}\right).

Therefore, the same argument as for the term in (A.4) leads to the desired result. ∎

A.3 Proof of Theorem 3

The CDF of $\mu_{i}$ .

We evaluate each term in the following Taylor expansion:

$\displaystyle\hat{F}_{\hat{\mu}}(x)-F_{\mu}(x)=$	$\displaystyle\frac{1}{N}\sum_{i=1}^{N}\mathbb{K}\left(\frac{x-\mu_{i}}{h}\right)-F_{\mu}(x)$	(A.14)
	$\displaystyle-\frac{1}{Nh}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})K\left(\frac{x-\mu_{i}}{h}\right)$	(A.15)
	$\displaystyle+\frac{1}{2Nh^{2}}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})^{2}K^{\prime}\left(\frac{x-\mu_{i}}{h}\right)$	(A.16)
	$\displaystyle+\sum_{j=3}^{\infty}\frac{(-1)^{j}}{j!Nh^{j}}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})^{j}K^{(j-1)}\left(\frac{x-\mu_{i}}{h}\right).$	(A.17)

For the term in (A.14), Lemma 8 under Assumptions 1, 5, and 8 shows that:

\displaystyle\frac{1}{N}\sum_{i=1}^{N}\mathbb{K}\left(\frac{x-\mu_{i}}{h}\right)-F_{\mu}(x)\stackrel{{\scriptstyle p}}{{\longrightarrow}}0,

as $N\to\infty$ and $h\to 0$ . Moreover, Lemma 8 also shows that:

\displaystyle\sqrt{N}\left(\frac{1}{N}\sum_{i=1}^{N}\mathbb{K}\left(\frac{x-\mu_{i}}{h}\right)-F_{\mu}(x)\right)\stackrel{{\scriptstyle d}}{{\longrightarrow}}\mathcal{N}\big{(}0,F_{\mu}(x)[1-F_{\mu}(x)]\big{)},

as $N\to\infty$ and $h\to 0$ with $Nh^{4}\to 0$ .

For (A.15), the mean is zero given $\hat{\mu}_{i}-\mu_{i}=\bar{w}_{i}$ and $E(\bar{w}_{i}|i)=0$ . The variance is:

\displaystyle var\left(\frac{1}{Nh}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})K\left(\frac{x-\mu_{i}}{h}\right)\right)=O\left(\frac{1}{NTh}\right),

by Lemmas 2 and 6. Thus, (A.15) is $O_{p}(1/\sqrt{NTh})$ by Markov inequality.

For (A.16), we define $c_{T}(x)\coloneqq E(T(\bar{w}_{i})^{2}|\mu_{i}=x)f_{\mu}(x)$ . The mean is:

	$\displaystyle E\left(\frac{1}{2Nh^{2}}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})^{2}K^{\prime}\left(\frac{x-\mu_{i}}{h}\right)\right)$	$\displaystyle=\frac{1}{2Th^{2}}E\left(E[T(\bar{w}_{i})^{2}\|\mu_{i}]K^{\prime}\left(\frac{x-\mu_{i}}{h}\right)\right)$
		$\displaystyle=\frac{1}{2Th}c_{T}(x)\int K^{\prime}(s)ds-\frac{1}{2T}c_{T}^{\prime}(x)\int sK^{\prime}(s)ds+o\left(\frac{1}{T}\right)$
		$\displaystyle=-\frac{1}{2T}c_{T}^{\prime}(x)\int sK^{\prime}(s)ds+o\left(\frac{1}{T}\right)$
		$\displaystyle=\frac{B_{\mu,2}(x)}{T}+o\left(\frac{1}{T}\right),$

by the law of iterated expectations, Lemma 6, and $\int K^{\prime}(s)=0$ with the definition of

\displaystyle B_{\mu,2}(x)\coloneqq-\lim_{T\to\infty}\frac{c_{T}^{\prime}(x)}{2}\int sK^{\prime}(s)ds.

The variance is:

\displaystyle var\left(\frac{1}{2Nh^{2}}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})^{2}K^{\prime}\left(\frac{x-\mu_{i}}{h}\right)\right)=O\left(\frac{1}{NT^{2}h^{3}}\right),

by Lemmas 1 and 6. Thus, (A.16) can be written as $B_{\mu,2}(x)/T+o_{p}(1/T)$ .

For the term in (A.17), the mean is:

	$\displaystyle E\left(\frac{1}{j!Nh^{j}}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})^{j}K^{(j-1)}\left(\frac{x-\mu_{i}}{h}\right)\right)$
	$\displaystyle=\frac{1}{j!T^{j/2}h^{j}}E\left(E\left(T^{j/2}(\bar{w}_{i})^{j}\middle\|\mu_{i}\right)K^{(j-1)}\left(\frac{x-\mu_{i}}{h}\right)\right)$
	$\displaystyle=\frac{1}{j!\sqrt{T^{j}h^{2j-2}}}E\left(\sqrt{T^{j}}(\bar{w}_{i})^{j}\middle\|\mu_{i}=x\right)f_{\mu}(x)\int K^{(j-1)}(s)ds+o\left(\frac{1}{\sqrt{T^{j}h^{2j-2}}}\right)$
	$\displaystyle=(-1)^{j}\frac{B_{\mu,j}(x)}{T^{j}h^{2j-2}}+o\left(\frac{1}{\sqrt{T^{j}h^{2j-2}}}\right),$

by the law of iterated expectations and Lemmas 1 and 6 with the definition of

\displaystyle B_{\mu,j}(x)\coloneqq\lim_{T\to\infty}\frac{(-1)^{j}}{j!}E\left(\sqrt{T^{j}}(\bar{w}_{i})^{j}\middle|\mu_{i}=x\right)f_{\mu}(x)\int K^{(j-1)}(s)ds.

The variance is:

\displaystyle var\left(\frac{1}{j!Nh^{j}}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})^{j}K^{(j-1)}\left(\frac{x-\mu_{i}}{h}\right)\right)=O\left(\frac{1}{NT^{j}h^{2j-1}}\right),

by Lemmas 2 and 6. Thus, (A.17) can be written as:

\displaystyle\sum_{j=3}^{\infty}\frac{(-1)^{j}}{j!Nh^{j}}\sum_{i=1}^{N}(\hat{\mu}_{i}-\mu_{i})^{j}K^{(j-1)}\left(\frac{x-\mu_{i}}{h}\right)=\sum_{j=3}^{\infty}\left(\frac{B_{\mu,j}(x)}{T^{j}h^{2j-2}}+o_{p}\left(\frac{1}{\sqrt{T^{j}h^{2j-2}}}\right)\right).

Consequently, we obtain the desired result for $\hat{F}_{\hat{\mu}}(x)$ by Slutsky’s theorem.

The CDF of $\gamma_{k,i}$ .

We evaluate each term in the following Taylor expansion:

$\displaystyle\hat{F}_{\hat{\gamma}_{k}}(x)-F_{\gamma_{k}}(x)=$	$\displaystyle\frac{1}{N}\sum_{i=1}^{N}\mathbb{K}\left(\frac{x-\gamma_{k,i}}{h}\right)-F_{\gamma_{k}}(x)$	(A.18)
	$\displaystyle-\frac{1}{Nh}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})K\left(\frac{x-\gamma_{k,i}}{h}\right)$	(A.19)
	$\displaystyle+\frac{1}{2Nh^{2}}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{2}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)$	(A.20)
	$\displaystyle+\sum_{j=3}^{\infty}\frac{(-1)^{j}}{j!Nh^{j}}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j}K^{(j-1)}\left(\frac{x-\gamma_{k,i}}{h}\right).$	(A.21)

For (A.18), consistency and asymptotic normality are established by the same arguments as for the CDF of $\mu_{i}$ .

For (A.19), we have the following equation by the expansion for $\hat{\gamma}_{k,i}$ :

$\displaystyle\frac{1}{Nh}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})K\left(\frac{x-\gamma_{k,i}}{h}\right)=$	$\displaystyle\frac{1}{Nh}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)K\left(\frac{x-\gamma_{k,i}}{h}\right)$	(A.22)
	$\displaystyle-\frac{1}{Nh}\sum_{i=1}^{N}\frac{T+k}{T-k}(\bar{w}_{i})^{2}K\left(\frac{x-\gamma_{k,i}}{h}\right)$	(A.23)
	$\displaystyle+\frac{1}{Nh}\sum_{i=1}^{N}\frac{1}{T-k}\sum_{t=1}^{k}w_{it}\bar{w}_{i}K\left(\frac{x-\gamma_{k,i}}{h}\right)$	(A.24)
	$\displaystyle+\frac{1}{Nh}\sum_{i=1}^{N}\frac{1}{T-k}\sum_{t=T-k+1}^{T}w_{it}\bar{w}_{i}K\left(\frac{x-\gamma_{k,i}}{h}\right).$	(A.25)

For the term in (A.22), the mean is zero and the variance is:

\displaystyle var\left(\frac{1}{Nh}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)K\left(\frac{x-\gamma_{k,i}}{h}\right)\right)=O\left(\frac{1}{NTh}\right),

by Lemmas 3 and 6. Thus, (A.22) is $O_{p}(1/\sqrt{NTh})$ . For (A.23), the mean is:

	$\displaystyle E\left(\frac{1}{Nh}\frac{T+k}{T-k}\sum_{i=1}^{N}(\bar{w}_{i})^{2}K\left(\frac{x-\gamma_{k,i}}{h}\right)\right)$	$\displaystyle=\frac{T+k}{hT(T-k)}E\left(E[T(\bar{w}_{i})^{2}\|\gamma_{k,i}]K\left(\frac{x-\gamma_{k,i}}{h}\right)\right)$
		$\displaystyle=\frac{1}{T}E(T(\bar{w}_{i})^{2}\|\gamma_{k,i}=x)f_{\gamma_{k}}(x)\int K(s)ds+o\left(\frac{1}{T}\right)$
		$\displaystyle=-\frac{B_{\gamma_{k},1}(x)}{T}+o\left(\frac{1}{T}\right),$

by the law of iterated expectations and Lemma 6 with the definition of

\displaystyle B_{\gamma_{k},1}(x)\coloneqq-\lim_{T\to\infty}T\cdot E((\bar{w}_{i})^{2}|\gamma_{k,i}=x)f_{\gamma_{k}}(x)\int K(s)ds.

The variance is:

\displaystyle var\left(\frac{1}{Nh}\frac{T+k}{T-k}\sum_{i=1}^{N}(\bar{w}_{i})^{2}K\left(\frac{x-\gamma_{k,i}}{h}\right)\right)=O\left(\frac{1}{NT^{2}h}\right),

by Lemmas 1 and 6. Thus, (A.23) is $B_{\gamma_{k},1}(x)/T+o_{p}(1/T)$ . For (A.24), the absolute mean is:

	$\displaystyle E\left\|\frac{1}{Nh}\sum_{i=1}^{N}\frac{1}{T-k}\sum_{t=1}^{k}w_{it}\bar{w}_{i}K\left(\frac{x-\gamma_{k,i}}{h}\right)\right\|$	$\displaystyle\leq\frac{1}{(T-k)h}\sqrt{E\left(\left(\sum_{t=1}^{k}w_{it}\bar{w}_{i}\right)^{2}\right)}\sqrt{E\left(K^{2}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)}$
		$\displaystyle=O\left(\frac{1}{Th}\right)\cdot O\left(\frac{1}{\sqrt{T}}\right)\cdot O\left(\sqrt{h}\right)$
		$\displaystyle=O\left(\frac{1}{\sqrt{T^{3}h}}\right),$

by Lemmas 1 and 6. Thus, (A.24) is $O_{p}(1/\sqrt{T^{3}h})$ . For (A.25), we can show that it is $O_{p}(1/\sqrt{T^{3}h})$ by the same argument. Thus, (A.19) is $B_{\gamma_{k},1}(x)/T+o_{p}(1/T)$ .

For (A.20), it is easy to see that:

	$\displaystyle\frac{1}{2Nh^{2}}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{2}K^{{}^{\prime}}\left(\frac{x-\gamma_{k,i}}{h}\right)$
	$\displaystyle=\frac{1}{2Nh^{2}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)^{2}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)+o_{p}\left(\frac{1}{T}\right),$

by similar procedures, to show the orders of (A.22), (A.23), (A.24), and (A.25). Introducing the shorthand notation $d_{T}(x)\coloneqq E[(\sum_{t=k+1}^{T}[w_{it}w_{i,t-k}-\gamma_{k,i}])^{2}|\gamma_{k,i}=x]f_{\gamma_{k}}(x)/(T-k)$ , the mean of the term is:

		$\displaystyle E\left(\frac{1}{2Nh^{2}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)^{2}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)$
	$\displaystyle=$	$\displaystyle\frac{1}{2(T-k)h^{2}}E\left(E\left(\frac{1}{T-k}\left(\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)^{2}\middle\|\gamma_{k,i}\right)K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)$
	$\displaystyle=$	$\displaystyle\frac{1}{2(T-k)h}d_{T}(x)\int K^{\prime}(s)ds-\frac{1}{2(T-k)}d_{T}^{\prime}(x)\int sK^{\prime}(s)ds+o\left(\frac{1}{T}\right)$
	$\displaystyle=$	$\displaystyle-\frac{1}{2T}d_{T}^{\prime}(x)\int sK^{\prime}(s)ds+o\left(\frac{1}{T}\right)$
	$\displaystyle=$	$\displaystyle\frac{B_{\gamma_{k},2}(x)}{T}+o\left(\frac{1}{T}\right),$

by Lemma 6 and $\int K^{\prime}(s)ds=0$ with the definition of

\displaystyle B_{\gamma_{k},2}(x)\coloneqq-\lim_{T\to\infty}\frac{d_{T}^{\prime}(x)}{2}\int sK^{\prime}(s)ds.

The variance of the term is:

\displaystyle var\left(\frac{1}{2Nh^{2}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)^{2}K^{\prime}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)=O\left(\frac{1}{NT^{2}h^{3}}\right),

by Lemmas 3 and 6. Thus, (A.20) is $B_{\gamma_{k},2}(x)/T+o_{p}(1/T)$ .

For (A.21), it is easy to see that:

	$\displaystyle\frac{1}{j!Nh^{j}}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j}K^{(j-1)}\left(\frac{x-\gamma_{k,i}}{h}\right)$
	$\displaystyle=\frac{1}{j!Nh^{j}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)^{j}K^{(j-1)}\left(\frac{x-\gamma_{k,i}}{h}\right)+o_{p}\left(\frac{1}{\sqrt{T^{j}h^{2j-2}}}\right),$

by the same argument as for (A.20). The mean of the term is:

	$\displaystyle E\left(\frac{1}{j!Nh^{j}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)^{j}K^{(j-1)}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)$
	$\displaystyle=\frac{1}{j!(T-k)^{j/2}h^{j}}E\left(\frac{1}{(T-k)^{j/2}}E\left(\left(\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j}\middle\|\gamma_{k,i}\right)K^{(j-1)}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)$
	$\displaystyle=\frac{1}{j!\sqrt{T^{j}h^{2j-2}}}\left(\frac{1}{\sqrt{T^{j}}}E\left(\left(\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j}\middle\|\gamma_{k,i}=x\right)f_{\gamma_{k}}(x)\right)\int K^{(j-1)}(s)ds+o_{p}\left(\frac{1}{\sqrt{T^{j}h^{2j-2}}}\right)$
	$\displaystyle=(-1)^{j}\frac{B_{\gamma_{k},j}(x)}{\sqrt{T^{j}h^{2j-2}}}+o_{p}\left(\frac{1}{\sqrt{T^{j}h^{2j-2}}}\right),$

by Lemma 6 with the definition of

\displaystyle B_{\gamma_{k},j}(x)\coloneqq\lim_{T\to\infty}\frac{(-1)^{j}}{j!}E\left(\frac{1}{\sqrt{T^{j}}}\left(\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i})\right)^{j}\middle|\gamma_{k,i}=x\right)f_{\gamma_{k}}(x)\int K^{(j-1)}(s)ds.

The variance is:

\displaystyle var\left(\frac{1}{j!Nh^{j}}\sum_{i=1}^{N}\left(\frac{1}{T-k}\sum_{t=k+1}^{T}w_{it}w_{i,t-k}-\gamma_{k,i}\right)^{j}K^{(j-1)}\left(\frac{x-\gamma_{k,i}}{h}\right)\right)=O\left(\frac{1}{NT^{j}h^{2j-1}}\right),

by Lemmas 4 and 6. Thus, (A.21) can be written as:

\displaystyle\sum_{j=3}^{\infty}\frac{(-1)^{j}}{j!Nh^{j}}\sum_{i=1}^{N}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j}K^{(j-1)}\left(\frac{x-\gamma_{k,i}}{h}\right)=\sum_{j=3}^{\infty}\left(\frac{B_{\gamma_{k},j}(x)}{\sqrt{T^{j}h^{2j-2}}}+o_{p}\left(\frac{1}{\sqrt{T^{j}h^{2j-2}}}\right)\right).

Consequently, we obtain the desired result for $\hat{F}_{\hat{\gamma}_{k}}(x)$ by Slutsky’s theorem.

The CDF of $\rho_{k,i}$ .

We regard $\mathbb{K}((x-\hat{\rho}_{k,i})/h)=\mathbb{K}((x-\hat{\gamma}_{k,i}/\hat{\gamma}_{0,i})/h)$ as a function of two variables $(\hat{\gamma}_{k,i},\hat{\gamma}_{0,i})$ . Taylor’s theorem for multivariate functions leads to:

	$\displaystyle\hat{F}_{\hat{\rho}_{k}}(x)-F_{\rho_{k}}(x)$
$\displaystyle=$	$\displaystyle\frac{1}{N}\sum_{i=1}^{N}\mathbb{K}\left(\frac{x-\rho_{k,i}}{h}\right)-F_{\rho_{k}}(x)$	(A.26)
	$\displaystyle+\frac{1}{N}\sum_{i=1}^{N}\sum_{j_{1}+j_{2}=1}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j_{1}}(\hat{\gamma}_{0,i}-\gamma_{0,i})^{j_{2}}\frac{\partial^{j_{1}+j_{2}}}{\partial a^{j_{1}}\partial b^{j_{2}}}\mathbb{K}\left(\frac{x-a/b}{h}\right)\Big{\|}_{a=\gamma_{k,i},b=\gamma_{0,i}}$	(A.27)
	$\displaystyle+\frac{1}{N}\sum_{i=1}^{N}\sum_{j_{1}+j_{2}=2}\frac{1}{j_{1}!j_{2}!}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j_{1}}(\hat{\gamma}_{0,i}-\gamma_{0,i})^{j_{2}}\frac{\partial^{j_{1}+j_{2}}}{\partial a^{j_{1}}\partial b^{j_{2}}}\mathbb{K}\left(\frac{x-a/b}{h}\right)\Big{\|}_{a=\gamma_{k,i},b=\gamma_{0,i}}$	(A.28)
	$\displaystyle+\sum_{j=3}^{\infty}\frac{1}{N}\sum_{i=1}^{N}\sum_{j_{1}+j_{2}=j}\frac{1}{j_{1}!j_{2}!}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j_{1}}(\hat{\gamma}_{0,i}-\gamma_{0,i})^{j_{2}}\frac{\partial^{j_{1}+j_{2}}}{\partial a^{j_{1}}\partial b^{j_{2}}}\mathbb{K}\left(\frac{x-a/b}{h}\right)\Big{\|}_{a=\gamma_{k,i},b=\gamma_{0,i}}.$	(A.29)

For (A.26), the consistency and asymptotic normality of the term are established by the same argument as for the CDF of $\mu_{i}$ .

(A.27) contains two terms. Of these, we focus only on $(Nh)^{-1}\sum_{i=1}^{N}\gamma_{0,i}^{-1}(\hat{\gamma}_{k,i}-\gamma_{k,i})K((x-\rho_{k,i})/h)$ , as the other term can be evaluated in the same manner. However, this term is analogous to that in (A.19), so it can be evaluated by the same argument. This means that (A.27) can be written as $B_{\rho_{k},1}(x)+o_{p}(1/T)$ for a nonrandom $B_{\rho_{k},1}(x)$ .

(A.28) contains three terms. Of these, we focus only on $(2Nh^{2})^{-1}\sum_{i=1}^{N}\gamma_{0,i}^{-2}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{2}K^{\prime}((x-\rho_{k,i})/h)$ , as the other terms can be evaluated in the same manner. However, this term is analogous to that in (A.20), so it can be evaluated by the same argument. This means that (A.28) is also $B_{\rho_{k},2}(x)/T+o_{p}(1/T)$ for a nonrandom $B_{\rho_{k},2}(x)$ .

For (A.29), we evaluate the mean of the term

\displaystyle\sum_{j_{1}+j_{2}=j}\frac{1}{j_{1}!j_{2}!}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j_{1}}(\hat{\gamma}_{0,i}-\gamma_{0,i})^{j_{2}}\frac{\partial^{j_{1}+j_{2}}}{\partial a^{j_{1}}\partial b^{j_{2}}}\mathbb{K}\left(\frac{x-a/b}{h}\right)\Big{|}_{a=\gamma_{k,i},b=\gamma_{0,i}},

which contains $j+1$ terms. Of these terms, we consider only:

\displaystyle\frac{1}{j!Nh^{j}}\sum_{i=1}^{N}\frac{1}{\gamma_{0,i}^{j}}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j}K^{(j-1)}\left(\frac{x-\rho_{k,i}}{h}\right),

as the other terms can be evaluated in the same manner. However, this term is analogous to that in (A.21), so we can evaluate it using the same argument. This means that (A.29) can be written as:

	$\displaystyle\sum_{j=3}^{\infty}\frac{1}{N}\sum_{i=1}^{N}\sum_{j_{1}+j_{2}=j}\frac{1}{j_{1}!j_{2}!}(\hat{\gamma}_{k,i}-\gamma_{k,i})^{j_{1}}(\hat{\gamma}_{0,i}-\gamma_{0,i})^{j_{2}}\frac{\partial^{j_{1}+j_{2}}}{\partial a^{j_{1}}\partial b^{j_{2}}}\mathbb{K}\left(\frac{x-a/b}{h}\right)\Big{\|}_{a=\gamma_{k,i},b=\gamma_{0,i}}$
	$\displaystyle=\sum_{j=3}^{\infty}\left(\frac{B_{\rho_{k},j}(x)}{\sqrt{T^{j}h^{2j-2}}}+o\left(\frac{1}{\sqrt{T^{j}h^{2j-2}}}\right)\right),$

for a nonrandom $B_{\rho_{k},j}(x)$ .

Consequently, we have the desired result for $\hat{F}_{\hat{\rho}_{k}}(x)$ by Slutsky’s theorem. ∎

Appendix B Appendix: Lemmas

This appendix contains the technical lemmas used to demonstrate the theorems in the main body.

We first present the lemmas for which the proofs are given in Okui and Yanagi (2019).

Lemma 1.

Suppose that Assumptions 1, 2, and 3 hold for $r_{m}=r$ and $r_{d}=r$ with a natural number $r$ . Then, it holds that $E((\bar{w}_{i})^{r})=O(T^{-r/2})$ .

Lemma 2.

Suppose that Assumptions 1, 2, and 3 hold for $r_{m}=r$ and $r_{d}=r$ with a natural number $r$ . Then, it holds that $E\left((\hat{\mu}_{i}-\mu_{i})^{r}\right)=O(T^{-r/2})$ .

Lemma 3.

Suppose that Assumptions 1, 2, and 3 hold for $r_{m}=r$ and $r_{d}=2r$ with a natural number $r$ . Then, it holds that $E((\sum_{t=k+1}^{T}(w_{it}w_{i,t-k}-\gamma_{k,i}))^{r})=O(T^{r/2})$ .

Lemma 4.

Suppose that Assumptions 1, 2, and 3 hold for $r_{m}=2r$ and $r_{d}=2r$ with a natural number $r$ . Then, it holds that $E((\hat{\gamma}_{k,i}-\gamma_{k,i})^{r})=O(T^{-r/2})$ .

Lemma 5.

Suppose that Assumptions 1, 2, 3, and 4 hold for $r_{m}=2r$ and $r_{d}=2r$ with a natural number $r$ . Then, it holds that $E((\hat{\rho}_{k,i}-\rho_{k,i})^{r})=O(T^{-r/2})$ .

We repeatedly use the following lemmas to prove our theorems. The proofs are similar to those in Pagan and Ullah (1999) and Li and Racine (2007), and are omitted.

Lemma 6.

Consider a continuous random variable $X\in\mathbb{R}$ , a random vector $Y=(Y_{1},Y_{2},\dots,Y_{d})^{\top}\in\mathbb{R}^{d}$ , and an interior point $x\in\mathbb{R}$ . Suppose that a function $g_{1}:\mathbb{R}\to\mathbb{R}$ satisfies $\int|g_{1}(s)|ds<\infty$ , $\int|sg_{1}(s)|ds<\infty$ , and $\int|s^{2}g_{1}(s)|ds<\infty$ , and that $E[g_{2}(X,Y)|X=\cdot]:\mathbb{R}\to\mathbb{R}$ and the density $f_{X}:\mathbb{R}\to\mathbb{R}$ are twice boundedly continuously differentiable at $x$ . It holds that

	$\displaystyle E\left(g_{1}\left(\frac{x-X}{h}\right)g_{2}(X,Y)\right)$	$\displaystyle=hA(x)\int g_{1}(s)ds-h^{2}A^{\prime}(x)\int sg_{1}(s)ds+o(h^{2})$
		$\displaystyle=O(h)+O(h^{2})+o(h^{2}),$

where $A(x)\coloneqq E[g_{2}(X,Y)|X=x]f_{X}(x)$ .

Note that the above result implies that, if we set $g_{2}(x,y)=1$ (constant):

	$\displaystyle E\left(g_{1}\left(\frac{x-X}{h}\right)\right)$	$\displaystyle=hf_{X}(x)\int g_{1}(s)ds-h^{2}f^{\prime}_{X}(x)\int sg_{1}(s)ds+o(h^{2})$
		$\displaystyle=O(h)+O(h^{2})+o(h^{2}).$

Suppose that $\{X_{i}\}_{i=1}^{N}$ is a random sample of a continuous random variable $X\in\mathbb{R}$ . We denote the density and CDF of $X$ by $f_{X}(\cdot)$ and $F_{X}(\cdot)=\Pr(X\leq\cdot)$ , respectively.

Lemma 7.

Let $\hat{f}_{X}(x)\coloneqq(Nh)^{-1}\sum_{i=1}^{N}K((x-X_{i})/h)$ be the kernel density estimator. Let $x\in\mathbb{R}$ be a fixed interior point in the support of $X$ . Suppose that the kernel function $K:\mathbb{R}\to\mathbb{R}$ is symmetric and satisfies $\int K(s)ds=1$ , $\kappa_{1}=\int s^{2}K(s)ds<\infty$ , $\kappa_{2}=\int K^{2}(s)ds<\infty$ , and $\int|s^{3}K(s)|ds<\infty$ , and that $f_{X}$ is bounded away from zero and three-times boundedly continuously differentiable near $x$ . When $N\to\infty$ and $h\to 0$ with $Nh\to\infty$ , it holds that $E(\hat{f}_{X}(x))=f_{X}(x)+h^{2}\kappa_{1}f^{\prime\prime}_{X}(x)/2+o(h^{2})$ and $var(\hat{f}_{X}(x))=\kappa_{2}f_{X}(x)/(Nh)+o((Nh)^{-1})$ . Moreover, when $\int|K(s)|^{3}ds<\infty$ and $Nh^{5}\to C\in[0,\infty)$ , it holds that $\sqrt{Nh}(\hat{f}_{X}(x)-f_{X}(x)-h^{2}\kappa_{1}f^{\prime\prime}_{X}(x)/2)\stackrel{{\scriptstyle d}}{{\longrightarrow}}\mathcal{N}(0,\kappa_{2}f_{X}(x))$ .

Lemma 8.

Let $\hat{F}_{X}(x)\coloneqq N^{-1}\sum_{i=1}^{N}\mathbb{K}((x-X_{i})/h)$ be the kernel CDF estimator. Let $x\in\mathbb{R}$ be a fixed interior point in the support of $X$ . Let $K(s)=d\mathbb{K}(s)/ds$ be the derivative. Suppose that $K:\mathbb{R}\to\mathbb{R}$ is symmetric and satisfies $\int K(s)ds=1$ , $\kappa_{1}=\int s^{2}K(s)ds<\infty$ , $\int|s^{3}K(s)|ds<\infty$ , and $\int|sK(s)|\mathbb{K}(s)ds<\infty$ , and that $F_{X}$ is three-times boundedly continuously differentiable near $x$ . When $N\to\infty$ and $h\to 0$ , then $E(\hat{F}_{X}(x))=F_{X}(x)+h^{2}\kappa_{1}f_{X}^{\prime}(x)/2+o(h^{2})$ and $var(\hat{F}_{X}(x))=F_{X}(x)[1-F_{X}(x)]/N+o(N^{-1})$ . Moreover, when $Nh^{4}\to 0$ also holds, it holds that $\sqrt{N}(\hat{F}_{X}(x)-F_{X}(x))\stackrel{{\scriptstyle d}}{{\longrightarrow}}\mathcal{N}(0,F_{X}(x)[1-F_{X}(x)])$ .

Appendix C Appendix: The validity of the infinite-order Taylor expansion

This appendix discusses the validity of the infinite-order Taylor expansion for the density estimation in Theorem 1. The discussion for the expansion of the CDF estimation in Theorem 3 is similar.

The infinite-order Taylor expansion of $\hat{f}_{\hat{\xi}}(x)=(Nh)^{-1}\sum_{i=1}^{N}K((x-\hat{\xi}_{i})/h)$ is:

\displaystyle\hat{f}_{\hat{\xi}}(x)=\frac{1}{Nh}\sum_{i=1}^{N}\sum_{j=0}^{\infty}\frac{(-1)^{j}}{j!}\frac{(\hat{\xi}_{i}-\xi_{i})^{j}}{h^{j}}K^{(j)}\left(\frac{x-\xi_{i}}{h}\right).

It holds if the remainder term of the finite-order Taylor expansion converges to zero as the order of the expansion increases. We show that it is the case with probability approaching one. The remainder term is given by:

\displaystyle\frac{1}{Nh^{j+1}}\frac{(-1)^{j}}{j!}\sum_{i=1}^{N}(\hat{\xi}_{i}-\xi_{i})^{j}K^{(j)}\left(\frac{x-\tilde{\xi}_{i}}{h}\right),

where $\tilde{\xi}_{i}$ is between $\hat{\xi}_{i}$ and $\xi_{i}$ . It is sufficient to argue that it converges to zero, as $j\to\infty$ , with probability approaching one. We observe that:

\begin{split}&\left|\frac{1}{Nh^{j+1}}\frac{(-1)^{j}}{j!}\sum_{i=1}^{N}(\hat{\xi}_{i}-\xi_{i})^{j}K^{(j)}\left(\frac{x-\tilde{\xi}_{i}}{h}\right)\right|\\ &\leq\left(\frac{1}{h^{j}}\max_{1\leq i\leq N}|\hat{\xi}_{i}-\xi_{i}|^{j}\right)\left(\frac{1}{j!h}\max_{1\leq i\leq N}\left|K^{(j)}\left(\frac{x-\tilde{\xi}_{i}}{h}\right)\right|\right).\end{split}

(A.30)

We argue that the term in the first parenthesis of (A.30) converges to zero, as $j\to\infty$ , with probability approaching one. Note that the convergence holds when $\max_{1\leq i\leq N}|\hat{\xi}_{i}-\xi_{i}|/h<1$ . For this, we observe that for any fixed $\varepsilon>0$ and positive integer $r\geq 1$ :

\begin{split}\Pr\left(\frac{1}{h}\max_{1\leq i\leq N}|\hat{\xi}_{i}-\xi_{i}|\leq\varepsilon\right)=\Pr\left(\frac{1}{h^{r}}\max_{1\leq i\leq N}|\hat{\xi}_{i}-\xi_{i}|^{r}\leq\varepsilon^{r}\right)&=\left(\Pr\left(|\hat{\xi}_{i}-\xi_{i}|^{r}\leq\varepsilon^{r}h^{r}\right)\right)^{N}\\ &\geq\left(1-\frac{E|\hat{\xi}_{i}-\xi_{i}|^{r}}{\varepsilon^{r}h^{r}}\right)^{N}\\ &\geq\left(1-\frac{M}{\sqrt{T^{r}h^{2r}}}\right)^{N},\end{split}

(A.31)

by Assumption 1, Markov’s inequality, and Lemma 2, 4, or 5 with fixed $M>0$ . The probability on the left-hand side of (A.31) thus converges to one if $(1-1/\sqrt{T^{r}h^{2r}})^{N}\to 1$ . Based on the binomial theorem, we observe that:

	$\displaystyle\left(1-\frac{1}{\sqrt{T^{r}h^{2r}}}\right)^{N}$	$\displaystyle=\sum_{l=0}^{N}\binom{N}{l}\left(-\frac{1}{\sqrt{T^{r}h^{2r}}}\right)^{l}$
		$\displaystyle=1-\frac{N}{\sqrt{T^{r}h^{2r}}}+\frac{N(N-1)}{2!(\sqrt{T^{r}h^{2r}})^{2}}-\frac{N(N-1)(N-2)}{3!(\sqrt{T^{r}h^{2r}})^{3}}+\cdots+\left(-\frac{1}{\sqrt{T^{r}h^{2r}}}\right)^{N}.$

As a result, the probability on the left-hand side of (A.31) converges to one if $N/\sqrt{T^{r}h^{2r}}\to 0$ for sufficiently large $r$ as $N\to\infty$ and $Th^{2}\to\infty$ . By taking $\varepsilon<1$ , we obtain the desired result. We note that the condition is significantly weaker than the relative magnitudes condition in Theorem 1. Hence, the term in the first parenthesis of (A.30) converges to zero, as $j\to\infty$ , with probability approaching one.

In a similar manner, we can observe that the term in the second parenthesis of (A.30) converges to zero with probability approaching one under regularity conditions.

Therefore, the infinite-order Taylor expansion in Theorem 1 holds under regularity conditions.

Appendix D Appendix: The infinite series of the asymptotic biases

This appendix discusses the conditions under which the infinite series of the asymptotic biases is well defined (i.e., summable and convergent). We focus on the density estimator for $\mu_{i}$ only, because the discussions for the other estimators are similar.

To examine the series of the asymptotic biases, we focus on the nonlinearity biases of the density estimator $\hat{f}_{\hat{\mu}}(x)$ . Let $e_{T,j}(x)\coloneqq E(T^{j/2}(\bar{w}_{i})^{j}|\mu_{i}=x)f_{\mu}(x)$ . For the nonlinearity bias in (A.3) of the proof of the density estimation, we observe that:

	$\displaystyle E\left(\frac{1}{j!Nh^{j+1}}\sum_{i=1}^{N}(\bar{w}_{i})^{j}K^{(j)}\left(\frac{x-\mu_{i}}{h}\right)\right)$
	$\displaystyle=\frac{1}{j!h^{j+1}}E\left(E\left((\bar{w}_{i})^{j}\|\mu_{i}\right)K^{(j)}\left(\frac{x-\mu_{i}}{h}\right)\right)$
	$\displaystyle=\frac{1}{j!T^{j/2}h^{j}}e_{T,j}(x)\int K^{(j)}(s)ds-\frac{1}{j!T^{j/2}h^{j-1}}e^{\prime}_{T,j}(x)\int sK^{(j)}(s)ds+\frac{1}{j!T^{j/2}h^{j-2}}\int e^{\prime\prime}_{T,j}(\tilde{x})s^{2}K^{(j)}(s)ds,$

by the law of iterated expectations, the change of variables, and Taylor’s theorem with $\tilde{x}$ located between $x-sh$ and $x$ . The equation for any odd $j$ is equal to:

\displaystyle-\frac{1}{j!T^{j/2}h^{j-1}}e^{\prime}_{T,j}(x)\int sK^{(j)}(s)ds+\frac{1}{j!T^{j/2}h^{j-2}}\int e^{\prime\prime}_{T,j}(\tilde{x})s^{2}K^{(j)}(s)ds,

because of the symmetry of $K$ . On the contrary, the equation for any even $j$ is equal to:

\displaystyle\frac{1}{j!T^{j/2}h^{j}}e_{T,j}(x)\int K^{(j)}(s)ds+\frac{1}{j!T^{j/2}h^{j-2}}\int e^{\prime\prime}_{T,j}(\tilde{x})s^{2}K^{(j)}(s)ds.

We focus on the summability of the series of the biases for odd $j$ only, because the discussion for even $j$ is the same. The partial sum of the series of the biases for odd $j$ can be written as:

	$\displaystyle\sum_{j=3}^{2n+1}\left(-\frac{1}{j!T^{j/2}h^{j-1}}e^{\prime}_{T,j}(x)\int sK^{(j)}(s)ds+\frac{1}{j!T^{j/2}h^{j-2}}\int e^{\prime\prime}_{T,j}(\tilde{x})s^{2}K^{(j)}(s)ds\right)$
	$\displaystyle=\sum_{l=1}^{n}\left(-\frac{1}{(2l+1)!T^{(2l+1)/2}h^{2l}}e^{\prime}_{T,2l+1}(x)\int sK^{(2l+1)}(s)ds\right.$
	$\displaystyle\quad\left.+\frac{1}{(2l+1)!T^{(2l+1)/2}h^{2l-1}}\int e^{\prime\prime}_{T,2l+1}(\tilde{x})s^{2}K^{(2l+1)}(s)ds\right)$
	$\displaystyle=-\left(\sum_{l=1}^{n}S_{1l}\right)+\left(\sum_{l=1}^{n}S_{2l}\right),$

where we define the variables $S_{1l}\coloneqq e^{\prime}_{T,2l+1}(x)\int sK^{(2l+1)}(s)ds/[(2l+1)!T^{(2l+1)/2}h^{2l}]$ and $S_{2l}\coloneqq\int e^{\prime\prime}_{T,2l+1}(\tilde{x})s^{2}K^{(2l+1)}(s)ds/[(2l+1)!T^{(2l+1)/2}h^{2l-1}]$ .

We examine the series of $S_{1l}$ only. The discussion for $S_{2l}$ is the same. The ratio test means that the series $\sum_{l=1}^{\infty}S_{1l}$ is summable and convergent if $\lim_{l\to\infty}|S_{1,l+1}/S_{1l}|<1$ . We observe that:

\displaystyle\left|\frac{S_{1,l+1}}{S_{1,l}}\right|=\left|\frac{1}{Th^{2}}\frac{1}{(2l+3)(2l+2)}\frac{e_{T,2l+3}^{\prime}(x)}{e_{T,2l+1}^{\prime}(x)}\frac{\int sK^{(2l+3)}(s)ds}{\int sK^{(2l+1)}(s)ds}\right|,

for any $l\geq 1$ . It converges to zero as $l\to\infty$ if $e_{T,2l+3}^{\prime}(x)/e_{T,2l+1}^{\prime}(x)=O(1)$ over $l$ and if $\int sK^{(2l+3)}(s)ds/\int sK^{(2l+1)}(s)ds=O(l)$ . The former is a regularity condition, and we can simply assume it. The latter is also a regularity condition, and we can easily show its validity by assuming that $K$ is the Gaussian kernel function. Hence, the condition for the ratio test is satisfied, implying that the series $\sum_{l=1}^{\infty}S_{1l}$ is summable and convergent.

The above discussions imply that the infinite series of the asymptotic biases is well defined under regularity conditions.

Appendix E Appendix: Monte Carlo simulations

This section presents the results of the Monte Carlo simulations. We here focus on the density estimation only. The number of simulation replications is 5,000.

Design.

We generate the data using the AR(1) process $y_{it}=(1-\phi_{i})\varsigma_{i}+\phi_{i}y_{i,t-1}+\sqrt{(1-\phi_{i}^{2})\sigma_{i}^{2}}u_{it}$ where $u_{it}\sim\mathcal{N}(0,1)$ , $y_{i0}\sim\mathcal{N}(\varsigma_{i},\sigma_{i}^{2})$ , and $u_{i0}\sim\mathcal{N}(0,1)$ . Note that this design satisfies $\mu_{i}=\varsigma_{i}$ , $\gamma_{0,i}=\sigma_{i}^{2}$ , and $\rho_{1,i}=\phi_{i}$ . We generate the unit-specific random variables $\varsigma_{i}\sim\mathcal{N}(-1,1)$ , $\phi_{i}\sim 2\cdot Beta(2,4)-1$ , and $\sigma_{i}^{2}\sim 3\cdot Beta(3,2)$ . We consider $N=250,500,1000$ and $T=12,24,48,96$ .

Estimators.

We estimate the densities of $\mu_{i}$ , $\gamma_{0,i}$ , and $\rho_{1,i}$ at their 20%, 40%, 60%, and 80% quantiles based on four estimators. The first is the naive estimator (NE) without split-panel jackknife bias correction. The second and third are the HPJ and TOJ estimators. The fourth is the infeasible estimator (IE) based on the true $\mu_{i}$ , $\gamma_{0,i}$ , and $\rho_{1,i}$ . For all estimators, we use the Epanechnikov kernel and the coverage error optimal bandwidth in Calonico et al. (2018a).

Results.

Tables 1, 2, and 3 present the simulation results for the densities of $\mu_{i}$ , $\gamma_{0,i}$ , and $\rho_{1,i}$ , respectively. The tables report the true values of the parameters and the bias and standard deviation (std) of each estimator. They also report the coverage probability (cp) of the 95% CI computed by the RBC procedure based on each estimator. Table 4 also describes the mean and the standard deviation of the selected bandwidths for NE and IE. Note that we use the same bandwidth for HPJ and TOJ as NE as discussed in the main body.

The NE exhibits large biases, especially with small $T$ . In particular, the biases of the density for $\gamma_{0,i}$ and $\rho_{1,i}$ are crucial because of the incidental parameter biases. As a result, the coverage probabilities of the NE are much smaller than 0.95. The performance of the NE improves as $T$ grows, but it is unsatisfactory for several parameters even when $T=96$ . These results highlight the importance of bias correction even with relatively large $T$ .

The performances of the HPJ and TOJ are significantly better than the NE. The HPJ and TOJ operate well especially when the NE exhibits large biases. The TOJ outperforms the HPJ when the HPJ exhibits relatively large biases, as a result of relatively large higher-order nonlinearity biases. Furthermore, for several parameters, the TOJ operates as well as the IE in terms of bias reduction and coverage probability. Note that the TOJ may inflate the estimation variability, especially when $T$ is small, but such cost is inevitable when our goal is to conduct unbiased inferences.

These simulation results demonstrate the severity of the incidental parameter biases and the nonlinearity biases and the success of the split-panel jackknife and the RBC inference. We thus recommend the RBC inference based on the split-panel jackknife bias-corrected estimation.

Table 1: Monte Carlo simulation results for

\mu

				NE			HPJ			TOJ			IE
	true	$N$	$T$	bias	std	cp	bias	std	cp	bias	std	cp	bias	std	cp
at $\mu$ ’s 20%Q	$0.280$	$250$	$12$	$-0.013$	$0.032$	$0.914$	$-0.008$	$0.039$	$0.938$	$-0.007$	$0.051$	$0.945$	$-0.005$	$0.034$	$0.941$
	$0.280$	$250$	$24$	$-0.009$	$0.032$	$0.940$	$-0.006$	$0.038$	$0.950$	$-0.005$	$0.047$	$0.951$	$-0.004$	$0.034$	$0.946$
	$0.280$	$250$	$48$	$-0.007$	$0.033$	$0.936$	$-0.006$	$0.037$	$0.942$	$-0.006$	$0.043$	$0.947$	$-0.004$	$0.034$	$0.946$
	$0.280$	$250$	$96$	$-0.007$	$0.034$	$0.938$	$-0.006$	$0.036$	$0.938$	$-0.006$	$0.040$	$0.942$	$-0.005$	$0.034$	$0.943$
	$0.280$	$500$	$12$	$-0.011$	$0.024$	$0.916$	$-0.007$	$0.030$	$0.940$	$-0.006$	$0.042$	$0.948$	$-0.004$	$0.026$	$0.949$
	$0.280$	$500$	$24$	$-0.008$	$0.024$	$0.938$	$-0.005$	$0.029$	$0.947$	$-0.005$	$0.037$	$0.951$	$-0.003$	$0.026$	$0.950$
	$0.280$	$500$	$48$	$-0.006$	$0.026$	$0.931$	$-0.005$	$0.030$	$0.936$	$-0.005$	$0.036$	$0.938$	$-0.004$	$0.026$	$0.940$
	$0.280$	$500$	$96$	$-0.005$	$0.025$	$0.944$	$-0.004$	$0.028$	$0.946$	$-0.004$	$0.031$	$0.946$	$-0.004$	$0.026$	$0.944$
	$0.280$	$1000$	$12$	$-0.011$	$0.019$	$0.900$	$-0.006$	$0.024$	$0.936$	$-0.005$	$0.034$	$0.945$	$-0.002$	$0.019$	$0.952$
	$0.280$	$1000$	$24$	$-0.007$	$0.019$	$0.933$	$-0.004$	$0.023$	$0.948$	$-0.003$	$0.030$	$0.953$	$-0.003$	$0.020$	$0.948$
	$0.280$	$1000$	$48$	$-0.005$	$0.019$	$0.945$	$-0.003$	$0.022$	$0.953$	$-0.003$	$0.027$	$0.953$	$-0.003$	$0.019$	$0.950$
	$0.280$	$1000$	$96$	$-0.004$	$0.019$	$0.947$	$-0.003$	$0.021$	$0.950$	$-0.003$	$0.024$	$0.952$	$-0.003$	$0.019$	$0.949$
at $\mu$ ’s 40%Q	$0.386$	$250$	$12$	$-0.033$	$0.035$	$0.743$	$-0.020$	$0.042$	$0.848$	$-0.014$	$0.054$	$0.890$	$-0.008$	$0.038$	$0.899$
	$0.386$	$250$	$24$	$-0.022$	$0.036$	$0.830$	$-0.013$	$0.041$	$0.883$	$-0.009$	$0.050$	$0.902$	$-0.008$	$0.038$	$0.906$
	$0.386$	$250$	$48$	$-0.016$	$0.036$	$0.865$	$-0.010$	$0.040$	$0.897$	$-0.008$	$0.046$	$0.910$	$-0.007$	$0.037$	$0.909$
	$0.386$	$250$	$96$	$-0.013$	$0.037$	$0.888$	$-0.009$	$0.039$	$0.901$	$-0.008$	$0.043$	$0.908$	$-0.008$	$0.037$	$0.905$
	$0.386$	$500$	$12$	$-0.031$	$0.027$	$0.693$	$-0.018$	$0.034$	$0.847$	$-0.011$	$0.045$	$0.899$	$-0.006$	$0.029$	$0.910$
	$0.386$	$500$	$24$	$-0.021$	$0.027$	$0.811$	$-0.012$	$0.031$	$0.892$	$-0.008$	$0.040$	$0.913$	$-0.006$	$0.028$	$0.916$
	$0.386$	$500$	$48$	$-0.015$	$0.028$	$0.856$	$-0.009$	$0.031$	$0.895$	$-0.007$	$0.036$	$0.912$	$-0.006$	$0.029$	$0.907$
	$0.386$	$500$	$96$	$-0.011$	$0.028$	$0.884$	$-0.008$	$0.030$	$0.904$	$-0.007$	$0.033$	$0.911$	$-0.007$	$0.028$	$0.914$
	$0.386$	$1000$	$12$	$-0.030$	$0.020$	$0.620$	$-0.016$	$0.026$	$0.850$	$-0.010$	$0.036$	$0.911$	$-0.005$	$0.022$	$0.917$
	$0.386$	$1000$	$24$	$-0.020$	$0.021$	$0.781$	$-0.010$	$0.025$	$0.888$	$-0.005$	$0.033$	$0.921$	$-0.004$	$0.022$	$0.911$
	$0.386$	$1000$	$48$	$-0.013$	$0.022$	$0.847$	$-0.007$	$0.025$	$0.905$	$-0.005$	$0.030$	$0.921$	$-0.004$	$0.022$	$0.912$
	$0.386$	$1000$	$96$	$-0.010$	$0.022$	$0.879$	$-0.006$	$0.023$	$0.910$	$-0.005$	$0.026$	$0.918$	$-0.005$	$0.022$	$0.911$
at $\mu$ ’s 60%Q	$0.386$	$250$	$12$	$-0.033$	$0.035$	$0.752$	$-0.019$	$0.043$	$0.854$	$-0.014$	$0.055$	$0.888$	$-0.007$	$0.037$	$0.909$
	$0.386$	$250$	$24$	$-0.023$	$0.036$	$0.820$	$-0.014$	$0.042$	$0.878$	$-0.010$	$0.051$	$0.904$	$-0.008$	$0.038$	$0.907$
	$0.386$	$250$	$48$	$-0.016$	$0.036$	$0.870$	$-0.010$	$0.039$	$0.900$	$-0.008$	$0.045$	$0.912$	$-0.008$	$0.037$	$0.907$
	$0.386$	$250$	$96$	$-0.012$	$0.037$	$0.887$	$-0.009$	$0.040$	$0.900$	$-0.007$	$0.043$	$0.907$	$-0.008$	$0.037$	$0.912$
	$0.386$	$500$	$12$	$-0.031$	$0.027$	$0.691$	$-0.018$	$0.033$	$0.849$	$-0.012$	$0.044$	$0.898$	$-0.006$	$0.029$	$0.911$
	$0.386$	$500$	$24$	$-0.022$	$0.027$	$0.809$	$-0.012$	$0.032$	$0.883$	$-0.008$	$0.040$	$0.905$	$-0.006$	$0.029$	$0.904$
	$0.386$	$500$	$48$	$-0.015$	$0.028$	$0.858$	$-0.008$	$0.032$	$0.893$	$-0.006$	$0.037$	$0.908$	$-0.006$	$0.029$	$0.902$
	$0.386$	$500$	$96$	$-0.010$	$0.029$	$0.888$	$-0.006$	$0.031$	$0.904$	$-0.005$	$0.034$	$0.914$	$-0.005$	$0.029$	$0.914$
	$0.386$	$1000$	$12$	$-0.030$	$0.020$	$0.628$	$-0.016$	$0.026$	$0.851$	$-0.009$	$0.036$	$0.912$	$-0.005$	$0.022$	$0.918$
	$0.386$	$1000$	$24$	$-0.020$	$0.021$	$0.777$	$-0.010$	$0.025$	$0.887$	$-0.006$	$0.033$	$0.919$	$-0.004$	$0.022$	$0.916$
	$0.386$	$1000$	$48$	$-0.013$	$0.022$	$0.855$	$-0.007$	$0.024$	$0.907$	$-0.005$	$0.030$	$0.920$	$-0.004$	$0.022$	$0.919$
	$0.386$	$1000$	$96$	$-0.009$	$0.022$	$0.881$	$-0.005$	$0.024$	$0.898$	$-0.004$	$0.027$	$0.911$	$-0.004$	$0.022$	$0.914$
at $\mu$ ’s 80%Q	$0.280$	$250$	$12$	$-0.012$	$0.032$	$0.924$	$-0.008$	$0.039$	$0.939$	$-0.007$	$0.051$	$0.946$	$-0.004$	$0.033$	$0.951$
	$0.280$	$250$	$24$	$-0.009$	$0.033$	$0.932$	$-0.006$	$0.039$	$0.942$	$-0.006$	$0.049$	$0.947$	$-0.004$	$0.034$	$0.942$
	$0.280$	$250$	$48$	$-0.007$	$0.033$	$0.945$	$-0.006$	$0.036$	$0.945$	$-0.006$	$0.043$	$0.943$	$-0.005$	$0.033$	$0.947$
	$0.280$	$250$	$96$	$-0.005$	$0.033$	$0.943$	$-0.005$	$0.036$	$0.946$	$-0.005$	$0.040$	$0.945$	$-0.004$	$0.034$	$0.944$
	$0.280$	$500$	$12$	$-0.011$	$0.024$	$0.915$	$-0.007$	$0.031$	$0.938$	$-0.005$	$0.041$	$0.948$	$-0.003$	$0.026$	$0.946$
	$0.280$	$500$	$24$	$-0.008$	$0.025$	$0.932$	$-0.005$	$0.030$	$0.944$	$-0.005$	$0.038$	$0.947$	$-0.003$	$0.025$	$0.946$
	$0.280$	$500$	$48$	$-0.006$	$0.025$	$0.940$	$-0.004$	$0.028$	$0.946$	$-0.004$	$0.034$	$0.952$	$-0.003$	$0.025$	$0.946$
	$0.280$	$500$	$96$	$-0.004$	$0.026$	$0.946$	$-0.003$	$0.028$	$0.949$	$-0.003$	$0.031$	$0.945$	$-0.003$	$0.026$	$0.948$
	$0.280$	$1000$	$12$	$-0.010$	$0.018$	$0.912$	$-0.006$	$0.023$	$0.948$	$-0.004$	$0.033$	$0.949$	$-0.002$	$0.019$	$0.948$
	$0.280$	$1000$	$24$	$-0.007$	$0.018$	$0.934$	$-0.004$	$0.022$	$0.949$	$-0.004$	$0.030$	$0.949$	$-0.003$	$0.019$	$0.951$
	$0.280$	$1000$	$48$	$-0.005$	$0.019$	$0.942$	$-0.004$	$0.022$	$0.946$	$-0.003$	$0.027$	$0.950$	$-0.003$	$0.020$	$0.944$
	$0.280$	$1000$	$96$	$-0.004$	$0.019$	$0.944$	$-0.003$	$0.021$	$0.950$	$-0.002$	$0.024$	$0.955$	$-0.003$	$0.019$	$0.950$

Table 2: Monte Carlo simulation results for

\gamma_{0}

				NE			HPJ			TOJ			IE
	true	$N$	$T$	bias	std	cp	bias	std	cp	bias	std	cp	bias	std	cp
at $\gamma_{0}$ ’s 20%Q	$0.646$	$250$	$12$	$0.173$	$0.072$	$0.302$	$0.197$	$0.106$	$0.524$	$0.109$	$0.165$	$0.905$	$-0.013$	$0.068$	$0.921$
	$0.646$	$250$	$24$	$0.131$	$0.074$	$0.564$	$0.083$	$0.097$	$0.879$	$0.011$	$0.137$	$0.944$	$-0.014$	$0.067$	$0.920$
	$0.646$	$250$	$48$	$0.078$	$0.071$	$0.830$	$0.025$	$0.084$	$0.943$	$-0.007$	$0.108$	$0.947$	$-0.013$	$0.068$	$0.918$
	$0.646$	$250$	$96$	$0.038$	$0.070$	$0.926$	$0.001$	$0.077$	$0.942$	$-0.010$	$0.090$	$0.942$	$-0.012$	$0.068$	$0.919$
	$0.646$	$500$	$12$	$0.174$	$0.058$	$0.114$	$0.211$	$0.087$	$0.284$	$0.127$	$0.142$	$0.858$	$-0.009$	$0.053$	$0.928$
	$0.646$	$500$	$24$	$0.136$	$0.058$	$0.331$	$0.093$	$0.080$	$0.789$	$0.017$	$0.119$	$0.948$	$-0.010$	$0.053$	$0.928$
	$0.646$	$500$	$48$	$0.083$	$0.057$	$0.702$	$0.030$	$0.070$	$0.933$	$-0.005$	$0.095$	$0.944$	$-0.009$	$0.054$	$0.926$
	$0.646$	$500$	$96$	$0.042$	$0.055$	$0.897$	$0.004$	$0.062$	$0.948$	$-0.009$	$0.075$	$0.939$	$-0.009$	$0.053$	$0.928$
	$0.646$	$1000$	$12$	$0.176$	$0.046$	$0.014$	$0.223$	$0.071$	$0.091$	$0.145$	$0.120$	$0.777$	$-0.007$	$0.041$	$0.932$
	$0.646$	$1000$	$24$	$0.140$	$0.045$	$0.102$	$0.099$	$0.064$	$0.669$	$0.019$	$0.102$	$0.949$	$-0.007$	$0.042$	$0.928$
	$0.646$	$1000$	$48$	$0.085$	$0.045$	$0.512$	$0.032$	$0.057$	$0.922$	$-0.004$	$0.081$	$0.946$	$-0.007$	$0.042$	$0.933$
	$0.646$	$1000$	$96$	$0.046$	$0.043$	$0.834$	$0.007$	$0.050$	$0.950$	$-0.007$	$0.063$	$0.945$	$-0.007$	$0.040$	$0.939$
at $\gamma_{0}$ ’s 40%Q	$0.701$	$250$	$12$	$-0.094$	$0.057$	$0.558$	$0.008$	$0.087$	$0.917$	$0.030$	$0.142$	$0.915$	$-0.010$	$0.068$	$0.929$
	$0.701$	$250$	$24$	$-0.026$	$0.061$	$0.888$	$0.027$	$0.089$	$0.926$	$0.021$	$0.134$	$0.927$	$-0.011$	$0.070$	$0.918$
	$0.701$	$250$	$48$	$-0.003$	$0.064$	$0.933$	$0.015$	$0.085$	$0.935$	$0.002$	$0.118$	$0.934$	$-0.010$	$0.068$	$0.933$
	$0.701$	$250$	$96$	$0.000$	$0.067$	$0.929$	$0.002$	$0.081$	$0.926$	$-0.005$	$0.101$	$0.927$	$-0.012$	$0.070$	$0.918$
	$0.701$	$500$	$12$	$-0.097$	$0.047$	$0.382$	$0.014$	$0.071$	$0.918$	$0.043$	$0.118$	$0.903$	$-0.008$	$0.052$	$0.931$
	$0.701$	$500$	$24$	$-0.024$	$0.047$	$0.877$	$0.034$	$0.070$	$0.907$	$0.029$	$0.108$	$0.929$	$-0.008$	$0.054$	$0.924$
	$0.701$	$500$	$48$	$0.000$	$0.050$	$0.941$	$0.021$	$0.066$	$0.936$	$0.007$	$0.094$	$0.942$	$-0.008$	$0.053$	$0.928$
	$0.701$	$500$	$96$	$0.001$	$0.052$	$0.930$	$0.003$	$0.064$	$0.931$	$-0.005$	$0.081$	$0.937$	$-0.010$	$0.053$	$0.926$
	$0.701$	$1000$	$12$	$-0.098$	$0.037$	$0.224$	$0.021$	$0.058$	$0.913$	$0.059$	$0.102$	$0.885$	$-0.006$	$0.040$	$0.935$
	$0.701$	$1000$	$24$	$-0.025$	$0.036$	$0.847$	$0.035$	$0.055$	$0.883$	$0.032$	$0.086$	$0.929$	$-0.007$	$0.040$	$0.927$
	$0.701$	$1000$	$48$	$0.000$	$0.038$	$0.936$	$0.020$	$0.053$	$0.930$	$0.006$	$0.076$	$0.940$	$-0.008$	$0.041$	$0.929$
	$0.701$	$1000$	$96$	$0.003$	$0.039$	$0.941$	$0.006$	$0.049$	$0.943$	$-0.004$	$0.066$	$0.941$	$-0.006$	$0.040$	$0.928$
at $\gamma_{0}$ ’s 60%Q	$0.623$	$250$	$12$	$-0.221$	$0.062$	$0.064$	$-0.114$	$0.102$	$0.795$	$-0.054$	$0.187$	$0.938$	$-0.011$	$0.068$	$0.950$
	$0.623$	$250$	$24$	$-0.134$	$0.062$	$0.471$	$-0.054$	$0.094$	$0.919$	$-0.026$	$0.164$	$0.950$	$-0.011$	$0.068$	$0.946$
	$0.623$	$250$	$48$	$-0.072$	$0.063$	$0.814$	$-0.025$	$0.088$	$0.943$	$-0.019$	$0.141$	$0.952$	$-0.010$	$0.070$	$0.942$
	$0.623$	$250$	$96$	$-0.042$	$0.066$	$0.904$	$-0.018$	$0.085$	$0.945$	$-0.021$	$0.123$	$0.951$	$-0.012$	$0.068$	$0.947$
	$0.623$	$500$	$12$	$-0.220$	$0.050$	$0.005$	$-0.110$	$0.085$	$0.733$	$-0.044$	$0.159$	$0.933$	$-0.009$	$0.052$	$0.947$
	$0.623$	$500$	$24$	$-0.131$	$0.050$	$0.280$	$-0.046$	$0.079$	$0.912$	$-0.011$	$0.143$	$0.952$	$-0.008$	$0.052$	$0.950$
	$0.623$	$500$	$48$	$-0.072$	$0.049$	$0.740$	$-0.020$	$0.070$	$0.949$	$-0.011$	$0.120$	$0.952$	$-0.007$	$0.051$	$0.950$
	$0.623$	$500$	$96$	$-0.040$	$0.050$	$0.891$	$-0.014$	$0.067$	$0.948$	$-0.015$	$0.102$	$0.949$	$-0.010$	$0.051$	$0.952$
	$0.623$	$1000$	$12$	$-0.222$	$0.039$	$0.000$	$-0.111$	$0.067$	$0.614$	$-0.042$	$0.125$	$0.934$	$-0.006$	$0.039$	$0.953$
	$0.623$	$1000$	$24$	$-0.130$	$0.040$	$0.092$	$-0.040$	$0.065$	$0.914$	$-0.001$	$0.120$	$0.949$	$-0.006$	$0.039$	$0.951$
	$0.623$	$1000$	$48$	$-0.071$	$0.038$	$0.598$	$-0.015$	$0.057$	$0.952$	$-0.004$	$0.100$	$0.955$	$-0.006$	$0.039$	$0.950$
	$0.623$	$1000$	$96$	$-0.037$	$0.038$	$0.861$	$-0.009$	$0.054$	$0.947$	$-0.010$	$0.087$	$0.944$	$-0.006$	$0.040$	$0.951$
at $\gamma_{0}$ ’s 80%Q	$0.433$	$250$	$12$	$-0.205$	$0.045$	$0.004$	$-0.142$	$0.074$	$0.546$	$-0.103$	$0.135$	$0.885$	$-0.008$	$0.060$	$0.934$
	$0.433$	$250$	$24$	$-0.144$	$0.050$	$0.183$	$-0.086$	$0.079$	$0.817$	$-0.056$	$0.142$	$0.932$	$-0.006$	$0.060$	$0.938$
	$0.433$	$250$	$48$	$-0.094$	$0.055$	$0.603$	$-0.050$	$0.080$	$0.905$	$-0.036$	$0.136$	$0.943$	$-0.006$	$0.059$	$0.943$
	$0.433$	$250$	$96$	$-0.056$	$0.057$	$0.832$	$-0.026$	$0.078$	$0.936$	$-0.021$	$0.122$	$0.946$	$-0.004$	$0.059$	$0.945$
	$0.433$	$500$	$12$	$-0.206$	$0.034$	$0.000$	$-0.143$	$0.058$	$0.322$	$-0.104$	$0.108$	$0.846$	$-0.004$	$0.043$	$0.958$
	$0.433$	$500$	$24$	$-0.143$	$0.038$	$0.026$	$-0.084$	$0.062$	$0.761$	$-0.052$	$0.113$	$0.933$	$-0.004$	$0.045$	$0.949$
	$0.433$	$500$	$48$	$-0.093$	$0.042$	$0.403$	$-0.046$	$0.063$	$0.897$	$-0.028$	$0.111$	$0.947$	$-0.004$	$0.046$	$0.936$
	$0.433$	$500$	$96$	$-0.055$	$0.044$	$0.764$	$-0.023$	$0.062$	$0.933$	$-0.018$	$0.100$	$0.952$	$-0.003$	$0.045$	$0.945$
	$0.433$	$1000$	$12$	$-0.205$	$0.026$	$0.000$	$-0.142$	$0.044$	$0.097$	$-0.101$	$0.083$	$0.786$	$-0.003$	$0.034$	$0.950$
	$0.433$	$1000$	$24$	$-0.143$	$0.030$	$0.000$	$-0.082$	$0.049$	$0.649$	$-0.049$	$0.091$	$0.920$	$-0.002$	$0.034$	$0.951$
	$0.433$	$1000$	$48$	$-0.091$	$0.032$	$0.155$	$-0.041$	$0.049$	$0.887$	$-0.020$	$0.088$	$0.946$	$-0.002$	$0.034$	$0.946$
	$0.433$	$1000$	$96$	$-0.054$	$0.033$	$0.642$	$-0.021$	$0.048$	$0.940$	$-0.012$	$0.081$	$0.947$	$-0.003$	$0.034$	$0.949$

Table 3: Monte Carlo simulation results for

\rho_{1}

				NE			HPJ			TOJ			IE
	true	$N$	$T$	bias	std	cp	bias	std	cp	bias	std	cp	bias	std	cp
at $\rho_{1}$ ’s 20%Q	$0.609$	$250$	$12$	$0.148$	$0.089$	$0.457$	$0.089$	$0.152$	$0.862$	$-0.004$	$0.288$	$0.931$	$-0.007$	$0.080$	$0.945$
	$0.609$	$250$	$24$	$0.066$	$0.085$	$0.825$	$-0.018$	$0.131$	$0.947$	$-0.076$	$0.222$	$0.938$	$-0.011$	$0.080$	$0.952$
	$0.609$	$250$	$48$	$0.027$	$0.083$	$0.930$	$-0.017$	$0.110$	$0.946$	$-0.019$	$0.160$	$0.954$	$-0.009$	$0.080$	$0.943$
	$0.609$	$250$	$96$	$0.008$	$0.082$	$0.944$	$-0.014$	$0.098$	$0.947$	$-0.015$	$0.124$	$0.947$	$-0.010$	$0.081$	$0.943$
	$0.609$	$500$	$12$	$0.151$	$0.067$	$0.302$	$0.094$	$0.115$	$0.805$	$-0.002$	$0.228$	$0.930$	$-0.007$	$0.060$	$0.944$
	$0.609$	$500$	$24$	$0.070$	$0.065$	$0.717$	$-0.013$	$0.102$	$0.947$	$-0.068$	$0.176$	$0.941$	$-0.006$	$0.061$	$0.949$
	$0.609$	$500$	$48$	$0.029$	$0.063$	$0.909$	$-0.014$	$0.087$	$0.951$	$-0.016$	$0.134$	$0.950$	$-0.006$	$0.062$	$0.944$
	$0.609$	$500$	$96$	$0.010$	$0.062$	$0.942$	$-0.010$	$0.076$	$0.947$	$-0.009$	$0.098$	$0.954$	$-0.007$	$0.060$	$0.947$
	$0.609$	$1000$	$12$	$0.154$	$0.050$	$0.149$	$0.099$	$0.088$	$0.665$	$-0.004$	$0.184$	$0.933$	$-0.004$	$0.046$	$0.950$
	$0.609$	$1000$	$24$	$0.073$	$0.048$	$0.539$	$-0.008$	$0.078$	$0.950$	$-0.060$	$0.138$	$0.933$	$-0.005$	$0.046$	$0.949$
	$0.609$	$1000$	$48$	$0.031$	$0.048$	$0.868$	$-0.012$	$0.069$	$0.944$	$-0.014$	$0.108$	$0.949$	$-0.005$	$0.046$	$0.946$
	$0.609$	$1000$	$96$	$0.014$	$0.047$	$0.937$	$-0.006$	$0.058$	$0.950$	$-0.005$	$0.079$	$0.958$	$-0.005$	$0.047$	$0.943$
at $\rho_{1}$ ’s 40%Q	$0.823$	$250$	$12$	$0.054$	$0.093$	$0.903$	$0.139$	$0.156$	$0.801$	$0.105$	$0.283$	$0.915$	$-0.015$	$0.091$	$0.942$
	$0.823$	$250$	$24$	$0.041$	$0.094$	$0.919$	$0.027$	$0.146$	$0.935$	$-0.051$	$0.244$	$0.940$	$-0.016$	$0.089$	$0.949$
	$0.823$	$250$	$48$	$0.012$	$0.090$	$0.949$	$-0.017$	$0.124$	$0.952$	$-0.039$	$0.183$	$0.951$	$-0.015$	$0.091$	$0.944$
	$0.823$	$250$	$96$	$-0.004$	$0.094$	$0.942$	$-0.021$	$0.114$	$0.944$	$-0.024$	$0.144$	$0.951$	$-0.015$	$0.090$	$0.946$
	$0.823$	$500$	$12$	$0.056$	$0.070$	$0.869$	$0.142$	$0.119$	$0.673$	$0.104$	$0.219$	$0.903$	$-0.011$	$0.067$	$0.950$
	$0.823$	$500$	$24$	$0.045$	$0.070$	$0.895$	$0.031$	$0.112$	$0.934$	$-0.048$	$0.192$	$0.947$	$-0.010$	$0.069$	$0.947$
	$0.823$	$500$	$48$	$0.016$	$0.069$	$0.941$	$-0.012$	$0.099$	$0.952$	$-0.034$	$0.153$	$0.944$	$-0.012$	$0.070$	$0.944$
	$0.823$	$500$	$96$	$0.003$	$0.069$	$0.953$	$-0.013$	$0.087$	$0.953$	$-0.015$	$0.115$	$0.953$	$-0.011$	$0.068$	$0.951$
	$0.823$	$1000$	$12$	$0.059$	$0.053$	$0.770$	$0.148$	$0.091$	$0.459$	$0.105$	$0.167$	$0.876$	$-0.008$	$0.052$	$0.951$
	$0.823$	$1000$	$24$	$0.045$	$0.054$	$0.834$	$0.030$	$0.088$	$0.931$	$-0.049$	$0.153$	$0.940$	$-0.009$	$0.052$	$0.950$
	$0.823$	$1000$	$48$	$0.019$	$0.053$	$0.932$	$-0.008$	$0.077$	$0.953$	$-0.029$	$0.123$	$0.950$	$-0.009$	$0.052$	$0.947$
	$0.823$	$1000$	$96$	$0.003$	$0.052$	$0.952$	$-0.013$	$0.067$	$0.951$	$-0.016$	$0.095$	$0.957$	$-0.009$	$0.051$	$0.952$
at $\rho_{1}$ ’s 60%Q	$0.889$	$250$	$12$	$-0.083$	$0.089$	$0.744$	$0.087$	$0.143$	$0.910$	$0.103$	$0.248$	$0.924$	$-0.016$	$0.092$	$0.931$
	$0.889$	$250$	$24$	$-0.014$	$0.091$	$0.928$	$0.056$	$0.135$	$0.933$	$0.028$	$0.216$	$0.944$	$-0.013$	$0.095$	$0.930$
	$0.889$	$250$	$48$	$-0.005$	$0.091$	$0.943$	$0.005$	$0.121$	$0.946$	$-0.025$	$0.172$	$0.945$	$-0.013$	$0.092$	$0.935$
	$0.889$	$250$	$96$	$-0.009$	$0.093$	$0.940$	$-0.011$	$0.110$	$0.942$	$-0.016$	$0.132$	$0.943$	$-0.014$	$0.094$	$0.931$
	$0.889$	$500$	$12$	$-0.078$	$0.067$	$0.701$	$0.095$	$0.110$	$0.867$	$0.132$	$0.196$	$0.889$	$-0.011$	$0.069$	$0.946$
	$0.889$	$500$	$24$	$-0.011$	$0.070$	$0.928$	$0.060$	$0.106$	$0.918$	$0.029$	$0.173$	$0.949$	$-0.012$	$0.069$	$0.944$
	$0.889$	$500$	$48$	$-0.003$	$0.069$	$0.947$	$0.006$	$0.094$	$0.953$	$-0.023$	$0.139$	$0.936$	$-0.011$	$0.070$	$0.943$
	$0.889$	$500$	$96$	$-0.007$	$0.071$	$0.938$	$-0.009$	$0.086$	$0.943$	$-0.014$	$0.108$	$0.945$	$-0.011$	$0.069$	$0.944$
	$0.889$	$1000$	$12$	$-0.077$	$0.051$	$0.587$	$0.097$	$0.084$	$0.793$	$0.154$	$0.162$	$0.804$	$-0.010$	$0.052$	$0.944$
	$0.889$	$1000$	$24$	$-0.008$	$0.053$	$0.929$	$0.063$	$0.082$	$0.886$	$0.030$	$0.138$	$0.944$	$-0.010$	$0.053$	$0.942$
	$0.889$	$1000$	$48$	$0.000$	$0.052$	$0.947$	$0.009$	$0.073$	$0.950$	$-0.021$	$0.111$	$0.947$	$-0.008$	$0.053$	$0.944$
	$0.889$	$1000$	$96$	$-0.004$	$0.053$	$0.946$	$-0.005$	$0.065$	$0.948$	$-0.010$	$0.086$	$0.942$	$-0.009$	$0.053$	$0.941$
at $\rho_{1}$ ’s 80%Q	$0.790$	$250$	$12$	$-0.247$	$0.075$	$0.099$	$-0.048$	$0.120$	$0.886$	$-0.069$	$0.194$	$0.911$	$-0.015$	$0.086$	$0.919$
	$0.790$	$250$	$24$	$-0.105$	$0.084$	$0.656$	$0.021$	$0.119$	$0.930$	$0.052$	$0.179$	$0.938$	$-0.011$	$0.087$	$0.921$
	$0.790$	$250$	$48$	$-0.045$	$0.086$	$0.863$	$0.014$	$0.108$	$0.934$	$0.007$	$0.145$	$0.938$	$-0.012$	$0.087$	$0.920$
	$0.790$	$250$	$96$	$-0.025$	$0.086$	$0.901$	$-0.003$	$0.097$	$0.934$	$-0.010$	$0.113$	$0.935$	$-0.012$	$0.086$	$0.921$
	$0.790$	$500$	$12$	$-0.240$	$0.058$	$0.026$	$-0.031$	$0.094$	$0.888$	$-0.082$	$0.156$	$0.887$	$-0.009$	$0.065$	$0.928$
	$0.790$	$500$	$24$	$-0.103$	$0.064$	$0.555$	$0.027$	$0.092$	$0.929$	$0.063$	$0.144$	$0.930$	$-0.010$	$0.067$	$0.922$
	$0.790$	$500$	$48$	$-0.043$	$0.066$	$0.848$	$0.018$	$0.086$	$0.935$	$0.008$	$0.121$	$0.938$	$-0.009$	$0.066$	$0.926$
	$0.790$	$500$	$96$	$-0.022$	$0.066$	$0.900$	$-0.001$	$0.076$	$0.938$	$-0.010$	$0.094$	$0.934$	$-0.009$	$0.065$	$0.931$
	$0.790$	$1000$	$12$	$-0.237$	$0.043$	$0.001$	$-0.024$	$0.071$	$0.891$	$-0.110$	$0.109$	$0.836$	$-0.007$	$0.050$	$0.936$
	$0.790$	$1000$	$24$	$-0.097$	$0.048$	$0.455$	$0.037$	$0.071$	$0.931$	$0.073$	$0.116$	$0.921$	$-0.007$	$0.051$	$0.932$
	$0.790$	$1000$	$48$	$-0.039$	$0.050$	$0.839$	$0.021$	$0.066$	$0.948$	$0.009$	$0.099$	$0.950$	$-0.008$	$0.050$	$0.937$
	$0.790$	$1000$	$96$	$-0.021$	$0.050$	$0.905$	$-0.001$	$0.059$	$0.946$	$-0.013$	$0.077$	$0.936$	$-0.008$	$0.051$	$0.924$

Table 4: Monte Carlo simulation results for bandwidths

			$\mu$ NE		$\mu$ IE		$\gamma_{0}$ NE		$\gamma_{0}$ IE		$\rho_{1}$ NE		$\rho_{1}$ IE
	$N$	$T$	mean	std	mean	std	mean	std	mean	std	mean	std	mean	std
at 20% Q	$250$	$12$	$0.889$	$0.277$	$0.807$	$0.255$	$0.374$	$0.025$	$0.387$	$0.053$	$0.316$	$0.092$	$0.320$	$0.098$
	$250$	$24$	$0.850$	$0.259$	$0.807$	$0.254$	$0.370$	$0.024$	$0.388$	$0.052$	$0.311$	$0.096$	$0.317$	$0.097$
	$250$	$48$	$0.831$	$0.261$	$0.805$	$0.254$	$0.373$	$0.030$	$0.386$	$0.052$	$0.314$	$0.096$	$0.318$	$0.097$
	$250$	$96$	$0.819$	$0.260$	$0.810$	$0.260$	$0.379$	$0.039$	$0.388$	$0.055$	$0.317$	$0.098$	$0.318$	$0.098$
	$500$	$12$	$0.786$	$0.237$	$0.714$	$0.232$	$0.316$	$0.015$	$0.324$	$0.032$	$0.282$	$0.082$	$0.285$	$0.086$
	$500$	$24$	$0.759$	$0.235$	$0.716$	$0.229$	$0.311$	$0.014$	$0.325$	$0.034$	$0.276$	$0.085$	$0.283$	$0.087$
	$500$	$48$	$0.734$	$0.230$	$0.716$	$0.232$	$0.313$	$0.017$	$0.324$	$0.032$	$0.278$	$0.085$	$0.283$	$0.086$
	$500$	$96$	$0.727$	$0.227$	$0.717$	$0.235$	$0.317$	$0.022$	$0.324$	$0.032$	$0.282$	$0.087$	$0.284$	$0.087$
	$1000$	$12$	$0.699$	$0.219$	$0.636$	$0.207$	$0.267$	$0.009$	$0.273$	$0.019$	$0.252$	$0.072$	$0.254$	$0.078$
	$1000$	$24$	$0.671$	$0.209$	$0.630$	$0.202$	$0.263$	$0.009$	$0.273$	$0.019$	$0.247$	$0.076$	$0.253$	$0.077$
	$1000$	$48$	$0.656$	$0.206$	$0.637$	$0.208$	$0.265$	$0.011$	$0.273$	$0.019$	$0.247$	$0.076$	$0.253$	$0.078$
	$1000$	$96$	$0.650$	$0.210$	$0.633$	$0.206$	$0.267$	$0.013$	$0.273$	$0.020$	$0.251$	$0.076$	$0.251$	$0.076$
at 40% Q	$250$	$12$	$0.850$	$0.179$	$0.792$	$0.175$	$0.510$	$0.151$	$0.434$	$0.097$	$0.323$	$0.084$	$0.314$	$0.095$
	$250$	$24$	$0.824$	$0.174$	$0.789$	$0.164$	$0.479$	$0.103$	$0.432$	$0.096$	$0.318$	$0.091$	$0.313$	$0.094$
	$250$	$48$	$0.810$	$0.169$	$0.790$	$0.171$	$0.451$	$0.090$	$0.435$	$0.099$	$0.312$	$0.093$	$0.313$	$0.093$
	$250$	$96$	$0.800$	$0.170$	$0.788$	$0.166$	$0.443$	$0.096$	$0.431$	$0.096$	$0.313$	$0.094$	$0.314$	$0.095$
	$500$	$12$	$0.731$	$0.141$	$0.679$	$0.131$	$0.433$	$0.147$	$0.383$	$0.086$	$0.286$	$0.074$	$0.280$	$0.084$
	$500$	$24$	$0.710$	$0.133$	$0.681$	$0.129$	$0.427$	$0.096$	$0.380$	$0.082$	$0.280$	$0.079$	$0.279$	$0.085$
	$500$	$48$	$0.699$	$0.135$	$0.682$	$0.131$	$0.399$	$0.082$	$0.381$	$0.084$	$0.278$	$0.084$	$0.278$	$0.085$
	$500$	$96$	$0.692$	$0.134$	$0.684$	$0.131$	$0.390$	$0.077$	$0.382$	$0.085$	$0.276$	$0.082$	$0.279$	$0.084$
	$1000$	$12$	$0.625$	$0.098$	$0.586$	$0.105$	$0.340$	$0.139$	$0.339$	$0.074$	$0.256$	$0.067$	$0.247$	$0.075$
	$1000$	$24$	$0.610$	$0.102$	$0.588$	$0.107$	$0.387$	$0.094$	$0.339$	$0.074$	$0.247$	$0.071$	$0.247$	$0.075$
	$1000$	$48$	$0.598$	$0.102$	$0.584$	$0.099$	$0.356$	$0.072$	$0.341$	$0.077$	$0.244$	$0.073$	$0.246$	$0.075$
	$1000$	$96$	$0.595$	$0.104$	$0.587$	$0.099$	$0.345$	$0.070$	$0.341$	$0.076$	$0.246$	$0.075$	$0.247$	$0.075$
at 60% Q	$250$	$12$	$0.842$	$0.165$	$0.788$	$0.165$	$0.308$	$0.061$	$0.415$	$0.131$	$0.323$	$0.070$	$0.327$	$0.082$
	$250$	$24$	$0.823$	$0.173$	$0.786$	$0.165$	$0.364$	$0.132$	$0.411$	$0.129$	$0.322$	$0.071$	$0.324$	$0.081$
	$250$	$48$	$0.811$	$0.174$	$0.787$	$0.164$	$0.410$	$0.149$	$0.413$	$0.129$	$0.322$	$0.074$	$0.327$	$0.080$
	$250$	$96$	$0.801$	$0.171$	$0.784$	$0.156$	$0.420$	$0.144$	$0.414$	$0.131$	$0.324$	$0.078$	$0.325$	$0.080$
	$500$	$12$	$0.731$	$0.139$	$0.682$	$0.136$	$0.253$	$0.028$	$0.363$	$0.117$	$0.282$	$0.057$	$0.290$	$0.072$
	$500$	$24$	$0.710$	$0.128$	$0.680$	$0.129$	$0.290$	$0.098$	$0.363$	$0.117$	$0.281$	$0.060$	$0.291$	$0.072$
	$500$	$48$	$0.698$	$0.138$	$0.683$	$0.135$	$0.344$	$0.131$	$0.362$	$0.118$	$0.287$	$0.066$	$0.290$	$0.073$
	$500$	$96$	$0.689$	$0.134$	$0.678$	$0.129$	$0.363$	$0.130$	$0.364$	$0.119$	$0.288$	$0.070$	$0.290$	$0.072$
	$1000$	$12$	$0.623$	$0.098$	$0.586$	$0.099$	$0.212$	$0.012$	$0.317$	$0.105$	$0.249$	$0.049$	$0.258$	$0.065$
	$1000$	$24$	$0.609$	$0.108$	$0.585$	$0.103$	$0.233$	$0.049$	$0.320$	$0.107$	$0.250$	$0.053$	$0.258$	$0.065$
	$1000$	$48$	$0.597$	$0.097$	$0.584$	$0.106$	$0.281$	$0.105$	$0.320$	$0.106$	$0.255$	$0.057$	$0.258$	$0.063$
	$1000$	$96$	$0.593$	$0.102$	$0.584$	$0.099$	$0.312$	$0.116$	$0.318$	$0.105$	$0.256$	$0.061$	$0.258$	$0.063$
at 80% Q	$250$	$12$	$0.889$	$0.271$	$0.809$	$0.257$	$0.373$	$0.116$	$0.404$	$0.137$	$0.338$	$0.082$	$0.312$	$0.057$
	$250$	$24$	$0.851$	$0.260$	$0.806$	$0.250$	$0.370$	$0.131$	$0.407$	$0.139$	$0.317$	$0.060$	$0.310$	$0.054$
	$250$	$48$	$0.832$	$0.261$	$0.809$	$0.258$	$0.384$	$0.145$	$0.404$	$0.135$	$0.309$	$0.054$	$0.311$	$0.056$
	$250$	$96$	$0.817$	$0.251$	$0.808$	$0.256$	$0.397$	$0.144$	$0.409$	$0.139$	$0.308$	$0.053$	$0.310$	$0.054$
	$500$	$12$	$0.787$	$0.243$	$0.715$	$0.233$	$0.323$	$0.097$	$0.358$	$0.121$	$0.297$	$0.064$	$0.265$	$0.040$
	$500$	$24$	$0.761$	$0.239$	$0.719$	$0.230$	$0.319$	$0.109$	$0.358$	$0.121$	$0.272$	$0.046$	$0.265$	$0.040$
	$500$	$48$	$0.748$	$0.237$	$0.718$	$0.228$	$0.331$	$0.125$	$0.358$	$0.123$	$0.262$	$0.038$	$0.264$	$0.039$
	$500$	$96$	$0.730$	$0.233$	$0.717$	$0.223$	$0.348$	$0.129$	$0.362$	$0.122$	$0.262$	$0.039$	$0.264$	$0.039$
	$1000$	$12$	$0.704$	$0.210$	$0.637$	$0.205$	$0.287$	$0.091$	$0.317$	$0.111$	$0.258$	$0.050$	$0.223$	$0.026$
	$1000$	$24$	$0.677$	$0.214$	$0.635$	$0.205$	$0.274$	$0.089$	$0.315$	$0.108$	$0.227$	$0.031$	$0.225$	$0.029$
	$1000$	$48$	$0.658$	$0.210$	$0.633$	$0.208$	$0.287$	$0.105$	$0.317$	$0.112$	$0.220$	$0.024$	$0.224$	$0.027$
	$1000$	$96$	$0.655$	$0.215$	$0.639$	$0.208$	$0.300$	$0.112$	$0.314$	$0.107$	$0.221$	$0.024$	$0.225$	$0.028$

Kernel Estimation for Panel Data with Heterogeneous Dynamics111First Version: February, 2018.

Abstract

1 Introduction

Related literature.

Paper organization.

2 Kernel density estimation

3 Asymptotic theory

3.1 Unique features in asymptotic investigations

3.2 Asymptotic biases for the density estimation

Assumption 1.

Assumption 2.

Assumption 3.

Assumption 4.

Assumption 5.

Assumption 6.

Assumption 7.

Theorem 1.

Remark 1.

Remark 2.

3.3 Split-panel jackknife bias correction for density estimation

Theorem 2.

Remark 3.

Remark 4.

3.4 Confidence interval and bandwidth selection for density estimation

CI estimation.

Remark 5.

Bandwidth selection.

3.5 Asymptotic biases for CDF estimation

Assumption 8.

Theorem 3.

Remark 6.

Remark 7.

4 Empirical application

Data.

Results.

5 Conclusion

Acknowledgments

References

Appendix A Appendix: Proofs of the theorems

A.1 Proof of Theorem 1

The density of μi\mu_{i}.

The density of γk,i\gamma_{k,i}.

The density of ρk,i\rho_{k,i}.

A.2 Proof of Theorem 2

A.3 Proof of Theorem 3

The CDF of μi\mu_{i}.

The CDF of γk,i\gamma_{k,i}.

The CDF of ρk,i\rho_{k,i}.

Appendix B Appendix: Lemmas

Lemma 1.

Lemma 2.

Lemma 3.

Lemma 4.

Lemma 5.

Lemma 6.

Lemma 7.

Lemma 8.

Appendix C Appendix: The validity of the infinite-order Taylor expansion

Appendix D Appendix: The infinite series of the asymptotic biases

Appendix E Appendix: Monte Carlo simulations

Design.

Estimators.

Results.

Kernel Estimation for Panel Data with Heterogeneous Dynamics¹¹1First Version: February, 2018.

The density of $\mu_{i}$ .

The density of $\gamma_{k,i}$ .

The density of $\rho_{k,i}$ .

The CDF of $\mu_{i}$ .

The CDF of $\gamma_{k,i}$ .

The CDF of $\rho_{k,i}$ .