Abstract
In this paper we propose new panel tests to detect changes in persistence. The test statistics are used to test the null hypothesis of stationarity against the alternative of a change in persistence from I(0) to I(1), from I(1) to I(0), and in an unknown direction. The limiting null distributions of the tests are derived and evaluated in small samples by means of Monte Carlo simulations. An empirical illustration is also provided.
Similar content being viewed by others
1 Introduction
Over the last two decades, a vast literature has investigated whether economic and financial time series may be characterized by a change in persistence between separate I(1) and I(0) regimes rather than simply I(1) or I(0) behavior. Changes of this kind in macroeconomic variables are well documented; see the literature reviews in Kim (2000) and Leybourne et al. (2003). A non-exhaustive list of the variables for which such phenomena have been observed includes inflation, real output, budgetary deficits, interest rates and exchange rates. Interestingly, while many data sets are in fact panels of multiple time series, the way that existing tests are constructed requires that the series are tested one at a time. This is wasteful in the sense that each time a test is carried out the information contained in the other series is effectively ignored. The current paper can be seen as a reaction to this. The purpose is to develop tests for changes in persistence that explores the multiplicity of series, and that can be seen as panel extensions of the time series tests of Kim (2000), Kim et al. (2002), and Busetti and Taylor (2004). The tests can be used to flexibly test the null hypothesis of stationarity against the alternative of a change in persistence not only from I(0) to I(1), and from I(1) to I(0), but also when the direction is unknown. The data generating process (DGP) considered is quite general. Some of the allowances are unit-specific constant and trend terms, cross-section heteroskedasticity, error serial correlation and cross-section dependence in the form of common factors. The asymptotic distributions of the tests are derived and evaluated in small samples using Monte Carlo simulation. An empirical illustration is also provided showing how how inflation of 20 developed countries has undergone a shift from I(0) to I(1).
The rest of the paper is organized as follows. Sections 2 and 3 present the model, the test statistics, and their asymptotic distributions, which are evaluated using simulations in Sect. 4. Section 5 reports the results from the empirical application. Section 6 concludes. Proofs of important results are provided in the Appendix.
2 Model and assumptions
Consider the panel data variable \(Y_{i,t}\), where \(i = 1,...,N\) and \(t = 1,...,T\) index the time-series and cross-sectional units, respectively. The DGP of this variable is given by
where \(D_{t,p} = (1,t,...,t^p)'\) is a p-order trend polynomial such that \(D_{t,p} = 0\) is \(p=-1\), \(F_t\) is an \(r\times 1\) vector of common factors with \(\lambda _{i}\) being the corresponding vector of factor loadings, and \(\varepsilon _{i,t}\) is a mean zero and I(0) error term. The following three specifications of \(\mu _{i,t}\) are considered, where 1(A), \(\lfloor x\rfloor \), \(\eta _{i,t}\) and \(\tau _i^0\in [0,1]\) denote the indicator function of the event A, the integer part of x, a mean zero I(0) error term, and the break fraction, respectively:
-
MU1.
I(0) \(\rightarrow \) I(1): \(\mu _{i,t}=\mu _{i,t-1}+1(t>\lfloor T\tau _{i}^0\rfloor )\eta _{i,t}\).
-
MU2.
I(1) \(\rightarrow \) I(0): \(\mu _{i,t}=\mu _{i,t-1}+1(t \le \lfloor T\tau _{i}^0\rfloor )\eta _{i,t}\).
-
MU3.
Unknown direction: I(0) \(\rightarrow \) I(1) or I(1) \(\rightarrow \) I(0).
Under MU1 \(Y_{i,t}\) is I(0) up to and including time \(\lfloor \tau _{i}^0 T\rfloor \) but is I(1) after the break, provided that \(\sigma _{\eta , i}^2 = \mathrm {var}(\eta _{i,t}) > 0\). Under MU2 \(Y_{i,t}\) is I(1) up to and including time \(\lfloor \tau _{i}^0 T\rfloor \) but it is I(0) after the break, provided again that \(\sigma _{\eta ,i}^2 > 0\). Therefore, the hypothesis of stationarity against a shift in persistence from I(0) to I(1) or viceversa can be stated as \(H_0:\sigma _{\eta ,1}^{2}= ...= \sigma _{\eta ,N}^{2}=0\) versus \(H_1:\sigma _{\eta ,i}^{2} > 0\) for at least some i. Whenever the alternative is I(1) \(\rightarrow \) I(0) we write “\(H_1:\) I(1) \(\rightarrow \) I(0)”, whereas if the alternative is I(1) \(\rightarrow \) I(0), we write “\(H_{1}:\) I(1) \(\rightarrow \) I(0)”.
The conditions placed on the above DGP are given in Assumption 1, where \(C <\infty \), tr(A), \(||{A}|| = \sqrt{tr({A}' {A})}\), \(\rightarrow _p\) and \(\mathcal {F}_{i,t}\) denote a generic positive constant, the trace and Euclidean norm of the (generic) matrix A, convergence in probability, and the sigma-field generated by \(\{(\varepsilon _{i,n},\eta _{i,n})\}_{n=1}^t\), respectively.
Assumption 1
-
(i)
\(\varepsilon _{i,t} = \gamma _i(L) v_{i,t}\), where \(v_{i,t}\) is independent and identically distribution (iid) with \(E(v_{i,t}) = 0\), \(E(v_{i,t}^2) = 1\), \(E(v_{i,t}^8) \le C\), \(\gamma _i(L) = \sum _{j=0}^\infty \gamma _{ji}L^j\), \(\sum _{j=0}^\infty j||\gamma _{ji}|| \le C\) and \(\gamma _i(1)^2 > 0\);
-
(ii)
\(\eta _{i,t} = \phi _i(L) w_{i,t}\), where \(w_{i,t}\) is iid with \(E(w_{i,t}) = 0\), \(E(w_{i,t}^2) = 1\), \(E(w_{i,t}^8) \le C\), \(\phi _i(L) = \sum _{j=0}^\infty \phi _{ji}L^j\), \(\sum _{j=0}^\infty j||\phi _{ji}|| \le C\) and \(\phi _i(1)^2 > 0\);
-
(iii)
\(F_t\) is I(0) such that \(E(||F_t||^4) \le C\) and \(T^{-1} \sum _{t=1}^{T} F_t F'_t \rightarrow _p \Sigma _F > 0\);
-
(iv)
\(\varepsilon _{i,t}\), \(\eta _{i,t}\) and \(F_t\) are mutually independent;
-
(v)
\(\mu _{1,0} = ... = \mu _{N,0} = 0\);
-
(vi)
\(\lambda _i\) is deterministic such that \(||\lambda _i||^4 \le C\), \(N^{-1}\sum _{i=1}^N\lambda _i\lambda _{i}' \rightarrow \Sigma _\lambda > 0\) as \(N \rightarrow \infty \).
Remark 1
Assumption 1 puts restrictions on the time series and cross-sectional properties of \(\varepsilon _{i,t}\) and \(\eta _{i,t}\). The restrictions are very similar to the ones of Bai and Ng (2004), and we therefore refer to this other paper for a detailed discussion. The main difference when compared to Bai and Ng (2004) is that here \(F_t\) cannot be I(1). Thus, while \(Y_{i,t}\) may be cross-correlated, it cannot be affected by common stochastic trends. However, we would like to point out that this assumption is mainly for ease of interpretation of the test outcome, for if \(F_t\) is allowed to be I(1) the persistence of \(Y_{i,t}\) cannot be inferred from \(e_{i,t}\) alone, and in the present paper we focus on the testing of \(e_{i,t}\). Hence, analogous to the PANIC approach of Bai and Ng (2004), if \(F_t\) is permitted to be I(1), then we also need to test this variable.
3 The test statistics
The general testing idea is to first purge the effect of \(F_t\), and then to submit the resulting residuals to a test for a change in persistence. The implementation of the first step depends on whether \(F_t\) is known or not.
3.1 \(F_t\) known
Consider the generic variable \(X_{i,t}\). The detrended version of this variables is henceforth denoted \(X_{i,t}^p = X_{i,t} - \sum _{n=1}^T X_{i,n}a_{n,t,p}\), where \(a_{n,k,p}=D_{n,p}' (\sum _{t=1}^T D_{t,p}D_{t,p}' )^{-1}D_{k,p}\) and \(p\ge 0\). If \(p= -1\), then we define \(X_{i,t}^p = X_{i,t}\). In this notation, the detrended and defactored version of \(Y_{i,t}\) is given by \(\hat{e}_{i,t} = Y_{i,t}^p - \hat{\lambda }_i'F_t^p\), where \(\hat{\lambda }_i\) is the least squares (LS) slope estimator in a regression of \(Y_{i,t}^p\) onto \(F_t^p\). Thus, while in this section \(F_t\) is assumed to be known, \(\lambda _i\) is still treated as unknown. Consider the following test statistic, which is suitable for testing if cross-section unit i is I(0) versus I(1) \(\rightarrow \) I(0) (see, for example, Kim 2000; Kim et al. 2002; Busetti and Taylor 2004):
where \(\tau \in [0,1]\), \(S_{i,t}^{0}(\tau )=\sum _{n=1}^{t} \hat{e}_{i,n}\) and \(S_{i,t}^{1}(\tau )=\sum _{n= \lfloor T\tau \rfloor +1}^t \hat{e}_{i,n}\). The error sequences \(\{\hat{e}_{i,n}\}_{n=1}^{\lfloor T\tau \rfloor }\) and \(\{\hat{e}_{i,n}\}_{n=\lfloor T\tau \rfloor + 1}^T\) come from two separate regressions; while the former uses only the first \(\lfloor T\tau \rfloor \) observations, the latter uses only the last \(\lfloor T(1-\tau )\rfloor \) observations.
Remark 2
The \(K_{i,T}(\tau )\) test considered here is in the spirit of Kwiatkowski et al. (1992) in which the constant I(0) null is tested versus the constant I(1) alternative. An alternative approach is to follow Banerjee et al. (1992) and Leybourne et al. (2003) who use the Dickey–Fuller statistic, in which the null and the alternative hypotheses are reversed. Panel variants of these can be constructed in the same way as the one suggested below for \(K_{i,T}(\tau )\) (see Demetrescu and Hanck 2013, for such a proposal).
Let \(\mathcal {C} = [\tau _{min},\tau _{max}]\subseteq (0,1)\). In this paper, we consider three transformations to eliminate the dependence on \(\tau \) in \(K_{i,T}(\tau )\) (see, for example, Kim, 2000);
-
T1.
The maximum-Chow transformation:
$$\begin{aligned} K_{i,T}^1 = \max _{s = \lfloor T\tau _{min}\rfloor ,..., \lfloor T\tau _{max}\rfloor } K_{i}(s/T). \end{aligned}$$ -
T2.
The mean-exponential transformation:
$$\begin{aligned} K_{i,T}^2 = \ln \left( (\lfloor T(\tau _{max} - \tau _{min})\rfloor +1)^{-1}\sum _{s = \lfloor T\tau _{min}\rfloor }^{\lfloor T\tau _{max}\rfloor } \exp [K_{i}(s/T)] \right) . \end{aligned}$$ -
T3.
The mean score transformation:
$$\begin{aligned} K_{i,T}^3 = (\lfloor T(\tau _{max} - \tau _{min})\rfloor +1)^{-1}\sum _{s = \lfloor T\tau _{min}\rfloor }^{\lfloor T\tau _{max}\rfloor } K_{i}(s/T). \end{aligned}$$
In Appendix (Proof of Theorem 1), we show that \(K_{i,T}(\tau ) \rightarrow _w K_i(\tau )\) as \(T\rightarrow \infty \), where \(\rightarrow _w\) signifies weak convergence and \(K_i(\tau )\) is a certain ratio of stochastic integrals. Since \(K_1(\tau ),...,K_N(\tau )\) are iid, we may define \(\mu _{K,j}= E(K_{i}^j)\) and \(\sigma _{K,j}^2 = \mathrm {var}(K_{i}^j)\) for \(j\in \{ 1,2,3 \}\). Numerical values of \(\mu _{K,j}\) and \(\sigma _{K,j}\) are reported in Table 1. The proposed panel test statistic for testing \(H_0\) versus \(H_1:\) I(0) \(\rightarrow \) I(1) is given by
For testing if cross-section unit i is I(0) versus I(1) \(\rightarrow \) I(0), the following “reverse” test statistic can be used (see Kim 2000; Kim et al. 2002; Busetti and Taylor 2004):
which can be transformed using T1–T3 to eliminate the dependence on \(\tau \). The resulting transformed statistic is written in an obvious notation as \(R_{i}^j\). Based on this test statistic, we may define \(R_{NT}^j = \sigma _{R,j}^{-1} N^{-1/2} \sum _{i=1}^N (R_{i,T}^j-\mu _{R,j} )\) with obvious definitions of \(\sigma _{R,j}^{2}\) and \(\mu _{R,j}\). When the direction of the persistency is unknown, the following maximum statistic may be used:
which can again be normalized to obtain \(M_{NT}^j = \sigma _{M,j}^{-1}N^{-1/2} \sum _{i=1}^N (M_{i,T}^j-\mu _{M,j} )\).
Theorem 1
Under \(H_0\) and Assumption 1, as \(N,\,T \rightarrow \infty \) with \(N/T\rightarrow 0\),
where\(\rightarrow _d\)signifies convergence in distribution.
Remark 3
While the test statistics considered here are independent of \(\tau _1^0,...,\tau _N^0\), in applications it is sometimes useful to be able to estimate these parameters. This can be accomplished using the proposal of Kim (2000, Sect. 3.2), which basically amounts to setting \(\hat{\tau }_i^0\) equal to the suitably maximizing or minimizing value of \(K_{i,T}(\tau )\), depending on whether it is I(0) \(\rightarrow \) I(1) or I(1) \(\rightarrow \) I(0) that is being tested. Alternatively, we may follow Busetti and Taylor (2004, Sect. 6.2), who suggest setting \(\hat{\tau }_i^0\) equal to the value of \(\tau _i^0\) that minimizes the sum of squares of \(\hat{e}_{i,t}\).
Remark 4
The requirement that \(N/T \rightarrow 0\) is sufficient but not necessary and is needed to make sure that certain remainder terms are negligible. However, the order of these terms is not the sharpest possible. A more elaborate asymptotic analysis would be required to obtain the exact order. In Sect. 4, we use Monte Carlo simulation to evaluate the effect of N / T in small samples.
3.2 \(F_t\) unknown
The estimation of \(F_t\) can be performed in two ways; (i) unrestrictedly, or (ii) restricted under \(H_0\). In both cases, we follow the bulk of the previous literature and use the principal components method (see, for example, Bai and Ng 2004). The restricted estimator of \(F = ( F_{1},..., F_{T})'\), denoted \(\hat{F}^0 = (\hat{F}_{1}^0,...,\hat{F}_{T}^0)'\), is \(\sqrt{T}\) times the eigenvectors corresponding to the first r largest eigenvalues of the \(T \times T\) matrix \(Y^p(Y^p)'\), where \(Y^p = ( Y_{1}^p,..., Y_{N}^p)\) and \(Y_i^p = ( Y_{i,1}^p,..., Y_{i,T}^p)'\) are \(T\times N\) and \(T\times 1\), respectively. Under the normalization \(T^{-1}\hat{F}^0(\hat{F}^0)'= I_r \), the estimated loading matrix is \((\hat{\lambda }^0)' = (\hat{\lambda }_1^0,...,\hat{\lambda }_N^0)=T^{-1}(\hat{F}^0)' Y^p\). The restricted estimator of \(e_{i,t}\) that we will be considering can now be constructed as
Let \(X_{i,t}^{p-1}\) be \(X_{i,t}\) when detrended using a trend polynomial of order \(p-1\). Hence, \(X_{i,t}^{p-1} = X_{i,t}\) if \(p = 0\). Let \(f_t = \Delta F_t\) and \(y_{i,t} = \Delta Y_{i,t}\) (for \(t=2,...,T\)). The unrestricted estimators \(\hat{f}_t^1\) and \(\hat{\lambda }_i^1\) of (the space spanned by) \(f_t^{p-1}\) and \(\lambda _i\) are \(\hat{F}_t^0\) and \(\hat{\lambda }_i^0\), respectively, but with \(Y_{i,t}^p\) replaced by \(y_{i,t}^{p-1}\). Let
where \(\tilde{e}_{i,1}^1 = 0\). The unrestricted estimator \(\hat{e}_{i,t}^1\) of \(e_{i,t}\) is given by \(\hat{e}_{i,t}^1 = (\tilde{e}_{i,t}^1)^p\). The appropriate test statistics to consider when \(F_t\) is unknown, henceforth denoted \(K_{hNT}^j\), \(R_{hNT}^j\) and \(M_{hNT}^j\) for \(h \in \{ 0,1 \}\), are given by \(K_{NT}^j\), \(R_{NT}^j\) and \(M_{NT}^j\), respectively, with \(\hat{e}_{i,t}\) replaced by \(\hat{e}_{i,t}^h\).
Theorem 2
Under \(H_0\) and Assumptions 1, as \(N,\,T \rightarrow \infty \) with \(N/T\rightarrow 0\),
As Theorem 2 makes clear, the factors can be unknown and still the asymptotic distributions of the test statistics are N(0, 1). This is in agreement with the results reported by Bai and Ng (2004) for their pooled panel unit root tests.
4 Monte Carlo simulations
A small-scale Monte Carlo study was conducted to investigate the properties of the new tests in small samples. The DGP is given by a restricted version of (1)–(2) that sets \(\varepsilon _{i,t} \sim N(0,\sigma _{\varepsilon , i}^2)\), \(\eta _{i,t} \sim N(0,\sigma _{\eta }^2)\), \(\sigma _{\eta }\in \{0,0.25,0.5\}\), \(\tau _i^0 \sim U(0.3,0.7)\), \(r = 1\), and \(F_t=\rho F_{t-1} + v_t\), where \(v_t\sim N(0,1)\) and \(\rho \in \{0.3,0.6\}\) (see, for example, Gengenbach et al. 2010, for a similar parametrization). For \(\sigma _{\varepsilon , i}\), we consider two cases. In the first, \(\sigma _{\varepsilon , i} = 1\) for all i, while in the second, \(\sigma _{\varepsilon , i} \sim U (1,2)\). Since a more volatile idiosyncratic error will make \(F_t\) more difficult to discern, we expect that the results for the second case will deteriorate when compared to the first. All results are based on 1,000 replications of samples of size \(N\in \{ 5, 10, 20 \}\) and \(T\in \{50,100\}\). Also, following Kim (2000), \(\mathcal {C}=[0.20,0.80]\). Results were obtained for \(p \in \{ 0,1\}\), although in this paper we focus on the results for the empirically most common specification with \(p = 0\) (a constant but no trend). The results for \(p=1\) (constant and trend) can be obtained upon request. Both the restricted and unrestricted factor estimation methods were simulated. Interestingly, the restricted method led to better results in terms of both size accuracy and power. In this paper, we therefore only report the results for the restricted method, where the number of common factors is determined using the \(IC_2\) criterion of Bai and Ng (2002) with a maximum of three factors.Footnote 1
The 5% size and power results are reported in Tables 2, 3, 4 and 5. While Tables 2 (\(\rho = 0.3\)) and 3 (\(\rho = 0.6\)) contain the results for the tests of I(0) \(\rightarrow \) I(1), Tables 4 (\(\rho = 0.3\)) and 5 (\(\rho = 0.6\)) contain the corresponding results for I(1) \(\rightarrow \) I(0). The information content of these tables may be summarized as follows.
-
All tests have good size accuracy when \(\sigma _{\varepsilon , i} = 1\) and \(\rho = 0.3\). This is true for all constellations of T and N considered, although the distortions do have a tendency to increase slightly in N, which is consistent with the previous panel unit root literature (see Westerlund and Breitung 2013, for a discussion). While there are no big differences, the best size accuracy is generally obtained by using \(K_{NT}^2\), \(R_{NT}^2\) and \(M_{NT}^2\), whereas \(K_{NT}^3\), \(R_{NT}^1\) and \(R_{NT}^3\) generally leads to the worst accuracy.
-
As expected, increases in \(\rho \) and/or \(\sigma _{\varepsilon , i}\) generally lead to reduced size accuracy, although the distortions are never very large. This is true regardless of the direction of the change in persistence. In fact, the results are remarkably stable, given that the test statistics do not require any corrections to account for nuisance parameters.
-
All tests perform quite well in terms of power, and there are clear improvements as N and/or T increases. The fact that power is not only increasing in T, but also in N illustrates the advantage of accounting for the cross-sectional variation of the data. Power is also increasing in the distance to the null, as measured by \(\sigma _\eta \), which is again just as expected.
5 Empirical illustration
The question of whether inflation should be considered as I(0) or I(1) has been subject to a long debate. According to recent studies (see, for example, Kim 2000; Busetti and Taylor 2004), however, inflation may be better characterized by a change in persistence between separate I(1) and I(0) regimes rather than simply I(1) or I(0) behavior. The purpose of this illustration is to test this hypothesis using a large panel of quarterly CPI inflation data covering 20 countries (Australia, Austria, Belgium, Canada, Denmark, Finland, France, Germany, Greece, Italy, Japan, Korea, the Netherlands, New Zealand, Norway, Spain, Sweden, Switzerland, the UK and the US) between 1970:1 and 2013:4. All data are taken from OECD Main Economic Indicators.
The number of common factors is determined in the same way as in the simulations. As is customary when dealing with inflation (see, for example, Leybourne et al. 2003), the tests are fitted with a constant but no trend. The results are reported in Table 6. The first thing to note is that while in case of \(K_{NT}^1\), \(K_{NT}^2\) and \(K_{NT}^3\) there is no evidence against the I(0) null, \(R_{NT}^1\), \(R_{NT}^2\) and \(R_{NT}^3\) all lead to a clear rejection. This is true even at the most conservative 1% level. We therefore conclude that inflation has been subject to a change in persistence from I(1) to I(0), which is in agreement with the recent empirical literature based on US data (see, for example, Busetti and Taylor 2004; Harvey et al. 2006). A common explanation for the observed change in persistence of inflation in the US is that it is due to the stock market collapse of the late 1980s and the recession that followed it. One interpretation of the results reported in the current paper is therefore that they reflect the worldwide recession of the early 1990s, which was to a large extent triggered by the recession in the US. Another possibility is that the results reflect in part monetary policy shifts (see, for example, Davig and Doh 2014, and the references provided therein).
6 Conclusion
This paper develops panel tests that are suitable for testing the null hypothesis of stationarity against the alternative of a change in persistence from I(0) to I(1), from I(1) to I(0), or when the direction is unknown. The DGP used for this purpose is quite general and allows unit-specific constant and trend terms, cross-section heteroskedasticity, error serial correlation and cross-section dependence in the form of common factors.
Notes
See Westerlund and Mishra (2016) for a more elaborate selection approach that uses a data-driven penalty.
References
Bai J (2003) Inferential theory for factor models of large dimensions. Econometrica 71:135–171
Bai J, Ng S (2002) Determining the number of factors in approximate factor models. Econometrica 70:191–221
Bai J, Ng S (2004) A PANIC attack on unit roots and cointegration. Econometrica 72:1127–1177
Banerjee A, Lumsdaine R, Stock J (1992) Recursive and sequential tests of the unit root and trend break hypotheses: theory and international evidence. J Bus Econ Stat 10:271–288
Busetti F, Taylor RAM (2004) Tests of stationarity against a change in persistence. J Economet 123:33–66
Davig T, Doh T (2014) Monetary policy regime shifts and inflation persistence. Rev Econ Stat 96:862–875
Demetrescu M, Hanck C (2013) Nonlinear IV panel unit root testing under structural breaks in the error variance. Stat Pap 54:1043–1066
Gengenbach C, Palm F, Urbain J-P (2010) Panel unit root tests in the presence of cross-sectional dependencies: comparison and implications for modelling. Economet Rev 29:111–145
Harvey DI, Leybourne SJ, Taylor RAM (2006) Modified tests for a change in persistence. J Economet 134:441–469
Kim JY (2000) Detection of change in persistence of a linear time series. J Economet 95:97–116
Kim JY, Franch JB, Amador RB (2002) Corrigendum to “Detection of change in persistence of a linear time series”. J Economet 109:389–392
Kwiatkowski D, Phillips PCB, Schmidt P, Shin Y (1992) Testing the null hypothesis of stationarity against the alternative of a unit root: how sure are we that economic time series have a unit root? J Econ 54:159–178
Leybourne SJ, Kim T, Newbold P, Smith V (2003) Tests for a change in persistence against the null of difference-stationarity. Economet J 6:291–311
Moon HR, Phillips PCB (2000) Estimation of autoregressive roots near unity using panel data. Econ Theory 16:927–997
Westerlund J, Larsson R (2009) A note on the pooling of individual PANIC unit root tests. Economet Theory 25:1851–1868
Westerlund J, Breitung J (2013) Lessons from a decade of IPS and LLC. Economet Rev 32:547–591
Westerlund J, Mishra s (2016) On the determination of the number of factors using information criteria with data-driven penalty. Stat Pap. doi:10.1007/s00362-015-0692-0
Author information
Authors and Affiliations
Corresponding author
Additional information
Previous versions of the paper were presented at the First International Conference in Memory of Carlo Giannini, “Recent Development in Econometric Methodology”, at University of Bergamo, and at seminars at University of Vienna and University of Leicester. The authors would like to thank seminar and conference participants, and in particular David Hendry, Chihwa Kao, Oliver Linton, Robert Kunst, Stephen Pollock, Wojciech Charemza, Panicos Demetriades, Qiang Zhang, Francesco Moscone, and two anonymous referees for many useful comments and suggestions. Westerlund would like to thank the Knut and Alice Wallenberg Foundation for financial support through a Wallenberg Academy Fellowship, and the Jan Wallander and Tom Hedelius Foundation for financial support under research grant number P2014–0112:1.
Appendix: Proofs
Appendix: Proofs
The proofs of Theorems 1 and 2 are established for \(K_{NT}^j\); the proofs for \(R_{NT}^j\) and \(M_{NT}^j\) are entirely analogous.
Proof of Theorem 1
Under MU1, \(\mu _{i,t} = \sum _{k=1}^t 1(k> \lfloor T\tau \rfloor )\eta _{i,k}\), and by further invoking \(H_0\), \(\mu _{i,t} = 0\), giving
It follows that
with obvious definitions of \(F_t^p\) and \(\varepsilon _{i,t}^p\), which in turn implies
Therefore,
Under \(H_0\) and with \(F_t\) known \(Y_{i,t} = \theta _i 'D_{t} + \lambda _i'F_t + \varepsilon _{i,t}\) is just an ordinary time series regression in I(0) variables with exogenous regressors. It follows that \(\sqrt{T}(\hat{\lambda }_i-\lambda _i) = O_p(1)\), and therefore, since \(T^{-1/2}\sum _{n=1}^{t}F_n^p = O_p(1)\),
Hence, using \(\overline{K}_{i,T}(\tau )\) to denote \(K_{i,T}(\tau )\) with \(\hat{e}_{i,n}\) replaces by \(e_{i,n}\), we have
where the first term on the right is the same as in Harvey et al. (2006). It follows from their results that
as \(T\rightarrow \infty \), where \(\rightarrow _w\) signifies weak convergence, and
with \(W_{\varepsilon ,i}(r)\) being a standard Brownian motion, and \(D_p(r)\) is such that \(Q_T^{-1}D_{\lfloor Tr\rfloor ,p} \rightarrow D_p(r)\), where \(Q_T = \mathrm {diag}(1,T,...,T^p)\). Note in particular how \(D_0(r) = 1\) and \(D_1(r) = (1,r)'\). Therefore, by the continuous mapping theorem, and writing \(K_{i,T}^j = H_j(K_{i,T}(\tau ))\) and \(\overline{K}_{i,T}^j = H_j(\overline{K}_{i,T}(\tau ))\) as in Busetti and Taylor (2004),
Let us now consider \(K_{NT}^j\). By using the previous result
where \(O_p(\sqrt{N}T^{-1/2}) = o_p(1)\) under our assumption that \(N/T=o(1)\). We now use the same steps as in Moon and Phillips (2000, p. 994) to verify that \((\overline{K}_{i,T}^j -\mu _{K,j})\) satisfies conditions (i)–(iv) of the central limit theorem of Phillips and Moon (1999, Theorem 2). In so doing we follow their notation and write \(Q_{i,T} = (\overline{K}_{i,T}^j -\mu _{K,j})\), which is iid with mean zero and variance \(\sigma _{K,j}^2 \le C\). We have already shown that \(\overline{K}_{i,T}^j \rightarrow _w \overline{K}_{i}^j\) as \(T\rightarrow \infty \), which implies \(Q_{i,T} \rightarrow _d Q_i = (\overline{K}_{i}^j - \mu _{K,j})\), and it is also not difficult to verify that \(E(Q_{i,T}^2) \rightarrow E(Q_i^2) = \sigma _{K,j}^2\). This verifies conditions (i), (ii) and (iv). Condition (iv) follows from noting that, by the continuous mapping theorem, \(Q_{i,T}^2 \rightarrow _w Q_i^2\). It follows that
as \(N,\,T\rightarrow \infty \) with \(N/T\rightarrow 0\). \(\square \)
Proof of Theorem 2
We begin by considering the case when the estimator of \(e_{i,t}\) is based on the restricted estimators of \(\lambda _i\) and \(F_t\) under \(H_0\). As in Proof of Theorem 1, under MU1 and \(H_0\), \(Y_{i,t}= \theta _i'D_{i,t} + \lambda _i'F_t + \varepsilon _{i,t}\). In order to capture the fact that \(\lambda _i\) and \(F_t\) are not separately identifiable we introduce the \(r\times r\) rotation matrix H such that
Hence,
By Lemmas 1(c) and 2 of Bai and Ng (2004), \(||\hat{\lambda }_i-(H^{-1})'\lambda _i|| = O_p(N^{-1}) + O_p(T^{-1/2})\) and \(||T^{-1/2}\sum _{n=1}^{t}(\hat{F}_n^0-HF_n^p)|| = O_p(N^{-1/2}) + O_p(T^{-3/4})\), where the latter result holds uniformly in t. Hence, since
we can show that
Hence, as in the case when \(F_t\) is known (see Proof of Theorem 1), the estimation and removal of the common component do not affect the asymptotic distribution of the test statistic. Specifically, using \(K_{0i,T}^{j}\) to denote \(K_{i,T}^j\) with \(\hat{e}_{i,n}^0\) in place of \(\hat{e}_{i,n}\), we get
which holds uniformly in (j, i). In order to show that the resulting panel statistic, \(K_{0NT}^{j}\) say, converges to N(0, 1), we may use the same argument as in Westerlund and Larsson (2009).
Consider the unrestricted estimator of \(e_{i,t}\). We have \(\tilde{e}_{i,t}^1 = \sum _{n=2}^t [y_{i,n}^{p-1} - (\hat{\lambda }_i^1)'\hat{f}_n^1]\), where, under \(H_0\), \(y_{i,t} = \Delta Y_{i,t}= \theta _i'\Delta D_{t} + \lambda _i'f_t + \Delta \varepsilon _{i,t}\) with \(f_t = \Delta F_t\). It follows that \(y_{i,t}^{p-1}= \lambda _i'f_t^{p-1} + (\Delta \varepsilon _{i,t})^{p-1}\), and therefore
Consider \(\sum _{n=2}^t\big (\hat{f}_t^1-Hf_t^{p-1}\big )\). From Proof of Theorem 3 in Bai (2003), using V to denote a diagonal matrix consisting of the first r eigenvalues of \((NT)^{-1} y^{p-1}( y^{p-1})'\) in decreasing order,
where we have made use of the fact that \(\sum _{n=2}^t (\Delta \varepsilon _{i,n})^{p-1} = \varepsilon _{i,t}^{p-1}-\varepsilon _{i,1}^{p-1}\). Now, ||V|| and \(||N^{-1/2}\sum _{i=1}^N \lambda _i (\varepsilon _{i,t}^{p-1}-\varepsilon _{i,1}^{p-1})||\) are both \(O_p(1)\). Moreover, by Lemma A.1 of Bai (2003),
from which it follows that
By using this and \(\hat{F}_t^1 = \sum _{n=2}^t \hat{f}_t^1 = H(F_t^{p-1}-F_1^{p-1}) + \sum _{n=2}^t (\hat{f}_t^1-Hf_t^{p-1})\), we obtain
suggesting that for \(p\ge 0\),
When appropriately normalized by \(T^{-1/2}\), taking partial sums do not affect the order of the remainder terms. Hence, again, the estimation and removal of the common component do not affect the asymptotic distribution of the test statistic.\(\square \)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Cerqueti, R., Costantini, M., Gutierrez, L. et al. Panel stationary tests against changes in persistence. Stat Papers 60, 1079–1100 (2019). https://doi.org/10.1007/s00362-016-0864-6
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00362-016-0864-6