Panel stationary tests against changes in persistence

Cerqueti, Roy; Costantini, Mauro; Gutierrez, Luciano; Westerlund, Joakim

doi:10.1007/s00362-016-0864-6

Panel stationary tests against changes in persistence

Regular Article
Open access
Published: 09 December 2016

Volume 60, pages 1079–1100, (2019)
Cite this article

Download PDF

You have full access to this open access article

Statistical Papers Aims and scope Submit manuscript

Panel stationary tests against changes in persistence

Download PDF

Roy Cerqueti¹,
Mauro Costantini²,
Luciano Gutierrez³ &
…
Joakim Westerlund^4,5

1413 Accesses
1 Citation
Explore all metrics

Abstract

In this paper we propose new panel tests to detect changes in persistence. The test statistics are used to test the null hypothesis of stationarity against the alternative of a change in persistence from I(0) to I(1), from I(1) to I(0), and in an unknown direction. The limiting null distributions of the tests are derived and evaluated in small samples by means of Monte Carlo simulations. An empirical illustration is also provided.

Asymptotic distribution of quasi-maximum likelihood estimation of dynamic panels using long difference transformation when both N and T are large

Article 20 February 2016

Cheng Hsiao & Qiankun Zhou

Misspecification in Dynamic Panel Data Models and Model-Free Inferences

Article 01 September 2017

Ryo Okui

A Fluctuation Test for Structural Change Detection in Heterogeneous Panel Data Models

Article 08 April 2024

Fuxiao Li, Yanting Xiao & Zhanshou Chen

1 Introduction

Over the last two decades, a vast literature has investigated whether economic and financial time series may be characterized by a change in persistence between separate I(1) and I(0) regimes rather than simply I(1) or I(0) behavior. Changes of this kind in macroeconomic variables are well documented; see the literature reviews in Kim (2000) and Leybourne et al. (2003). A non-exhaustive list of the variables for which such phenomena have been observed includes inflation, real output, budgetary deficits, interest rates and exchange rates. Interestingly, while many data sets are in fact panels of multiple time series, the way that existing tests are constructed requires that the series are tested one at a time. This is wasteful in the sense that each time a test is carried out the information contained in the other series is effectively ignored. The current paper can be seen as a reaction to this. The purpose is to develop tests for changes in persistence that explores the multiplicity of series, and that can be seen as panel extensions of the time series tests of Kim (2000), Kim et al. (2002), and Busetti and Taylor (2004). The tests can be used to flexibly test the null hypothesis of stationarity against the alternative of a change in persistence not only from I(0) to I(1), and from I(1) to I(0), but also when the direction is unknown. The data generating process (DGP) considered is quite general. Some of the allowances are unit-specific constant and trend terms, cross-section heteroskedasticity, error serial correlation and cross-section dependence in the form of common factors. The asymptotic distributions of the tests are derived and evaluated in small samples using Monte Carlo simulation. An empirical illustration is also provided showing how how inflation of 20 developed countries has undergone a shift from I(0) to I(1).

The rest of the paper is organized as follows. Sections 2 and 3 present the model, the test statistics, and their asymptotic distributions, which are evaluated using simulations in Sect. 4. Section 5 reports the results from the empirical application. Section 6 concludes. Proofs of important results are provided in the Appendix.

2 Model and assumptions

Consider the panel data variable $Y_{i,t}$, where $i = 1,...,N$ and $t = 1,...,T$ index the time-series and cross-sectional units, respectively. The DGP of this variable is given by

$$\begin{aligned}&\displaystyle Y_{i,t} = \theta _i'D_{t,p}+\lambda _{i}'F_t+ e_{i,t},\end{aligned}$$

(1)

$$\begin{aligned}&\displaystyle e_{i,t} = \mu _{i,t} + \varepsilon _{i,t}, \end{aligned}$$

(2)

where $D_{t,p} = (1,t,...,t^p)'$ is a p-order trend polynomial such that $D_{t,p} = 0$ is $p=-1$, $F_t$ is an $r\times 1$ vector of common factors with $\lambda _{i}$ being the corresponding vector of factor loadings, and $\varepsilon _{i,t}$ is a mean zero and I(0) error term. The following three specifications of $\mu _{i,t}$ are considered, where 1(A), $\lfloor x\rfloor $, $\eta _{i,t}$ and $\tau _i^0\in [0,1]$ denote the indicator function of the event A, the integer part of x, a mean zero I(0) error term, and the break fraction, respectively:

MU1.
I(0) $\rightarrow $ I(1): $\mu _{i,t}=\mu _{i,t-1}+1(t>\lfloor T\tau _{i}^0\rfloor )\eta _{i,t}$.
MU2.
I(1) $\rightarrow $ I(0): $\mu _{i,t}=\mu _{i,t-1}+1(t \le \lfloor T\tau _{i}^0\rfloor )\eta _{i,t}$.
MU3.
Unknown direction: I(0) $\rightarrow $ I(1) or I(1) $\rightarrow $ I(0).

Under MU1 $Y_{i,t}$ is I(0) up to and including time $\lfloor \tau _{i}^0 T\rfloor $ but is I(1) after the break, provided that $\sigma _{\eta , i}^2 = \mathrm {var}(\eta _{i,t}) > 0$. Under MU2 $Y_{i,t}$ is I(1) up to and including time $\lfloor \tau _{i}^0 T\rfloor $ but it is I(0) after the break, provided again that $\sigma _{\eta ,i}^2 > 0$. Therefore, the hypothesis of stationarity against a shift in persistence from I(0) to I(1) or viceversa can be stated as $H_0:\sigma _{\eta ,1}^{2}= ...= \sigma _{\eta ,N}^{2}=0$ versus $H_1:\sigma _{\eta ,i}^{2} > 0$ for at least some i. Whenever the alternative is I(1) $\rightarrow $ I(0) we write “$H_1:$ I(1) $\rightarrow $ I(0)”, whereas if the alternative is I(1) $\rightarrow $ I(0), we write “$H_{1}:$ I(1) $\rightarrow $ I(0)”.

The conditions placed on the above DGP are given in Assumption 1, where $C <\infty $, tr(A), $||{A}|| = \sqrt{tr({A}' {A})}$, $\rightarrow _p$ and $\mathcal {F}_{i,t}$ denote a generic positive constant, the trace and Euclidean norm of the (generic) matrix A, convergence in probability, and the sigma-field generated by $\{(\varepsilon _{i,n},\eta _{i,n})\}_{n=1}^t$, respectively.

Assumption 1

(i)
$\varepsilon _{i,t} = \gamma _i(L) v_{i,t}$, where $v_{i,t}$ is independent and identically distribution (iid) with $E(v_{i,t}) = 0$, $E(v_{i,t}^2) = 1$, $E(v_{i,t}^8) \le C$, $\gamma _i(L) = \sum _{j=0}^\infty \gamma _{ji}L^j$, $\sum _{j=0}^\infty j||\gamma _{ji}|| \le C$ and $\gamma _i(1)^2 > 0$;
(ii)
$\eta _{i,t} = \phi _i(L) w_{i,t}$, where $w_{i,t}$ is iid with $E(w_{i,t}) = 0$, $E(w_{i,t}^2) = 1$, $E(w_{i,t}^8) \le C$, $\phi _i(L) = \sum _{j=0}^\infty \phi _{ji}L^j$, $\sum _{j=0}^\infty j||\phi _{ji}|| \le C$ and $\phi _i(1)^2 > 0$;
(iii)
$F_t$ is I(0) such that $E(||F_t||^4) \le C$ and $T^{-1} \sum _{t=1}^{T} F_t F'_t \rightarrow _p \Sigma _F > 0$;
(iv)
$\varepsilon _{i,t}$, $\eta _{i,t}$ and $F_t$ are mutually independent;
(v)
$\mu _{1,0} = ... = \mu _{N,0} = 0$;
(vi)
$\lambda _i$ is deterministic such that $||\lambda _i||^4 \le C$, $N^{-1}\sum _{i=1}^N\lambda _i\lambda _{i}' \rightarrow \Sigma _\lambda > 0$ as $N \rightarrow \infty $.

Remark 1

Assumption 1 puts restrictions on the time series and cross-sectional properties of $\varepsilon _{i,t}$ and $\eta _{i,t}$. The restrictions are very similar to the ones of Bai and Ng (2004), and we therefore refer to this other paper for a detailed discussion. The main difference when compared to Bai and Ng (2004) is that here $F_t$ cannot be I(1). Thus, while $Y_{i,t}$ may be cross-correlated, it cannot be affected by common stochastic trends. However, we would like to point out that this assumption is mainly for ease of interpretation of the test outcome, for if $F_t$ is allowed to be I(1) the persistence of $Y_{i,t}$ cannot be inferred from $e_{i,t}$ alone, and in the present paper we focus on the testing of $e_{i,t}$. Hence, analogous to the PANIC approach of Bai and Ng (2004), if $F_t$ is permitted to be I(1), then we also need to test this variable.

3 The test statistics

The general testing idea is to first purge the effect of $F_t$, and then to submit the resulting residuals to a test for a change in persistence. The implementation of the first step depends on whether $F_t$ is known or not.

3.1 $F_t$ known

Consider the generic variable $X_{i,t}$. The detrended version of this variables is henceforth denoted $X_{i,t}^p = X_{i,t} - \sum _{n=1}^T X_{i,n}a_{n,t,p}$, where $a_{n,k,p}=D_{n,p}' (\sum _{t=1}^T D_{t,p}D_{t,p}' )^{-1}D_{k,p}$ and $p\ge 0$. If $p= -1$, then we define $X_{i,t}^p = X_{i,t}$. In this notation, the detrended and defactored version of $Y_{i,t}$ is given by $\hat{e}_{i,t} = Y_{i,t}^p - \hat{\lambda }_i'F_t^p$, where $\hat{\lambda }_i$ is the least squares (LS) slope estimator in a regression of $Y_{i,t}^p$ onto $F_t^p$. Thus, while in this section $F_t$ is assumed to be known, $\lambda _i$ is still treated as unknown. Consider the following test statistic, which is suitable for testing if cross-section unit i is I(0) versus I(1) $\rightarrow $ I(0) (see, for example, Kim 2000; Kim et al. 2002; Busetti and Taylor 2004):

$$\begin{aligned} K_{i,T}(\tau )=\frac{ (\lfloor T\tau \rfloor )^{2}}{(T-\lfloor T\tau \rfloor )^{2}} \frac{\sum _{t=\lfloor T\tau \rfloor +1}^T S_{i,t}^{ 1 }(\tau )^2}{\sum _{t=1}^{\lfloor T\tau \rfloor } S_{i,t}^{ 0 }(\tau )^2}, \end{aligned}$$

where $\tau \in [0,1]$, $S_{i,t}^{0}(\tau )=\sum _{n=1}^{t} \hat{e}_{i,n}$ and $S_{i,t}^{1}(\tau )=\sum _{n= \lfloor T\tau \rfloor +1}^t \hat{e}_{i,n}$. The error sequences $\{\hat{e}_{i,n}\}_{n=1}^{\lfloor T\tau \rfloor }$ and $\{\hat{e}_{i,n}\}_{n=\lfloor T\tau \rfloor + 1}^T$ come from two separate regressions; while the former uses only the first $\lfloor T\tau \rfloor $ observations, the latter uses only the last $\lfloor T(1-\tau )\rfloor $ observations.

Remark 2

The $K_{i,T}(\tau )$ test considered here is in the spirit of Kwiatkowski et al. (1992) in which the constant I(0) null is tested versus the constant I(1) alternative. An alternative approach is to follow Banerjee et al. (1992) and Leybourne et al. (2003) who use the Dickey–Fuller statistic, in which the null and the alternative hypotheses are reversed. Panel variants of these can be constructed in the same way as the one suggested below for $K_{i,T}(\tau )$ (see Demetrescu and Hanck 2013, for such a proposal).

Let $\mathcal {C} = [\tau _{min},\tau _{max}]\subseteq (0,1)$. In this paper, we consider three transformations to eliminate the dependence on $\tau $ in $K_{i,T}(\tau )$ (see, for example, Kim, 2000);

T1.
The maximum-Chow transformation:
$$\begin{aligned} K_{i,T}^1 = \max _{s = \lfloor T\tau _{min}\rfloor ,..., \lfloor T\tau _{max}\rfloor } K_{i}(s/T). \end{aligned}$$
T2.
The mean-exponential transformation:
$$\begin{aligned} K_{i,T}^2 = \ln \left( (\lfloor T(\tau _{max} - \tau _{min})\rfloor +1)^{-1}\sum _{s = \lfloor T\tau _{min}\rfloor }^{\lfloor T\tau _{max}\rfloor } \exp [K_{i}(s/T)] \right) . \end{aligned}$$
T3.
The mean score transformation:
$$\begin{aligned} K_{i,T}^3 = (\lfloor T(\tau _{max} - \tau _{min})\rfloor +1)^{-1}\sum _{s = \lfloor T\tau _{min}\rfloor }^{\lfloor T\tau _{max}\rfloor } K_{i}(s/T). \end{aligned}$$

Table 1 Simulated mean and standard deviation normalization factors

Full size table

In Appendix (Proof of Theorem 1), we show that $K_{i,T}(\tau ) \rightarrow _w K_i(\tau )$ as $T\rightarrow \infty $, where $\rightarrow _w$ signifies weak convergence and $K_i(\tau )$ is a certain ratio of stochastic integrals. Since $K_1(\tau ),...,K_N(\tau )$ are iid, we may define $\mu _{K,j}= E(K_{i}^j)$ and $\sigma _{K,j}^2 = \mathrm {var}(K_{i}^j)$ for $j\in \{ 1,2,3 \}$. Numerical values of $\mu _{K,j}$ and $\sigma _{K,j}$ are reported in Table 1. The proposed panel test statistic for testing $H_0$ versus $H_1:$ I(0) $\rightarrow $ I(1) is given by

$$\begin{aligned} K_{NT}^j = \frac{1}{\sigma _{K,j} \sqrt{N}} \sum _{i=1}^N( K_{i,T}^j-\mu _{K,j}). \end{aligned}$$

For testing if cross-section unit i is I(0) versus I(1) $\rightarrow $ I(0), the following “reverse” test statistic can be used (see Kim 2000; Kim et al. 2002; Busetti and Taylor 2004):

$$\begin{aligned} R_{i}(\tau )=(K_{i}(\tau ))^{-1}, \end{aligned}$$

which can be transformed using T1–T3 to eliminate the dependence on $\tau $. The resulting transformed statistic is written in an obvious notation as $R_{i}^j$. Based on this test statistic, we may define $R_{NT}^j = \sigma _{R,j}^{-1} N^{-1/2} \sum _{i=1}^N (R_{i,T}^j-\mu _{R,j} )$ with obvious definitions of $\sigma _{R,j}^{2}$ and $\mu _{R,j}$. When the direction of the persistency is unknown, the following maximum statistic may be used:

$$\begin{aligned} M_{i,T}^j = \max \{K_{i,T}^j,R_{i,T}^j\}, \end{aligned}$$

which can again be normalized to obtain $M_{NT}^j = \sigma _{M,j}^{-1}N^{-1/2} \sum _{i=1}^N (M_{i,T}^j-\mu _{M,j} )$.

Theorem 1

Under $H_0$ and Assumption 1, as $N,\,T \rightarrow \infty $ with $N/T\rightarrow 0$,

$$\begin{aligned} K_{NT}^j,\,R_{NT}^j,\,M_{NT}^j \rightarrow _d N(0,1), \end{aligned}$$

where$\rightarrow _d$signifies convergence in distribution.

Remark 3

While the test statistics considered here are independent of $\tau _1^0,...,\tau _N^0$, in applications it is sometimes useful to be able to estimate these parameters. This can be accomplished using the proposal of Kim (2000, Sect. 3.2), which basically amounts to setting $\hat{\tau }_i^0$ equal to the suitably maximizing or minimizing value of $K_{i,T}(\tau )$, depending on whether it is I(0) $\rightarrow $ I(1) or I(1) $\rightarrow $ I(0) that is being tested. Alternatively, we may follow Busetti and Taylor (2004, Sect. 6.2), who suggest setting $\hat{\tau }_i^0$ equal to the value of $\tau _i^0$ that minimizes the sum of squares of $\hat{e}_{i,t}$.

Remark 4

The requirement that $N/T \rightarrow 0$ is sufficient but not necessary and is needed to make sure that certain remainder terms are negligible. However, the order of these terms is not the sharpest possible. A more elaborate asymptotic analysis would be required to obtain the exact order. In Sect. 4, we use Monte Carlo simulation to evaluate the effect of N / T in small samples.

3.2 $F_t$ unknown

The estimation of $F_t$ can be performed in two ways; (i) unrestrictedly, or (ii) restricted under $H_0$. In both cases, we follow the bulk of the previous literature and use the principal components method (see, for example, Bai and Ng 2004). The restricted estimator of $F = ( F_{1},..., F_{T})'$, denoted $\hat{F}^0 = (\hat{F}_{1}^0,...,\hat{F}_{T}^0)'$, is $\sqrt{T}$ times the eigenvectors corresponding to the first r largest eigenvalues of the $T \times T$ matrix $Y^p(Y^p)'$, where $Y^p = ( Y_{1}^p,..., Y_{N}^p)$ and $Y_i^p = ( Y_{i,1}^p,..., Y_{i,T}^p)'$ are $T\times N$ and $T\times 1$, respectively. Under the normalization $T^{-1}\hat{F}^0(\hat{F}^0)'= I_r $, the estimated loading matrix is $(\hat{\lambda }^0)' = (\hat{\lambda }_1^0,...,\hat{\lambda }_N^0)=T^{-1}(\hat{F}^0)' Y^p$. The restricted estimator of $e_{i,t}$ that we will be considering can now be constructed as

$$\begin{aligned} \hat{e}_{i,t}^0 = Y_{i,t}^p - (\hat{\lambda }_i^0)'\hat{F}_t^0. \end{aligned}$$

(3)

Let $X_{i,t}^{p-1}$ be $X_{i,t}$ when detrended using a trend polynomial of order $p-1$. Hence, $X_{i,t}^{p-1} = X_{i,t}$ if $p = 0$. Let $f_t = \Delta F_t$ and $y_{i,t} = \Delta Y_{i,t}$ (for $t=2,...,T$). The unrestricted estimators $\hat{f}_t^1$ and $\hat{\lambda }_i^1$ of (the space spanned by) $f_t^{p-1}$ and $\lambda _i$ are $\hat{F}_t^0$ and $\hat{\lambda }_i^0$, respectively, but with $Y_{i,t}^p$ replaced by $y_{i,t}^{p-1}$. Let

$$\begin{aligned} \tilde{e}_{i,t}^1 = \sum _{n=2}^t \left[ y_{i,n}^{p-1} - (\hat{\lambda }_i^1)'\hat{f}_n^1\right] , \end{aligned}$$

(4)

where $\tilde{e}_{i,1}^1 = 0$. The unrestricted estimator $\hat{e}_{i,t}^1$ of $e_{i,t}$ is given by $\hat{e}_{i,t}^1 = (\tilde{e}_{i,t}^1)^p$. The appropriate test statistics to consider when $F_t$ is unknown, henceforth denoted $K_{hNT}^j$, $R_{hNT}^j$ and $M_{hNT}^j$ for $h \in \{ 0,1 \}$, are given by $K_{NT}^j$, $R_{NT}^j$ and $M_{NT}^j$, respectively, with $\hat{e}_{i,t}$ replaced by $\hat{e}_{i,t}^h$.

Table 2 5% size and power when testing I(0) $\mathrm \rightarrow $ I(1) and $\rho = 0.3$

Full size table

Table 3 5% size and power when testing I(0) $\mathrm \rightarrow $ I(1) and $\rho = 0.6$

Full size table

Table 4 5% size and power when testing I(1) $\mathrm \rightarrow $ I(0) and $\rho = 0.3$

Full size table

Table 5 5% size and power when testing I(1) $\mathrm \rightarrow $ I(0) and $\rho = 0.6$

Full size table

Table 6 Empirical test results

Full size table

Theorem 2

Under $H_0$ and Assumptions 1, as $N,\,T \rightarrow \infty $ with $N/T\rightarrow 0$,

$$\begin{aligned} K_{hNT}^j,\,R_{hNT}^j,\,M_{hNT}^j \rightarrow _d N(0,1). \end{aligned}$$

As Theorem 2 makes clear, the factors can be unknown and still the asymptotic distributions of the test statistics are N(0, 1). This is in agreement with the results reported by Bai and Ng (2004) for their pooled panel unit root tests.

4 Monte Carlo simulations

A small-scale Monte Carlo study was conducted to investigate the properties of the new tests in small samples. The DGP is given by a restricted version of (1)–(2) that sets $\varepsilon _{i,t} \sim N(0,\sigma _{\varepsilon , i}^2)$, $\eta _{i,t} \sim N(0,\sigma _{\eta }^2)$, $\sigma _{\eta }\in \{0,0.25,0.5\}$, $\tau _i^0 \sim U(0.3,0.7)$, $r = 1$, and $F_t=\rho F_{t-1} + v_t$, where $v_t\sim N(0,1)$ and $\rho \in \{0.3,0.6\}$ (see, for example, Gengenbach et al. 2010, for a similar parametrization). For $\sigma _{\varepsilon , i}$, we consider two cases. In the first, $\sigma _{\varepsilon , i} = 1$ for all i, while in the second, $\sigma _{\varepsilon , i} \sim U (1,2)$. Since a more volatile idiosyncratic error will make $F_t$ more difficult to discern, we expect that the results for the second case will deteriorate when compared to the first. All results are based on 1,000 replications of samples of size $N\in \{ 5, 10, 20 \}$ and $T\in \{50,100\}$. Also, following Kim (2000), $\mathcal {C}=[0.20,0.80]$. Results were obtained for $p \in \{ 0,1\}$, although in this paper we focus on the results for the empirically most common specification with $p = 0$ (a constant but no trend). The results for $p=1$ (constant and trend) can be obtained upon request. Both the restricted and unrestricted factor estimation methods were simulated. Interestingly, the restricted method led to better results in terms of both size accuracy and power. In this paper, we therefore only report the results for the restricted method, where the number of common factors is determined using the $IC_2$ criterion of Bai and Ng (2002) with a maximum of three factors.^{Footnote 1}

The 5% size and power results are reported in Tables 2, 3, 4 and 5. While Tables 2 ($\rho = 0.3$) and 3 ($\rho = 0.6$) contain the results for the tests of I(0) $\rightarrow $ I(1), Tables 4 ($\rho = 0.3$) and 5 ($\rho = 0.6$) contain the corresponding results for I(1) $\rightarrow $ I(0). The information content of these tables may be summarized as follows.

All tests have good size accuracy when $\sigma _{\varepsilon , i} = 1$ and $\rho = 0.3$. This is true for all constellations of T and N considered, although the distortions do have a tendency to increase slightly in N, which is consistent with the previous panel unit root literature (see Westerlund and Breitung 2013, for a discussion). While there are no big differences, the best size accuracy is generally obtained by using $K_{NT}^2$, $R_{NT}^2$ and $M_{NT}^2$, whereas $K_{NT}^3$, $R_{NT}^1$ and $R_{NT}^3$ generally leads to the worst accuracy.
As expected, increases in $\rho $ and/or $\sigma _{\varepsilon , i}$ generally lead to reduced size accuracy, although the distortions are never very large. This is true regardless of the direction of the change in persistence. In fact, the results are remarkably stable, given that the test statistics do not require any corrections to account for nuisance parameters.
All tests perform quite well in terms of power, and there are clear improvements as N and/or T increases. The fact that power is not only increasing in T, but also in N illustrates the advantage of accounting for the cross-sectional variation of the data. Power is also increasing in the distance to the null, as measured by $\sigma _\eta $, which is again just as expected.

5 Empirical illustration

The question of whether inflation should be considered as I(0) or I(1) has been subject to a long debate. According to recent studies (see, for example, Kim 2000; Busetti and Taylor 2004), however, inflation may be better characterized by a change in persistence between separate I(1) and I(0) regimes rather than simply I(1) or I(0) behavior. The purpose of this illustration is to test this hypothesis using a large panel of quarterly CPI inflation data covering 20 countries (Australia, Austria, Belgium, Canada, Denmark, Finland, France, Germany, Greece, Italy, Japan, Korea, the Netherlands, New Zealand, Norway, Spain, Sweden, Switzerland, the UK and the US) between 1970:1 and 2013:4. All data are taken from OECD Main Economic Indicators.

The number of common factors is determined in the same way as in the simulations. As is customary when dealing with inflation (see, for example, Leybourne et al. 2003), the tests are fitted with a constant but no trend. The results are reported in Table 6. The first thing to note is that while in case of $K_{NT}^1$, $K_{NT}^2$ and $K_{NT}^3$ there is no evidence against the I(0) null, $R_{NT}^1$, $R_{NT}^2$ and $R_{NT}^3$ all lead to a clear rejection. This is true even at the most conservative 1% level. We therefore conclude that inflation has been subject to a change in persistence from I(1) to I(0), which is in agreement with the recent empirical literature based on US data (see, for example, Busetti and Taylor 2004; Harvey et al. 2006). A common explanation for the observed change in persistence of inflation in the US is that it is due to the stock market collapse of the late 1980s and the recession that followed it. One interpretation of the results reported in the current paper is therefore that they reflect the worldwide recession of the early 1990s, which was to a large extent triggered by the recession in the US. Another possibility is that the results reflect in part monetary policy shifts (see, for example, Davig and Doh 2014, and the references provided therein).

6 Conclusion

This paper develops panel tests that are suitable for testing the null hypothesis of stationarity against the alternative of a change in persistence from I(0) to I(1), from I(1) to I(0), or when the direction is unknown. The DGP used for this purpose is quite general and allows unit-specific constant and trend terms, cross-section heteroskedasticity, error serial correlation and cross-section dependence in the form of common factors.

Notes

See Westerlund and Mishra (2016) for a more elaborate selection approach that uses a data-driven penalty.

References

Bai J (2003) Inferential theory for factor models of large dimensions. Econometrica 71:135–171
Bai J, Ng S (2002) Determining the number of factors in approximate factor models. Econometrica 70:191–221
Article MathSciNet MATH Google Scholar
Bai J, Ng S (2004) A PANIC attack on unit roots and cointegration. Econometrica 72:1127–1177
Article MathSciNet MATH Google Scholar
Banerjee A, Lumsdaine R, Stock J (1992) Recursive and sequential tests of the unit root and trend break hypotheses: theory and international evidence. J Bus Econ Stat 10:271–288
Google Scholar
Busetti F, Taylor RAM (2004) Tests of stationarity against a change in persistence. J Economet 123:33–66
Article MathSciNet MATH Google Scholar
Davig T, Doh T (2014) Monetary policy regime shifts and inflation persistence. Rev Econ Stat 96:862–875
Article Google Scholar
Demetrescu M, Hanck C (2013) Nonlinear IV panel unit root testing under structural breaks in the error variance. Stat Pap 54:1043–1066
Article MathSciNet MATH Google Scholar
Gengenbach C, Palm F, Urbain J-P (2010) Panel unit root tests in the presence of cross-sectional dependencies: comparison and implications for modelling. Economet Rev 29:111–145
Article MathSciNet MATH Google Scholar
Harvey DI, Leybourne SJ, Taylor RAM (2006) Modified tests for a change in persistence. J Economet 134:441–469
Article MathSciNet MATH Google Scholar
Kim JY (2000) Detection of change in persistence of a linear time series. J Economet 95:97–116
Article MathSciNet MATH Google Scholar
Kim JY, Franch JB, Amador RB (2002) Corrigendum to “Detection of change in persistence of a linear time series”. J Economet 109:389–392
Article Google Scholar
Kwiatkowski D, Phillips PCB, Schmidt P, Shin Y (1992) Testing the null hypothesis of stationarity against the alternative of a unit root: how sure are we that economic time series have a unit root? J Econ 54:159–178
Leybourne SJ, Kim T, Newbold P, Smith V (2003) Tests for a change in persistence against the null of difference-stationarity. Economet J 6:291–311
Article MathSciNet MATH Google Scholar
Moon HR, Phillips PCB (2000) Estimation of autoregressive roots near unity using panel data. Econ Theory 16:927–997
Westerlund J, Larsson R (2009) A note on the pooling of individual PANIC unit root tests. Economet Theory 25:1851–1868
Article MathSciNet MATH Google Scholar
Westerlund J, Breitung J (2013) Lessons from a decade of IPS and LLC. Economet Rev 32:547–591
Article MathSciNet Google Scholar
Westerlund J, Mishra s (2016) On the determination of the number of factors using information criteria with data-driven penalty. Stat Pap. doi:10.1007/s00362-015-0692-0

Download references

Author information

Authors and Affiliations

University of Macerata, Macerata, Italy
Roy Cerqueti
Brunel University, London, UK
Mauro Costantini
University of Sassari, Sassari, Italy
Luciano Gutierrez
Lund University, Lund, Sweden
Joakim Westerlund
Centre for Financial Econometrics, Deakin University, Burwood, VIC, Australia
Joakim Westerlund

Authors

Roy Cerqueti
View author publications
You can also search for this author in PubMed Google Scholar
Mauro Costantini
View author publications
You can also search for this author in PubMed Google Scholar
Luciano Gutierrez
View author publications
You can also search for this author in PubMed Google Scholar
Joakim Westerlund
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Joakim Westerlund.

Additional information

Previous versions of the paper were presented at the First International Conference in Memory of Carlo Giannini, “Recent Development in Econometric Methodology”, at University of Bergamo, and at seminars at University of Vienna and University of Leicester. The authors would like to thank seminar and conference participants, and in particular David Hendry, Chihwa Kao, Oliver Linton, Robert Kunst, Stephen Pollock, Wojciech Charemza, Panicos Demetriades, Qiang Zhang, Francesco Moscone, and two anonymous referees for many useful comments and suggestions. Westerlund would like to thank the Knut and Alice Wallenberg Foundation for financial support through a Wallenberg Academy Fellowship, and the Jan Wallander and Tom Hedelius Foundation for financial support under research grant number P2014–0112:1.

Appendix: Proofs

The proofs of Theorems 1 and 2 are established for $K_{NT}^j$; the proofs for $R_{NT}^j$ and $M_{NT}^j$ are entirely analogous.

Proof of Theorem 1

Under MU1, $\mu _{i,t} = \sum _{k=1}^t 1(k> \lfloor T\tau \rfloor )\eta _{i,k}$, and by further invoking $H_0$, $\mu _{i,t} = 0$, giving

$$\begin{aligned} Y_{i,t}= \theta _i'D_{t} + \lambda _i'F_t + \mu _{i,t} + \varepsilon _{i,t} = \theta _i'D_{t} + \lambda _i'F_t + \varepsilon _{i,t}, \end{aligned}$$

(5)

It follows that

$$\begin{aligned} Y_{i,t}^p = \lambda _i'F_t^p + \varepsilon _{i,t}^p, \end{aligned}$$

(6)

with obvious definitions of $F_t^p$ and $\varepsilon _{i,t}^p$, which in turn implies

$$\begin{aligned} \hat{e}_{i,t} = Y_{i,t}^p -\hat{\lambda }_i'F_t^p = \varepsilon _{i,t}^p - (\hat{\lambda }_i-\lambda _i)'F_t^p, \end{aligned}$$

(7)

Therefore,

$$\begin{aligned} T^{-1/2}\sum _{n=1}^{t}\hat{e}_{i,n} = T^{-1/2}\sum _{n=1}^{t}\varepsilon _{i,n}^p - (\hat{\lambda }_i-\lambda _i)'T^{-1/2}\sum _{n=1}^{t}F_n^p. \end{aligned}$$

(8)

Under $H_0$ and with $F_t$ known $Y_{i,t} = \theta _i 'D_{t} + \lambda _i'F_t + \varepsilon _{i,t}$ is just an ordinary time series regression in I(0) variables with exogenous regressors. It follows that $\sqrt{T}(\hat{\lambda }_i-\lambda _i) = O_p(1)$, and therefore, since $T^{-1/2}\sum _{n=1}^{t}F_n^p = O_p(1)$,

$$\begin{aligned} T^{-1/2}\sum _{n=1}^{t}\hat{e}_{i,n} = T^{-1/2}\sum _{n=1}^{t}\varepsilon _{i,n}^p + O_p(T^{-1/2}). \end{aligned}$$

(9)

Hence, using $\overline{K}_{i,T}(\tau )$ to denote $K_{i,T}(\tau )$ with $\hat{e}_{i,n}$ replaces by $e_{i,n}$, we have

$$\begin{aligned} K_{i,T}(\tau )= \overline{K}_{i,T}(\tau )+ O_p(T^{-1/2}), \end{aligned}$$

(10)

where the first term on the right is the same as in Harvey et al. (2006). It follows from their results that

$$\begin{aligned} K_{i,T}(\tau ) \rightarrow _w \overline{K}_{i}(\tau ) = \frac{\overline{A}_{i}(\tau )}{\overline{B}_{i}(\tau )}, \end{aligned}$$

(11)

as $T\rightarrow \infty $, where $\rightarrow _w$ signifies weak convergence, and

$$\begin{aligned}&\displaystyle \overline{A}_{i}(\tau ) = (1-\tau )^{-2}\int _{\tau }^1\overline{a}_i(r)^2 dr,\\&\displaystyle \overline{B}_{i}(\tau ) = \tau ^{-2}\int _0^{\tau } \overline{b}_i(r)^2 dr,\\&\displaystyle \overline{a}_{i}(\tau ) = W_{\varepsilon ,i}(\tau )-W_{\varepsilon ,i}(r)- \int _{\tau }^1dW_{\varepsilon ,i}(r) D_p(r)' \left( \int _{\tau }^1 D_p(r)D_p(r)'dr\right) ^{-1} \int _{\tau }^r D_p(s)ds,\\&\displaystyle \overline{b}_{i}(\tau ) = W_{\varepsilon ,i}(r)- \int _{0}^{\tau }dW_{\varepsilon ,i}(r) D_p(r)' \left( \int _{0}^{\tau } D_p(r)D_p(r)'dr\right) ^{-1} \int _{0}^r D_p(s)ds, \end{aligned}$$

with $W_{\varepsilon ,i}(r)$ being a standard Brownian motion, and $D_p(r)$ is such that $Q_T^{-1}D_{\lfloor Tr\rfloor ,p} \rightarrow D_p(r)$, where $Q_T = \mathrm {diag}(1,T,...,T^p)$. Note in particular how $D_0(r) = 1$ and $D_1(r) = (1,r)'$. Therefore, by the continuous mapping theorem, and writing $K_{i,T}^j = H_j(K_{i,T}(\tau ))$ and $\overline{K}_{i,T}^j = H_j(\overline{K}_{i,T}(\tau ))$ as in Busetti and Taylor (2004),

$$\begin{aligned} K_{i,T}^j = \overline{K}_{i,T}^j(\tau ) + O_p\big (T^{-1/2}\big ) \rightarrow _w H_j(\overline{K}_{i}(\tau )) = \overline{K}_{i}^j. \end{aligned}$$

(12)

Let us now consider $K_{NT}^j$. By using the previous result

$$\begin{aligned} K_{NT}^j = \frac{1}{\sigma _{K,j}\sqrt{N}} \sum _{i=1}^N \Big (K_{i,T}^j -\mu _{K,j} \Big ) = \frac{1}{\sigma _{K,j}\sqrt{N}} \sum _{i=1}^N \Big (\overline{K}_{i,T}^j -\mu _{K,j}\Big ) + O_p(\sqrt{N}T^{-1/2})\nonumber \\ \end{aligned}$$

(13)

where $O_p(\sqrt{N}T^{-1/2}) = o_p(1)$ under our assumption that $N/T=o(1)$. We now use the same steps as in Moon and Phillips (2000, p. 994) to verify that $(\overline{K}_{i,T}^j -\mu _{K,j})$ satisfies conditions (i)–(iv) of the central limit theorem of Phillips and Moon (1999, Theorem 2). In so doing we follow their notation and write $Q_{i,T} = (\overline{K}_{i,T}^j -\mu _{K,j})$, which is iid with mean zero and variance $\sigma _{K,j}^2 \le C$. We have already shown that $\overline{K}_{i,T}^j \rightarrow _w \overline{K}_{i}^j$ as $T\rightarrow \infty $, which implies $Q_{i,T} \rightarrow _d Q_i = (\overline{K}_{i}^j - \mu _{K,j})$, and it is also not difficult to verify that $E(Q_{i,T}^2) \rightarrow E(Q_i^2) = \sigma _{K,j}^2$. This verifies conditions (i), (ii) and (iv). Condition (iv) follows from noting that, by the continuous mapping theorem, $Q_{i,T}^2 \rightarrow _w Q_i^2$. It follows that

$$\begin{aligned} K_{NT}^j= & {} \frac{1}{\sigma _{K,j}\sqrt{N}} \sum _{i=1}^N \Big (K_{i,T}^j -\mu _{K,j} \Big ) \nonumber \\= & {} \frac{1}{\sigma _{K,j}\sqrt{N}} \sum _{i=1}^N \Big (\overline{K}_{i,T}^j -\mu _{K,j} ) + O_p(\sqrt{N}T^{-1/2}\Big ) \rightarrow _d N(0,1) \end{aligned}$$

(14)

as $N,\,T\rightarrow \infty $ with $N/T\rightarrow 0$. $\square $

Proof of Theorem 2

We begin by considering the case when the estimator of $e_{i,t}$ is based on the restricted estimators of $\lambda _i$ and $F_t$ under $H_0$. As in Proof of Theorem 1, under MU1 and $H_0$, $Y_{i,t}= \theta _i'D_{i,t} + \lambda _i'F_t + \varepsilon _{i,t}$. In order to capture the fact that $\lambda _i$ and $F_t$ are not separately identifiable we introduce the $r\times r$ rotation matrix H such that

$$\begin{aligned} \hat{e}_{i,t}^0 = Y_{i,t}^p -(\hat{\lambda }_i^0)'\hat{F}_t^0 = \varepsilon _{i,t}^p - \lambda _i' H^{-1}(\hat{F}_t^0-HF_t^p) - (\hat{\lambda }_i-(H^{-1})'\lambda _i)'\hat{F}_t^0.\qquad \end{aligned}$$

(15)

Hence,

$$\begin{aligned} T^{-1/2}\sum _{n=1}^{t}\hat{e}_{i,n}^0&= T^{-1/2}\sum _{n=1}^{t}\varepsilon _{i,n}^p - \lambda _i' H^{-1}T^{-1/2}\sum _{n=1}^{t}(\hat{F}_n^0-HF_n^p) \nonumber \\&\quad -\, (\hat{\lambda }_i-(H^{-1})'\lambda _i)'T^{-1/2}\sum _{n=1}^{t}\hat{F}_n^0. \end{aligned}$$

(16)

By Lemmas 1(c) and 2 of Bai and Ng (2004), $||\hat{\lambda }_i-(H^{-1})'\lambda _i|| = O_p(N^{-1}) + O_p(T^{-1/2})$ and $||T^{-1/2}\sum _{n=1}^{t}(\hat{F}_n^0-HF_n^p)|| = O_p(N^{-1/2}) + O_p(T^{-3/4})$, where the latter result holds uniformly in t. Hence, since

$$\begin{aligned} T^{-1/2}\sum _{n=1}^{t}\hat{F}_n^0 = HT^{-1/2}\sum _{n=1}^{t}F_n^p + T^{-1/2}\sum _{n=1}^{t}(\hat{F}_n^0-HF_n^p) = O_p(1), \end{aligned}$$

(17)

we can show that

$$\begin{aligned} T^{-1/2}\sum _{n=1}^{t}\hat{e}_{i,n}^0 = T^{-1/2}\sum _{n=1}^{t}\varepsilon _{i,n}^p + O_p(N^{-1/2}) + O_p(T^{-1/2}). \end{aligned}$$

(18)

Hence, as in the case when $F_t$ is known (see Proof of Theorem 1), the estimation and removal of the common component do not affect the asymptotic distribution of the test statistic. Specifically, using $K_{0i,T}^{j}$ to denote $K_{i,T}^j$ with $\hat{e}_{i,n}^0$ in place of $\hat{e}_{i,n}$, we get

$$\begin{aligned} \Big |K_{0i,T}^{j} - K_{i,T}^j\Big | = O_p(N^{-1/2}) + O_p(T^{-1/2}), \end{aligned}$$

(19)

which holds uniformly in (j, i). In order to show that the resulting panel statistic, $K_{0NT}^{j}$ say, converges to N(0, 1), we may use the same argument as in Westerlund and Larsson (2009).

Consider the unrestricted estimator of $e_{i,t}$. We have $\tilde{e}_{i,t}^1 = \sum _{n=2}^t [y_{i,n}^{p-1} - (\hat{\lambda }_i^1)'\hat{f}_n^1]$, where, under $H_0$, $y_{i,t} = \Delta Y_{i,t}= \theta _i'\Delta D_{t} + \lambda _i'f_t + \Delta \varepsilon _{i,t}$ with $f_t = \Delta F_t$. It follows that $y_{i,t}^{p-1}= \lambda _i'f_t^{p-1} + (\Delta \varepsilon _{i,t})^{p-1}$, and therefore

$$\begin{aligned}&\tilde{e}_{i,t}^1 = \sum _{n=2}^t \Big [y_{i,n}^{p-1} - (\hat{\lambda }_i^1)'\hat{f}_n^1\Big ] \nonumber \\&= \sum _{n=2}^t \Big [(\Delta \varepsilon _{i,t})^{p-1} - \lambda _i' H^{-1}(\hat{f}_t^1-Hf_t^{p-1}) - (\hat{\lambda }_i-(H^{-1})'\lambda _i)'\hat{f}_t^1 \Big ]. \end{aligned}$$

(20)

Consider $\sum _{n=2}^t\big (\hat{f}_t^1-Hf_t^{p-1}\big )$. From Proof of Theorem 3 in Bai (2003), using V to denote a diagonal matrix consisting of the first r eigenvalues of $(NT)^{-1} y^{p-1}( y^{p-1})'$ in decreasing order,

$$\begin{aligned}&{ \sum _{n=2}^t\Big (\hat{f}_t^1-Hf_t^{p-1}\Big ) }\nonumber \\&\quad = N^{-1/2}V^{-1}T^{-1}\sum _{n=2}^T\hat{f}_t^1 \Big (f_t^{p-1}\Big )' N^{-1/2}\sum _{i=1}^N \lambda _i \sum _{n=2}^t (\Delta \varepsilon _{i,t})^{p-1} + O_p\Big (N^{-1}\Big ) + O_p\Big (T^{-1}\Big )\nonumber \\&\quad = N^{-1/2}V^{-1}HT^{-1}\sum _{n=2}^Tf_t^{p-1} \Big (f_t^{p-1}\Big )' N^{-1/2}\sum _{i=1}^N \lambda _i \Big (\varepsilon _{i,t}^{p-1}-\varepsilon _{i,1}^{p-1}\Big ) \nonumber \\&\qquad +\, N^{-1/2}V^{-1}T^{-1}\sum _{n=2}^T\Big (\hat{f}_t^1 - Hf_t^{p-1}\Big )\Big (f_t^{p-1}\Big )' N^{-1/2}\sum _{i=1}^N \lambda _i \Big (\varepsilon _{i,t}^{p-1}-\varepsilon _{i,1}^{p-1}\Big ) \nonumber \\&\qquad +\, O_p\Big (N^{-1}\Big ) + O_p\Big (T^{-1}\Big ). \end{aligned}$$

(21)

where we have made use of the fact that $\sum _{n=2}^t (\Delta \varepsilon _{i,n})^{p-1} = \varepsilon _{i,t}^{p-1}-\varepsilon _{i,1}^{p-1}$. Now, ||V|| and $||N^{-1/2}\sum _{i=1}^N \lambda _i (\varepsilon _{i,t}^{p-1}-\varepsilon _{i,1}^{p-1})||$ are both $O_p(1)$. Moreover, by Lemma A.1 of Bai (2003),

$$\begin{aligned} \left| \left| T^{-1}\sum _{n=2}^T(\hat{f}_t^1 - Hf_t^{p-1})(f_t^{p-1})' \right| \right|\le & {} \left( T^{-1}\sum _{n=2}^T||\hat{f}_t^1 - Hf_t^{p-1}||^2\right) ^{1/2} \left( T^{-1}\sum _{n=2}^T||f_t^{p-1}||^2\right) ^{1/2}\\= & {} O_p(N^{-1/2}) + O_p(T^{-1/2}), \end{aligned}$$

from which it follows that

$$\begin{aligned} \left| \left| \sum _{n=2}^t(\hat{f}_t^1-Hf_t^{p-1})\right| \right| = O_p(N^{-1/2}) + O_p(T^{-1}). \end{aligned}$$

(22)

By using this and $\hat{F}_t^1 = \sum _{n=2}^t \hat{f}_t^1 = H(F_t^{p-1}-F_1^{p-1}) + \sum _{n=2}^t (\hat{f}_t^1-Hf_t^{p-1})$, we obtain

$$\begin{aligned} \tilde{e}_{i,t}^1= & {} \sum _{n=2}^t(\Delta \varepsilon _{i,t})^{p-1} - \lambda _i' H^{-1}\sum _{n=2}^t\Big (\hat{f}_t^1-Hf_t^{p-1}\Big ) - (\hat{\lambda }_i-(H^{-1})'\lambda _i)'\sum _{n=2}^t\hat{f}_t^1 \nonumber \\= & {} \varepsilon _{i,t}^{p-1}-\varepsilon _{i,1}^{p-1} - \lambda _i' H^{-1}\sum _{n=2}^t\Big (\hat{f}_t^1-Hf_t^{p-1}\Big ) \nonumber \\&-\, (\hat{\lambda }_i-(H^{-1})'\lambda _i)'H(F_t^{p-1}-F_1^{p-1}) - (\hat{\lambda }_i -(H^{-1})'\lambda _i)'\sum _{n=2}^t \Big (\hat{f}_t^1 -Hf_t^{p-1}\Big ) \nonumber \\= & {} \varepsilon _{i,t}^{p-1}-\varepsilon _{i,1}^{p-1} + O_p(N^{-1/2}) + O_p(T^{-1/2}). \end{aligned}$$

(23)

suggesting that for $p\ge 0$,

$$\begin{aligned} \hat{e}_{i,t}^1 = (\tilde{e}_{i,t}^1)^p = \varepsilon _{i,t}^{p} + O_p\big (N^{-1/2}\big ) + O_p\big (T^{-1/2}\big ). \end{aligned}$$

(24)

When appropriately normalized by $T^{-1/2}$, taking partial sums do not affect the order of the remainder terms. Hence, again, the estimation and removal of the common component do not affect the asymptotic distribution of the test statistic.$\square $

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Cerqueti, R., Costantini, M., Gutierrez, L. et al. Panel stationary tests against changes in persistence. Stat Papers 60, 1079–1100 (2019). https://doi.org/10.1007/s00362-016-0864-6

Download citation

Received: 10 February 2016
Revised: 22 November 2016
Published: 09 December 2016
Issue Date: August 2019
DOI: https://doi.org/10.1007/s00362-016-0864-6

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Panel stationary tests against changes in persistence

Abstract

Similar content being viewed by others

Asymptotic distribution of quasi-maximum likelihood estimation of dynamic panels using long difference transformation when both N and T are large

Misspecification in Dynamic Panel Data Models and Model-Free Inferences

A Fluctuation Test for Structural Change Detection in Heterogeneous Panel Data Models

1 Introduction

2 Model and assumptions

Assumption 1

Remark 1

3 The test statistics

3.1 \(F_t\) known

Remark 2

Theorem 1

Remark 3

Remark 4

3.2 \(F_t\) unknown

Theorem 2

4 Monte Carlo simulations

5 Empirical illustration

6 Conclusion

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix: Proofs

Proof of Theorem 1

Proof of Theorem 2

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

Panel stationary tests against changes in persistence

Abstract

Similar content being viewed by others

Asymptotic distribution of quasi-maximum likelihood estimation of dynamic panels using long difference transformation when both N and T are large

Misspecification in Dynamic Panel Data Models and Model-Free Inferences

A Fluctuation Test for Structural Change Detection in Heterogeneous Panel Data Models

1 Introduction

2 Model and assumptions

Assumption 1

Remark 1

3 The test statistics

3.1 \(F_t\) known

Remark 2

Theorem 1

Remark 3

Remark 4

3.2 \(F_t\) unknown

Theorem 2

4 Monte Carlo simulations

5 Empirical illustration

6 Conclusion

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix: Proofs

Appendix: Proofs

Proof of Theorem 1

Proof of Theorem 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation