1 Introduction

Over the last two decades, a vast literature has investigated whether economic and financial time series may be characterized by a change in persistence between separate I(1) and I(0) regimes rather than simply I(1) or I(0) behavior. Changes of this kind in macroeconomic variables are well documented; see the literature reviews in Kim (2000) and Leybourne et al. (2003). A non-exhaustive list of the variables for which such phenomena have been observed includes inflation, real output, budgetary deficits, interest rates and exchange rates. Interestingly, while many data sets are in fact panels of multiple time series, the way that existing tests are constructed requires that the series are tested one at a time. This is wasteful in the sense that each time a test is carried out the information contained in the other series is effectively ignored. The current paper can be seen as a reaction to this. The purpose is to develop tests for changes in persistence that explores the multiplicity of series, and that can be seen as panel extensions of the time series tests of Kim (2000), Kim et al. (2002), and Busetti and Taylor (2004). The tests can be used to flexibly test the null hypothesis of stationarity against the alternative of a change in persistence not only from I(0) to I(1), and from I(1) to I(0), but also when the direction is unknown. The data generating process (DGP) considered is quite general. Some of the allowances are unit-specific constant and trend terms, cross-section heteroskedasticity, error serial correlation and cross-section dependence in the form of common factors. The asymptotic distributions of the tests are derived and evaluated in small samples using Monte Carlo simulation. An empirical illustration is also provided showing how how inflation of 20 developed countries has undergone a shift from I(0) to I(1).

The rest of the paper is organized as follows. Sections 2 and 3 present the model, the test statistics, and their asymptotic distributions, which are evaluated using simulations in Sect. 4. Section 5 reports the results from the empirical application. Section 6 concludes. Proofs of important results are provided in the Appendix.

2 Model and assumptions

Consider the panel data variable \(Y_{i,t}\), where \(i = 1,...,N\) and \(t = 1,...,T\) index the time-series and cross-sectional units, respectively. The DGP of this variable is given by

$$\begin{aligned}&\displaystyle Y_{i,t} = \theta _i'D_{t,p}+\lambda _{i}'F_t+ e_{i,t},\end{aligned}$$
(1)
$$\begin{aligned}&\displaystyle e_{i,t} = \mu _{i,t} + \varepsilon _{i,t}, \end{aligned}$$
(2)

where \(D_{t,p} = (1,t,...,t^p)'\) is a p-order trend polynomial such that \(D_{t,p} = 0\) is \(p=-1\), \(F_t\) is an \(r\times 1\) vector of common factors with \(\lambda _{i}\) being the corresponding vector of factor loadings, and \(\varepsilon _{i,t}\) is a mean zero and I(0) error term. The following three specifications of \(\mu _{i,t}\) are considered, where 1(A), \(\lfloor x\rfloor \), \(\eta _{i,t}\) and \(\tau _i^0\in [0,1]\) denote the indicator function of the event A, the integer part of x, a mean zero I(0) error term, and the break fraction, respectively:

  1. MU1.

    I(0) \(\rightarrow \) I(1): \(\mu _{i,t}=\mu _{i,t-1}+1(t>\lfloor T\tau _{i}^0\rfloor )\eta _{i,t}\).

  2. MU2.

    I(1) \(\rightarrow \) I(0): \(\mu _{i,t}=\mu _{i,t-1}+1(t \le \lfloor T\tau _{i}^0\rfloor )\eta _{i,t}\).

  3. MU3.

    Unknown direction: I(0) \(\rightarrow \) I(1) or I(1) \(\rightarrow \) I(0).

Under MU1 \(Y_{i,t}\) is I(0) up to and including time \(\lfloor \tau _{i}^0 T\rfloor \) but is I(1) after the break, provided that \(\sigma _{\eta , i}^2 = \mathrm {var}(\eta _{i,t}) > 0\). Under MU2 \(Y_{i,t}\) is I(1) up to and including time \(\lfloor \tau _{i}^0 T\rfloor \) but it is I(0) after the break, provided again that \(\sigma _{\eta ,i}^2 > 0\). Therefore, the hypothesis of stationarity against a shift in persistence from I(0) to I(1) or viceversa can be stated as \(H_0:\sigma _{\eta ,1}^{2}= ...= \sigma _{\eta ,N}^{2}=0\) versus \(H_1:\sigma _{\eta ,i}^{2} > 0\) for at least some i. Whenever the alternative is I(1) \(\rightarrow \) I(0) we write “\(H_1:\) I(1) \(\rightarrow \) I(0)”, whereas if the alternative is I(1) \(\rightarrow \) I(0), we write “\(H_{1}:\) I(1) \(\rightarrow \) I(0)”.

The conditions placed on the above DGP are given in Assumption 1, where \(C <\infty \), tr(A), \(||{A}|| = \sqrt{tr({A}' {A})}\), \(\rightarrow _p\) and \(\mathcal {F}_{i,t}\) denote a generic positive constant, the trace and Euclidean norm of the (generic) matrix A, convergence in probability, and the sigma-field generated by \(\{(\varepsilon _{i,n},\eta _{i,n})\}_{n=1}^t\), respectively.

Assumption 1

  1. (i)

    \(\varepsilon _{i,t} = \gamma _i(L) v_{i,t}\), where \(v_{i,t}\) is independent and identically distribution (iid) with \(E(v_{i,t}) = 0\), \(E(v_{i,t}^2) = 1\), \(E(v_{i,t}^8) \le C\), \(\gamma _i(L) = \sum _{j=0}^\infty \gamma _{ji}L^j\), \(\sum _{j=0}^\infty j||\gamma _{ji}|| \le C\) and \(\gamma _i(1)^2 > 0\);

  2. (ii)

    \(\eta _{i,t} = \phi _i(L) w_{i,t}\), where \(w_{i,t}\) is iid with \(E(w_{i,t}) = 0\), \(E(w_{i,t}^2) = 1\), \(E(w_{i,t}^8) \le C\), \(\phi _i(L) = \sum _{j=0}^\infty \phi _{ji}L^j\), \(\sum _{j=0}^\infty j||\phi _{ji}|| \le C\) and \(\phi _i(1)^2 > 0\);

  3. (iii)

    \(F_t\) is I(0) such that \(E(||F_t||^4) \le C\) and \(T^{-1} \sum _{t=1}^{T} F_t F'_t \rightarrow _p \Sigma _F > 0\);

  4. (iv)

    \(\varepsilon _{i,t}\), \(\eta _{i,t}\) and \(F_t\) are mutually independent;

  5. (v)

    \(\mu _{1,0} = ... = \mu _{N,0} = 0\);

  6. (vi)

    \(\lambda _i\) is deterministic such that \(||\lambda _i||^4 \le C\), \(N^{-1}\sum _{i=1}^N\lambda _i\lambda _{i}' \rightarrow \Sigma _\lambda > 0\) as \(N \rightarrow \infty \).

Remark 1

Assumption 1 puts restrictions on the time series and cross-sectional properties of \(\varepsilon _{i,t}\) and \(\eta _{i,t}\). The restrictions are very similar to the ones of Bai and Ng (2004), and we therefore refer to this other paper for a detailed discussion. The main difference when compared to Bai and Ng (2004) is that here \(F_t\) cannot be I(1). Thus, while \(Y_{i,t}\) may be cross-correlated, it cannot be affected by common stochastic trends. However, we would like to point out that this assumption is mainly for ease of interpretation of the test outcome, for if \(F_t\) is allowed to be I(1) the persistence of \(Y_{i,t}\) cannot be inferred from \(e_{i,t}\) alone, and in the present paper we focus on the testing of \(e_{i,t}\). Hence, analogous to the PANIC approach of Bai and Ng (2004), if \(F_t\) is permitted to be I(1), then we also need to test this variable.

3 The test statistics

The general testing idea is to first purge the effect of \(F_t\), and then to submit the resulting residuals to a test for a change in persistence. The implementation of the first step depends on whether \(F_t\) is known or not.

3.1 \(F_t\) known

Consider the generic variable \(X_{i,t}\). The detrended version of this variables is henceforth denoted \(X_{i,t}^p = X_{i,t} - \sum _{n=1}^T X_{i,n}a_{n,t,p}\), where \(a_{n,k,p}=D_{n,p}' (\sum _{t=1}^T D_{t,p}D_{t,p}' )^{-1}D_{k,p}\) and \(p\ge 0\). If \(p= -1\), then we define \(X_{i,t}^p = X_{i,t}\). In this notation, the detrended and defactored version of \(Y_{i,t}\) is given by \(\hat{e}_{i,t} = Y_{i,t}^p - \hat{\lambda }_i'F_t^p\), where \(\hat{\lambda }_i\) is the least squares (LS) slope estimator in a regression of \(Y_{i,t}^p\) onto \(F_t^p\). Thus, while in this section \(F_t\) is assumed to be known, \(\lambda _i\) is still treated as unknown. Consider the following test statistic, which is suitable for testing if cross-section unit i is I(0) versus I(1) \(\rightarrow \) I(0) (see, for example, Kim 2000; Kim et al. 2002; Busetti and Taylor 2004):

$$\begin{aligned} K_{i,T}(\tau )=\frac{ (\lfloor T\tau \rfloor )^{2}}{(T-\lfloor T\tau \rfloor )^{2}} \frac{\sum _{t=\lfloor T\tau \rfloor +1}^T S_{i,t}^{ 1 }(\tau )^2}{\sum _{t=1}^{\lfloor T\tau \rfloor } S_{i,t}^{ 0 }(\tau )^2}, \end{aligned}$$

where \(\tau \in [0,1]\), \(S_{i,t}^{0}(\tau )=\sum _{n=1}^{t} \hat{e}_{i,n}\) and \(S_{i,t}^{1}(\tau )=\sum _{n= \lfloor T\tau \rfloor +1}^t \hat{e}_{i,n}\). The error sequences \(\{\hat{e}_{i,n}\}_{n=1}^{\lfloor T\tau \rfloor }\) and \(\{\hat{e}_{i,n}\}_{n=\lfloor T\tau \rfloor + 1}^T\) come from two separate regressions; while the former uses only the first \(\lfloor T\tau \rfloor \) observations, the latter uses only the last \(\lfloor T(1-\tau )\rfloor \) observations.

Remark 2

The \(K_{i,T}(\tau )\) test considered here is in the spirit of Kwiatkowski et al. (1992) in which the constant I(0) null is tested versus the constant I(1) alternative. An alternative approach is to follow Banerjee et al. (1992) and Leybourne et al. (2003) who use the Dickey–Fuller statistic, in which the null and the alternative hypotheses are reversed. Panel variants of these can be constructed in the same way as the one suggested below for \(K_{i,T}(\tau )\) (see Demetrescu and Hanck 2013, for such a proposal).

Let \(\mathcal {C} = [\tau _{min},\tau _{max}]\subseteq (0,1)\). In this paper, we consider three transformations to eliminate the dependence on \(\tau \) in \(K_{i,T}(\tau )\) (see, for example, Kim, 2000);

  1. T1.

    The maximum-Chow transformation:

    $$\begin{aligned} K_{i,T}^1 = \max _{s = \lfloor T\tau _{min}\rfloor ,..., \lfloor T\tau _{max}\rfloor } K_{i}(s/T). \end{aligned}$$
  2. T2.

    The mean-exponential transformation:

    $$\begin{aligned} K_{i,T}^2 = \ln \left( (\lfloor T(\tau _{max} - \tau _{min})\rfloor +1)^{-1}\sum _{s = \lfloor T\tau _{min}\rfloor }^{\lfloor T\tau _{max}\rfloor } \exp [K_{i}(s/T)] \right) . \end{aligned}$$
  3. T3.

    The mean score transformation:

    $$\begin{aligned} K_{i,T}^3 = (\lfloor T(\tau _{max} - \tau _{min})\rfloor +1)^{-1}\sum _{s = \lfloor T\tau _{min}\rfloor }^{\lfloor T\tau _{max}\rfloor } K_{i}(s/T). \end{aligned}$$
Table 1 Simulated mean and standard deviation normalization factors

In Appendix (Proof of Theorem 1), we show that \(K_{i,T}(\tau ) \rightarrow _w K_i(\tau )\) as \(T\rightarrow \infty \), where \(\rightarrow _w\) signifies weak convergence and \(K_i(\tau )\) is a certain ratio of stochastic integrals. Since \(K_1(\tau ),...,K_N(\tau )\) are iid, we may define \(\mu _{K,j}= E(K_{i}^j)\) and \(\sigma _{K,j}^2 = \mathrm {var}(K_{i}^j)\) for \(j\in \{ 1,2,3 \}\). Numerical values of \(\mu _{K,j}\) and \(\sigma _{K,j}\) are reported in Table 1. The proposed panel test statistic for testing \(H_0\) versus \(H_1:\) I(0) \(\rightarrow \) I(1) is given by

$$\begin{aligned} K_{NT}^j = \frac{1}{\sigma _{K,j} \sqrt{N}} \sum _{i=1}^N( K_{i,T}^j-\mu _{K,j}). \end{aligned}$$

For testing if cross-section unit i is I(0) versus I(1) \(\rightarrow \) I(0), the following “reverse” test statistic can be used (see Kim 2000; Kim et al. 2002; Busetti and Taylor 2004):

$$\begin{aligned} R_{i}(\tau )=(K_{i}(\tau ))^{-1}, \end{aligned}$$

which can be transformed using T1–T3 to eliminate the dependence on \(\tau \). The resulting transformed statistic is written in an obvious notation as \(R_{i}^j\). Based on this test statistic, we may define \(R_{NT}^j = \sigma _{R,j}^{-1} N^{-1/2} \sum _{i=1}^N (R_{i,T}^j-\mu _{R,j} )\) with obvious definitions of \(\sigma _{R,j}^{2}\) and \(\mu _{R,j}\). When the direction of the persistency is unknown, the following maximum statistic may be used:

$$\begin{aligned} M_{i,T}^j = \max \{K_{i,T}^j,R_{i,T}^j\}, \end{aligned}$$

which can again be normalized to obtain \(M_{NT}^j = \sigma _{M,j}^{-1}N^{-1/2} \sum _{i=1}^N (M_{i,T}^j-\mu _{M,j} )\).

Theorem 1

Under \(H_0\) and Assumption 1, as \(N,\,T \rightarrow \infty \) with \(N/T\rightarrow 0\),

$$\begin{aligned} K_{NT}^j,\,R_{NT}^j,\,M_{NT}^j \rightarrow _d N(0,1), \end{aligned}$$

where\(\rightarrow _d\)signifies convergence in distribution.

Remark 3

While the test statistics considered here are independent of \(\tau _1^0,...,\tau _N^0\), in applications it is sometimes useful to be able to estimate these parameters. This can be accomplished using the proposal of Kim (2000, Sect. 3.2), which basically amounts to setting \(\hat{\tau }_i^0\) equal to the suitably maximizing or minimizing value of \(K_{i,T}(\tau )\), depending on whether it is I(0) \(\rightarrow \) I(1) or I(1) \(\rightarrow \) I(0) that is being tested. Alternatively, we may follow Busetti and Taylor (2004, Sect. 6.2), who suggest setting \(\hat{\tau }_i^0\) equal to the value of \(\tau _i^0\) that minimizes the sum of squares of \(\hat{e}_{i,t}\).

Remark 4

The requirement that \(N/T \rightarrow 0\) is sufficient but not necessary and is needed to make sure that certain remainder terms are negligible. However, the order of these terms is not the sharpest possible. A more elaborate asymptotic analysis would be required to obtain the exact order. In Sect. 4, we use Monte Carlo simulation to evaluate the effect of N / T in small samples.

3.2 \(F_t\) unknown

The estimation of \(F_t\) can be performed in two ways; (i) unrestrictedly, or (ii) restricted under \(H_0\). In both cases, we follow the bulk of the previous literature and use the principal components method (see, for example, Bai and Ng 2004). The restricted estimator of \(F = ( F_{1},..., F_{T})'\), denoted \(\hat{F}^0 = (\hat{F}_{1}^0,...,\hat{F}_{T}^0)'\), is \(\sqrt{T}\) times the eigenvectors corresponding to the first r largest eigenvalues of the \(T \times T\) matrix \(Y^p(Y^p)'\), where \(Y^p = ( Y_{1}^p,..., Y_{N}^p)\) and \(Y_i^p = ( Y_{i,1}^p,..., Y_{i,T}^p)'\) are \(T\times N\) and \(T\times 1\), respectively. Under the normalization \(T^{-1}\hat{F}^0(\hat{F}^0)'= I_r \), the estimated loading matrix is \((\hat{\lambda }^0)' = (\hat{\lambda }_1^0,...,\hat{\lambda }_N^0)=T^{-1}(\hat{F}^0)' Y^p\). The restricted estimator of \(e_{i,t}\) that we will be considering can now be constructed as

$$\begin{aligned} \hat{e}_{i,t}^0 = Y_{i,t}^p - (\hat{\lambda }_i^0)'\hat{F}_t^0. \end{aligned}$$
(3)

Let \(X_{i,t}^{p-1}\) be \(X_{i,t}\) when detrended using a trend polynomial of order \(p-1\). Hence, \(X_{i,t}^{p-1} = X_{i,t}\) if \(p = 0\). Let \(f_t = \Delta F_t\) and \(y_{i,t} = \Delta Y_{i,t}\) (for \(t=2,...,T\)). The unrestricted estimators \(\hat{f}_t^1\) and \(\hat{\lambda }_i^1\) of (the space spanned by) \(f_t^{p-1}\) and \(\lambda _i\) are \(\hat{F}_t^0\) and \(\hat{\lambda }_i^0\), respectively, but with \(Y_{i,t}^p\) replaced by \(y_{i,t}^{p-1}\). Let

$$\begin{aligned} \tilde{e}_{i,t}^1 = \sum _{n=2}^t \left[ y_{i,n}^{p-1} - (\hat{\lambda }_i^1)'\hat{f}_n^1\right] , \end{aligned}$$
(4)

where \(\tilde{e}_{i,1}^1 = 0\). The unrestricted estimator \(\hat{e}_{i,t}^1\) of \(e_{i,t}\) is given by \(\hat{e}_{i,t}^1 = (\tilde{e}_{i,t}^1)^p\). The appropriate test statistics to consider when \(F_t\) is unknown, henceforth denoted \(K_{hNT}^j\), \(R_{hNT}^j\) and \(M_{hNT}^j\) for \(h \in \{ 0,1 \}\), are given by \(K_{NT}^j\), \(R_{NT}^j\) and \(M_{NT}^j\), respectively, with \(\hat{e}_{i,t}\) replaced by \(\hat{e}_{i,t}^h\).

Table 2 5% size and power when testing I(0) \(\mathrm \rightarrow \) I(1) and \(\rho = 0.3\)
Table 3 5% size and power when testing I(0) \(\mathrm \rightarrow \) I(1) and \(\rho = 0.6\)
Table 4 5% size and power when testing I(1) \(\mathrm \rightarrow \) I(0) and \(\rho = 0.3\)
Table 5 5% size and power when testing I(1) \(\mathrm \rightarrow \) I(0) and \(\rho = 0.6\)
Table 6 Empirical test results

Theorem 2

Under \(H_0\) and Assumptions 1, as \(N,\,T \rightarrow \infty \) with \(N/T\rightarrow 0\),

$$\begin{aligned} K_{hNT}^j,\,R_{hNT}^j,\,M_{hNT}^j \rightarrow _d N(0,1). \end{aligned}$$

As Theorem 2 makes clear, the factors can be unknown and still the asymptotic distributions of the test statistics are N(0, 1). This is in agreement with the results reported by Bai and Ng (2004) for their pooled panel unit root tests.

4 Monte Carlo simulations

A small-scale Monte Carlo study was conducted to investigate the properties of the new tests in small samples. The DGP is given by a restricted version of (1)–(2) that sets \(\varepsilon _{i,t} \sim N(0,\sigma _{\varepsilon , i}^2)\), \(\eta _{i,t} \sim N(0,\sigma _{\eta }^2)\), \(\sigma _{\eta }\in \{0,0.25,0.5\}\), \(\tau _i^0 \sim U(0.3,0.7)\), \(r = 1\), and \(F_t=\rho F_{t-1} + v_t\), where \(v_t\sim N(0,1)\) and \(\rho \in \{0.3,0.6\}\) (see, for example, Gengenbach et al. 2010, for a similar parametrization). For \(\sigma _{\varepsilon , i}\), we consider two cases. In the first, \(\sigma _{\varepsilon , i} = 1\) for all i, while in the second, \(\sigma _{\varepsilon , i} \sim U (1,2)\). Since a more volatile idiosyncratic error will make \(F_t\) more difficult to discern, we expect that the results for the second case will deteriorate when compared to the first. All results are based on 1,000 replications of samples of size \(N\in \{ 5, 10, 20 \}\) and \(T\in \{50,100\}\). Also, following Kim (2000), \(\mathcal {C}=[0.20,0.80]\). Results were obtained for \(p \in \{ 0,1\}\), although in this paper we focus on the results for the empirically most common specification with \(p = 0\) (a constant but no trend). The results for \(p=1\) (constant and trend) can be obtained upon request. Both the restricted and unrestricted factor estimation methods were simulated. Interestingly, the restricted method led to better results in terms of both size accuracy and power. In this paper, we therefore only report the results for the restricted method, where the number of common factors is determined using the \(IC_2\) criterion of Bai and Ng (2002) with a maximum of three factors.Footnote 1

The 5% size and power results are reported in Tables 2, 3, 4 and 5. While Tables 2 (\(\rho = 0.3\)) and 3 (\(\rho = 0.6\)) contain the results for the tests of I(0) \(\rightarrow \) I(1), Tables 4 (\(\rho = 0.3\)) and 5 (\(\rho = 0.6\)) contain the corresponding results for I(1) \(\rightarrow \) I(0). The information content of these tables may be summarized as follows.

  • All tests have good size accuracy when \(\sigma _{\varepsilon , i} = 1\) and \(\rho = 0.3\). This is true for all constellations of T and N considered, although the distortions do have a tendency to increase slightly in N, which is consistent with the previous panel unit root literature (see Westerlund and Breitung 2013, for a discussion). While there are no big differences, the best size accuracy is generally obtained by using \(K_{NT}^2\), \(R_{NT}^2\) and \(M_{NT}^2\), whereas \(K_{NT}^3\), \(R_{NT}^1\) and \(R_{NT}^3\) generally leads to the worst accuracy.

  • As expected, increases in \(\rho \) and/or \(\sigma _{\varepsilon , i}\) generally lead to reduced size accuracy, although the distortions are never very large. This is true regardless of the direction of the change in persistence. In fact, the results are remarkably stable, given that the test statistics do not require any corrections to account for nuisance parameters.

  • All tests perform quite well in terms of power, and there are clear improvements as N and/or T increases. The fact that power is not only increasing in T, but also in N illustrates the advantage of accounting for the cross-sectional variation of the data. Power is also increasing in the distance to the null, as measured by \(\sigma _\eta \), which is again just as expected.

5 Empirical illustration

The question of whether inflation should be considered as I(0) or I(1) has been subject to a long debate. According to recent studies (see, for example, Kim 2000; Busetti and Taylor 2004), however, inflation may be better characterized by a change in persistence between separate I(1) and I(0) regimes rather than simply I(1) or I(0) behavior. The purpose of this illustration is to test this hypothesis using a large panel of quarterly CPI inflation data covering 20 countries (Australia, Austria, Belgium, Canada, Denmark, Finland, France, Germany, Greece, Italy, Japan, Korea, the Netherlands, New Zealand, Norway, Spain, Sweden, Switzerland, the UK and the US) between 1970:1 and 2013:4. All data are taken from OECD Main Economic Indicators.

The number of common factors is determined in the same way as in the simulations. As is customary when dealing with inflation (see, for example, Leybourne et al. 2003), the tests are fitted with a constant but no trend. The results are reported in Table 6. The first thing to note is that while in case of \(K_{NT}^1\), \(K_{NT}^2\) and \(K_{NT}^3\) there is no evidence against the I(0) null, \(R_{NT}^1\), \(R_{NT}^2\) and \(R_{NT}^3\) all lead to a clear rejection. This is true even at the most conservative 1% level. We therefore conclude that inflation has been subject to a change in persistence from I(1) to I(0), which is in agreement with the recent empirical literature based on US data (see, for example, Busetti and Taylor 2004; Harvey et al. 2006). A common explanation for the observed change in persistence of inflation in the US is that it is due to the stock market collapse of the late 1980s and the recession that followed it. One interpretation of the results reported in the current paper is therefore that they reflect the worldwide recession of the early 1990s, which was to a large extent triggered by the recession in the US. Another possibility is that the results reflect in part monetary policy shifts (see, for example, Davig and Doh 2014, and the references provided therein).

6 Conclusion

This paper develops panel tests that are suitable for testing the null hypothesis of stationarity against the alternative of a change in persistence from I(0) to I(1), from I(1) to I(0), or when the direction is unknown. The DGP used for this purpose is quite general and allows unit-specific constant and trend terms, cross-section heteroskedasticity, error serial correlation and cross-section dependence in the form of common factors.