Autoregressive conditional heteroskedasticity

In econometrics, the autoregressive conditional heteroskedasticity (ARCH) model is a statistical model for time series data that describes the variance of the current error term or innovation as a function of the actual sizes of the previous time periods' error terms; often the variance is related to the squares of the previous innovations. The ARCH model is appropriate when the error variance in a time series follows an autoregressive (AR) model; if an autoregressive moving average (ARMA) model is assumed for the error variance, the model is a generalized autoregressive conditional heteroskedasticity (GARCH) model.

ARCH models are commonly employed in modeling financial time series that exhibit time-varying volatility and volatility clustering, i.e. periods of swings interspersed with periods of relative calm (this is, when the time series exhibits heteroskedasticity). ARCH-type models are sometimes considered to be in the family of stochastic volatility models, although this is strictly incorrect since at time t the volatility is completely predetermined (deterministic) given previous values.

Model specification

To model a time series using an ARCH process, let <math> ~\epsilon_t~ </math>denote the error terms (return residuals, with respect to a mean process), i.e. the series terms. These <math> ~\epsilon_t~ </math> are split into a stochastic piece <math>z_t</math> and a time-dependent standard deviation <math>\sigma_t</math> characterizing the typical size of the terms so that

:<math> ~\epsilon_t=\sigma_t z_t ~</math>

The random variable <math>z_t</math> is a strong white noise process. The series <math> \sigma_t^2 </math> is modeled by

:<math> \sigma_t^2=\alpha_0+\alpha_1 \epsilon_{t-1}^2+\cdots+\alpha_q \epsilon_{t-q}^2 = \alpha_0 + \sum_{i=1}^q \alpha_{i} \epsilon_{t-i}^2 </math>,

:where <math> ~\alpha_0>0~ </math> and <math> \alpha_i\ge 0,~i>0</math>.

An ARCH(q) model can be estimated using ordinary least squares. A method for testing whether the residuals <math> \epsilon_t </math> exhibit time-varying heteroskedasticity using the Lagrange multiplier test was proposed by Engle (1982). This procedure is as follows:

Estimate the best fitting autoregressive model AR(q) <math> y_t = a_0 + a_1 y_{t-1} + \cdots + a_q y_{t-q} + \epsilon_t = a_0 + \sum_{i=1}^q a_i y_{t-i} + \epsilon_t </math>.
Obtain the squares of the error <math> \hat \epsilon^2 </math> and regress them on a constant and q lagged values:
: <math> \hat \epsilon_t^2 = \alpha_0 + \sum_{i=1}^{q} \alpha_i \hat \epsilon_{t-i}^2</math>
: where q is the length of ARCH lags.
The null hypothesis is that, in the absence of ARCH components, we have <math> \alpha_i = 0 </math> for all <math> i = 1, \cdots, q </math>. The alternative hypothesis is that, in the presence of ARCH components, at least one of the estimated <math> \alpha_i </math> coefficients must be significant. In a sample of T residuals under the null hypothesis of no ARCH errors, the test statistic T'R² follows <math> \chi^2 </math> distribution with q degrees of freedom, where <math> T' </math> is the number of equations in the model which fits the residuals vs the lags (i.e. <math> T'=T-q </math>). If T'R² is greater than the Chi-square table value, we reject the null hypothesis and conclude there is an ARCH effect in the ARMA model. If T'R² is smaller than the Chi-square table value, we do not reject the null hypothesis.

GARCH

If an autoregressive moving average (ARMA) model is assumed for the error variance, the model is a generalized autoregressive conditional heteroskedasticity (GARCH) model.|date=October 2017

NAGARCH

Nonlinear Asymmetric GARCH(1,1) (NAGARCH) is a model with the specification:

This model should not be confused with the NARCH model, together with the NGARCH extension, introduced by Higgins and Bera in 1992.

IGARCH

Integrated Generalized Autoregressive Conditional heteroskedasticity (IGARCH) is a restricted version of the GARCH model, where the persistent parameters sum up to one, and imports a unit root in the GARCH process. The condition for this is

<math>

\sum^p_{i=1} ~\beta_{i} +\sum_{i=1}^q~\alpha_{i} = 1

</math>.

EGARCH

The exponential generalized autoregressive conditional heteroskedastic (EGARCH) model by Nelson & Cao (1991) is another form of the GARCH model. Formally, an EGARCH(p,q):

<math>\log\sigma_{t}^2=\omega+\sum_{k=1}^{q}\beta_{k}g(Z_{t-k})+\sum_{k=1}^{p}\alpha_{k}\log\sigma_{t-k}^{2}</math>

where <math>g(Z_{t})=\theta Z_{t}+\lambda(|Z_{t}|-E(|Z_{t}|))</math>, <math>\sigma_{t}^{2}</math> is the conditional variance, <math>\omega</math>, <math>\beta</math>, <math>\alpha</math>, <math>\theta</math> and <math>\lambda</math> are coefficients. <math>Z_{t}</math> may be a standard normal variable or come from a generalized error distribution. The formulation for <math>g(Z_{t})</math> allows the sign and the magnitude of <math>Z_{t}</math> to have separate effects on the volatility. This is particularly useful in an asset pricing context.

Since <math>\log\sigma_{t}^{2}</math> may be negative, there are no sign restrictions for the parameters.

GARCH-M

The GARCH-in-mean (GARCH-M) model adds a heteroskedasticity term into the mean equation. It has the specification:

<math>

y_t = ~\beta x_t + ~\lambda ~\sigma_t + ~\epsilon_t

</math>

The residual <math> ~\epsilon_t </math> is defined as:

<math>

~\epsilon_t = ~\sigma_t ~\times z_t

</math>

QGARCH

The Quadratic GARCH (QGARCH) model by Sentana (1995) is used to model asymmetric effects of positive and negative shocks.

In the example of a GARCH(1,1) model, the residual process <math> ~\sigma_t </math> is

<math>

~\epsilon_t = ~\sigma_t z_t

</math>

where <math> z_t </math> is i.i.d. and

<math>

~\sigma_t^2 = K + ~\alpha ~\epsilon_{t-1}^2 + ~\beta ~\sigma_{t-1}^2 + ~\phi ~\epsilon_{t-1}

</math>

GJR-GARCH

Similar to QGARCH, the Glosten-Jagannathan-Runkle GARCH (GJR-GARCH) model by Glosten, Jagannathan and Runkle (1993) also models asymmetry in the ARCH process. The suggestion is to model <math> ~\epsilon_t = ~\sigma_t z_t </math> where <math> z_t </math> is i.i.d., and

<math>

~\sigma_t^2 = K + ~\delta ~\sigma_{t-1}^2 + ~\alpha ~\epsilon_{t-1}^2 + ~\phi ~\epsilon_{t-1}^2 I_{t-1}

</math>

where <math> I_{t-1} = 0 </math> if <math> ~\epsilon_{t-1} \ge 0 </math>, and <math> I_{t-1} = 1 </math> if <math> ~\epsilon_{t-1} < 0 </math>.

TGARCH model

The Threshold GARCH (TGARCH) model by Zakoian (1994) is similar to GJR GARCH. The specification is one on conditional standard deviation instead of conditional variance:

<math>

~\sigma_t = K + ~\delta ~\sigma_{t-1} + ~\alpha_1^{+} ~\epsilon_{t-1}^{+} + ~\alpha_1^{-} ~\epsilon_{t-1}^{-}

</math>

where <math> ~\epsilon_{t-1}^{+} = ~\epsilon_{t-1} </math> if <math> ~\epsilon_{t-1} > 0 </math>, and <math> ~\epsilon_{t-1}^{+} = 0 </math> if <math> ~\epsilon_{t-1} \le 0 </math>. Likewise, <math> ~\epsilon_{t-1}^{-} = ~\epsilon_{t-1} </math> if <math> ~\epsilon_{t-1} \le 0 </math>, and <math> ~\epsilon_{t-1}^{-} = 0 </math> if <math> ~\epsilon_{t-1} > 0 </math>.

fGARCH

Hentschel's fGARCH model, also known as Family GARCH, is an omnibus model that nests a variety of other popular symmetric and asymmetric GARCH models including APARCH, GJR, AVGARCH, NGARCH, etc.

COGARCH

In 2004, Claudia Klüppelberg, Alexander Lindner and Ross Maller proposed a continuous-time generalization of the discrete-time GARCH(1,1) process. The idea is to start with the GARCH(1,1) model equations

:<math>\epsilon_t = \sigma_t z_t,</math>

:<math>\sigma_t^2 = \alpha_0 + \alpha_1 \epsilon^2_{t-1} + \beta_1 \sigma^2_{t-1} = \alpha_0 + \alpha_1 \sigma_{t-1}^2 z_{t-1}^2 + \beta_1 \sigma^2_{t-1}, </math>

and then to replace the strong white noise process <math> z_t </math> by the infinitesimal increments <math> \mathrm{d}L_t </math> of a Lévy process <math> (L_t)_{t\geq0} </math>, and the squared noise process <math> z^2_t </math> by the increments <math> \mathrm{d}[L,L]^\mathrm{d}_t </math>, where

:<math> [L,L]^\mathrm{d}_t = \sum_{s\in[0,t]} (\Delta L_t)^2,\quad t\geq0, </math>

is the purely discontinuous part of the quadratic variation process of <math> L </math>. The result is the following system of stochastic differential equations:

:<math>\mathrm{d}G_t = \sigma_{t-} \,\mathrm{d}L_t,</math>

:<math>\mathrm{d}\sigma_t^2 = (\beta - \eta \sigma^2_t)\,\mathrm{d}t + \varphi \sigma_{t-}^2 \,\mathrm{d}[L,L]^\mathrm{d}_t, </math>

where the positive parameters <math> \beta </math>, <math> \eta </math> and <math> \varphi </math> are determined by <math> \alpha_0 </math>, <math> \alpha_1 </math> and <math> \beta_1 </math>. Now given some initial condition <math> (G_0,\sigma^2_0) </math>, the system above has a pathwise unique solution <math> (G_t,\sigma^2_t)_{t\geq0} </math> which is then called the continuous-time GARCH (COGARCH) model.

MF2-GARCH

The multiplicative factor multi-frequency GARCH (MF2-GARCH) was proposed by Conrad and Engle (2025), and it features stationary returns and allows for recursive long-term volatility forecasts. They exploit the fact that daily standardized volatility forecast errors of one-component GARCH models are essentially unpredictable based on past daily standardized forecast errors, but a rolling window moving average of past daily standardized forecast errors does have predictive power. The MF2-GARCH, <math>\epsilon_t=\sqrt{\sigma_t^2 \tau_t} z_t</math>, where <math> z_t</math> is standard Gaussian, combines a short-term GJR-GARCH component

:<math>

h_{t} = (1-\phi) + \left(\alpha + \gamma \mathbf{1}_{\{\eta_{d,t-1}<0\\right) \frac{\eta_{d,t-1}^2}{\tau_{t-1 + \beta h_{t-1}

</math>

with <math> ~\alpha >0, \alpha+\gamma>0, \beta>0</math> and <math> ~\phi = \alpha+\gamma/2+\beta < 1</math>, and a long-term component specified as a multiplicative error model (MEM) for the past forecast errors of the GARCH component, exploiting the predictability in the averaged standardized forecast errors of the short-term component.

:<math>

\tau_t = \lambda_0 + \lambda_1 \frac{1}{m} \sum_{j=1}^{m} \frac{\eta_{d,t-j}^2}{h_{t-j + \lambda_2 \tau_{t-1}

</math>

with <math> ~\lambda_0 >0, \lambda_1 >0, \lambda_2>0</math> and <math> ~\lambda_1 + \lambda_2 <1</math>. <math> m </math> is chosen by minimizing the Bayesian Information Criterion (BIC, SIC).

Empirically, the long-term volatility component is closely linked to news about macroeconomics and monetary policy. The immediate reaction of stock market indices to U.S. macroeconomic announcements (e.g., initial jobless claims or incoming orders) depends on the level of long-term stock market volatility.

ZD-GARCH

An ARCH model without intercept was proposed by Hafner and Preminger (2015), who set the intercept term to zero (<math>~\omega=0</math>), in the first order ARCH model <math> ~\epsilon_t = ~\sigma_t z_t </math>, where <math> z_t </math> is i.i.d., and the conditional variance is:

:<math>

~\sigma_t^2 = ~\alpha_{1} ~\epsilon_{t-1}^2.

</math>

This model was extended by Li, Zhang, Zhu and Ling (2018) which consider the Zero-Drift GARCH (ZD-GARCH) with the specification:

:<math>

~\sigma_t^2 = ~\alpha_{1} ~\epsilon_{t-1}^2 + ~\beta_{1} ~\sigma_{t-1}^2.

</math>

The ZD-GARCH model does not require <math> ~\alpha_{1} + ~\beta_{1}= 1 </math>, and hence it nests the Exponentially weighted moving average (EWMA) model in "RiskMetrics". Since <math> ~\omega= 0 </math>, the ZD-GARCH model is always non-stationary, and its statistical inference methods are quite different from those for the classical GARCH model. Based on the historical data, the parameters <math> ~\alpha_{1} </math> and <math> ~\beta_{1} </math> can be estimated by the generalized QMLE method.

Spatial and Spatiotemporal GARCH

Spatial GARCH processes by Otto, Schmid and Garthoff (2018) are considered as the spatial equivalent to the temporal generalized autoregressive conditional heteroscedasticity (GARCH) models. In contrast to the temporal ARCH model, in which the distribution is known given the full information set for the prior periods, the distribution is not straightforward in the spatial and spatiotemporal setting due to the contemporaneous dependence between neighboring spatial locations. The spatial model is given by <math> ~\epsilon(s_i) = ~\sigma(s_i) z(s_i) </math> and

:<math>

~\sigma(s_i)^2 = ~\alpha_i + \sum_{v=1}^{n} \rho w_{iv} \epsilon(s_v)^2,

</math>

where <math> ~s_i</math> denotes the <math> i</math>-th spatial location and <math> ~w_{iv}</math> refers to the <math> iv</math>-th entry of a spatial weight matrix and <math> w_{ii}=0</math> for <math>~i = 1, ..., n </math>. The spatial weight matrix defines which locations are considered to be adjacent.

In spatiotemporal extensions, the conditional variance is modelled as a joint function of spatially lagged past squared observations and temporally lagged volatilities, allowing for both cross-sectional and serial dependence. These models have been applied in fields such as environmental statistics, regional economics, and financial econometrics, where shocks can propagate over space and time. Recent reviews summarise methodological developments, estimation techniques, and applications across disciplines. This results in a nonparametric modelling scheme, which allows for: (i) advanced robustness to overfitting, since the model marginalises over its parameters to perform inference, under a Bayesian inference rationale; and (ii) capturing highly-nonlinear dependencies without increasing model complexity.

Autoregressive conditional heteroskedasticity

Model specification

GARCH

NAGARCH

IGARCH

EGARCH

GARCH-M

QGARCH

GJR-GARCH

TGARCH model

fGARCH

COGARCH

MF2-GARCH

ZD-GARCH

Spatial and Spatiotemporal GARCH

References

Further reading