Inferring variance dynamics from an appropriate variance proxy is an alternative to the classical approach. Popular examples of such proxies are the realized variance and the Parkinson daily range estimator (Parkinson, 1980).
The realized variance for a certain trading day is defined as the sum of the squared intraday returns. The intraday returns used by V-Lab are computed in five minutes intervals.
The Parkinson daily range estimator is defined as:
where and are, respectively, the maximum and minimum log prices between the opening and the closing of day . Under the assumption of constant variance and zero drift within the day, the range is an unbiased estimator of the latent variance of returns.
Both the realized variance and the range are estimators that only use information recorded between the opening and the closing of the trading day, so they do not take into account the overnight information. There are several solutions suggested in the literature that account for the overnight information. V-Lab corrects the variance proxy (the realized variance or the daily range) by multiplying the series by a time varying adjustment factor in order to ensure that the average of the variance proxy approximately matches the sample variance of the returns. The adjustment factor is obtained as follows.
Let be variance proxy at time , i.e. is the realized variance or the high-low range at time . And let be the exponentially weighted moving average of the variance proxy . That is:
for some . Analogously, let be the exponentially weighted moving average of the squared return at time , so:
V-Lab uses and, for the initial conditions of the recursion, V-Lab sets and .
Then, the adjustment factor is defined as:
and, finally, the adjusted variance proxy is defined as:
From now on, when we say variance proxy, we actually mean the adjusted variance proxy , and that is the notation we use in the following subsections.
Consider a variance proxy (realized variance or high-low range, for instance) time series , where and is any distribution with non-negative support with unit mean and variance . So, is the conditional mean of .
Despite of having serially uncorrelated error terms, the variance proxy time series, , does not need to be serially independent. For instance, its conditional mean may depend on past information. The Multiplicative Error Model () assumes a specific parametric form for this dependence. More specifically, we say that if:
V-Lab assumes that the distribution of the fundamental innovation is Chi-Square, i.e. . Notice that it does not imply that the variance proxy (realized variance or high-low range) is Chi-Squared distributed. Even though its conditional distribution is Chi-Square, its unconditional distribution may be far more complex. In fact, assuming that the conditional distribution is Chi-Square is not as restrictive as it seems, since the parameters can be estimated by Quasi-Maximum Likelihood (QML). Anyway, in a more general scenario, one could assume that the conditional distribution is Gamma, and its parameters could also be estimated by QML. This would be more general because the Gamma distribution nests both the Chi- Square and the Exponential distributions.
Notice that modeling the variance proxy (realized variance or high-low range) with the model captures a stylized fact in financial time series, variance (hence, volatility) clustering. The variance proxy is more likely to be high at time if it was also high at time . Another way of seeing this is noting that a shock at time also impacts the variance proxy at time . However, if , the variance proxy itself is mean reverting, and it fluctuates around:
the unconditional variance proxy implied by the model.
Usual restrictions on the parameters are . Though, it is possible to have and ; the conditional variance proxy is then an integrated process (shocks to the variance are persistent).
Let be the last observation in the sample, and let , and be the QML estimators of the parameters , and respectively. The model implies that the forecast of the conditional expectation of the variance proxy (realized variance or high-low range) at time is:
and so, by applying the above formula iteratively, we can forecast the conditional expectation of the variance proxy for any horizon . Then, the forecast of the proxy for the compound volatility at time is:
Notice that, for large , this forecast of the proxy for the compound volatility converges to:
scaling over the horizon with the well known square-root law, times the square root of the estimate of the unconditional expectation of the variance proxy implied by the model.
The Asymmetric (), as the , is able to capture the stylized fact that negative shocks at time have a stronger impact in the variance proxy at time , than positive shocks. This asymmetry used to be called leverage effect because the increase in risk was believed to come from the increased leverage induced by the negative shocks, but nowadays we know that this channel is just too small. The specification is:
where , and:
where is the return at time . So, the effective coefficient associated with a negative shock is . In financial time series, we generally find that is positive and statistically significant. Notice that the is a restricted version of the , with .
The specific model just described can be generalized to account for more lags in the conditional variance proxy. A model assumes that:
The best model ( and ) can be chosen, for instance, by Bayesian Information Criterion (BIC), also known as Schwarz Information Criterion (SIC), or by Akaike Information Criterion (AIC). The former tends to be more parsimonious than the latter. V-Lab uses and though, because this is usually the option that best fits financial time series.
An analogous generalization could be done with the model.
Engle, R. F., 2002. New Frontiers for ARCH Models. Journal of Applied Econometrics 17: 425-446. https://doi.org/10.1002/jae.683
Parkinson, M., 1980. The Extreme Value Method for Estimating The Variance of The Rate of Return. Journal of Business 53: 61-65. https://www.jstor.org/stable/2352357