ARFIMA - Autoregressive Fractionally Integrated Moving Average (Hosking 1984)¶
| Type | Parametric |
| Resolution | Monthly / Annual |
| Sites | Univariate |
| Class | ARFIMAGenerator |
Overview¶
ARFIMA(p,d,q) extends classical ARMA models by allowing the differencing parameter d to take fractional values in (0, 0.5), enabling the model to reproduce long-range dependence (LRD) observed in hydrologic timeseries. The fractional differencing operator produces hyperbolic decay in the autocorrelation function, matching the Hurst phenomenon that standard AR/ARMA models fundamentally cannot capture. This makes ARFIMA particularly important for generating synthetic flows that preserve multi-year drought persistence and low-frequency variability.
The Hurst exponent H is related to the fractional differencing parameter by H = d + 0.5, providing a direct link between the model parameter and the observed long-memory behavior of the timeseries.
Algorithm¶
Preprocessing¶
- Validate input as univariate timeseries with at least 30 timesteps (50+ recommended for reliable d estimation).
- Deseasonalize if monthly data: remove monthly means and divide by monthly standard deviations to produce stationary residuals.
Fitting¶
- Estimate fractional differencing parameter d using one of:
- Whittle estimator (frequency-domain MLE, recommended; Fox & Taqqu 1986): where I(w_j) is the periodogram and the spectral density is: derived from |1 - e^{-iw}|^{-2d} = [2(1 - cos(w))]^{-d} (Hosking 1981, eq. 2.3).
- R/S analysis for Hurst exponent: compute H, then d = H - 0.5.
-
GPH (Geweke-Porter-Hudak) log-periodogram regression (Geweke & Porter-Hudak 1983):
OLS regression on the m = sqrt(n) lowest Fourier frequencies; slope = -d. -
Apply fractional differencing to obtain the fractionally differenced series:
where the fractional differencing coefficients are: Truncate the infinite sum at lag K (default: 100). -
Fit ARMA(p,q) to the differenced series:
- If q = 0: Yule-Walker equations for AR(p) (classical approach).
- If q > 0: Conditional Sum of Squares (CSS) estimation, which jointly estimates AR and MA coefficients by minimizing the sum of squared one-step-ahead prediction errors (Chung and Baillie, 1993). Initial AR guess from Yule-Walker; optimized via L-BFGS-B with bounds (-0.99, 0.99) for stationarity/invertibility.
- If
auto_order=True: grid search over p in {0, 1, 2} and q in {0, 1, 2}, selecting by BIC (proven consistent for ARFIMA; Huang et al. 2022). -
Store AR coefficients phi, MA coefficients theta, and innovation variance sigma_eps^2.
-
Store all fitted parameters: d, phi, theta, sigma_eps^2, seasonal means/stds (if monthly), truncation lag K.
Generation¶
-
Generate ARMA innovations for the differenced series:
-
Invert fractional differencing via MA convolution (FIR filter) to recover the long-memory process:
where the inverse coefficients are: Note: this is a moving-average (convolution) over the ARMA series W, not an autoregressive recursion over X. -
Re-seasonalize if monthly: multiply by monthly standard deviations and add monthly means.
-
Enforce non-negativity: clip to zero or apply back-transformation if log-space was used.
Parameters¶
| Parameter | Type | Default | Description |
|---|---|---|---|
Q_obs |
pd.Series or pd.DataFrame |
- | Observed streamflow with DatetimeIndex |
p |
int |
1 |
AR order for the short-memory component |
q |
int |
0 |
MA order for the short-memory component |
auto_order |
bool |
False |
Select (p, q) via BIC grid search (Huang et al. 2022) |
d_method |
str |
'whittle' |
Estimation method for d: 'whittle', 'gph', or 'rs' |
truncation_lag |
int |
100 |
Truncation lag K for fractional differencing coefficients |
deseasonalize |
bool |
True |
Remove monthly seasonality before fitting (set False for annual data) |
name |
str |
None |
Optional name identifier for this generator instance |
debug |
bool |
False |
Enable debug logging |
Properties Preserved¶
- Long-range dependence / Hurst exponent (directly parameterized via d)
- Lag-1 through lag-p short-memory autocorrelation (via AR component)
- Short-memory MA structure when q > 0 (via CSS-estimated MA component)
- Monthly means and standard deviations (via deseasonalization)
- Power spectrum at low frequencies (hyperbolic decay)
Not preserved: - Spatial correlations (univariate method) - Non-Gaussian marginal distributions (Gaussian innovations assumed) - Non-stationarity or trends
Limitations¶
- Univariate only. For multisite applications, combine with spatial correlation methods
- Requires long records (50+ years) for reliable estimation of d
- Gaussian innovation assumption may underrepresent extreme events
- Truncation of infinite fractional differencing series introduces approximation error
- CSS estimation for MA component has known small-sample bias (Chung and Baillie, 1993)
References¶
Primary: Hosking, J.R.M. (1984). Modeling persistence in hydrological time series using fractional differencing. Water Resources Research, 20(12), 1898-1908. https://doi.org/10.1029/WR020i012p01898
See also: - Granger, C.W.J., and Joyeux, R. (1980). An introduction to long-memory time series models and fractional differencing. Journal of Time Series Analysis, 1(1), 15-29. https://doi.org/10.1111/j.1467-9892.1980.tb00297.x - Geweke, J., and Porter-Hudak, S. (1983). The estimation and application of long memory time series models. Journal of Time Series Analysis, 4(4), 221-238. https://doi.org/10.1111/j.1467-9892.1983.tb00371.x - Fox, R., and Taqqu, M.S. (1986). Large-sample properties of parameter estimates for strongly dependent stationary Gaussian time series. The Annals of Statistics, 14(2), 517-532. - Montanari, A., Rosso, R., and Taqqu, M.S. (1997). Fractionally differenced ARIMA models applied to hydrologic time series: Identification, estimation, and simulation. Water Resources Research, 33(5), 1035-1044. https://doi.org/10.1029/97WR00043 - Koutsoyiannis, D. (2002). The Hurst phenomenon and fractional Gaussian noise made easy. Hydrological Sciences Journal, 47(4), 573-595. - Chung, C.-F., and Baillie, R.T. (1993). Small sample bias in conditional sum-of-squares estimators of fractionally integrated ARMA models. Empirical Economics, 18, 791-806. - Huang, J., Chan, N.H., Chen, S., and Ing, C.-K. (2022). Consistent order selection for ARFIMA processes. Annals of Statistics, 50(3).
Implementation: src/synhydro/methods/generation/parametric/arfima.py
Tests: tests/test_arfima_generator.py