Alysha M De Livera, Rob J Hyndman and Ralph D Snyder
Journal of the American Statistical Association (2011) 106(496), 1513-1527.
A new innovations state space modeling framework, incorporating Box-Cox transformations, Fourier series with time varying coefficients and ARMA error correction, is introduced for forecasting complex seasonal time series that cannot be handled using existing forecasting models. Such complex time series include time series with multiple seasonal periods, high frequency seasonality, non-integer seasonality and dual-calendar effects. Our new modelling framework provides an alternative to existing exponential smoothing models, and is shown to have many advantages. The methods for initialization and estimation, including likelihood evaluation, are presented, and analytical expressions for point forecasts and interval predictions under the assumption of Gaussian errors are derived, leading to a simple, comprehensible approach to forecasting complex seasonal time series. Our trigonometric formulation is also presented as a means of decomposing complex seasonal time series, which cannot be decomposed using any of the existing decomposition methods. The approach is useful in a broad range of applications, and we illustrate its versatility in three empirical studies where it demonstrates excellent forecasting performance over a range of prediction horizons. In addition, we show that our trigonometric decomposition leads to the identification and extraction of seasonal components, which are otherwise not apparent in the time series plot itself.
Keywords: exponential smoothing, Fourier series, prediction intervals, seasonality, state space models, time series decomposition.
To read the data into R:
library(forecast) # (a) U.S. finished motor gasoline products supplied # (thousands of barrels per day), # weekly data from February 1991 to July 2005. gas <- read.csv("http://robjhyndman.com/data/gasoline.csv")[,1] gas <- ts(gas, start=1991+31/365.25, frequency = 365.25/7) # (b) Number of calls handled on weekdays between 7:00 am and 9:05 pm # Five-minute call volume from March 3, 2003, to May 23, 2003 # in a large North American commercial bank. calls <- unlist(read.csv("http://robjhyndman.com/data/callcenter.txt", header=TRUE,sep="\t")) calls <- msts(calls, start=2003 + (31+28+2)/365.25, seasonal.periods = c(169, 169*5)) # (c) Turkish electricity demand data. # Daily data from 1 January 2000 to 31 December 2008. telec <- read.csv("http://robjhyndman.com/data/turkey_elec.csv") telec <- msts(telec, start=2000, seasonal.periods = c(7,354.37,365.25))
- p.1517. .