3.5 Methods used by official statistics agencies
Official statistics agencies (such as the US Census Bureau and the Australian Bureau of Statistics) are responsible for a large number of official economic and social time series. These agencies have developed their own decomposition procedures which are used for seasonal adjustment. Most of them use variants of the X-11 method, or the SEATS method, or a combination of the two. These methods are designed specifically to work with quarterly and monthly data, which are the most common series handled by official statistics agencies. They will not handle seasonality of other kinds, such as daily data, or hourly data, or weekly data. We will use the latest implementation of this group of methods known as “X-13ARIMA-SEATS.” For the methods discussed in this section, you will need to have installed the
seasonal package in R.
The X-11 method originated in the US Census Bureau and was further developed by Statistics Canada. It is based on classical decomposition, but includes many extra steps and features in order to overcome the drawbacks of classical decomposition that were discussed in the previous section. In particular, trend-cycle estimates are available for all observations including the end points, and the seasonal component is allowed to vary slowly over time. X-11 also handles trading day variation, holiday effects and the effects of known predictors. There are methods for both additive and multiplicative decomposition. The process is entirely automatic and tends to be highly robust to outliers and level shifts in the time series. The details of the X-11 method are described in Dagum & Bianconcini (2016).
<- us_retail_employment %>% x11_dcmp model(x11 = X_13ARIMA_SEATS(Employed ~ x11())) %>% components() autoplot(x11_dcmp) + labs(title = "Decomposition of total US retail employment using X-11.")
Compare this decomposition with the STL decomposition shown in Figure 3.7 and the classical decomposition shown in Figure 3.13. The X-11 trend-cycle has captured the sudden fall in the data due to the 2007–2008 global financial crisis better than either of the other two methods (where the effect of the crisis has leaked into the remainder component). Also, the unusual observation in 1996 is now more clearly seen in the X-11 remainder component.
Figure 3.15 shows the trend-cycle component and the seasonally adjusted data, along with the original data. The seasonally adjusted data is very similar to the trend-cycle component in this example, so it is hard to distinguish them on the plot.
%>% x11_dcmp ggplot(aes(x = Month)) + geom_line(aes(y = Employed, colour = "Data")) + geom_line(aes(y = season_adjust, colour = "Seasonally Adjusted")) + geom_line(aes(y = trend, colour = "Trend")) + labs(y = "Persons (thousands)", title = "Total employment in US retail") + scale_colour_manual( values = c("gray", "#0072B2", "#D55E00"), breaks = c("Data", "Seasonally Adjusted", "Trend") )
It can be useful to use seasonal plots and seasonal sub-series plots of the seasonal component, to help us visualise the variation in the seasonal component over time. Figure 3.16 shows a seasonal sub-series plot of the seasonal component from Figure 3.14. In this case, there are only small changes over time.
%>% x11_dcmp gg_subseries(seasonal)
“SEATS” stands for “Seasonal Extraction in ARIMA Time Series” (ARIMA models are discussed in Chapter 9). This procedure was developed at the Bank of Spain, and is now widely used by government agencies around the world. The details are beyond the scope of this book. However, a complete discussion of the method is available in Dagum & Bianconcini (2016).
<- us_retail_employment %>% seats_dcmp model(seats = X_13ARIMA_SEATS(Employed ~ seats())) %>% components() autoplot(seats_dcmp) + labs(title = "Decomposition of total US retail employment using SEATS")
X_13ARIMA_SEATS() function calls the
seasonal package which has many options for handling variations of X-11 and SEATS. See the package website for a detailed introduction to the options and features available.