vindianadoan

Posted on May 24, 2023

ARIMA Modelling: Theory

Introduction

ARIMA models, standing for AutoRegressive Integrated Moving Average, is a great model for predicting temporal time series data with a complex nature. Unlike basic regression models, which may struggle to capture complexity with just a few variables (usually due to autocorrelation, trend and seasonality), ARIMA excels in treating autocorrelation, trend and seasonality through a built-in differencing function (the "Integrated" part of ARIMA); thus, it removes the need for specifying explanatory variables and extra transformation. Examples of application include finance, weather predictions, and even anticipating website traffic.

Define the model

Let's build the ARIMA model from scratch. As discussed, the ARIMA model is made of three parts: an autoregressive part, an integrated part and a moving average part.

For example, given some time series

X = {X_1, X_2, ...}

The autoRegressive (AR) component says that an observation

(X_t)

at certain point in time

t

, can be described as a linear combination of its lagged observations (ie. prior time points):

X_t = \alpha_1 X_{t-1} + \alpha_2 X_{t-2} ...

where

\alpha_i

are the parameters or coefficients of such regression.
(Note: if you find the terms linear combination and linear regression confusing, a linear regression is model that outputs predictions based on a linear combination of input features).
Using the lag operator (

L^pX_t = X_{t-q}

), you can also express the above as:

X_t = \alpha_1 (X_{t-1}) + \alpha_2 (X_{t-2}) ... + \alpha_p (X_{t-p})

X_t = \alpha_1 LX_{t} + \alpha_2 L^2X_{t} ... + \alpha_p L^pX_{t}

X_t = (\sum_{i=1}^{p} \alpha_i L^i)X_{t} \hspace{1cm} (1)

An optimisation of the AR part for the ARIMA model is an optimisation of the number of previous terms - or the number of time lags to be included in the linear combination. We call

p

the order of the autoregressive model.
For example, when we say the AR part has an order of two, we symbolise it as

AR(2)

, and express it as:

X_t = \alpha_1 LX_{t} + \alpha_2 L^2X_{t}

Now, equivalently, $(X_t)$ can also be expressed as a combination of previous error terms. This is described in the Moving Average part of the ARIMA model:

X_t = \epsilon_t + \theta_1\epsilon_{t-1} + \theta_2\epsilon_{t-2} + ... + \theta_q \epsilon_{t-q}

X_t = \epsilon_t + \theta_1 L \epsilon_{t} + \theta_2 L^2\epsilon_{t} + ... + \theta_q L^q \epsilon_{t}

X_t = \epsilon_t + (\sum_{i=1}^{q} \theta_q L^q)\epsilon_{t} \hspace{1cm} (2)

where:

\theta_i

are the parameters/ coefficients of the linear combination of the error terms

\epsilon_t

are the error terms at time

t

q

is the order of this moving average.
Note that the concept of moving average here can be misleading, as moving average in this context simply refers to the moving window of the previous errors, it does not take an average of anything.

Since both (1) and (2) describe $X_t$ , we can add them together:

X_t = (\sum_{i=1}^{p} \alpha_i L^i)X_{t} + \epsilon_t + (\sum_{i=1}^{q} \theta_q L^q)\epsilon_{t}

This is often expressed as:

X_t - (\sum_{i=1}^{p} \alpha_i L^i)X_{t} = \epsilon_t + (\sum_{i=1}^{q} \theta_q L^q)\epsilon_{t}

(1 - \sum_{i=1}^{p} \alpha_i L^i)X_{t} = (1 + \sum_{i=1}^{q} \theta_q L^q)\epsilon_{t}

We say that, given time series data $X_t$ where $t$ is an integer index and the $X_t$ are real numbers, an $ARIMA(p,q)$ model is given by:

(1 - \sum_{i=1}^{p} \alpha_i L^i)X_{t} = (1 + \sum_{i=1}^{q} \theta_q L^q)\epsilon_{t}

Why do we need both an AR component and an MA component?

Since the AR(p) component is a combination of the past 'p' values, and the values of the series become more and more dependent on its past values when moving along the time series. Hence, it captures the trend and autocorrelation in the series.
Meanwhile, the MA(q) part is a combination of the past 'q' forecast errors, so it captures the "shock" (unexpected changes) to the model instead. The impact of a shock in the series decreases over time (called "shock decay"). Thus, the effect of an error made at a particular point in time diminishes as we move further away from that point.
In short, the AR(p) part describes the overall and long term changes, whereas the MA(q) describes the short-term changes. Together, they allow the ARIMA model to capture a wide range of time series patterns.

DEV Community

ARIMA Modelling: Theory

Introduction

Define the model

Why do we need both an AR component and an MA component?

Top comments (0)

Read next

Mastering CSS Transitions, Animations, and Transformations: Bring Your Web Designs to Life

Frontend Challenge - December Edition, Glam Up My Markup: Winter Solstice

How to Build Responsive Websites: A Beginner’s Guide

2-month learning plan for mastering Node.js, Express.js, and the essential backend development skills