I have created a timeseries SARIMAX model using the statsmodels library in python.
Currently, we are following the below pipeline:
- Train the model
- Make a forecast for one step in the future and use the inbuilt statsmodels functionality, to produce a confidence interval for this mean prediction.
- Repeat this process each day to predict the next day.
Our overall aim is to only train the model once per week. This means that we want to produce forecasts for more than 1 step on from the date the model was trained. As we will be doing each prediction, after the actual data for the day before has been collected, we want to use the parameters from SARIMAX to produce a prediction. This step is fairly straightforward, you just need to use the coefficents and multiply them by the first level difference for the lag components.
However I am struggling to then recreate the confidence intervals for my predictions, does anyone have any idea on the maths that statsmodels uses to create these or the maths that I should be using?
Thanks in advance, James