python - How to use a Keras RNN model to forecast for future dates or events?

Question

Here is my code fore training the complete model and saving it:

num_units = 2
activation_function = 'sigmoid'
optimizer = 'adam'
loss_function = 'mean_squared_error'
batch_size = 10
num_epochs = 100

# Initialize the RNN
regressor = Sequential()

# Adding the input layer and the LSTM layer
regressor.add(LSTM(units = num_units, activation = activation_function, input_shape=(None, 1)))

# Adding the output layer
regressor.add(Dense(units = 1))

# Compiling the RNN
regressor.compile(optimizer = optimizer, loss = loss_function)

# Using the training set to train the model
regressor.fit(x_train, y_train, batch_size = batch_size, epochs = num_epochs)
regressor.save('model.h5')

After that I have seen that most of the time people our suggesting the test dataset for checking the prediction which I have attempted as well and got good result.

But the problem is in the usage of the model that I have created. I want to have a forecast for next 30 days or every minute whatsoever. Now I have the trained model but I am not getting what I can do or what code do I use to use the model and forecast the prices for next 30 days or one minute.

Please suggest me the way out. I am stuck at this problem since a week and not able to make any successful attempts.

Here is the link of the repository where one can find the complete runnable code, the model, and the dataset: My repository link

Is there someone who can help me with the issue? Please let me know. — Jaffer Wilson
I have doubt regarding the dataset. You provides prices as input which explain how accurate you model is. There is more complexe models not able to predict evolution with such a good accuracy. I guess the second input is the date (not visible in the head (block 3) ?. I'll take a look when I have time but you can take a look at this video (youtube.com/watch?v=EqWm8A-dRYg). He is predicting BTC price with bidirectionnal LSTM and social networks sentiments. To finish, you can predict the days after (as you have a daily timestep). If you need mode days, you can create a loop. — Nicolas M.
@NicolasM.Sure.. thank you for your time and consideration. But have a look at the code so I can improve. Thanks. — Jaffer Wilson
Why downvote? I do not understand, why people not having any answer start downvoting the question or sometimes try to close it. — Jaffer Wilson
I've slightly modified the dataframe compare to what you've done. Nevertheless, I've added a shifted column to predict the next price but the prediction is still to accurate. I don't know exactly what is wrong but I guess this is due to the fact that we have 1 batch of around 700 timesteps of 1 input and the model is able to understand that there is only a shift. Everytime I tried RNN, I used multiple batches compare to this exercice so I'm also stuck :s. I'll continue to take a look — Nicolas M.

Daniel Möller Daniel Möller · Accepted Answer · 2018-02-15T12:53:58

Well, you need a stateful=True model, so you can feed it one prediction after another to get the next and keep the model thinking that each input is not a new sequence, but a sequel to the previous.

Fixing the code and training

I see in the code that there is an attempt to make your y be a shifte x (a good option for predicting the next steps). But there is also a big problem in the preprocessing here:

training_set = df_train.values
training_set = min_max_scaler.fit_transform(training_set)

x_train = training_set[0:len(training_set)-1]
y_train = training_set[1:len(training_set)]
x_train = np.reshape(x_train, (len(x_train), 1, 1))

Data for LSTM layers must be shaped as (number_of_sequences, number_of_steps,features).

So, you're clearly creating sequences of 1 step only, meaning that your LSTM is not learning sequences at all. (There is no sequence with only one step).

Assuming that your data is a single unique sequence with 1 feature, it should definitely be shaped as (1, len(x_train), 1).

Naturally, y_train should also have the same shape.

This, in its turn, will require that your LSTM layers be return_sequences=True - The only way to make y have a length in steps. Also, for having a good prediction, you may need a more complex model (because now it will be trully learning).

This done, you train your model until you get a satisfactory result.

Predicting the future

For predicting the future, you will need stateful=True LSTM layers.

Before anything, you reset the model's states: model.reset_states() - Necessary every time you're inputting a new sequence into a stateful model.

Then, first you predict the entire X_train (this is needed for the model to understand at which point of the sequence it is, in technical words: to create a state).

predictions = model.predict(`X_train`) #this creates states

And finally you create a loop where you start with the last step of the previous prediction:

future = []
currentStep = predictions[:,-1:,:] #last step from the previous prediction

for i in range(future_pred_count):
    currentStep = model.predict(currentStep) #get the next step
    future.append(currentStep) #store the future steps    

#after processing a sequence, reset the states for safety
model.reset_states()

Example

This code does this with a 2-feature sequence, a shifted future step prediction, and a method that is a little different from this answer, but based on the same principle.

I created two models (one stateful=False, for training without needing to reset states every time - never forget to reset states when you're starting a new sequence - and the other stateful=True, copying the weights from the trained model, for predicting the future)

https://github.com/danmoller/TestRepo/blob/master/TestBookLSTM.ipynb

python - How to use a Keras RNN model to forecast for future dates or events?

3 Answers

How can I create a dataset for RNNs?