Tensorflow RNN example limited to fixed batch size?

Question

When looking at the RNN example at Tensorflow im having an issue with how the initial state is constructed. At build time of the graph we limit the graph to only handle input of one batch size. This is an issue for me since I want to be able feed in a single example and get a prediction for that single example.

The part of the code that restricts this is:

initial_state = state = tf.zeros([batch_size, lstm.state_size])

So my question is how can I expand the example so that I can use a variable batch size so that I can use the same model for training with batch size and then use single example for predictions?

Mihail Burduja Mihail Burduja · Accepted Answer · 2017-04-03T16:57:11

This is how I'm doing this. You can pass the batch_size as a variable like this:

batch_size = tf.placeholder(tf.int32)
init_state = cell.zero_state(batch_size, tf.float32)

where cell is one of RNN cells (BasicLSTMCell, BasicGRUCell, MultiRNNCell, etc). However, if you're preserving the state over multiple batches that won't work since its' size has to be constant.

Tensorflow RNN example limited to fixed batch size?

2 Answers