python - Which seeds have to be set where to realize 100% reproducibility of training results in tensorflow?

Question

In a general tensorflow setup like

model = construct_model()
with tf.Session() as sess:
    train_model(sess)

Where construct_model() contains the model definition including random initialization of weights (tf.truncated_normal) and train_model(sess) executes the training of the model -

Which seeds do I have to set where to ensure 100% reproducibility between repeated runs of the code snippet above? The documentation for tf.random.set_random_seed may be concise, but left me a bit confused. I tried:

tf.set_random_seed(1234)
model = construct_model()
    with tf.Session() as sess:
        train_model(sess)

But got different results each time.

You also need to remove parallelism from your computation because that is often non-deterministic, turn off GPU and use sess = tf.Session(config=tf.ConfigProto(inter_op_parallelism_threads=1,intra_op_parallelism_threads=1) — Yaroslav Bulatov
Also, some non-determinism is caused by using modern instruction sets like SSE (see here ), so to get 100% reproducibility you may need to recompile TF without using SSE — Yaroslav Bulatov
Just for clarification, the above sess = tf.Session... in the comments does not turn off the GPU, as observed by watch nvidia-smi (in the case of an nvidia gpu, as on AWS EC2 p2.xlarge instances) — Shadi

eugen eugen · Accepted Answer · 2019-10-16T08:33:14

The best solution which works as of today with GPU is to install tensorflow-determinism with the following:

pip install tensorflow-determinism

Then include the following code to your code

import tensorflow as tf
import os
os.environ['TF_DETERMINISTIC_OPS'] = '1'

source: https://github.com/NVIDIA/tensorflow-determinism

python - Which seeds have to be set where to realize 100% reproducibility of training results in tensorflow?

3 Answers