In Tensorflow models example (slim) , when we specify the batch size is it per GPU or is that the total batch size split among the gpus ?
python train_image_classifier.py --dataset_name=flowers --dataset_split_name=train --dataset_dir=/tmp/flowers-data --model_name=inception_v3 --log_every_n_steps=10 --batch_size=64 --max_number_of_steps=100 --num_clones=4