How to select batch size automatically to fit GPU?

Question

I am training deep neural networks with a GPU. If I make samples too large, batches too large, or networks too deep, I get an out of memory error. In this case, it is sometimes possible to make smaller batches and still train.

Is it possible to calculate GPU size required for training and determine what batch size to choose beforehand?

UPDATE

If I print network summary, it displays number of "trainable parameters". Can't I estimate from this value? For example, take this, multiply by batch size, double for gradients etc?

from the link Salvador gave, you also need to calculate intermediate memory that hold the image and its transforms. The total is 3*4*( intermediate memory * num_image + traninable parameters)/1024**3 GB — user3226167
Ideally, I think it should be possible. Each layer should just give amount of memory it will hog by accepting input size and whether in training mode and the model class should just compute total memory by calling memory occupied by its individual layers. Just need these frameworks(pytorch/.tensorflow) to add this functionality. — saurabheights

Salvador Dali Salvador Dali · Accepted Answer · 2017-07-16T21:41:36

No, it is not possible to do this automatically. So you need to go through a lot of trial and error to find appropriate size if you want your batch to be as much as possible.

Stanford's CNN class provides some guidance how to estimate the memory size, but all suggestions are related to CNN (not sure what do you train).

How to select batch size automatically to fit GPU?

3 Answers