Tensorflow flatten vs numpy flatten function effect on machine learning training

Question

I am starting with deep learning stuff using keras and tensorflow. At very first stage i am stuck with a doubt. when I use tf.contrib.layers.flatten (Api 1.8) for flattening a image (could be multichannel as well).

How is this different than using flatten function from numpy? How does this affect the training. I can see the tf.contrib.layers.flatten is taking longer time than numpy flatten. Is it doing something more?

This is a very close question but here the accepted answer includes Theano and does not solve my doubts exactly.

Example: Lets say i have a training data of (10000,2,96,96) shape. Now I need the output to be in (10000,18432) shape. I can do this using tensorflow flatten or by using numpy flatten like

X_reshaped = X_train.reshape(*X_train.shape[:1], -2)

what difference does it make in training and which is the best practice?

Trying to understand your network: Am I right that your training shape (10000,2,96,96) refers to (num_images, num_colourchannels, x_pixel, y_pixel)? On several different occasions I have seen shapes as (num_images, x_pixel, y_pixel, num_colourchannels). Does your choice make a difference, and how did you motivate it? Thanks! — NeStack
ah you are right. Both are possible. It does not make a difference if processed correctly. it is only matter of your keras settings. By simply setting your keras.json file (in <yourUserFolder>/.keras) and set this as a default configuration 'channels_first' or 'channels_last'. This settings will only be applicable to your machine then. — Haramoz

Maxim Maxim · Accepted Answer · 2018-05-09T18:16:26

The biggest difference between np.flatten and tf.layers.flatten (or tf.contrib.layers.flatten) is that numpy operations are applicable only to static nd arrays, while tensorflow operations can work with dynamic tensors. Dynamic in this case means that the exact shape will be known only at runtime (either training or testing).

So my recommendation is pretty simple:

If the input data is static numpy array, e.g. in pre-processing, use np.flatten. This avoids unnecessary overhead and returns numpy array as well.
If the data is already a tensor, use any of the flatten ops provided by tensorflow. Between those, tf.layers.flatten is better choice since tf.layers API is more stable than tf.contrib.*.

Tensorflow flatten vs numpy flatten function effect on machine learning training

4 Answers