I have been trying to get a deeper understanding of convolutional operation as I am implementing a convolutional neural network. But I am stuck while trying to calculate the backward pass or deconvolution.
Lets say the input is a 3 dimensional RGB image with dimension 3x7x7
The filter has the dimension 3x3x3
. On convolving with stride set to 2 we will get an output of dimension 3x3
.
Now here is my problem. I have read that deconvolution is the convolution of the output with flipped kernel. But on flipping the kernel, it will still be of dimension 3x3x3
and output is of dimension 3x3
which . The input was of dimension 3x7x7
. So, how is deconvolution calculated ?
3x3x3
filter shape? – Maxim