Is the training method of a Convolutional Network still known as deep learning?

Question

In papers such as ImageNet Classiﬁcation with Deep Convolutional Neural Networks

http://www.cs.toronto.edu/~fritz/absps/imagenet.pdf

the training method seems to be basic backpropagation with stochastic gradient descent.

Even though CNNs are part of deep neural networks, is this purely because of the large number of hidden layers present? And does this mean that the backprop here falls under the category of deep learning because the network is deep, even though it does not follow the same pattern as the likes of a DBN using greedy layer wise training, a true deep learning technique?

Thanks for the help and advice.

This is called deep due to number of hidden layers are more than its ancestors. They are successful since people find more ways to deal diminishing gradient problem for deeper NN models (ReLU, Dropout, Maxout, Response Normalization...). They are viable due to GPUs. — erogol
@Erogol Do you know any recent summary paper that describe those techniques that address the diminishing gradient problem? — ziggystar

Rey Cruz Rey Cruz · Accepted Answer · 2016-05-18T19:11:54

If you read the Wikipedia page of Deep Learning it said: "Deep Learning is a branch of machine learning based on a set of algorithms that attempt to model high-level abstractions in data by using multiple processing layers, with complex structures or otherwise, composed of multiple non-linear transformations".

A CNN has multiple layers of non-linear transformation, so qualifies as a Deep Learning model.

Also in this book from MIT http://www.deeplearningbook.org/ CNN are also part of Deep Learning models.

Between DBN and CNN exist an important difference, the first is an unsupervised model and the other isn't, besides one use DBN for pre-initialization.

If you read about RNN or LSTM that are also Deep Learning models you will find that both are basically trained with a modified version of backpropagation called backpropagation through time.

So, remember the concept multiple non-linear transformations for model high-level abstractions in data.

Also Deep Learning refers to the model not to the training.

Is the training method of a Convolutional Network still known as deep learning?

3 Answers