Linear or Non-linear is not a question of HOG or any other feature. It is simply related to number of instances + number of clusters + number of feature dimensions. In general Linear models are preferential for datasets including more feature dimensions than the instances. If the case is reversal, than you should go for non-linear like kernel SVM since it implicitly project your data into another space where again your instances are represented with more dimensions.
In most cases, you get very good number of feature dimensions by applying HOG to images. Therefore you can simply use linear models. However if you have 100000... classes and 10000000... images then HOG + Linear model will insuffice. Therfore no one for instance in ImageNet challenge uses HOG with linear SVM.