I am classifying medical images using bag-of-words model. I did the following to extract the feature vector:
- extract features from small image patches and then apply BOW on those features
- extract pixel values from small image patches then apply BOW on those pixel values
After the feature extraction I tried PCA, feature selection, changing no of clusters for KMeans etc to improve the accuracy. But in my case BOW learned on pixel values (1) outperforms (90%) than the BOW learned on features(2) (70%). My features are good and when I use those features to classify the images using some other framework I was able to get more than 95% accuracy.
My question is why BOW learned on pixels performs better than BOW learned on features?


Normal-abnormal colonoscopy image classification
Figure 1: a normal colon image
Figure 2: an image with polyp