I'm coding a program in OpenCV, which is supposed to detect objects in a scene,namely products in a supermarket.
I plan to use SURF descriptors for this purpose, however everything I've found so far is related to finding 1 occurrence of an object in a scene (generally with nearest neighbor matching) and I've found absolutely nothing about detecting objects in a scene with multiple instances of the same object (NN obviously doesn't work, since the best match for each point may be in different instances).
I also need to use a classifier, like Artificial Neural Networks, which could be more helpful in finding multiple instances of the object, however I don't understand how to use an ANN (or any other classifier) with keypoints.
Should I use the ¿64? values of each SURF point as the input of the ANN, and each of, say 5 products, as the output? Meaning that all the points (which are not similar) within one object would produce the same output.
I've read that that's the way to go, but I don't see how it could work since all the keypoints in one object may (and should) have different characteristics. But I can't think of any other way to do it.
Sorry if I haven't explained it very well, I'll try to clarify if something's not clear enough.