What is the simplest way to make object detector on C++ with Fast/Faster-RCNN and Caffe?
As known, we can use follow RCNN (Region-based Convolutional Neural Networks) with Caffe:
scores, boxes = im_detect(net, im, obj_proposals)
which calls to def im_detect(net, im, boxes):
for this used rbgirshick/caffe-fast-rcnn, ROIPooling-layers and output bbox_pred
scores, boxes = im_detect(net, im)
which calls to def im_detect(net, im, boxes=None):
for this used rbgirshick/caffe-fast-rcnn, ROIPooling-layers and output bbox_pred
All of these use Python and Caffe, but how to do it on C++ and Caffe?
There is only C++ example for classification (to say what on image), but there is not for detecton (to say what and where on image): https://github.com/BVLC/caffe/tree/master/examples/cpp_classification
Is it enough to simply clone rbgirshick/py-faster-rcnn repository with
rbgirshick/caffe-fast-rcnn, download the pre-tained model ./data/scripts/fetch_faster_rcnn_models.sh
, use this coco/VGG16/faster_rcnn_end2end/test.prototxt and done a small change in CaffeNet C++ Classification example?
And how can I get output data from two layers bbox_pred and cls_score ?
Will I have all (bbox_pred & cls_score) in one array:
const vector<Blob<float>*>& output_blobs = net_->ForwardPrefilled();
Blob<float>* output_layer = output_blobs[0];
const float* begin = output_layer->cpu_data();
const float* end = begin + output_layer->channels();
std::vector<float> bbox_and_score_array(begin, end);
Or in two arrays?
const vector<Blob<float>*>& output_blobs = net_->ForwardPrefilled();
Blob<float>* bbox_output_layer = output_blobs[0];
const float* begin_b = bbox_output_layer ->cpu_data();
const float* end_b = begin_b + bbox_output_layer ->channels();
std::vector<float> bbox_array(begin_b, end_b);
Blob<float>* score_output_layer = output_blobs[1];
const float* begin_c = score_output_layer ->cpu_data();
const float* end_c = begin_c + score_output_layer ->channels();
std::vector<float> score_array(begin_c, end_c);