caffe fast rcnn smoothL1layer implementation

Question

I was reading the fast rcnn caffe code. Inside the SmoothL1LossLayer, I found that the implementation is not the same as the paper equation, is that what it should be ?

The paper equation:

For each labeled bounding box with class u, we calculate the sum error of tx, ty, tw, th, but in the code, we have:

There is no class label information used. Can anyone explain why?

And in the backpropagation step,

why there is an i here ?

lnman lnman · Accepted Answer · 2017-04-12T07:46:48

In train.prototxt bbox_pred has output size 84 = 4(x,y,h,w) * 21(number of label). So does bbox_targets. So it is using all labels.
As for loss layers it is looping over bottom blobs to find which on to propagate gradient through. Here only one of propagate_down[i] is true.

caffe fast rcnn smoothL1layer implementation

1 Answers