I am training for Custom Object Detection using Mask RCNN in TensorFlow Object Detection. Therefore, I am to predict the object instance mask along with the bounding box.
Pre-trained model : mask_rcnn_inception_v2_coco
Following is a snapshot of my training.
INFO:tensorflow:global step 4181: loss = 0.0031 (3.290 sec/step)
INFO:tensorflow:global step 4181: loss = 0.0031 (3.290 sec/step)
INFO:tensorflow:global step 4182: loss = 0.0030 (2.745 sec/step)
INFO:tensorflow:global step 4182: loss = 0.0030 (2.745 sec/step)
In this case, can you please tell me what is the loss here?
My questions is not related to training loss and its variation w.r.t. the steps.
I am just unclear about what is meant by this loss while training a Mask RCNN? In a Mask RCNN, there are 3 parallel heads at the last layer,
- for detecting the class
- for predicting bounding box
- for predicting instance masks
In such a case, what is loss?