OpenCV: counting bounding box in a video

Question

I have some videos that are to be considered as ground truths for people detection: this is an example.

I also have the staple video (without any detections) and I have to run my people detector algorithm on it and compare my results with the ground truth video.

The problem is that I would like to have not only a qualitative comparison, but also a quantitative. So, as far as I am able to count the number of detections in my personal algorithm, I must find a reliable way to count the number of bounding boxes that appear in the ground truth video for each frame.

I have taken into account this link and this one either, but they are meant to find the contours of a shape, not a bounding box. I know it could sound non-sense to detect the number of detections, but this is the only way I have to get a numerical ground truth.

Are you sure there is no data attached to video with frame number and list of bounding boxes (I would expect that something like this have to exists)? Have you looked at bitbucket.org/amilan/motchallenge-devkit ? — wdudzik
The validation videos from data sets have their ground-truths with them. Please look for one such file. It generally is .xml or .csv. — mibrahimy
@wdudzik yes, you are right: ground truth files are attached here and I also found how to use them. I apologize for asking because I could have searched before, but I had been googling for good datasets for days and then I found this, it looked perfect for my code, but I blindly got lost in this ground truth trouble. As an excuse, I will provide my GitHub repo containing the detection as soon as I have something concrete. — Lorenzo

Stephen Meschke Stephen Meschke · Accepted Answer · 2019-03-15T18:33:46

Use a pedestrian dataset that has source video and ground truth. The source video will be a video file (like .avi) and the ground truth is a spreadsheet (like .csv). The x,y coordinates and width and height of the bounding boxes around pedestrians is saved in the spreadsheet.

To visually check your results, draw the ground truth and your results on the same video.

Use an algorithm to quantitatively check your results. The accuracy function I used was:

overlap / ((ground_truth_area + my_results_area)/2)

The overlap is shown in gray in the gif. How I calculated overlap.

OpenCV: counting bounding box in a video

1 Answers