Marine Search and Rescue

Search and rescue missions out in the ocean can be very complex, with the number of people in need of help ranging to hundreds at a time. It can become especially challenging to keep count of the people needing help in such a dynamic environment.

Demonstration and Discussion

Faster-RCNN Predictions

We aim to use a popular architecture for object detection, Faster-RCNN, to be able to recognize humans in the scene and classify them in one of six categories. The model uses a ResNet-50 backbone which condenses the input image down into a feature map which is the basis for generating proposals. There is a classifier head, regression head, and confidence head which predict the category of the object, the bounding box and its offsets, and the confidence of the initial classifcation respectively. Using this approach we are able to count quite accurately the number of humans in the scene and further split it into "Rescued" and "To be rescued".

Challenges and Future Work

The model struggles to make accurate predictions when the humans in the scene are only a few pixels big. We can experiment with the ratios and scales of the anchor boxes generated in the RPN to combat this particular issue and use a deeper backbone which has been finetuned on this dataset. The problem statement can be taken one step further from object detection to multi-object tracking. In this we would assign IDs to all the humans in the scenes and could even store their temporal movement to predict their trajectories and point out other risks. A good starting point would be to see the performance of a model such as Deep SORT.

Techonologies: Python, PyTorch, OpenCV, Numpy
GitHub Code Repository: Link