TLDR: A study compared YOLOv7, YOLOv8, YOLOv9, YOLOv10, and Faster R-CNN for underwater waste detection using a 15-class dataset. YOLOv8 achieved the highest accuracy (80.9% mAP), demonstrating its potential for improving underwater cleanup operations due to its advanced architecture.
The increasing global population and industrial growth have led to significant environmental concerns, particularly underwater pollution. This issue, largely driven by plastics and chemical contaminants, severely impacts marine biodiversity, ecosystems, and human health. The economic consequences are also substantial, affecting industries like tourism, fishing, and shipping. For instance, marine debris is estimated to cost the Asia-Pacific region approximately $1.26 billion annually.
Addressing this widespread problem requires advanced technological solutions. Recent breakthroughs in machine learning, Artificial Intelligence (AI), and autonomous systems offer promising avenues for detecting, classifying, and removing underwater contaminants. AI-powered image recognition systems, such as convolutional neural networks (CNNs), can significantly improve the identification of various types of garbage, leading to more precise and efficient cleanup operations.
Comparing Object Detection Models
A recent study aimed to compare the performance of several cutting-edge object detection models for underwater garbage detection. The researchers evaluated YOLOv7, YOLOv8, YOLOv9, YOLOv10, and Faster R-CNN. Their goal was to identify which model is most effective at recognizing waste materials in challenging marine environments, including conditions with low visibility and variable depths.
The study utilized a large dataset comprising 5130 images across fifteen different classes of underwater garbage, such as masks, cans, cell phones, electronics, glass bottles, gloves, metal, nets, polythene bags, plastic bottles, rods, sunglasses, and tires. This comprehensive dataset allowed for a thorough training and testing of the models.
Also Read:
- Advancing Solar Panel Detection with Large Language Models
- Knowledge Grafting: A New Method for Efficient AI on Resource-Limited Hardware
YOLOv8 Emerges as Top Performer
After training and testing, the YOLOv8 model emerged as the top performer. It achieved a mean Average Precision (mAP) of 80.9%, outperforming all other models, including YOLOv7, YOLOv9, YOLOv10, and Faster R-CNN. YOLOv8’s superior performance is attributed to its advanced architecture, which incorporates features like improved anchor-free mechanisms and self-supervised learning. These enhancements enable more precise and efficient recognition of objects in diverse settings.
The findings highlight YOLOv8’s significant potential as a powerful tool in the global effort against pollution. Its effectiveness can greatly enhance the capabilities and scalability of underwater cleanup operations. While the study showed promising results, the authors noted limitations, such as the dataset not fully capturing all real-world undersea conditions like extreme low visibility or cluttered backgrounds. Future work could involve larger datasets from multiple locations and categories, and integrating techniques like domain adaptation and transfer learning to improve model generalization across different underwater settings. Combining these real-time detection capabilities with underwater robotic systems could also revolutionize environmental monitoring and cleanup efforts.
For more detailed information, you can refer to the full research paper available at this link.


