A seabed garbage target detection method based on improved YOLOv8
-
摘要: 针对复杂海底环境下垃圾目标呈现多尺度形态分布、与海洋生物特征高度相似导致的类间模糊性,以及由此引发的特征提取能力不足、检测精度不高和定位不准确等问题,提出一种基于改进的YOLOv8的海底垃圾目标检测算法。首先,将ODConv全维度动态卷积融合到颈部网络的C2f中,形成新模块C2f_ODConv,使模型能够实现对卷积核的全方位动态调整,更精细地适应输入数据的特征,从而提高特征提取的效果;其次,在C2f_ODConv后引入可变形注意力transformer(deformable attention transformer,DAT),有效捕捉图像中的局部细节,提高模型检测精度;最后,使用UIoU Loss取代CIoU Loss并采用线性衰退策略,进一步准确定位目标,提高模型泛化能力。在TrashCan-Instance公开数据集上进行实验,结果表明:改进后模型的召回率和平均精度分别为64.4%、70.4%,相比基线YOLOv8提升4.5、2.2百分点,进一步满足了海底垃圾实时检测需求。Abstract: Aiming at the problems of complex seafloor environment where litter targets present multi-scale morphological distribution, inter-class ambiguity, due to high similarity with marine organisms' features, and the resulting issues of insufficient feature extraction capability, poor detection accuracy, and inaccurate localization, a seabed litter target detection algorithm based on the improved YOLOv8 was proposed. First, the ODConv full-dimensional dynamic convolution plus was fused into the C2f of the neck network, to form a new module C2f_ODConv, which enables the model to realize all-round dynamic adjustment of the convolution kernel and more finely adapt to the features of the input data, thus improving the effectiveness of the feature extraction; second, a deformable attention was introduced after the C2f_ODConv, which effectively captured local details in the image and improves the model detection accuracy; finally, UIoU Loss was used instead of CIoU Loss and a linear recession strategy was adopted to localize the target and improve the model generalization ability further accurately. Experiments were conducted on the public dataset TrashCan-Instance, and the experimental results showed that the improved model had a Recall and mAP of 64.4% and 70.4% respectively, which were 4.5 and 2.2 percentage points higher than the baseline model YOLOv8, and also met the underwater spam detection demand.
-
Key words:
- target detection /
- seabed debris detection /
- ODConv /
- deformable attention /
- UIoU loss function
-
[1] MAJCHROWSKA S,MIKOLAJCZYK A,FERLIN M,et al. Waste detection in Pomerania:non-profit project for detecting waste in environment[J]. Waste Management,2022,138:274-284. [2] DI CIACCIO F. TROISI S. Monitoring marine environments with autonomous underwater vehicles:a bibliometric analysis[J]. Results in Engineering,2021,9:100205. [3] DUANGSUWAN S,PRAPRUETDEE P. Drone-enabled AI edge computing and 5G communication network for real-time coastal litter detection[J]. Drones,2024,8(12):750. [4] HIPOLITO J C,ALON A S,AMORADO R V,et al. Detection of underwater marine plastic debris using an augmented low sample size dataset for machine vision system:a deep transfer learning approach[C]// 2021 IEEE 19th Student Conference on Research and Development(SCOReD). 2021. [5] Yuan H C,Zang T Q. Underwater garbage target detection based on attention mechanism and Ghost-YOLOv5[J]. Environmental Engineering,2023,41(7):214-221. 袁红春,臧天祺. 基于注意力机制及Ghost-YOLOv5的水下垃圾目标检测[J]. 环境工程,2023,41(7):214-221. [6] Pang M,Wang G,Zhan Y,et al. Marine underwater garbage detection method based on improved YOLOv5 algorithm[J]. Computer and Modernization,2024(7):120-126. 庞梅,汪珙,詹泳,等. 基于 YOLOv5 改进算法的海洋水下垃圾检测方法[J]. 计算机与现代化,2024(7):120-126. [7] YASEEN M. What is YOLOv8:an in-depth exploration of the internal features of the next-generation object detector[J]. arXiv preprint,2024,arXiv:2408.15857. [8] LI C,ZHOU A,YAO A. Omni-dimensional dynamic convolution[J]. arXiv,2022,abs/2209.07947. [9] XIA Z,PAN X,SONG S,al et,Vision transformer with deformable attention[C]// 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR),2022.[10] [10] 唐智贤,李镇,郭俏,等. 基于可变形注意力transformer的胃肠癌病理图像细胞核自动分割方法[J]. 复旦学报(医学版),2024,51(3): 396-403. [11] LUO X,CAI Z,SHAO B,et al. Unified-IoU:for high-quality object detection[J]. arXiv,2024.arXiv:2310.13616. [12] HONG J,FULTON M,SATTAR J. TrashCan:A semantically-segmented dataset towards visual detection of marine debris[J]. arXiv preprint,2020,arXiv:2007.08097. [13] REN J X,WU J L,WANG J R. PCB defect detection based on improved YOLOv5s with ECA and BiFPN[J]. Manufacturing Technology& Machine Tool,2024(8):78-82. 任金霞,吴吉林,王金荣. 基于ECA和BiFPN改进YOLOv5s的PCB缺陷检测[J]. 制造技术与机床,2024(8):78-82. [14] WANG Z,ZHANG S,CHEN Y,et al. Detection of small foreign objects in Pu-erh sun-dried green tea:an enhanced YOLOv8 neural network model based on deep learning[J]. Food Control,2025,168:110890. -
点击查看大图
计量
- 文章访问数: 10
- HTML全文浏览量: 3
- PDF下载量: 0
- 被引次数: 0
登录
注册
E-alert
登录
注册
E-alert
下载: