水下模糊目标的自适应检测方法

Adaptive detection methods for fuzzy underwater objects

  • 摘要: 水下环境存在光线衰弱、色彩失真、复杂背景干扰、目标尺度多样以及目标特征模糊等问题。针对目标尺度多样性和特征模糊难定位等问题,提出了一种基于Faster R-CNN (faster region-based convolutional neural networks)改进的水下目标检测算法。首先,在特征提取中引入可切换的空洞卷积,解决了特征提取过程中图像全局上下文信息丢失而导致的特征损失问题;其次,使用递归特征金字塔使高层特征与底层特征多次交互融合,增强模型对水下小目标以及复杂形状目标的检测能力;最后,引入了一种基于引导锚框的建议网络,该网络根据图像的语义特征,动态生成锚点更为稀疏且形状自适应的锚框,显著地提高了模型对水下目标检测的准确性及定位能力。实验表明:改进后的算法在水下数据集DUO上mAP(mean average precision)提高了5.7%,并且在通用目标检测数据集VOC上也有较好的表现。

     

    Abstract: Underwater environments usually suffer from issues such as light attenuation, color distortion, complex background interference, diverse target scales, and blurred target features. This paper addresses challenges related to diverse target scales and difficulties in feature localization by proposing an improved underwater target detection algorithm based on Faster R-CNN ((sg_Faster R-CNN). Firstly, we introduced switchable atrous convolution in feature extraction to address the problem of feature loss due to the absence of global contextual information during feature extraction. Secondly, we used a recursive feature pyramid to enable multiple interactions between high-level and low-level features, enhancing the capability of model to detect small underwater targets and complex-shaped objects. Lastly, we introduced a proposal network based on guided anchor boxes, which could dynamically generate anchors that were sparser and shape-adaptive based on the image's semantic features, significantly improving the accuracy and localization ability of the model for underwater target detection. Experiments demonstrate that the improved algorithm achieves a 5.7% increase in mAP (mean average precision) on the DUO underwater dataset and also performs well on the general-purpose object detection dataset VOC.

     

/

返回文章
返回