改进边缘感知门控与三路自适应融合的红外弱小目标检测模型

    Infrared dim and small target detection model based on improved edge-aware gating and three-path adaptive fusion

    • 摘要: 红外弱小目标检测作为计算机视觉在国防与民用监测领域的重要研究方向,是远程预警、海域监视、无人装备环境感知等任务的关键技术支撑,对提升感知系统的环境适配性和任务响应精度具有重要意义。针对复杂场景下红外弱小目标检测中存在的信噪比低、背景杂波干扰、目标特征不显著等问题,提出一种改进边缘感知门控与三路自适应融合的红外弱小目标检测模型。首先,采用U型编解码架构实现多尺度语义融合,以Swin-Transformer为骨干,统一建模局部细节与长程依赖;然后,为有效抑制背景噪声并增强目标边缘响应,改进设计递推增强型边缘语义门控层(recursive enhanced edge-semantic gated contextual layer, REES-GCL)与空间空洞金字塔池化(atrous spatial pyramid pooling, ASPP)协同模块,自适应抑制复杂背景杂波,精准提取目标边缘信息;最后,针对传统特征融合中语义鸿沟显著,权重分配缺乏针对性,改进设计门控引导的双阶段注意力融合模块(gated dual-stage attention fusion, GDSAF),实现三路特征的精准协同。实验表明:该模型在复杂背景与低对比度场景下鲁棒性显著提升,检测精度与召回率分别较最优对比模型提高7.3%和3.1%,验证了递推增强型边缘语义门控层与ASPP协同机制、双阶段注意力融合模块的有效性。

       

      Abstract: As a vital research direction of computer vision in the fields of national defense and civil monitoring, infrared dim and small target detection acts as a key technical support for tasks such as long-range early warning, maritime surveillance and environmental perception of unmanned equipment, which is of great significance for improving the environmental adaptability and task response accuracy of perception systems. Aiming at the problems of low signal-to-noise ratio, background clutter interference and inconspicuous target features in infrared dim and small target detection under complex scenarios, an improved infrared dim and small target detection model with edge-aware gating and three-path adaptive fusion was proposed. Firstly, a U-shaped encoder-decoder architecture was adopted to realize multi-scale semantic fusion, with Swin-Transformer as the backbone to uniformly model local details and long-range dependencies. Then, to effectively suppress background noise and enhance target edge responses, a collaborative module integrating the recursive enhanced edge-semantic gated contextual layer (REES-GCL) and atrous spatial pyramid pooling (ASPP) was improved and designed, which ccould adaptively suppress complex background clutter and accurately extract target edge information. Finally, addressing the issues of significant semantic gap and lack of targeted weight allocation in traditional feature fusion, a gated dual-stage attention fusion (GDSAF) module was improved and designed to achieve the precise collaboration of three-path features. Experimental results show that the proposed model achieves a significant improvement in robustness under complex backgrounds and low-contrast scenarios, with the detection precision and recall rate increased by 7.3% and 3.1% respectively, compared with the state-of-the-art comparison model, which verifies the effectiveness of the collaborative mechanism of REES-GCL and ASPP as well as the dual-stage attention fusion module.

       

    /

    返回文章
    返回