改进UNet++融合注意力机制的舰船图像识别方法

Improving the Ship Image Recognition Method of UNet++ Fusion Attention Mechanism

  • 摘要: 【目的】针对复杂战场环境下舰船目标识别面临的小样本、高标注成本及复杂海况干扰等难题,提出了一种融合 SE-Net 通道注意力机制的改进UNet++ 模型,实现端到端的舰船图像精准分类。【方法】以 ResNet50 为编码器主干,构建编码器-解码器双向特征精炼机制,嵌入SE-Net模块动态校准通道特征响应,引入联合损失函数增强特征判别性。采用弱监督学习策略,仅需图像级标签,显著降低标注成本。在自建七类舰船数据集上,改进模型分类准确率达92.0%,【结果】训练损失从初始1.40降至0.20,验证损失从1.50降至0.23,优于主流主干网络与多种注意力机制,在尺度变化、薄雾、浓雾等复杂海况下的适用性实验中,准确率较基线提升8.9%~18.5%,消融实验与可视化分析验证了各模块的有效性。【结论】本研究为复杂海下舰船目标精准识别提供了有效的技术解决方案。

     

    Abstract: Aiming at the problems of small samples, high annotation costs and complex sea interference faced by ship target recognition in complex battlefield environments, an improved UNet++ model integrating the attention mechanism of SE-Net channel is proposed to achieve end-to-end accurate classification of ship images. ResNet50 is used as the backbone of the encoder to construct a bidirectional feature refining mechanism of encoder-decoder, the SE-Net module is embedded to dynamically calibrate the channel feature response, and the joint loss function is introduced to enhance feature discrimination. The weakly supervised learning strategy is adopted, and only image-level labels are required, which significantly reduces the annotation cost. On the self-built seven types of ship datasets, the improved model classification accuracy reached 92.0%, the training loss was reduced from the initial 1.40 to 0.20, and the verification loss was reduced from 1.50 to 0.23, which was better than the mainstream backbone network and a variety of attention mechanisms, and the accuracy was improved by 8.9%~18.5% compared with the baseline in the applicability experiments under complex sea conditions such as scale changes, mist, and dense fog, and the effectiveness of each module was verified by ablation experiments and visual analysis. This study provides an effective technical solution for accurate target identification of complex undersea ships.

     

/

返回文章
返回