SSD-YOLO:一种基于新型抗噪与方向敏感注意力融合的合成孔径雷达船舶检测方法

SSD-YOLO: a SAR ship detection via a novel anti-noise and direction-sensitive attention fusion approach

  • 摘要:
    目的 为解决合成孔径雷达(SAR)图像中的强噪声和目标尺度差异问题,提出一种改进型YOLOv8n模型(SSD-YOLO),旨在提升复杂海洋场景下的船舶检测性能。
    方法 该模型以YOLOv8n为基线,融合三大创新模块:SAR_SPPF模块利用Ghost卷积和双径池化高效抗噪;C2f_SimAM模块嵌入无参数SimAM注意力机制,强化目标响应;C2f_DSConv模块则采用方向敏感的深度可分离卷积,精细捕捉舰船纹理与方向信息。
    结果 在SSDD数据集上的实验结果表明,模型精确率97.0%、召回率96.4%,mAP@0.5达99.0%,mAP@0.5:0.95达74.3%。该模型参数量约3 M,FLOPs为7.7 G,保持了轻量化。在更具挑战性的HRSID数据集上的泛化实验表明,模型的mAP@0.5达到94.9%,进一步证明了其鲁棒性与泛化能力。消融实验证明了各模块的有效性。
    结论 本研究通过融合轻量级抗噪、目标强化及方向敏感特征捕捉策略,为SAR舰船检测提供了一种高效、鲁棒的解决方案。

     

    Abstract:
    Objective Synthetic Aperture Radar (SAR) has established itself as an indispensable tool for marine monitoring due to its all-weather, day-and-night imaging capabilities. However, automatic ship detection in SAR imagery remains a significant challenge, primarily stemming from intense coherent speckle noise, extreme variations in target scales, and complex background interference in inshore scenarios. Existing lightweight models often struggle to maintain high detection precision and robustness under these varying conditions. To address these issues, this paper proposes an improved lightweight detection model, SSD-YOLO, based on the YOLOv8n architecture, designed to enhance performance in complex marine environments.
    Method The SSD-YOLO model establishes a hierarchical "Reinforce-Purify-Fuse" synergistic framework by integrating three theoretically grounded innovative modules. First, to address the issue of weak target signals amidst noise, a Parameter-free Target Response Enhancement Module (C2f_SimAM) is embedded in the backbone network. By leveraging an energy-function-based SimAM attention mechanism rooted in neuroscience, this module adaptively calculates 3D attention weights. It effectively enhances the linear separability of target neurons and suppresses background clutter without adding any model parameters. Second, a Lightweight Anti-Noise Feature Aggregation Module (SAR_SPPF) is deployed at the end of the backbone. Distinguished from traditional spatial pyramid pooling, this module utilizes Ghost Convolutions for computational efficiency and introduces a novel dual-path pooling strategy. Theoretical derivation based on the Gamma distribution properties of SAR speckle noise demonstrates that combining small-kernel Average Pooling (to smooth background variance) with large-kernel Max Pooling (to preserve strong scattering peaks) maximizes the signal-to-noise ratio. Third, a Direction-Sensitive Multi-Scale Fusion Module (C2f_DSConv) is introduced in the neck network. This module replaces standard convolutions with direction-sensitive depthwise separable convolutions. By employing specific kernel orientations (0°, 45°, and 90°), it finely captures the anisotropic texture and geometric morphology of ships, significantly improving detection capabilities for densely arranged and multi-scale targets.
    Results Extensive experiments were conducted on the SSDD and HRSID datasets. On the SSDD dataset, the model achieved 97.0% precision, 96.4% recall, 99.0% mAP@0.5, and 74.3% mAP@0.5:0.95. Despite these significant performance gains, the model remains highly lightweight, maintaining approximately 3.01M parameters and 7.9G FLOPs. Generalization experiments on the more challenging high-resolution HRSID dataset demonstrated superior robustness, achieving an mAP@0.5 of 94.9%. Quantitative analysis of feature map SNR further validated the internal logic of the proposed synergistic framework.
    Conclusion This research provides an efficient and robust solution for SAR ship detection by combining lightweight anti-noise,target enhancement,and direction-sensitive feature capture strategies.

     

/

返回文章
返回