• 摘要: 为了解决遥感图像中小目标特征不明显、背景复杂及目标密集分布导致的检测遗漏问题,提出一种融合注意力与特征增强的遥感图像小目标检测算法。首先,构造多层级特征协同策略,结合通道注意力和空间注意力机制并利用门控机制动态调整信息流动,有效捕捉不同层级特征间的潜在关联,加强模型在复杂场景中的鲁棒性。其次,设计一个多尺度特征增强模块,将三维卷积和多尺度特征编码结合形成一个增强型颈部网络,优化信息提取能力与小目标的特征表达。最后,采用NWD结合SIoU作为损失函数,在提升模型训练效率的同时进一步加强小目标的定位准确性。在NWPU VHR-10数据集、DIOR数据集和RSOD数据集上的实验结果表明,改进算法相较于原始模型,mAP@0.5数值分别提高了9.9%、3.1%和3.2%,可视化实验结果显示改进算法可以检测出原始模型漏检的小目标,与其他主流检测算法相比具有更优的检测精度。

       

      Abstract:
      Objective Remote sensing image target detection plays a crucial role in resource exploration, urban planning and management, and disaster emergency response. However, targets in remote sensing images are typically characterized by small size, large scale variations, complex backgrounds, blurred edges, and insufficient texture information. These challenges are further exacerbated in low-resolution images, where the distinction between targets and the background becomes less pronounced. Although significant progress has been achieved in remote sensing target detection, small target detection still suffers from missed detections and limited feature representation. Therefore, this study aims to improve the accuracy and robustness of small target detection in remote sensing images while enhancing training efficiency and localization precision.
      Methods To address the issues of unclear features, complex backgrounds, and densely distributed small targets, a remote sensing small target detection algorithm integrating attention mechanisms and feature enhancement is proposed. First, a multi-level feature collaboration strategy is constructed by combining channel attention and spatial attention mechanisms with a gating mechanism to dynamically regulate information flow, thereby strengthening cross-level feature interaction and suppressing background noise. Second, a multi-scale feature enhancement module is designed by integrating three-dimensional convolution and multi-scale feature encoding to form an enhanced neck network, which improves the feature extraction and representation capabilities for small targets. Finally, SIoU and NWD are jointly employed as regression loss functions to optimize bounding box matching, accelerate convergence, and enhance localization accuracy.
      Results and Discussions Experiments conducted on the NWPU VHR-10, DIOR, and RSOD datasets demonstrate the effectiveness of the proposed method. Compared with the original baseline model, the improved algorithm achieves improvements in mAP@0.5 of 9.9%, 3.1%, and 3.2%, respectively. Visualization results further indicate that the proposed method can successfully detect small targets that were previously missed, particularly in complex scenes with densely distributed targets. These results verify that the integration of multi-level attention collaboration and multi-scale feature enhancement effectively improves small target representation and reduces missed detections.
      Conclusions The proposed attention- and feature-enhancement-based small target detection algorithm significantly improves detection accuracy and localization performance in remote sensing images. By enhancing cross-level feature interaction and strengthening multi-scale representation, the method effectively addresses the challenges of small target detection in complex scenarios. The experimental results confirm its robustness and superiority over the baseline model, demonstrating its potential for practical remote sensing applications.