• 摘要: 针对相似背景干扰场景下目标检测精度低、实时性不强等问题,提出了一种基于改进RT-DETR的目标检测算法。分析了浮点运算量(floating point operations, FLOPs)和每秒浮点运算次数(floating point operations per second, FLOPS)对实时性的影响,采用速度、效率和轻量化方面都表现出色的FasterNet模块作为主干网络,实现了网络结构轻量化、算法实时性的提升。引入了能够处理图像高频和低频特征的HiLo (high frequency attention and low frequency attention)模块作为尺度内融合模块,实现了算法检测精度和抗干扰能力的提升。选取相似背景干扰场景下的CAMO-D数据集、CottonInsect数据集和自建病虫害数据集展开实验,结果表明相对于RT-DETR (real-time detection transformer)算法,本文算法检测准确率分别提高了8.0%、6.6%和9.0%,F1分数分别提高了13.8%、3.2%和4.2%。在保持帧率基本不变的情况下,浮点运算量减少了49.6%,参数量减少了45.7%,计算准确性和效率方面提升明显。本文算法能有效应对相似背景干扰下的目标检测挑战,可以应用于更加广泛的实际场景。

       

      Abstract: In response to the problems of low detection precision and poor real-time performance caused by the extreme similarity between object and background in complex scenarios, an object detection algorithm based on improved RT-DETR is proposed. The influence of FLOPs (floating point operations) and FLOPS (floating point operations per second) on real-time performance is analyzed. FasterNet module, which is excellent in speed, efficiency and lightweight, is used as the backbone network to realize a lightweight network structure and real-time algorithm improvement. The HiLo (high- and low-frequency attention) module, which can process high-frequency and low-frequency image features simultaneously, is introduced as the intra-scale fusion module to improve algorithm detection precision and anti-interference ability. Experiments are carried out on CAMO-D, CottonInsect and a self-built pest and disease dataset in complex scenarios, and the results show that the improved algorithm is superior to the RT-DETR (real time detection transformer) algorithm in that the detection precision has increased by 8.0%, 6.6%, and 9.0%, and F1 score has increased by 13.8%, 4.2%, and 3.2%, respectively. Under the premise of frame rate being unchanged, the floating-point operations of the improved algorithm have reduced by 49.6%, and number of parameters reduced by 45.7%, which makes the calculation accuracy and efficiency improve significantly. The algorithm effectively addresses the challenge of object detection under similar background interference and holds promise for broader application in various practical scenarios.