基于多任务学习框架的红外行人检测算法

苟于涛; 马梁; 宋怡萱; 靳雷; 雷涛

doi:10.12086/oee.2021.210358

基于多任务学习框架的红外行人检测算法

Multi-task learning for thermal pedestrian detection

摘要: 与高质量可见光图像相比，红外图像在行人检测任务中往往存在较高的虚警率。其主要原因在于红外图像受成像分辨率及光谱特性限制，缺乏清晰的纹理特征，同时部分样本的特征质量较差，干扰网络的正常学习。本文提出基于多任务学习框架的红外行人检测算法，其在多尺度检测框架的基础上，做出以下改进：1) 引入显著性检测任务作为协同分支与目标检测网络构成多任务学习框架，以共同学习的方式侧面强化检测器对强显著区域及其边缘信息的关注。2) 通过将样本显著性强度引入分类损失函数，抑制噪声样本的学习权重。在公开KAIST数据集上的检测结果证实，本文的算法相较于基准算法RetinaNet能够降低对数平均丢失率(MR^-2)4.43%。

Abstract: Compared with high-quality RGB images, thermal images tend to have a higher false alarm rate in pedestrian detection tasks. The main reason is that thermal images are limited by imaging resolution and spectral characteristics, lacking clear texture features, while some samples have poor feature quality, which interferes with the network training. We propose a thermal pedestrian algorithm based on a multi-task learning framework, which makes the following improvements based on the multiscale detection framework. First, saliency detection tasks are introduced as an auxiliary branch with the target detection network to form a multitask learning framework, which side-step the detector's attention to illuminate salient regions and their edge information in a co-learning manner. Second, the learning weight of noisy samples is suppressed by introducing the saliency strength into the classification loss function. The detection results on the publicly available KAIST dataset confirm that our learning method can effectively reduce the log-average miss rate by 4.43% compared to the baseline, RetinaNet.

基于多任务学习框架的红外行人检测算法

Multi-task learning for thermal pedestrian detection

相关链接

目录

基于多任务学习框架的红外行人检测算法

Multi-task learning for thermal pedestrian detection

相关链接

目录

微信二维码