• 摘要: 为解决疲劳驾驶检测模型计算量大、参数量需求高以及人脸关键点提取能力不足的问题,提出了一种基于改进YOLOv8n的算法。首先,引入DRB (dilated reparameterization block)网络替代原始骨干网络,通过扩大感受野,有效捕捉不同尺度的关键特征,显著增强特征提取能力。其次,在Detect检测头中融入轻量化全卷积单级目标检测器(fully convolutional one-stage object detector, FCOS)架构,通过共享卷积,显著减少参数量,同时提升定位与分类性能,降低计算需求。实验总训练轮次为100,在同一硬件配置下改进后的模型平均精度均值(mAP)达81.2%,较基准模型提升1.9%;计算量(GFLOPs)和模型大小(model size)分别减少32.1%和33.3%,延迟时间缩短71.53%,检测速度提升35 f/s。改进算法在驾驶员面部目标检测任务中具有一定参考价值。

       

      Abstract: In order to solve the problems of the fatigue driving detection model's high computational volume, high parameter count requirement, and insufficient extraction ability of key points of the human face, an algorithm based on improved YOLOv8n is proposed. Firstly, a dilated reparameterization block (DRB) network is introduced to replace the original backbone network, and by expanding the receptive field, the DRB module can effectively capture key features at different scales, which significantly improves the feature extraction ability. Secondly, a lightweight fully convolutional one-stage object detector (FCOS) architecture is integrated into the detection head. Through shared convolution, the architecture significantly reduces the number of parameters, improves the localization and classification performance, and reduces the computational requirements. The total number of training rounds in the experiment is 100, and the mean average precision (mAP) of the improved model under the same hardware configuration reaches 81.2%, which is 1.9% higher than that of the benchmark model; the amount of computation (GFLOPs) and the weight file (model size) are reduced by 32.1% and 33.3%, respectively, and the latency is shortened by 71.53%, with an increase in detection speed of 35 f/s. The improved algorithm has some reference value in the driver's facial target detection task.