Liang L M, Chen K Q, Chen L J, et al. Improving the lightweight FCM-YOLOv8n for steel surface defect detection[J]. Opto-Electron Eng, 2025, 52(2): 240280. doi: 10.12086/oee.2025.240280
Citation: Liang L M, Chen K Q, Chen L J, et al. Improving the lightweight FCM-YOLOv8n for steel surface defect detection[J]. Opto-Electron Eng, 2025, 52(2): 240280. doi: 10.12086/oee.2025.240280

Improving the lightweight FCM-YOLOv8n for steel surface defect detection

    Fund Project: National Natural Science Foundation of China (51365017, 61463018), Natural Science Foundation of Jiangxi Province (20192BAB205084), and Jiangxi Provincial Department of Education Science and Technology Research Youth Project (GJJ2200848)
More Information
  • In response to the deficiencies of existing steel surface defect detection algorithms in terms of resource consumption, detection accuracy, and efficiency, a lightweight steel defect detection algorithm based on YOLOv8n (FCM-YOLOv8n) is proposed. First, a frequency-aware feature fusion network is utilized to efficiently extract and integrate high-frequency information, reducing computational costs while enhancing detection speed. Second, a lightweight feature interaction module (Cc-C2f) is restructured to effectively preserve spatial and channel dependencies while reducing feature redundancy, thereby lowering model parameters and computational complexity. Finally, a multi-spectrum attention mechanism is applied to mitigate feature information loss in the frequency domain, improving the accuracy of detecting complex defects. Experimental results on the Severstal and NEU-DET steel defect datasets show that, compared to YOLOv8n, the FCM-YOLOv8n algorithm achieves a 2.2% and 1.5% improvement in mAP@0.5, respectively, with a 0.5 M and 1.5 G reduction in parameters and computational complexity. The FPS reaches 143 f/s and 154 f/s, respectively, demonstrating excellent real-time performance. The algorithm achieves an optimal balance between detection accuracy, computational cost, and efficiency, providing robust support for edge device applications. Further validation on the GC10-DET dataset shows a 2.9% improvement in mAP@0.5 compared to the baseline model, fully demonstrating the algorithm's exceptional generalization ability.
  • 加载中
  • [1] 黄硕清, 黄金贵. 基于RFB和YOLOv5特征增强融合改进的钢材缺陷检测方法[J]. 计算机工程, 2024. doi: 10.19678/j.issn.1000-3428.0068476

    CrossRef Google Scholar

    Huang S Q, Huang J G. Improved steel defect detection method based on enhanced fusion of RFB and YOLOv5 features[J]. Comput Eng, 2024. doi: 10.19678/j.issn.1000-3428.0068476

    CrossRef Google Scholar

    [2] 梁礼明, 龙鹏威, 卢宝贺, 等. 改进GBS-YOLOv7t的钢材表面缺陷检测[J]. 光电工程, 2024, 51(5): 240044. doi: 10.12086/oee.2024.240044

    CrossRef Google Scholar

    Liang L M, Long P W, Lu B H, et al. Improvement of GBS-YOLOv7t for steel surface defect detection[J]. Opto-Electron Eng, 2024, 51(5): 240044. doi: 10.12086/oee.2024.240044

    CrossRef Google Scholar

    [3] 梁礼明, 龙鹏威, 冯耀, 等. 改进轻量化VTG-YOLOv7-tiny的钢材表面缺陷检测[J]. 光学 精密工程, 2024, 32(8): 1227−1240. doi: 10.37188/OPE.20243208.1227

    CrossRef Google Scholar

    Liang L M, Long P W, Feng Y, et al. Improving the lightweight VTG-YOLOv7-tiny for steel surface defect detection[J]. Opt Precis Eng, 2024, 32(8): 1227−1240. doi: 10.37188/OPE.20243208.1227

    CrossRef Google Scholar

    [4] Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014: 580–587. https://doi.org/10.1109/CVPR.2014.81.

    Google Scholar

    [5] Liu W, Anguelov D, Erhan D, et al. SSD: single shot multibox detector[C]//Proceedings of the 14th European Conference on Computer Vision–ECCV 2016, 2016: 21–37. https://doi.org/10.1007/978-3-319-46448-0_2.

    Google Scholar

    [6] Reis D, Kupec J, Hong J, et al. Real-time flying object detection with YOLOv8[Z]. arXiv: 2305.09972, 2023. https://doi.org/10.48550/arXiv.2305.09972.

    Google Scholar

    [7] Wang X Q, Gao H B, Jia Z M, et al. BL-YOLOv8: an improved road defect detection model based on YOLOv8[J]. Sensors, 2023, 23(20): 8361. doi: 10.3390/s23208361

    CrossRef Google Scholar

    [8] Zeng S, Yang W Z, Jiao Y Y, et al. SCA-YOLO: a new small object detection model for UAV images[J]. Vis Comput, 2024, 40(3): 1787−1803. doi: 10.1007/s00371-023-02886-y

    CrossRef Google Scholar

    [9] 李刚, 邵瑞, 周鸣乐, 等. 基于注意力的轻量级工业产品缺陷检测网络[J]. 计算机工程, 2023, 49(11): 275−283. doi: 10.19678/j.issn.1000-3428.0066270

    CrossRef Google Scholar

    Li G, Shao R, Zhou M L, et al. Lightweight industrial products defect detection network based on attention[J]. Comput Eng, 2023, 49(11): 275−283. doi: 10.19678/j.issn.1000-3428.0066270

    CrossRef Google Scholar

    [10] 刘毅, 蒋三新. 基于改进YOLOX的钢材表面缺陷检测研究[J]. 现代电子技术, 2024, 47(9): 131−138. doi: 10.16652/j.issn.1004-373x.2024.09.024

    CrossRef Google Scholar

    Liu Y, Jiang S X. Steel surface defect detection algorithm based on improved YOLOX[J]. Mod Electron Tech, 2024, 47(9): 131−138. doi: 10.16652/j.issn.1004-373x.2024.09.024

    CrossRef Google Scholar

    [11] 马冬梅, 朱佳浩. 面向热轧带钢表面缺陷检测的YOLOv5算法优化分析[J]. 制造技术与机床, 2024, (6): 153−160. doi: 10.19287/j.mtmt.1005-2402.2024.06.023

    CrossRef Google Scholar

    Ma D M, Zhu J H. The optimization of YOLOv5 algorithm for detecting surface defects on hot rolled strips[J]. Manuf Technol Mach Tool, 2024, (6): 153−160. doi: 10.19287/j.mtmt.1005-2402.2024.06.023

    CrossRef Google Scholar

    [12] 徐薪羽, 沈通, 吕佳. 基于改进YOLOv8算法的钢材表面缺陷检测[J]. 自动化应用, 2024, 65(15): 6−10. doi: 10.19769/j.zdhy.2024.15.002

    CrossRef Google Scholar

    Xu X Y, Shen T, Lv J. Steel Surface Defect Detection Based On Improved YOLOv8 algorithm[J]. Autom Appl, 2024, 65(15): 6−10. doi: 10.19769/j.zdhy.2024.15.002

    CrossRef Google Scholar

    [13] Chen L W, Fu Y, Gu L, et al. Frequency-aware feature fusion for dense image prediction[J]. IEEE Trans Pattern Anal Mach Intell, 2024, 46(12): 10763−10780. doi: 10.1109/TPAMI.2024.3449959

    CrossRef Google Scholar

    [14] Zhang T F, Li L, Zhou Y, et al. CAS-ViT: convolutional additive self-attention vision transformers for efficient mobile applications[Z]. arXiv: 2408.03703, 2024. https://doi.org/10.48550/arXiv.2408.03703.

    Google Scholar

    [15] Shi D. TransNeXt: robust foveal visual perception for vision transformers[C]//Proceedings of 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024: 17773–17783. https://doi.org/10.1109/CVPR52733.2024.01683.

    Google Scholar

    [16] Qin Z Q, Zhang P Y, Wu F, et al. FcaNet: frequency channel attention networks[C]//Proceedings of 2021 IEEE/CVF International Conference on Computer Vision, 2021: 783–792. https://doi.org/10.1109/ICCV48922.2021.00082.

    Google Scholar

    [17] Yeung C C, Lam K M. Efficient fused-attention model for steel surface defect detection[J]. IEEE Trans Instrum Meas, 2022, 71: 2510011. doi: 10.1109/TIM.2022.3176239

    CrossRef Google Scholar

    [18] Wang X, Zhuang K Y. An improved YOLOX method for surface defect detection of steel strips[C]//Proceedings of 2023 IEEE 3rd International Conference on Power, Electronics and Computer Applications, 2023: 152–157. https://doi.org/10.1109/ICPECA56706.2023.10075827.

    Google Scholar

  • In response to the deficiencies of existing steel surface defect detection algorithms in terms of resource consumption, detection accuracy, and efficiency, a lightweight steel defect detection algorithm based on YOLOv8n (FCM-YOLOv8n) is proposed. This algorithm incorporates three principal innovative elements. First, a frequency-aware feature fusion network is utilized to efficiently extract and integrate high-frequency information, reducing computational costs while enhancing detection speed. This network ingeniously integrates an adaptive low-pass filter generator (ALPF), an offset generator, and an adaptive high-pass filter generator (AHPF). The ALPF generator forecasts spatially-variant low-pass filters, which serve to attenuate high-frequency constituents within objects, thereby diminishing intra-class disparities during the up-sampling procedure. The offset generator plays a pivotal role in refining pronounced inconsistent features and tenuous boundaries. It achieves this by substituting inconsistent elements with more congruous ones via resampling. Meanwhile, the AHPF generator functions to augment the high-frequency detailed boundary information that is otherwise lost during down-sampling. Collectively, this fusion paradigm substantially augments feature consistency and sharpens object boundaries. Secondly, a lightweight feature interaction module (Cc-C2f) is restructured to effectively preserve spatial and channel dependencies while reducing feature redundancy, lowering model parameters and computational complexity. The Cc-C2f module integrates the lightweight convolutional additive self-attention mechanism (CDSA) and the lightweight convolutional gated linear unit (CGLU). The CDSA module takes into account both channel and spatial information, and employs fast linear transformation to reduce the number of model parameters and computational complexity. The CGLU module combines local and global information to enhance the network's representational ability. Finally, a multi-spectrum attention mechanism is applied to mitigate feature information loss in the frequency domain, improving the accuracy of detecting complex defects. Experimental results on the Severstal and NEU-DET steel defect datasets show that, compared to YOLOv8n, the FCM-YOLOv8n algorithm achieves a 2.2% and 1.5% improvement in mAP@0.5, respectively, with a 0.5 M and 1.5 G reduction in parameters and computational complexity. The FPS reaches 143 f/s and 154 f/s, respectively, demonstrating excellent real-time performance. The algorithm achieves an optimal balance between detection accuracy, computational cost, and efficiency, providing robust support for edge device applications. Further validation on the GC10-DET dataset shows a 2.9% improvement in mAP@0.5 compared to the baseline model, demonstrating the algorithm's exceptional generalization ability. Through comparative analysis with disparate algorithms, the superiority of the proposed algorithm's performance is further accentuated.

  • 加载中
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Figures(10)

Tables(4)

Article Metrics

Article views() PDF downloads() Cited by()

Access History

Other Articles By Authors

Article Contents

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint