LI Y T, Zhu W K, Li H J, et al. Image recognition of complex transmission lines based on lightweight encoder-decoder networks[J]. Opto-Electron Eng, 2024, 51(10): 240158. doi: 10.12086/oee.2024.240158
Citation: LI Y T, Zhu W K, Li H J, et al. Image recognition of complex transmission lines based on lightweight encoder-decoder networks[J]. Opto-Electron Eng, 2024, 51(10): 240158. doi: 10.12086/oee.2024.240158

Image recognition of complex transmission lines based on lightweight encoder-decoder networks

    Fund Project: Special Funds for Basic Scientific Research Business Expenses of Zhejiang Provincial Universities (2020YW29)
More Information
  • To address the problems of too many parameters and much time consumption in existing recognition networks for transmission lines, a lightweight encoder-decoder network is constructed to discern complex transmission line images featured with multiple intersections quickly and accurately. The encoder is based on the first 16 layers of conventional MobileNetV3 to reduce network parameters. The convolutional block attention module is used to replace the squeeze and excitation attention module to improve the network's ability to extract the feature information of transmission lines. The decoder is constructed by combining deeply separable convolution and deep atrous spatial pyramid pooling to expand the receptive field and improve the network's ability to aggregate contextual information with different scales. Moreover, the training network is sparse by using the L1 regularization method. The pruning threshold is determined according to the product of the scaling factor and the corresponding output of each channel to remove redundant channels and compress the network effectively, which improves the recognition speed of transmission lines. Experimental results demonstrate that the mean pixel accuracy, mean intersection over union , and recognition speed of the lightweight encoder-decoder network are 92.11%, 84.19%, and 41 frames per second, respectively, which are better than PSPNet, U2Net, and existing improved transmission lines recognition networks.
  • 加载中
  • [1] 樊星, 李小彭, 彭健文, 等. 可变重心线路巡检机器人摆振分析及抑制[J]. 振动与冲击, 2024, 43(2): 323−333. doi: 10.13465/j.cnki.jvs.2024.02.035

    CrossRef Google Scholar

    Fan X, Li X P, Peng J W, et al. Pendulum vibration analysis and suppression of a power transmission line inspection robot with variable center of gravity[J]. J Vib Shock, 2024, 43(2): 323−333. doi: 10.13465/j.cnki.jvs.2024.02.035

    CrossRef Google Scholar

    [2] Xu B Y, Zhao Y L, Wang T, et al. Development of power transmission line detection technology based on unmanned aerial vehicle image vision[J]. SN Appl Sci, 2023, 5(3): 72. doi: 10.1007/S42452-023-05299-7

    CrossRef Google Scholar

    [3] 刘传洋, 吴一全. 基于深度学习的输电线路视觉检测方法研究进展[J]. 中国电机工程学报, 2023, 43(19): 7423−7445. doi: 10.13334/j.0258-8013.pcsee.221139

    CrossRef Google Scholar

    Liu C Y, Wu Y Q. Research progress of vision detection methods based on deep learning for transmission lines[J]. Proc CSEE, 2023, 43(19): 7423−7445. doi: 10.13334/j.0258-8013.pcsee.221139

    CrossRef Google Scholar

    [4] 赵延峰, 胡耀垓, 王先培, 等. 复杂场景下的电力线自动提取[J]. 测绘通报, 2021, (8): 1−6. doi: 10.13474/j.cnki.11-2246.2021.0231

    CrossRef Google Scholar

    Zhao Y F, Hu Y G, Wang X P, et al. Automatic power line extraction algorithm in complex scene[J]. Bull Surv Mapp, 2021, (8): 1−6. doi: 10.13474/j.cnki.11-2246.2021.0231

    CrossRef Google Scholar

    [5] 陈雪云, 夏瑾, 杜珂. 基于多线型特征增强网络的架空输电线检测[J]. 浙江大学学报(工学版), 2021, 55(12): 2382−2389. doi: 10.3785/j.issn.1008-973X.2021.12.019

    CrossRef Google Scholar

    Chen X Y, Xia J, Du K. Overhead transmission line detection based on multiple linear-feature enhanced detector[J]. J Zhejiang Univ (Eng Sci), 2021, 55(12): 2382−2389. doi: 10.3785/j.issn.1008-973X.2021.12.019

    CrossRef Google Scholar

    [6] 李运堂, 李恒杰, 张坤, 等. 基于新型编码解码网络的复杂输电线识别[J]. 浙江大学学报(工学版), 2024, 58(6): 1133−1141. doi: 10.3785/j.issn.1008-973X.2024.06.004

    CrossRef Google Scholar

    Li Y T, Li H J, Zhang K, et al. Recognition of complex power lines based on novel encoder-decoder network[J]. J Zhejiang Univ (Eng Sci), 2024, 58(6): 1133−1141. doi: 10.3785/j.issn.1008-973X.2024.06.004

    CrossRef Google Scholar

    [7] Jaffari R, Hashmani M A, Reyes-aldasoro C C. A novel focal phi loss for power line segmentation with auxiliary classifier U-Net[J]. Sensors, 2021, 21(8): 2803. doi: 10.3390/S21082803

    CrossRef Google Scholar

    [8] Chen G K, Hao K, Wang B B, et al. A power line segmentation model in aerial images based on an efficient multibranch concatenation network[J]. Expert Syst Appl, 2023, 228: 120359. doi: 10.1016/j.eswa.2023.120359

    CrossRef Google Scholar

    [9] Yang L, Fan J F, Huo B Y, et al. PLE-Net: automatic power line extraction method using deep learning from aerial images[J]. Expert Syst Appl, 2022, 198: 116771. doi: 10.1016/j.eswa.2022.116771

    CrossRef Google Scholar

    [10] 许刚, 李果. 轻量化航拍图像电力线语义分割[J]. 中国图象图形学报, 2021, 26(11): 2605−2618. doi: 10.11834/jig.200690

    CrossRef Google Scholar

    Xu G, Li G. Research on lightweight neural network of aerial powerline image segmentation[J]. J Image Graphics, 2021, 26(11): 2605−2618. doi: 10.11834/jig.200690

    CrossRef Google Scholar

    [11] 陈梅林, 王逸舟, 戴彦, 等. SaSnet: 基于自监督学习的电力线实时分割网络[J]. 中国电机工程学报, 2022, 42(4): 1365−1374. doi: 10.13334/j.0258-8013.pcsee.210504

    CrossRef Google Scholar

    Chen M L, Wang Y Z, Dai Y, et al. Small and strong: power line segmentation network in real time based on self-supervised learning[J]. Proc CSEE, 2022, 42(4): 1365−1374. doi: 10.13334/j.0258-8013.pcsee.210504

    CrossRef Google Scholar

    [12] Gao Z S, Yang G D, Li E, et al. Efficient parallel branch network with multi-scale feature fusion for real-time overhead power line segmentation[J]. IEEE Sensors J, 2021, 21(10): 12220−12227. doi: 10.1109/JSEN.2021.3062660

    CrossRef Google Scholar

    [13] Zhu K J, Xu C H, Wei Y C, et al. Fast-PLDN: fast power line detection network[J]. J Real-Time Image Proc, 2022, 19(1): 3−13. doi: 10.1007/s11554-021-01154-3

    CrossRef Google Scholar

    [14] Han G J, Zhang M, Li Q, et al. A lightweight aerial power line segmentation algorithm based on attention mechanism[J]. Machines, 2022, 10(10): 881. doi: 10.3390/MACHINES10100881

    CrossRef Google Scholar

    [15] Zhu F W, Sun Y, Zhang Y Q, et al. An improved MobileNetV3 mushroom quality classification model using images with complex backgrounds[J]. Agronomy, 2023, 13(12): 2924. doi: 10.3390/agronomy13122924

    CrossRef Google Scholar

    [16] Yang K, Zhang Y, Zhang X, et al. YOLOX with CBAM for insulator detection in transmission lines[J]. Multimed Tools Appl, 2024, 83(14): 43419−43437. doi: 10.1007/s11042-023-17245-1

    CrossRef Google Scholar

    [17] Tang G, Zhao H R, Claramunt C, et al. PPA-Net: pyramid pooling attention network for multi-scale ship detection in SAR images[J]. Remote Sens, 2023, 15(11): 2855. doi: 10.3390/rs15112855

    CrossRef Google Scholar

    [18] Saida S J, Sahoo S P, Ari S. Deep convolution neural network based semantic segmentation for ocean eddy detection[J]. Expert Syst Appl, 2023, 219: 119646. doi: 10.1016/j.eswa.2023.119646

    CrossRef Google Scholar

  • Transmission line inspection is a key link in the regular maintenance of the power grid, which is very important to ensure the safe and stable operation of the power system. There are many problems with manual inspection, such as high cost, low efficiency, and high risk due to geographical influence. In contrast, unmanned aerial vehicle (UAV) intelligent inspection has advantages such as low cost, high efficiency, and good maneuverability, and has experienced rapid development in recent years. The important premise of realizing intelligent inspection of UAVs is to identify transmission lines quickly and accurately in complex backgrounds. At present, there are two main methods for transmission line recognition: conventional image processing and intelligent image processing based on deep learning. However, conventional image processing for identifying transmission lines is susceptible to background interference, which leads to low recognition accuracy of wrong detection or missed detection; Intelligent image recognition based on deep learning still has some problems, such as many network parameters, long time consumption and difficulty in deploying end devices. To solve these problems, a lightweight encoder-decoder network is constructed to realize fast and accurate identification of multiple intersecting complex transmission lines. The encoder is based on the first 16 layers of conventional MobileNetV3, reduces network parameters, and uses convolutional block attention modules to replace the squeezing and excitation attention modules of conventional MobileNetV3 networks, improving the network's ability to extract transmission line feature information. Combining deeply separable convolution and deep atrous spatial pyramid pooling, a decoder is constructed to expand the receptive field and improve the ability of the network to aggregate context information of different scales. Utilizing L1 regularization for sparse training of the network, setting pruning thresholds based on the product of scaling factors and corresponding channel outputs, effectively removes redundant channels from the network, compressing network volume and improving transmission line recognition speed. Experimental results show that the mean pixel accuracy, mean intersection over union, and the recognition speed of the lightweight encoder-decoder network are 92.11%, 84.19%, and 41 frames/sec, respectively, which are better than those of PSPNetU2Net and the improved transmission lines recognition network is helpful for the deployment of end equipment.

  • 加载中
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Figures(10)

Tables(5)

Article Metrics

Article views() PDF downloads() Cited by()

Access History
Article Contents

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint