LI Jun, ZHOU Keyu, ZOU Jun, ZENG Wenbing
Abstract:
In view of the problems of protective equipment detection, such as information interference, uneven illumination and occlusion in the construction scene, a lightweight algorithm with improved YOLOv8n was proposed, which was called YOLO-LA . Firstly, the weighted bidirectional feature pyramid network BiFPN was introduced into the neck, and the underlying details and high-level semantic information were improved through multi-path interactive fusion, the multi-scale feature fusion performance was enhanced, and the detection accuracy of the model for small targets in complex scenes was improved. Secondly, the C2f-ContextGuided module was used to transform the backbone network in the baseline model, and the global context information was used to calculate the weight vector, to refine the joint features of the local features and the surrounding context features, so as to improve the feature extraction ability of the model and reduce the complexity of the model. Then, a new LSCD lightweight detection head was proposed, which used shared convolution to reduce the number of parameters and computations of the model. Finally, EIoU was used to replace the original CIoU, and the border regression was optimized, and the convergence speed and regression accuracy of the algorithm were improved. Compared with the baseline model YOLOv8n, the number of parameters, the amount of computation, and the size of the model were reduced by 61.5%, 43.2% and 58.7%, respectively, and the mAP@0.5 was increased by 1.4 percentage points, and the FPS was 253 frames/s, which could meet the requirements of real-time, accuracy and lightweight of protective equipment wearing detection.