Advances in International Computer Science
Advances in International Computer Science. 2023; 3: (3) ; 10.12208/j.aics.20230023 .
总浏览量: 599
辽宁何氏医学院 辽宁沈阳
*通讯作者: 孙志华,单位:辽宁何氏医学院 辽宁沈阳;
提出了一种基于对抗性训练的小物体识别方法,用于提高模型在复杂环境下的鲁棒性。该方法使用YOLOv3作为基础框架,冻结前几层并只更新后续层。通过真实图像和生成的复杂图像进行训练,使模型适应两种数据。采用细致的Adam优化器、较小的学习率和批大小进行训练。实验结果显示,该方法在小物体数据集上的mAP高于YOLOv3,并在复杂测试集上具有高精度,表明对抗训练确实增强了模型的鲁棒性。然而,该方法的速度下降到YOLOv3的0.7倍,因为对抗图像较复杂,需要更长的前向传播时间。总之,对抗性训练可以显著提高小物体识别模型的鲁棒性,但也会带来速度下降和数据集依赖性增加的问题。需要进一步改进模型和训练策略,以在保持鲁棒性的同时尽量减少速度和数据集影响。综上所述,该研究提出了一种基于YOLOv3和对抗性训练的小物体识别方法,可以显著提高模型在复杂环境下的鲁棒性,但还需要进一步改进和优化。
A small object recognition method based on adversarial training is proposed to improve the robustness of the model in complex environments. The method uses YOLOv3 as the base framework, freezing the first few layers and updating only the subsequent layers. The model is trained through real images and generated complex images to adapt to both types of data. Training with a detailed Adam optimizer, small learning rate and batch size. Experimental results show that the proposed method has a higher mAP on small object datasets than YOLOv3, and has high accuracy on complex test sets, indicating that adversarial training does enhance the robustness of the model. However, the speed of this method drops to 0.7 times that of YOLOv3 because the counter image is more complex and requires a longer forward propagation time. In conclusion, adversarial training can significantly improve the robustness of small object recognition models, but it also brings the problem of reduced speed and increased dataset dependency. Further improvements to the model and training strategies are needed to minimize speed and data set impact while maintaining robustness. In summary, this study proposes a small object recognition method based on YOLOv3 and adversarial training, which can significantly improve the robustness of the model in complex environments, but it needs further improvement and optimization.
[1] Redmon J, Farhadi A. Yolov3: An incremental improvement[J]. arXiv preprint arXiv:1804.02767, 2018.
[2] Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37.
[3] Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 3431-3440.
[4] Jian S, Kaiming H, Shaoqing R, et al. Deep residual learning for image recognition[C]//IEEE Conference on Computer Vision & Pattern Recognition. 2016: 770-778.
[5] Isola P, Zhu J Y, Zhou T, et al. Image-to-image translation with conditional adversarial networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 1125-1134.
[6] Hoffman J, Tzeng E, Park T, et al. Cycada: Cycle- consistent adversarial domain adaptation[C]// International conference on machine learning. Pmlr, 2018: 1989-1998.