改进YOLOv11与多模态融合机制的钢材表面缺陷检测方法*

doi:10.16731/j.cnki.1671-3133.2026.04.013

现代制造工程 ›› 2026, Vol. 547 ›› Issue (4): 103-116.doi: 10.16731/j.cnki.1671-3133.2026.04.013

• 仪器仪表/检测/监控 • 上一篇下一篇

改进YOLOv11与多模态融合机制的钢材表面缺陷检测方法^*

徐慧玲¹, 尚冠宇²

1 开封大学信息工程学院,开封 475004;
2 郑州西亚斯学院电信与智能制造学院,郑州 451150

收稿日期:2025-05-28 发布日期:2026-05-07
通讯作者: 尚冠宇,硕士,副教授,主要研究方向为人工智能、计算机科学与技术。E-mail:sgyu80@126.com
作者简介:徐慧玲,硕士,讲师,主要研究方向为计算机应用技术与人工智能。E-mail:huilingx@126.com
基金资助:
^*河南省科技发展计划项目(252102110375);河南省专创融合特色示范课程项目(教办高(2024)144号-190);河南省教育厅河南省本科高校2023年度产教融合研究项目(教办高(2024)13号-重点项目57)

Steel surface defect detection method based on improved YOLOv11 and multimodal fusion mechanism

XU Huiling¹, SHANG Guanyu²

1 School of Information Engineering,Kaifeng University,Kaifeng 475004,China;
2 School of Telecommunications and Intelligent Manufacturing,Zhengzhou Sias University,Zhengzhou 451150,China

Received:2025-05-28 Published:2026-05-07

摘要/Abstract

摘要： 针对钢材表面缺陷检测中存在的复杂背景干扰和缺陷多样性等问题,提出了一种改进YOLOv11与多模态融合机制的钢材表面缺陷检测方法。首先,在YOLOv11的基础上设计了双流架构的网络模型,分别提取RGB图像流和深度图像流的多模态特征信息,并利用空洞卷积扩大深度图像的感受野,通过深度增强模块自适应调整各通道权重来强化钢材表面几何结构的感知能力;其次,利用自注意力机制增加对钢材缺陷关键区域的关注度,确保多模态信息有效融合;然后,设计了梯度感知区域平衡损失函数,通过引入梯度一致性损失和特征对齐损失,并将多个损失项加权求和,旨在提升钢材表面缺陷检测的精度、鲁棒性和适应性;最后,设计了自适应多模态协同量化策略,确保在不损失检测精度的同时提高双流网络模型的推理速度。实验结果表明,提出的改进方法与几种主流的缺陷检测算法相比具有更优的检测性能,且能够较好地平衡检测精度与速度,mAP为92.1 %,检测速度达到了48.3 FPS,较原始YOLOv11分别提升了6.7 %和11.8 FPS,并通过可视化结果展现出其能够适应复杂环境和缺陷多样性的检测任务,为钢材制造和加工领域的自动化质量检测提供了技术支撑。

关键词: 钢材表面缺陷检测, YOLOv11, 双流架构, 多模态

Abstract: To address complex background interference and defect diversity in steel surface defect detection,a steel surface defect detection method based on improved YOLOv11 and a multimodal fusion mechanism was proposed.Firstly,a dual-stream architecture network model was designed based on YOLOv11,which used to extract multi-modal feature information from RGB image stream and depth image stream,respectively. The dilated convolution was used to expand the receptive field of the depth image,and the depth enhancement module adaptively adjusted the weights of each channel to enhance the perception ability of the geometric structure of the steel surface. A self-attention mechanism was used to focus on critical defect areas and enhance the fusion of multimodal information.Next,the gradient aware region balance loss function was designed,which introduced gradient consistency loss and feature alignment loss,and weighted and summed multiple loss terms to improve the accuracy,robustness,and adaptability of steel surface defect detection. Finally,an adaptive multimodal collaborative quantization strategy was designed,which used to ensure improved inference speed of the dual-stream network model without sacrificing detection accuracy. The experimental results showed that the proposed improved method had better detection performance,compared to several mainstream defect detection algorithms,and could balance detection accuracy and speed well. The mAP reached 92.1 %,and the detection speed reached 48.3 FPS,showing improvements of 6.7 % and 11.8 FPS over the original YOLOv11,respectively. The visualization results demonstrated it′s adaptability to complex environments and diverse defect types,supporting automated quality inspection in steel manufacturing.

Key words: steel surface defect detection, YOLOv11, dual-stream architecture, multimodal

中图分类号:

TP391.4

徐慧玲, 尚冠宇. 改进YOLOv11与多模态融合机制的钢材表面缺陷检测方法^*[J]. 现代制造工程, 2026, 547(4): 103-116.

XU Huiling, SHANG Guanyu. Steel surface defect detection method based on improved YOLOv11 and multimodal fusion mechanism[J]. Modern Manufacturing Engineering, 2026, 547(4): 103-116.

参考文献

[1] 伍麟,郝鸿宇,宋友. 基于计算机视觉的工业金属表面缺陷检测综述[J]. 自动化学报,2024,50(7):1261-1283.
[2] KURSAT D,MUSTAFA A,MEHMET C,et al. Automated steel surface defect detection and classification using a new deep learning-based approach[J]. Neural Computing and Applications,2022,35(11):8389-8406.
[3] 宿磊,王立建,祁阳,等. 基于IADSA深度迁移网络的金属表面缺陷检测[J]. 机械工程学报,2023,59(24):46-55.
[4] LI Y H,HAN Z Y,WANG W M,et al. Steel surface defect detection based on sparse global attention transformer[J]. Pattern Analysis and Applications,2024,27(4):152.
[5] 李键,李华,胡翔坤,等. 基于深度学习的表面缺陷检测技术研究进展[J]. 计算机集成制造系统,2024,30(3):774-790.
[6] KUROMIZU M,MOTOYAMA A,KOMATSUBARA K,et al. Detection and location estimate of external surface defects using velocity effects from double-sided quadrupole permanent magnets[J]. IEEE Transactions on Magnetics,2024,60(12):1-4.
[7] 胡广华,涂千禧. 基于光度立体和双流特征融合网络的工业产品表面缺陷检测方法[J]. 华南理工大学学报(自然科学版),2024,52(10):112-123.
[8] DUTTA C,SAGAR S P,KUMAR A,et al. An adaptive sampling protocol for real-time defect assessment using eddy current sensor and machine learning algorithm[J]. IEEE Transactions on Industry Applications,2023,59(5):5682-5690.
[9] SANTO E A,KHOR W,CIAMPA F. Statistical and machine learning-based imaging with long pulse thermography for the detection of non-standardised defects in CFRP composites[J]. Journal of Nondestructive Evaluation,2024,44(1):6.
[10] 梁海波,王怡,贾武升. 基于机器学习的天然气钢质管道缺陷检测方法研究[J]. 安全与环境学报,2023,23(10):3528-3537.
[11] 左才,张勇斌,齐元胜,等. 基于机器视觉的印刷品表面划痕缺陷检测[J]. 印刷与数字媒体技术研究,2023(5):42-48.
[12] LEE S H. A study on Cascade R-CNN-based dangerous goods detection using X-Ray image[J]. Computers,Materials & Continua,2022,73(2):4245-4260.
[13] 林珊玲,彭雪玲,王栋,等. 多尺度增强特征融合的钢表面缺陷目标检测[J]. 光学精密工程,2024,32(7):1075-1086.
[14] 王伟家,张宇,王京华,等. 基于改进RetinaNet的轻量化钢材表面缺陷检测算法[J]. 模式识别与人工智能,2024,37(8):692-702.
[15] 唐孝育,孙明革. 基于改进SSD模型的手机盖板玻璃缺陷检测[J]. 吉林化工学院学报,2023,40(9):70-74.
[16] SELAMET F,CAKAR S,KOTAN M. Automatic detection and classification of defective areas on metal parts by using adaptive fusion of Faster R-CNN and shape from shading[J]. IEEE Access,2022,10:126030-126038.
[17] 周双喜,袁海强,邓芳明. 基于改进Mask R-CNN钢纤维混凝土裂缝检测模型[J]. 华东交通大学学报,2021,38(6):37-45.
[18] CHOWDHURY S A,TAUFIQUE M F N,WANG J,et al. Automated grain boundary (gb) segmentation and microstructural analysis in 347h stainless steel using deep learning and multimodal microscopy[J]. Integrating Materials and Manufacturing Innovation,2024,13(1):244-256.
[19] 张李辉,刘紫燕. 结合YOLOv8和多模态特征融合的3D目标检测算法[J]. 国外电子测量技术,2024,43(12):91-98.
[20] IKECHUKWU S,AKIN E. High performance network for detection of surface defects on hot-rolled steel strips based on an optimized Yolo V3[C]//Proc. of the 9th International Conference on Electrical and Electronics Engineering (ICEEE). Alanya,Turkey:ICEEE,2022:1-6.
[21] RIZVI Z S,JAMIL M,HUANG W. Enhanced defect detection on wind turbine blades using binary segmentation masks and YOLO[J]. Computers and Electrical Engineering,2024,120(PA):109615.
[22] DEEPTI R G,PRABADEVI B. MoL-YOLOv7:Streamlining industrial defect detection with an optimized YOLOv7 approach[J]. IEEE Access,2024,12:117090-117101.
[23] ZOU J L,WANG H C. Steel surface defect detection method based on improved YOLOv9 network[J]. IEEE Access,2024,12:124160-124170.
[24] LIU T Q. Enhanced zero-shot YOLOv10 for multi-class tiny-object detection of steel surface defects[C]//Proc. of the 6th International Conference on Robotics and Computer Vision (ICRCV). Piscataway,NJ:IEEE Press,2024:44-52.
[25] LIU B Q,LI X F. An improved YOLOv11 model for detecting the metal roofing tiles alongside the railways[C]//Proc. of the 4th International Conference on Artificial Intelligence,Robotics,and Communication (ICAIRC). Piscataway,NJ:IEEE Press,2024:195-199.

改进YOLOv11与多模态融合机制的钢材表面缺陷检测方法^*

Steel surface defect detection method based on improved YOLOv11 and multimodal fusion mechanism

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 1

编辑推荐

Metrics