长沙理工大学学报(自然科学版)
基于双目BEV与改进YOLOv 8的路面裂缝识别方法
CSTR:
作者:
作者单位:

(1. 长沙理工大学 土木与环境工程学院 ,湖南 长沙 410114;2. 湖南水利水电勘测设计规划研究总院有限公司 ,湖南 长沙 410007;3. 湖南拓达结构监测技术有限公司 ,湖南 长沙 410017;4. 天津津港建设有限公司 ,天津 滨海 300456)

作者简介:

通讯作者:

谢海波(1980—)(ORCID:0009-0005-9863-4080),男,讲师,主要从事计算机视觉方面的研究。E-mail:xiehaibo@csust.edu.cn

中图分类号:

U418.6;TP391.4

基金项目:

国家自然科学基金项目(52478495);湖南省自然科学基金项目(2022JJ50324)


A road crack identification method based on stereo BEV and improved YOLOv 8
Author:
Affiliation:

(1. School of Civil and Environmental Engineering , Changsha University of Science & Technology , Changsha 410114,China;2. Hunan Water Resources and Hydropower Survey , Design, Planning and Research Co ., Ltd., Changsha 410007, China;3. Hunan Tuoda Structural Monitoring Technology Co ., Ltd., Changsha 410017, China;4. Tianjin Jingang Construction Company Limited ,Tianjin 300456, China)

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    【目的】针对路面裂缝检测过程中目标尺度不一致、全局特征建模能力不足的问题,提出一种基于双目鸟瞰图 (bird’s eye view,BEV)与改进 YOLOv 8模型的路面裂缝识别方法,旨在实现高效、精准的裂缝检测与分割。【方法】首先,通过双目立体视觉与逆透视变换技术生成高精度 BEV图像,解决传统视角下目标尺度不一致问题;其次,提出的 C2f-DRR模块利用区域残差化 -语义残差化的解耦策略有效捕获裂缝的多尺度上下文信息,通过大核卷积与小核空洞卷积协同作用丰富图像的细节信息并减少背景干扰;最后,引入上下文锚点注意力机制,使模型能够动态聚焦裂缝中心区域,并实现对远距离像素间的长程依赖关系的建模。【结果】为验证改进模型的有效性,在测试集上进行了对比试验。改进后模型的平均精度均值 MAP50达83.7%,准确率 P达83.9%,F1分数达 83.5%,较原始的 YOLOv 8n模型的分别提升 4.4、4.0、1.8个百分点。在公开数据集UAV -PDD 2023上验证的 MAP50达70.5%,召回率 R达64.8%,准确度 P达74.1%,较原模型的分别提升了 3.5、4.5、0.6个百分点。改进模型在识别精度、鲁棒性、泛化学习能力方面均优于原始模型。【结论】本研究提出的基于双目 BEV视角的裂缝分割方法有效提升了模型在复杂道路场景下的检测精度与泛化能力,为自动化路面病害检测提供了技术支持。

    Abstract:

    [Purposes ] This study aims to address the issues of inconsistent target scale and insufficient global feature modeling in road crack detection,proposing a road crack identification method based on binocular bird ’s eye view (BEV) and an improved YOLOv 8 model to achieve efficient and accurate crack detection and segmentation.[Methods] High -precision BEV images were generated using binocular stereo vision and inverse perspective mapping (IPM) technology to resolve the scale inconsistency problem in traditional perspectives.The proposed C 2f-DRR module captured multi -scale contextual information of cracks effectively by employing a region -residual and semantic -residual decoupling strategy.It combined large kernel convolutions with small kernel dilated convolutions to enrich image detail and reduce background interference.Additionally,a context anchor point attention mechanism was introduced to dynamically focus the model on the central region of cracks and achieve model long -range dependencies between distant pixels.[Findings] To verify the effectiveness of the improved model,comparative experiments are conducted on the test set.The improved model achieves a mean average precision (MAP50) of 83.7%,accuracy (P) of 83.9%,and F1 score of 83.5%,with improvements of 4.4,4.0,and 1.8 percentage points,respectively,over the original YOLOv 8n model.The model is also tested on the publicly available UAV -PDD 2023 dataset,achieving an MAP50 of 70.5%,recall (R) of 64.8%,and accuracy (P) of 74.1%,with improvements of 3.5,4.5,and 0.6 percentage points,respectively.The improved model outperforms the original model in identification accuracy,robustness,and generalization ability.[Conclusions ] The proposed BEV -based crack segmentation method effectively enhances detection accuracy and generalization in complex road environments,providing reliable technical support for automated road damage detection.

    参考文献
    相似文献
    引证文献
引用本文

谢海波,邱杨航,黄莹颖,等.基于双目BEV与改进YOLOv 8的路面裂缝识别方法[J].长沙理工大学学报(自然科学版),2025,22(5):1-16.
XIE Haibo, QIU Yanghang, HUANG Yingying, et al. A road crack identification method based on stereo BEV and improved YOLOv 8[J]. Journal of Changsha University of Science & Technology (Natural Science),2025,22(5):1-16.

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-03-04
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-11-27
  • 出版日期:
文章二维码