Segmentación automática de grietas en pavimento asfáltico mediante redes neuronales convolucionales U-Net

Luz Rashell Soto Olguín; Fredy Gabriel Ramírez Villanueva; Héctor Ramiro Estigarribia Barreto

doi:10.62544/ucomscientia.v3i2.59

Authors

Luz Rashell Soto Olguín Universidad Nacional de Caaguazú, Facultad de Ciencias y Tecnologías, Grupo de Investigación en Ciencia de Datos. Coronel Oviedo, Paraguay. https://orcid.org/0009-0001-3938-2902
Fredy Gabriel Ramírez Villanueva Universidad Nacional de Caaguazú, Facultad de Ciencias y Tecnologías, Grupo de Investigación en Ciencia de Datos. Coronel Oviedo, Paraguay. https://orcid.org/0009-0000-9172-6496
Héctor Ramiro Estigarribia Barreto Universidad Nacional de Caaguazú, Facultad de Ciencias y Tecnologías, Grupo de Investigación en Ciencia de Datos. Coronel Oviedo, Paraguay. https://orcid.org/0000-0002-2954-6053

DOI:

https://doi.org/10.62544/ucomscientia.v3i2.59

Keywords:

Redes neuronales convolucionales, visión artificial, pavimento asfáltico, segmentación de imágenes, mantenimiento vial

Abstract

This article presents the development of an automatic crack segmentation system for asphalt pavement using convolutional neural networks, specifically the U-Net architecture. A proprietary dataset was built with 847 images captured in the city of Coronel Oviedo, Paraguay, of which 505 contained cracks. The images were manually labeled and processed at a resolution of 256x256 pixels. The model was trained for 500 epochs using the binary crossentropy loss function and the Adam optimizer. Outstanding performance metrics were obtained, with an F1 Score of 0.9956 and an Intersection over Union (IoU) index of 0.9913. Additionally, post-segmentation processing was integrated for crack width quantification. In summary, these results demonstrate the high precision and robustness of the developed system.

References

Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp, A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M., ... Zheng, X. (2016).Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv. https://doi.org/10.48550/ARXIV.1603.04467

Ali, L., AlJassmi, H., Swavaf, M., Khan, W. y Alnajjar, F. (2024). Rs-net: Arquitectura de red U-Net Sharp residual para la segmentación y evaluación de la severidad de grietas en pavimentos. Journal of Big Data,11(1), 116. https://doi.org/10.1186/s40537-024-00981-y

Alzamora, P. (2023). Segmentación semántica de imágenes con Deep Learning. Data Machine Learning Visualization. https://blog.damavis.com/segmentacion-semantica-de-imagenes-con-deep-learning

Astute Analytica. (2024). Road Maintenance Market to Drive Fast to Reach Valuation of USD 23.39 Billion By 2032. GlobeNewsWire: https://www.globenewswire.com/news-release/2024/05/07/2876916/0/en/Road-Maintenance-Market-to-Drive-Fast-to-Reach-Valuation-of-USD-23-39-Billion-By-2032-Astute-Analytica.html

Bradski, G. (2000). The OpenCV Library. Dr. Dobb's Journal of Software Tools. https://drdobbs.com/open-source/the-opencv-library/184404319

Cao, H., Wang, Y., Chen, J., Jiang, D., Zhang, X., Tian, Q. y Wang, M. (2023). Swin-unet: Transformador puro tipo Unet para la segmentación de imágenes médicas. En L. Karlinsky, T. Michaeli y K. Nishino (Eds.),Visión por Computador –Talleres ECCV 2022(pp. 205-218). Springer Nature Suiza. https://doi.org/10.1007/978-3-031-25066-8_9

Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F. y Adam, H. (2018). Codificador-decodificador con convolución separable de alto rendimiento para la segmentación semántica de imágenes. En V. Ferrari, M. Hebert, C. Sminchisescu y Y. Weiss (Eds.),Visión por Computador –ECCV 2018(pp. 833-851). Springer International Publishing. https://doi.org/10.1007/978-3-030-01234-2_49

Chen, S., Feng, Z., Xiao, G., Chen, X., Gao, C., Zhao, M. y Yu, H. (2024). Detección de grietas en pavimento basada en el modelo swin-unet mejorado.Buildings,14(5), 1442. https://doi.org/10.3390/buildings14051442

Clark, J. (2024). Pillow (PIL Fork) Documentation: Version 10.4.0. https://pillow.readthedocs.io/en/stable/

Cruz, G., Riobó, A., Pfeifer, M. y Duarte, D. (2024). IA desde los cimientos: Desafíos y oportunidades en el contexto de América Latina y el Caribe. Banco Interamericano de Desarrollo. https://doi.org/10.18235/0013275DataScientest. (2021). Convolutional Neural Network: definición y funcionamiento. https://datascientest.com/es/convolutional-neural-network-es

Di Benedetto, A., Fiani, M. y Gujski, L. M. (2023). Arquitectura de CNN basada en U-net para la segmentación de grietas en carreteras.Infraestructuras,8(5), 90. https://doi.org/10.3390/infrastructures8050090

Dong, C., Li, L., Yan, J., Zhang, Z., Pan, H. y Catbas, F. N. (2021). Segmentación de grietas por fatiga a nivel de píxel en imágenes a gran escala de estructuras de acero mediante una red codificador-decodificador.Sensors,21(12), 4135. https://doi.org/10.3390/s21124135

Federal Highway Administration. (2014). Distress Identification Manual for the Long-Term Pavement Performance Program. (5th ed.). U.S. Department of Transportation. https://www.fhwa.dot.gov/publications/research/infrastructure/pavements/ltpp/13092/13092.pdf

Geeksforgeeks. (2025). Image Edge Detection Operators in Digital Image Processing. https://www.geeksforgeeks.org/image-edge-detection-operators-in-digital-image-processing/

Global Infrastructure Hub. (2022). AI and deep learning for identifying pavement failures in Latin American and the Caribbean. World Bank's Public-Private Infrastructure Advisory Facility (PPIAF). https://infratech.gihub.org/infratech-case-studies/ai-and-deep-learning-for-identifying-pavement-failures-in-latin-american-and-the-caribbean/

Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. http://www.deeplearningbook.org

Guevara, T. (2025). La inteligencia artificial avanza desigual en Latinoamérica, buscan llevarla al sector productivo. Voz de América (VoA). https://www.vozdeamerica.com/a/ia-avanza-desigual-latinoamerica/8010993.html

Harris, C. R., Millman, K. J., van der Walt, S. J., Gommers, R., Virtanen, P., Cournapeau, D., Wieser, E., Taylor, J., Berg, S., Smith, Nueva Jersey, Kern, R., Picus, M., Hoyer, S., van Kerkwijk, M. H., Brett, M., Haldane, A., del Río, J. F., Wiebe, M., Peterson, P.,... Oliphant, T. E. (2020). Programación de matrices con NumPy.Naturaleza,585(7825), 357-362. https://doi.org/10.1038/s41586-020-2649-2

He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition.2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770-778. https://doi.org/10.1109/CVPR.2016.90

Kaveh, H., & Alhajj, R. (2024). Recent advances in crack detection technologies for structures: A survey of 2022-2023 literature. Frontiers in Built Environment,10, 1321634. https://doi.org/10.3389/fbuil.2024.1321634

IBM. (2024). ¿Qué es la segmentación de imágenes?. https://www.ibm.com/es-es/topics/image-segmentation

Kingma, D. P., & Ba, J. (2014).Adam: A method for stochastic optimization. arXiv. https://doi.org/10.48550/ARXIV.1412.6980

Lau, S. L. H., Chong, E. K. P., Yang, X., & Wang, X. (2020). Automated pavement crack segmentation using u-net-based convolutional neural network.IEEE Access,8, 114892-114899. https://doi.org/10.1109/ACCESS.2020.3003638

Long, J., Shelhamer, E., & Darrell, T. (2015). Fully convolutional networks for semantic segmentation.2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3431-3440. https://doi.org/10.1109/CVPR.2015.7298965

Madroñero Urcuqui, J. D., y Valencia López, Y. C. (2019). Metodología para la identificación automática del deterioro en pavimento flexible, por medio de fotografías aéreas tomadas desde vehículos no tripulados. Universidad del Valle. https://hdl.handle.net/10893/15476

Marín-Acevedo, E. A. (2019). Detección automática de grietas y fisuras en pavimento por medio de fotos y redes neuronales convolucionales. Universidad Católica de Oriente. https://hdl.handle.net/20.500.12516/419

Ministerio de Obras Públicas y Comunicaciones. (2019). Manual de Carreteras del Paraguay.https://apcarreteras.org.py/manual-de-carreteras-del-paraguay-rev-2019/

Nyathi, M. A., Bai, J., & Wilson, I. D. (2024). Deep learning for concrete crack detection and measurement.Metrology,4(1), 66-81. https://doi.org/10.3390/metrology4010005

Peng, Y., Chen, D. Z., & Sonka, M. (2025). U-net v2: Rethinking the skip connections of u-net for medical image segmentation.2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI), 1-5. https://doi.org/10.1109/ISBI60581.2025.10980742

Pfeifer, M., Fernandez, S., y Riobo, A. (2023). Pavimenta2: infraestructura digital al servicio de los activos viales.(BID) moviliblog. https://blogs.iadb.org/transporte/es/pavimenta2-infraestructura-digital-al-servicio-de-los-activos-viales/Riobo, A., Pfeifer, M., y Calle Jordá, A. (2024). VíaSegura: lecciones aprendidas e inteligencia artificial al servicio de la seguridad vial.(BID) Moviliblog. https://blogs.iadb.org/transporte/es/viasegura-lecciones-aprendidas-e-inteligencia-artificial-al-servicio-de-la-seguridad-vial

Ronneberger, O., Fischer, P., & Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. En N. Navab, J. Hornegger, W. M. Wells, & A. F. Frangi (Eds.),Medical Image Computing and Computer-Assisted Intervention –MICCAI 2015(Vol. 9351, pp. 234-241). Springer International Publishing. https://doi.org/10.1007/978-3-319-24574-4_28

Sabouri, M., & Sepidbar, A. (2023). SUT-Crack. https://doi.org/10.17632/gsbmknrhkv.6

Soto Olguín, L. y Ramírez Villanueva, F. (2024). Segmentación de grietas superficiales en pavimento asfáltico utilizando técnicas de visión artificial. [Tesis de grado]. Universidad Nacional de Caaguazú. https://publicaciones.fctunca.edu.py/handle/123456789/128

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. (2014). Dropout: A Simple Way to Prevent Neural Networks from Overfitting. Journal of Machine Learning Research, 15(1), 1929–1958. https://dl.acm.org/doi/abs/10.5555/2627435.2670313

Szeliski, R. (2022).Computer vision: Algorithms and applications. Springer International Publishing. https://doi.org/10.1007/978-3-030-34372-9

Tan, M., & Le, Q. (2019). EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.Proceedings of the 36th International Conference on Machine Learning. https://proceedings.mlr.press/v97/tan19a.html

Tang, Y., Qian, Y., & Yang, E. (2022). Weakly supervised convolutional neural network for pavement crack segmentation.Intelligent Transportation Infrastructure,1, liac013. https://doi.org/10.1093/iti/liac013

Tello Cifuentes, L., Marulanda, J., y Thomson, P. (2021). detección de grietas en el pavimento usando técnicas de procesamiento de imágenes y redes neuronales artificiales. Encuentro Internacional De Educación En Ingeniería. http://dx.doi.org/10.26507/ponencia.1565

Yu, Y., Xia, W., Zhao, Z., & He, B. (2024). A lightweight and high-accuracy model for pavement crack segmentation. Applied Sciences,14(24), 11632. https://doi.org/10.3390/app142411632

Zhang, Z., He, Y., Hu, D., Jin, Q., Zhou, M., Liu, Z., Chen, H., Wang, H., & Xiang, X. (2025). Algorithm for pixel-level concrete pavement crack segmentation based on an improved U-Net model.Scientific Reports,15(1), 6553. https://doi.org/10.1038/s41598-025-91352-x

Zhang, Y., & Zhang , L. (2024). Detection of Pavement Cracks by Deep Learning Models of Transformer and UNet. IEEE Transactions on Intelligent Transportation Systems, 25(11). https://doi.org/10.1109/TITS.2024.3420763

Zhang, Q., Chen , S., Wu , Y., Ji , Z., Yan , F., Huang , S., & Liu , Y. (2024a). Improved U-net network asphalt pavement crack detection method. PLoS ONE, 19(5). https://doi.org/10.1371/journal.pone.0300679

Zhang, J., Sun, S., Song, W., Li, Y., & Teng, Q. (2024b). A novel convolutional neural network for enhancing the continuity of pavement crack detection.Scientific Reports,14(1), 30376. https://doi.org/10.1038/s41598-024-81119-1Zhang, J., Xia, H., Li, P., Zhang, K., Hong, W., & Guo, R. (2024c). A pavement crack detection method via deep learning and a binocular-vision-based unmanned aerial vehicle.Applied Sciences,14(5), 1778. https://doi.org/10.3390/app14051778

Zhao, H., Shi, J., Qi, X., Wang, X., & Jia, J. (2017). Pyramid Scene Parsing Network. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu, USA. https://doi.org/https://doi.org/10.1109/CVPR.2017.660

Zhou, Z., Rahman Siddiquee, M. M., Tajbakhsh, N., & Liang, J. (2018). Unet++: A nested u-net architecture for medical image segmentation. En D. Stoyanov, Z. Taylor, G. Carneiro, T. Syeda-Mahmood, A. Martel, L. Maier-Hein, J. M. R. S. Tavares, A. Bradley, J. P. Papa, V. Belagiannis, J. C. Nascimento, Z. Lu, S. Conjeti, M. Moradi, H. Greenspan, & A. Madabhushi (Eds.),Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support(Vol. 11045, pp. 3-11). Springer International Publishing. https://doi.org/10.1007/978-3-030-00889-5

Automatic Crack Segmentation in Asphalt Pavement Using U-Net Convolutional Neural Networks

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Current Issue

Browse

Information

Language

Make a Submission

Redes Sociales

Descargar_Revista

Descargar Revista

Indexado