Road Pothole Detection Using YOLOv8 with Image Augmentation

General Information

ISSN: 2301-3699 (Print); 2972-3973 (Online)
Frequency: Bimonthly
Managing Editor: Ms. Alice Loh
DOI: 10.18178/joig
Abstracting/Indexing: Scopus (Since 2021), CNKI, Google Scholar, Crossref, etc.
APC: 500 USD
Average Days to Accept: 98 days
Acceptance Rate: 19%
E-mail: editor@joig.net
Journal Metrics:
5.0

2024CiteScore

69rd percentile

Powered by

Editor-in-Chief

Dr. Branislav Vuksanovic
Deputy Head of Department, Systems Engineering Department, Military Technological College, Muscat, Oman
I am very excited to serve as the first Editor-in-Chief of the International Journal of Image and Graphics (JOIG) and hope that the publication can enrich the readers’ experience... [Read More]

What's New

2025-06-04

All papers published in Vol. 13, No. 2 have been indexed by SCOPUS.

2025-06-04

JOIG received the CiteScore 2024 with 5.0

2025-04-30

Volume 13, No. 2 has been published now.

Home > Articles > All Issues > 2024 > Volume 12, No. 4, 2024 >

JOIG 2024 Vol.12(4):417-426
doi: 10.18178/joig.12.4.417-426

Ken Gorro*, Elmo Ranolo, Lawrence Roble, and Rue Nicole Santillan

Department of Industrial Technology, College of Technology, Cebu Technological University, Carmen, Philippines
Email: ken.gorro@ctu.edu.ph (K.G); elmo.ranolo@ctu.edu.ph (E.R.); lawrence7roble@gmail.com (L.R.); ruesantillan123@gmail.com (R.N.S.)
*Corresponding author

Manuscript received May 17, 2024; revised June 24, 2024; accepted July 25, 2024; published December 16, 2024.

Abstract—Potholes are considered a vital danger to road safety. This study is going to use a novel method realized in the YOLOv8 (You Only Look Once version 8) object detection algorithm library, a well-cutting-edge algorithm, to mark the potholes in road images. Focusing on the resistance to the two types of error namely overfitting and underfitting, the study adopts a set of image augmentation operations and refines the hyperparameters, which contain weight decay and learning rate. For a highly effective hole-filling prediction model, precision annotated images of the roads with the location of potholes marked using the Visual Object Tagging Tool (VoTT) were amassed. These images where potholes are marked using bounding boxes were mined, and the collected data were used to build the state-of-the-art AI models which are fine-tuned for generalization and deployment. The YOLOv8 architecture was trained on this dataset with the assistance of the assessment metric that supplies the most efficient validation and training errors. The data set was composed of 2000 MS VoTT movement images; from these, only 20% was applied to the validation and test phase while the rest of 80% was used for training. For the YOLOv8 training, exposure bounding boxes were used, in which each sample was copied and perturbed at random, the total number of samples used as training increased to 9000. Applying 500 nodes from the computing unit Google Colab featuring High-RAM specifications helped to speed up the training process. A variety of experiments had been performed to evaluate the effectiveness of isolated techniques as well as adjust and select important hyperparameters for example weight decay, learning rate, and batch size. The optimal weight decay value came from experimentation and this included using the values 0.009, 0.001, and 32 for learning rate and batch size. The sum of all this is outstanding, and the perplexity led to an exemplary result with the loss of training 0.06 and validation 0.04, this demonstrates the effectiveness of the proposed method concerning pothole detection. This test is to show whether the model is not overfitting or underfitting.

Keywords—YOLOv8, exposure bounding box, Microsoft Visual object Tagging Tool (VoTT)

Cite: Ken Gorro, Elmo Ranolo, Lawrence Roble, and Rue Nicole Santillan, "Road Pothole Detection Using YOLOv8 with Image Augmentation," Journal of Image and Graphics, Vol. 12, No. 4, pp. 417-426, 2024.

Copyright © 2024 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.

附件说明

Article Metrics in Dimensions

PREVIOUS PAPER

A Novel Color Feature for the Improvement of Pigment Spot Extraction in Iris Images

NEXT PAPER

Deep Learning-Based Classification and Diagnosis of Alzheimer’s & Dementia Using Multi-scale Feature Extraction from Baseline MRI Scans

Home

Articles

Author Guide

Editor Guide

Reviewer Guide

Topics and Special Issues

journal menu