Home > Articles > All Issues > 2024 > Volume 12, No. 3, 2024 >
JOIG 2024 Vol.12(3): 292-301
doi: 10.18178/joig.12.3.292-301

Performance Improvement of YOLOv8-RTDETR Method Based Retail Product Detection

Andi Wahyu Maulana 1,2, Suryo Adhi Wibowo 1,2,*, Unang Sunarya 3, Rissa Rahmania 4, and Asep Insani 5
1. School of Electrical Engineering, Telkom University, Bandung, Indonesia
2. Center of Excellence Artificial Intelligence for Learning and Optimization, Telkom University, Bandung, Indonesia
3. School of Applied Science, Telkom University, Bandung, Indonesia
4. Computer Science Department, School of Computer Science, Bina Nusantara University, Bandung Campus, Jakarta, Indonesia
5. National Research and Innovation Agency (BRIN), Jakarta, Indonesia
Email: suryoadhiwibowo@telkomuniversity.ac.id (S.A.W.);
awahyumaulana@student.telkomuniversity.ac.id (A.W.M.); unangsunarya@telkomuniversity.ac.id (U.S.); rissa.rahmania@binus.ac.id (R.R.); asep035@brin.go.id (A.I.)
*Corresponding author

Manuscript received April 23, 2024; revised June 11, 2024; accepted June 18, 2024; published September 6, 2024

Abstract—Recently people often purchase their daily needs at retail stores. Therefore, crowds might happen due to a manual queueing system. To overcome the problem, the smart system based on object detection has been conducted using several object detection methods. This study proposed YOLOv8 combined with transformer Real-Time Detection Transformer (RT-DETR) model to enhance the method performance in detecting the detail products. The intra-Class Variation method has been used to recognize the characteristics of the products such as size, color, and variant of the product. To validate the proposed model, three different datasets have been applied that is grocery dataset that displays products one by one in the training and validation process, the RPC-dataset that has many products in one image, and the D2S dataset with products that have varying lighting and stacked products. Results showed that the proposed model outperformed compared to other models, with a mean Average Precision (mAP) of 99.5% for the grocery dataset, 99.3% RPC-dataset, and 85.5% D2S dataset, respectively.

Keywords—intra-class variation, mean Average Precision (mAP), retail product, Real-Time Detection Transformer (RT-DETR), YOLOv8

Cite: Andi Wahyu Maulana, Suryo Adhi Wibowo, Unang Sunarya, Rissa Rahmania, and Asep Insani, "Performance Improvement of YOLOv8-RTDETR Method Based Retail Product Detection," Journal of Image and Graphics, Vol. 12, No. 3, pp. 292-301, 2024.

Copyright © 2024 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.