What question did this study set out to answer?

This research aims to benchmark the performance of fine-tuned YOLOv8 models for real-time industrial defect detection.

June 2, 2026Open Access

Real-Time Industrial Defect Detection on Edge Hardware Using Fine-Tuned YOLOv8: A Systematic Benchmark on the NEU Surface Defect Database with Automotive & EV Battery Manufacturing Extensions

Key Points

This research aims to benchmark the performance of fine-tuned YOLOv8 models for real-time industrial defect detection.
Evaluated four YOLOv8 architectures (n/s/m/l) using 1,800 images from the NEU Surface Defect Database.
Trained models under production-realistic constraints and assessed accuracy and speed.
Introduced a new automotive manufacturing defect taxonomy for enhanced transfer learning.
YOLOv8l achieved 74.1% mAP@0.5 with 15.8ms inference time on NVIDIA T4 GPU, 43.6M parameters.
YOLOv8n reached 73.6% mAP@0.5 at 2.1ms, showing 7.5× speed increase.
Class detection rates varied significantly, with patches at 91.8% and crazing at 50.6%.

Abstract

Automated visual inspection is a cornerstone of modern smart manufacturing, yet deployment of deep learning-based defect detection systems on production lines remains constrained by the scarcity of labeled industrial data and the computational cost of edge inference without cloud connectivity. This paper presents a systematic benchmark of four fine-tuned YOLOv8 architectures (YOLOv8n/s/m/l) for real-time industrial surface defect detection on the NEU Surface Defect Database — 1,800 images across 6 steel surface defect categories. We fully train all four variants and evaluate the accuracy-efficiency trade-off under production-realistic constraints. YOLOv8l achieves the highest mAP@0.5 of 74.1% at 15.8ms inference on NVIDIA T4 GPU (43.6M parameters), while YOLOv8n achieves 73.6% at 2.1ms — a difference of only 0.5 percentage points at 7.5× the speed. Per-class analysis reveals substantial variance: patches (91.8%) and scratches (84.4%) are detected reliably across all variants, while crazing (50.6%) and rolled-in scale (55.6%) remain challenging due to their diffuse, texture-distributed morphology poorly suited to bounding box formulation. We further introduce an automotive manufacturing defect taxonomy covering body panel, weld, paint, and EV battery cell categories and outline a transfer learning pathway to automotive deployment. Our results establish a reproducible performance baseline for edge-deployed industrial defect detection and form the computer vision foundation for the ClosedMfgAI closed-loop manufacturing intelligence system.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Ezeji et al. (Sun,) studied this question.

synapsesocial.com/papers/6a1e734530b38c64201b685f https://doi.org/https://doi.org/10.5281/zenodo.20476727

Bookmark

View Full Paper