March 3, 2026Open Access

Evaluating video-based synthetic data for training lightweight models in strawberry leaf disease classification

Key Points

ResNet-18 reached a performance accuracy of 98.71%, demonstrating the effectiveness of synthetic data for disease classification.
The study employed a synthetic dataset of 1,467 images, generated from videos with varied lighting and leaf morphology.
By utilizing a feature extraction strategy, six deep learning models were trained and evaluated against 618 real-world images to assess their generalization.
Statistical analysis confirmed that the performance of these models, especially ResNet-18 and MobileNetV3-Small, remain comparable, highlighting synthetic data's stability.

Abstract

Collecting large, diverse, and well-labeled datasets remains a persistent bottleneck in agricultural computer vision. This study explores the efficacy of video-based synthetic data, generated via the diffusion-transformer model Sora, to address this scarcity for strawberry leaf disease classification. A synthetic dataset of 1,467 images was curated by extracting frames from generated videos, using structured text prompts and reference images to capture temporal variations in lighting and leaf morphology. This data was utilized to train six lightweight deep learning architectures (DenseNet-121, EfficientNet-B0, MobileNetV3-Small, ResNet-18, ShuffleNetV2, and Vision Transformer (ViT)-Tiny) using a feature extraction strategy. The models were evaluated on a held-out test set of 618 real-world images to assess synthetic-to-real generalization. ResNet-18 achieved the highest nominal performance, with accuracy, precision, recall, and F1-score all reaching 98.71%. A 5-fold stratified cross-validation further confirmed the approach’s stability with an average accuracy of 98.9%. Notably, statistical analysis using McNemar’s test revealed no significant performance difference ( p > 0.05) between ResNet-18 and the significantly lighter MobileNetV3-Small. These findings demonstrate that video-derived synthetic data can effectively bridge the domain gap, enabling the training of robust, resource-efficient models suitable for deployment on edge devices in precision agriculture.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Adnan Miski (Wed,) studied this question.

synapsesocial.com/papers/69a75d06c6e9836116a266d3 https://doi.org/https://doi.org/10.7717/peerj-cs.3521

Bookmark

View Full Paper