What question did this study set out to answer?

The aim is to develop a framework for creating realistic augmented datasets for autonomous driving, addressing the limitations of current methods.

February 5, 2026Open Access

Toward Realistic Autonomous Driving Dataset Augmentation: A Real–Virtual Fusion Approach with Inconsistency Mitigation

Key Points

The aim is to develop a framework for creating realistic augmented datasets for autonomous driving, addressing the limitations of current methods.
Proposed a real–virtual fusion framework using real-world data and a digital twin environment.
Synchronized real-world data with digital models using a robust look-up table.
Inserted virtual objects into real image backgrounds while focusing on visual consistency.
Employed illumination matching techniques to reduce discrepancies in visuals.
Conducted experiments to evaluate effectiveness.
The fusion strategy effectively bridged the reality gap between real and synthetic images.
Significant improvements in dataset quality for enhancing object recognition.
Demonstrated a cost-effective and safer alternative for dataset enrichment.
Improved generalization capabilities of perception models in autonomous driving systems.

Abstract

Autonomous driving systems rely on vast and diverse datasets for robust object recognition. However, acquiring real-world data, especially for rare and hazardous scenarios, is prohibitively expensive and risky. While purely synthetic data offers flexibility, it often suffers from a significant reality gap due to discrepancies in visual fidelity and physics. To address these challenges, this paper proposes a novel real–virtual fusion framework for efficiently generating highly realistic augmented image datasets for autonomous driving. Our methodology leverages real-world driving data from South Korea’s K-City, synchronizing it with a digital twin environment in Morai Sim (v24.R2) through a robust look-up table and fine-tuned localization approach. We then seamlessly inject diverse virtual objects (e.g., pedestrians, vehicles, traffic lights) into real image backgrounds. A critical contribution is our focus on inconsistency mitigation, employing advanced techniques such as illumination matching during virtual object injection to minimize visual discrepancies. We evaluate the proposed approach through experiments. Our results show that this real–virtual fusion strategy significantly bridges the reality gap, providing a cost-effective and safe solution for enriching autonomous driving datasets and improving the generalization capabilities of perception models.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper