Reliable simulation-based methods are essential for the verification and validation of automated driving. To meet this need, G is introduced – a metric that quantifies the simulation-to-reality gap by comparing outcomes of environment-perception system tests executed in real and simulation-based test drives. To execute the simulation-based test drives, a novel sensor-in-the-loop test environment – combining over-the-air (OTA) and direct-data-injection (DDI) stimulation for camera and radar sensors – was integrated, enabling an extensive test campaign including static and Euro-NCAP-inspired dynamic scenarios under daytime, night, and rain conditions. For the empirical analysis of G, test drives were conducted on a real proving ground and reproduced in the laboratory within the sensor-in-the-loop environment. Test drives in both real and virtual domains generated more than two hours of data, which have been published as an open dataset for the scientific community. The results comprise a direct comparison of outcomes from real-world drives and their virtual counterparts. Across the investigated scenarios, G remained predominantly positive, confirming a pessimistic bias of simulation-based test drives. OTA consistently narrowed the gap relative to DDI, substantiating the benefits of full sensor integration. The findings indicate that while tests under daytime conditions may justifiably migrate to the virtual domain, while accurate simulation-based validation under adverse weather still demands the development of improved weather-disturbance models – otherwise real-world drives remain necessary. By consolidating established perception metrics into a single, regulator-oriented indicator, the proposed metric G establishes a foundation for demonstrating simulation fidelity and aligns directly with emerging German and EU requirements for approval workflows for automated driving.
Fabio Reway (Thu,) studied this question.