What question did this study set out to answer?

This research aims to identify the factors causing performance degradation in vehicle orientation prediction when applied across different datasets.

May 16, 2026Open Access

Cross-Dataset Insights for Fine-Grained Vehicle Orientation Prediction

Key Points

This research aims to identify the factors causing performance degradation in vehicle orientation prediction when applied across different datasets.
Conducted a cross-dataset benchmark using Car Full View and Freiburg Static Cars 52 v1.1 datasets.
Employed a fixed ConvNeXt-Small predictor with varied training sources, test targets, and preprocessing strategies.
Evaluated conditions with five-fold cross-validation at the vehicle-instance level.
Identified annotation label incompatibility as the primary source of transfer error, reducing cross-dataset CMAE by 3.5–4.5° with corrected labels.
Crop protocol contributed significantly to error, with mismatched train/test crops yielding CMAE of 9–12°.
Joint training on harmonized datasets achieved best-balanced performance of 3.77° on CFV and 5.38° on UnsupCar.

Abstract

Fine-grained vehicle orientation estimation is widely reported with strong in-domain accuracy, yet performance degrades substantially when models are applied across datasets; the relative contributions of visual domain shift and annotation label incompatibility to this degradation remain poorly understood. A controlled cross-dataset benchmark was conducted using two publicly available datasets—Car Full View (CFV) and Freiburg Static Cars 52 v1.1 (UnsupCar)—under a fixed ConvNeXt-Small predictor with a varied training source, test target, and image preprocessing strategy. All conditions were evaluated with five-fold cross-validation at the vehicle-instance level. Annotation label incompatibility was identified as the dominant source of transfer error: correcting the angular convention mismatch in UnsupCar orientation labels reduced cross-dataset circular mean absolute error (CMAE) by approximately 3.5–4.5∘. Crop protocol was a similarly large factor—train/test crop mismatch raised CMAE into the 9–12∘ range. Square cropping with mirrored boundary padding provided the most robust preprocessing across both in-domain and cross-dataset conditions. After label harmonization, a residual transfer gap of approximately 2∘ remained, with a consistent directional asymmetry favoring the UnsupCar-to-CFV transfer direction. Joint training on both harmonized datasets achieved the best-balanced performance (3.77∘ on CFV; 5.38∘ on UnsupCar). These results demonstrate that instance-level splitting, explicit label harmonization, and consistent crop definition are necessary preconditions for credible cross-dataset vehicle orientation evaluation.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Pasaulis et al. (Thu,) studied this question.

synapsesocial.com/papers/6a080ae2a487c87a6a40ce4b https://doi.org/https://doi.org/10.3390/electronics15102097

Bookmark

View Full Paper