What question did this study set out to answer?

To improve 3D occupancy prediction efficiency using a bird's-eye-view representation with height awareness.

February 2, 2026Open Access

HBEVOcc: Height-Aware Bird’s-Eye-View Representation for 3D Occupancy Prediction from Multi-Camera Images

Key Points

To improve 3D occupancy prediction efficiency using a bird's-eye-view representation with height awareness.
Developed HBEVOcc, a bird's-eye-view based prediction method.
Utilized a height-aware deformable attention module to incorporate height information.
Extracted multi-camera image features and transformed them into 3D occupancy features.
Integrated a height-aware voxel loss with adaptive vertical weighting.
Achieved state-of-the-art results in mIoU and RayIoU metrics.
Reduced training memory consumption significantly.
Performed effectively on Occ3D-nuScenes and OpenOcc datasets.

Abstract

Due to the ability to perceive fine-grained 3D scenes and recognize objects of arbitrary shapes, 3D occupancy prediction plays a crucial role in vision-centric autonomous driving and robotics. However, most existing methods rely on voxel-based methods, which inevitably demand a large amount of memory and computing resources. To address this challenge and facilitate more efficient 3D occupancy prediction, we propose HBEVOcc, a Bird’s-Eye-View based method for 3D scene representation with a novel height-aware deformable attention module, which can effectively leverage latent height information within BEV framework to compensate for lack of height dimension, significantly reducing computing resource consumption while enhancing the performance. Specifically, our method first extracts multi-camera image features and lifts these 2D features into 3D BEV occupancy features via explicit and implicit view transformations. The BEV features are then further processed by a BEV feature extraction network and height-aware deformable attention module, with the final 3D occupancy prediction results obtained through a prediction head. To further enhance voxel supervision along the height axis, we introduce a height-aware voxel loss with adaptive vertical weighting. Extensive experiments on the Occ3D-nuScenes and OpenOcc dataset demonstrate that HBEVOcc can achieve state-of-the-art results in terms of both mIoU and RayIoU metrics with less training memory (even when trained on 2080Ti).

Read Full Paperexternally

AI에게 질문

Bookmark

View Full Paper