What does this research mean for the field?

The proposed targetless LiDAR–camera extrinsic calibration framework achieves superior accuracy and runtime efficiency compared to existing segmentation-based global optimization methods. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The research aims to develop a targetless calibration method for aligning LiDAR and camera systems accurately and efficiently.

March 1, 2026Open Access

Targetless LiDAR–Camera Extrinsic Calibration via Class-Agnostic Boundary Mask Alignment and SPSA-Based Optimization

Key Points

The research aims to develop a targetless calibration method for aligning LiDAR and camera systems accurately and efficiently.
Developed a calibration framework using boundary mask alignment in a shared image-plane representation.
Constructed LiDAR-camera mask pairs from image-plane depth and intensity projections.
Utilized bounded rotation-only global initialization for robust initial pose candidates.
Employed stochastic gradient approximation for efficient extrinsic parameter optimization.
Showed improved accuracy and runtime trade-off compared to existing global optimization methods.
Validated stable cross-modal alignment through real-world tests despite varying conditions like vibration and timing jitter.

Abstract

Targetless LiDAR–camera extrinsic calibration remains challenging due to unreliable cross-modal correspondences and sensitivity to initialization. We present a targetless extrinsic calibration framework based on class-agnostic boundary mask alignment in a shared image-plane representation. This scheme first constructs consistent LiDAR–camera mask pairs from image-plane depth and intensity projections of LiDAR data and camera images. It then obtains robust initial pose candidates through bounded rotation-only global initialization and refines them using a computationally efficient stochastic gradient approximation to estimate the optimal extrinsic parameters. Experiments on the KITTI benchmark demonstrate a superior accuracy–runtime trade-off compared with a segmentation-based global optimization baseline, while real-world driving tests confirm stable cross-modal alignment under vibration and inter-modal timing jitter.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Jeong et al. (Fri,) studied this question.

synapsesocial.com/papers/69a3d89aec16d51705d2f89c https://doi.org/https://doi.org/10.3390/s26051501

Bookmark

View Full Paper