SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection | Synapse