What question did this study set out to answer?

To enhance Mixture-of-Experts routing efficiency by reducing computational complexity through BVH traversal on NVIDIA hardware.

April 10, 2026Open Access

SpectralAI: O(N log N) Hardware-Accelerated Expert Routing via RT Core BVH Traversal

Key Points

To enhance Mixture-of-Experts routing efficiency by reducing computational complexity through BVH traversal on NVIDIA hardware.
Replaced O(N²) matrix multiplication with O(N log N) BVH traversal.
Utilized NVIDIA RT Core for hardware acceleration.
Projected token embeddings into 3D geometric space for expert selection.
Introduced the Inception Engine for a 12-dimensional semantic representation.
Achieved 113–218× speedup in routing time.
Reduced VRAM usage by 731× on NVIDIA RTX 5070 Ti.
Validated on OLMoE-1B-7B achieving perplexity of 6.79, a 1.5% increase over baseline.
RT Core routing operates at 19.1 μs/batch with 13.4M queries/s.
Downstream HellaSwag accuracy decreased by only 1.1 percentage points.

Abstract

We present SpectralAI, a system that replaces the O(N²) matrix multiplication in Mixture-of-Experts (MoE) routing with O(N log N) Bounding Volume Hierarchy (BVH) traversal on dedicated NVIDIA RT Core hardware. Our approach projects token embeddings into 3D geometric space and uses hardware-accelerated BVH traversal for expert selection, achieving 113–218× routing speedup and 731× VRAM reduction on a single NVIDIA RTX 5070 Ti. We validate on OLMoE-1B-7B (7B parameters, 64 experts, 16 MoE layers): BVH pre-filter mode achieves perplexity 6.79 (+1.5% vs baseline), RT Core routing runs at 19.1 μs/batch with 13.4M queries/s, and downstream HellaSwag accuracy drops only 1.1 percentage points. We also introduce the Inception Engine, a nested Instance Acceleration Structure that composes four levels of 3D spaces into an effective 12-dimensional semantic representation, bypassing the hardware's native 3D limitation. To the best of our knowledge, this is the first system to repurpose GPU ray tracing cores for neural network expert routing. Package includes the paper (PDF + markdown source), validation data, and all figures.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Jordi Silvestre Lopez

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

SpectralAI: O(N log N) Hardware-Accelerated Expert Routing via RT Core BVH Traversal

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study