November 30, 2025Open Access

TVC: tokenized video compression with ultra-low bit rate

Key Points

Tokenized video compression achieves high perceptual quality while maintaining ultra-low bit rates.
Lossless compression techniques significantly reduce overhead in video transmission.
Method utilizes tokenization to effectively manage video data streams with improved efficiency.
Highlights the potential of token-based frameworks for advancing video processing technologies.

Abstract

Abstract Tokenized visual representations have shown promise in image compression, yet their extension to video remains underexplored due to the challenges posed by complex temporal dynamics and stringent bit rate constraints. In this paper, we present tokenized video compression (TVC), a token-based dual-stream framework designed to operate effectively at ultra-low bit rates. TVC leverages the Cosmos video tokenizer to extract both discrete and continuous token streams. The discrete tokens are partially masked using a strategic masking scheme and then compressed losslessly with a discrete checkerboard context model to reduce transmission overhead. The masked tokens are reconstructed by a decoder-only Transformer with spatiotemporal token prediction. In parallel, the continuous tokens are quantized and compressed using a continuous checkerboard context model, providing complementary continuous information at ultra-low bit rates. At the decoder side, the two streams are fused with a ControlNet-based multi-scale integration module, ensuring high perceptual quality alongside stable fidelity in reconstruction. Overall, this work illustrates the practicality of tokenized video compression and points to new directions for semantics-aware, token-native approaches.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Lebin Zhou

Cihan Ruan

Nam Ling

Journals

Visual Intelligence

Actions

Institutions

University of Newcastle Australia

Santa Clara University

Institute for the Future

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

TVC: tokenized video compression with ultra-low bit rate

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study