March 19, 2024

Variable-rate Neural Speech Compression with Multi-scale Feature Extraction and Improved Entropy Modeling

Key Points

Key points are not available for this paper at this time.

Abstract

Speech coding serves as a means of data compression, aiming to decrease the expenses related to data storage and transmission. The efficacy of compressing speech efficiently through neural networks has been demonstrated in methods using vector quantization (VQ). However, the complex procedure of VQ makes it challenging to fit into frameworks and limits compression at discrete bitrate points. This paper proposes a neural speech compression framework, which achieves flexible bitrate speech reconstruction through compact latent representation and better entropy estimation.

KI fragen

Bookmark

Cite This Study

Sun et al. (Tue,) studied this question.

synapsesocial.com/papers/68e73757b6db6435876b0905 https://doi.org/https://doi.org/10.1109/dcc58796.2024.00102

KI fragen

Bookmark