What type of study is this?

This is a In Vitro Study study.

August 26, 2025Open Access

Toward Non-Invasive Voice Restoration: A Deep Learning Approach Using Real-Time MRI

Key Points

Results indicate silent articulatory motion encodes enough information for voice approximation, suggesting a breakthrough in non-invasive restoration methods.
Using real-time MRI, this approach synthesizes personalized speech, bypassing acoustic input and targeting individuals with speech challenges.
The method employs a Pix2Pix conditional GAN to map segmented MRI frames to articulatory class representations, enhancing the synthesis process.
Findings may enable more accessible speech restoration options for people with congenital mutism or neurodegenerative diseases, warranting further exploration.

Abstract

Despite recent advances in brain-computer interfaces (BCIs) for speech restoration, existing systems remain invasive, costly, and inaccessible to individuals with congenital mutism or neurodegenerative disease. We present a proof-of-concept pipeline that synthesizes personalized speech directly from real-time magnetic resonance imaging (rtMRI) of the vocal tract, without requiring acoustic input. Segmented rtMRI frames are mapped to articulatory class representations using a Pix2Pix conditional GAN, which are then transformed into synthetic audio waveforms by a convolutional neural network modeling the articulatory-to-acoustic relationship. The outputs are rendered into audible form and evaluated with speaker-similarity metrics derived from Resemblyzer embeddings. While preliminary, our results suggest that even silent articulatory motion encodes sufficient information to approximate a speaker's vocal characteristics, offering a non-invasive direction for future speech restoration in individuals who have lost or never developed voice.

Read Full Paperexternally

KI fragen

Bookmark

View Full Paper

Cite This Study

Mahdi Saleh (Tue,) studied this question.

synapsesocial.com/papers/68af620aad7bf08b1eae3103 https://doi.org/https://doi.org/10.1101/2025.08.22.25334256

KI fragen

Bookmark

View Full Paper