What type of study is this?

This is a Quantitative Study study.

September 16, 2025

Enhancing Spoof Detection in Automatic Speaker Verification Using CQCC Optimization and ViT Architecture

Key Points

Enhancing spoof detection improves security features in automatic speaker verification systems, making them more reliable.
Using the ASVspoof 2019 dataset, the study achieves a test accuracy of 97% with Genetic Algorithm optimization.
CQCC extraction and multiple optimization methods ensure better feature refinement, reducing training time significantly.
Evaluation metrics like Equal Error Rate and t-DCF highlight the benefits of utilizing optimized features for real-time applications.

Abstract

Spoof detection is found to be essential for improving the security features of automatic speaker verification (ASV) systems, which are primarily used in authentication. The primary goal of this study is to enhance the performance and efficiency of spoof detection using speech samples taken from the ASVspoof 2019 dataset. The Constant Q Cepstral Coefficients (CQCC) extracted from these speech samples act as an important key feature. Feature optimization methods such as Genetic Algorithm (GA), Grey Wolf Optimizer (GWO), and Mayfly Optimizer (MO) are used to refine these features and hence enhance the model accuracy with minimal time cost. A Vision Transformer (ViT) model is then trained using each optimized feature, and the performance is evaluated by comparing the results from different optimization methods. Time analysis shows a substantial reduction in training time per epoch when the optimized features are used. The Genetic Algorithm attained the best performance, with a test accuracy of 97% and the least training time. Equal Error Rate (EER) and the Tandem Detection Cost Function (t-DCF) are used as the evaluation metrics. This study demonstrates how feature optimization helps to enhance spoof detection accuracy while reducing processing time, hence becoming an authentic solution for real-time ASV systems.

Bookmark

Cite This Study

Selin et al. (Mon,) studied this question.

synapsesocial.com/papers/68d454cb31b076d99fa5a413 https://doi.org/https://doi.org/10.53759/7669/jmc202505202

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Bookmark