Quantification of magnetic resonance spectroscopy (MRS) data using linear combination modeling (LCM) is challenging, partly due to the large number of spectral parameters to be estimated. Popular conventional LCM approaches often place soft constraints on signal amplitude ratios to improve fitting stability, at the cost of introducing bias. Meanwhile, existing deep learning (DL) methods tend to oversimplify the problem by omitting important spectral parameters, limiting their real-world utility. With these considerations in mind, we developed Q-MRS, a DL framework based on a Convolutional vision Transformer (CvT) that combines the strengths of a convolutional neural network (CNN) and a Transformer. The model was trained on a large dataset of simulated spectra and evaluated on high-quality 3T GABA-edited MEGA-PRESS data acquired from health adults in the medial parietal lobe. On simulated data with known ground-truth metabolite levels, the CvT outperformed two baseline models, a simple CNN and an Inception network. When applied to the in vivo data, Q-MRS produced fit quality and concentration estimates comparable with those of two established LCM methods, LCModel and Osprey, without imposing constraints on metabolite amplitude ratios. These results suggest that the proposed method is a promising approach for MRS analysis.
Wu et al. (Wed,) studied this question.