Key points are not available for this paper at this time.
Metabolomics studies use high-throughput analytical platforms to measure metabolites in biological samples. These mass spectrometry and/or NMR spectroscopy platforms generate complex data sets, and the analysis of such data poses many challenges, in particular the high dimensionality with relatively fewer number of samples means that sophisticated statistical models are required to analyse these data and these models come with caveats. In this review, we discuss some of these common caveats associated with most popular statistical tests and models. We present common mistakes found in metabolomics data analysis, along with recommendations on how to avoid them. The aim of this review is to raise awareness of the potential risks of misusing or abusing statistical models, and to promote good practices for reliable and reproducible metabolomics research. A new form of permutation test with emphasis on assessing the statistical significance level of the effect captured by supervised model is also proposed. • Inappropriate use of statistics is one of main cause of irreproducible scientific studies. • Common mistakes found in metabolomics studies are reviewed. • Recommendations of the proper use of common statistical tools are made. • A form of permutation test is presented. • Mind your Ps and Qs is an English idiom that means to be on your best behaviour. This is apt for metabolomics data analysis.
Xu et al. (Sat,) studied this question.