Matrix-assisted laser desorption/ ionization Mass Spectrometry (MALDI-MS) is a powerful tool to detect and characterize biomolecules, making it particularly useful in different fields and applications such as proteomics, clinical diagnostics, and biomarker discovery. MALDI data is commonly contaminated by the artefacts originated from both chemical and electrical noise. Data preprocessing is hence important to remove these artefacts and improve the accuracy and reliability of the subsequent (quantitative and qualitative) analysis. A systematic investigation of different preprocessing steps is necessary to establish an effective preprocessing pipeline. In this study, we systematically investigated the different steps including interpolation, smoothing, baseline correction, peak alignment, and peak binning, along with normalization to establish a preprocessing pipeline of MALDI spectral data. The performance of the preprocessing steps and pipeline was benchmarked by the balanced accuracy of differentiating hepatocellular carcinoma (HCC) and healthy (normal) based on MALDI spectral data of liver tissue samples. The established preprocessing pipeline improved the balanced accuracy from 61.3% to 77.6% under the patient-level cross-validation, and from 92.9% to 94.7% under spectral-level cross-validation. Our findings demonstrated that the classification performance can be greatly affected by the quality of MALDI data, which can be improved by preprocessing steps. The small difference between the patient-level and spectral-level validations from our analysis demonstrated well a satisfying performance of the classification against patient-to-patient variability with the help of our preprocessing pipeline. This study will potentially benefit the MS community. Preprocessing steps were systematically investigated according to the classification performance to separate hepatocellular carcinoma (HCC) against Normal samples. Each step was incrementally added to the preprocessing pipeline. An optimal preprocessing pipeline was established by making sure the classification performance was progressively improved through the preprocessing steps.
Building similarity graph...
Analyzing shared references across papers
Loading...
Adhikari et al. (Sun,) studied this question.
synapsesocial.com/papers/69a286600a974eb0d3c0143b — DOI: https://doi.org/10.1016/j.rechem.2026.103175
Mou Adhikari
Leibniz Institute of Photonic Technology
Oleg Ryabchykov
Helmholtz Institute Jena
Shuxia Guo
Leibniz Institute of Photonic Technology
Results in Chemistry
Friedrich Schiller University Jena
Leibniz Institute of Photonic Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: