What question did this study set out to answer?

The study aims to enhance COVID-19 classification using chest CT images through dimensionality reduction techniques and machine learning algorithms.

March 18, 2026Open Access

Dimensionality Reduction and Machine Learning Methods for COVID-19 Classification Using Chest CT Images

Key Points

The study aims to enhance COVID-19 classification using chest CT images through dimensionality reduction techniques and machine learning algorithms.
Utilized chest CT images to identify COVID-19 patients.
Implemented three dimensionality reduction methods: PCA, UMAP, and diffusion maps.
Applied logistic regression and XGBoost for classification of extracted features.
Trained models using stratified cross-validation to prevent data leakage.
Evaluated model performance using imbalance-aware metrics.
The strongest model combined diffusion maps with logistic regression.
Achieved 97.35% accuracy, 92.16% sensitivity, and 98.59% specificity on the test set.
Demonstrated enhanced performance compared to existing models in recent studies.

Abstract

During the COVID-19 pandemic, researchers have made efforts to detect COVID-19 through various methods. In the dataset used for this study, COVID-19 patients were identified using chest computed tomography (CT) images. High dimensionality is frequently an issue in machine learning image classification. Accordingly, this study implemented three dimensionality reduction methods in combination with various machine learning algorithms for improved classification. Principal component analysis (PCA), uniform manifold approximation and projection (UMAP), and diffusion maps were applied to the dataset to extract the most important features of the chest CT images. The extracted features were given as input either to logistic regression or the extreme gradient boosting (XGBoost) algorithm to perform classification. The strongest model identified from this study was diffusion maps in combination with logistic regression. This model, evaluated against existing models from similar studies in recent years, yielded strong performance for detecting COVID-19 cases using chest CT images. Our proposed model achieved 97.35% accuracy, 92.16% sensitivity, and 98.59% specificity on the held-out test set in differentiating between COVID-19-positive cases and healthy, non-COVID-19 cases. This study aimed to detect COVID-19 without the use of viral testing. Importantly, this method could assist clinicians in making an initial diagnosis, especially when viral testing is not available or timely enough for the patient’s case. This study also provides deeper insight into various dimensionality reduction methods and how compatible they are with biomedical imaging data. Models were trained using stratified cross-validation on the training set, with final performance evaluated on a held-out test set at the patient level to prevent data leakage. Additional imbalance-aware metrics were used to assess robustness given class distribution differences.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Somodi et al. (Mon,) studied this question.

synapsesocial.com/papers/69ba44154e9516ffd37a5fd5 https://doi.org/https://doi.org/10.3390/electronics15061235

Bookmark

View Full Paper