Biological matrix data are essential for computational analysis, providing a structured framework to identify patterns and relationships in biological systems. Many other biological data types, including sequences, networks, and images, can be transformed into matrix representations through feature extraction and encoding. However, their high dimensionality complicates analysis, leading to increased computational complexity and the risk of overfitting, known as the curse of dimensionality. To address these challenges, we developed SIBioX, a matrix-based bioinformatics tool powered by swarm intelligence algorithms. It integrates 54 swarm intelligence methods, 5 conventional feature selection techniques, and 17 machine learning models, enabling comprehensive analysis of biological matrix data. With a user-friendly graphical interface, it supports operations such as feature normalization, selection, classification, clustering, statistical analysis, and data visualization. Additionally, it converts nonmatrix biological data, like gene and protein sequences, into matrix formats for further study. Experimental results demonstrate that SIBioX not only attains high accuracy in feature selection but also effectively reduces dimensionality, thereby streamlining bioinformatics workflows and promoting greater efficiency in biomedical research.
Yao et al. (Sun,) studied this question.