Key points are not available for this paper at this time.
Feature selection is a fundamental problem in machine learning and data mining. How to choose the most problem-related features from a set of collected features is essential. In this paper, a novel method using correlation coefficient clustering in removing similar/redundant features is proposed. The collected features are grouped into clusters by measuring their correlation coefficient values. The most class-dependent feature in each cluster is retained while others in the same cluster are removed. Thus, the most class-related and mutually unrelated features are identified. The proposed method was applied to two datasets: the disordered protein dataset and the Arrhythmia (ARR) dataset. The experimental results show that the method is superior to other feature selection methods in speed and/or accuracy. Detail discussions are given in the paper.
Building similarity graph...
Analyzing shared references across papers
Loading...
Hui-Huang Hsu
Tamkang University
Cheng‐Wei Hsieh
Institute of Biological Chemistry, Academia Sinica
Journal of Software
Building similarity graph...
Analyzing shared references across papers
Loading...
Hsu et al. (Wed,) studied this question.
synapsesocial.com/papers/6a15745f52e78db3804e23fb — DOI: https://doi.org/10.4304/jsw.5.12.1371-1377