December 1, 2010

Feature Selection via Correlation Coefficient Clustering

Key Points

Key points are not available for this paper at this time.

Abstract

Feature selection is a fundamental problem in machine learning and data mining. How to choose the most problem-related features from a set of collected features is essential. In this paper, a novel method using correlation coefficient clustering in removing similar/redundant features is proposed. The collected features are grouped into clusters by measuring their correlation coefficient values. The most class-dependent feature in each cluster is retained while others in the same cluster are removed. Thus, the most class-related and mutually unrelated features are identified. The proposed method was applied to two datasets: the disordered protein dataset and the Arrhythmia (ARR) dataset. The experimental results show that the method is superior to other feature selection methods in speed and/or accuracy. Detail discussions are given in the paper.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Hui-Huang Hsu

Tamkang University

Cheng‐Wei Hsieh

Institute of Biological Chemistry, Academia Sinica

Journals

Journal of Software

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Feature Selection via Correlation Coefficient Clustering

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study