March 30, 2004

Minimum redundancy feature selection from microarray gene expression data

Key Points

Key points are not available for this paper at this time.

Abstract

Selecting a small subset of genes out of the thousands of genes in microarray data is important for accurate classification of phenotypes. Widely used methods typically rank genes according to their differential expressions among phenotypes and pick the top-ranked genes. We observe that feature sets so obtained have certain redundancy and study methods to minimize it. Feature sets obtained through the minimum redundancy - maximum relevance framework represent broader spectrum of characteristics of phenotypes than those obtained through standard ranking methods; they are more robust, generalize well to unseen data, and lead to significantly improved classifications in extensive experiments on 5 gene expressions data sets.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

C. Ding

Georgia Institute of Technology

Hujin Peng

Ocean University of China

Actions

Institutions

University of California, Berkeley

Lawrence Berkeley National Laboratory

National Energy Research Scientific Computing Center

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Minimum redundancy feature selection from microarray gene expression data

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study