March 30, 2004

Minimum redundancy feature selection from microarray gene expression data

Key Points

Key points are not available for this paper at this time.

Abstract

Selecting a small subset of genes out of the thousands of genes in microarray data is important for accurate classification of phenotypes. Widely used methods typically rank genes according to their differential expressions among phenotypes and pick the top-ranked genes. We observe that feature sets so obtained have certain redundancy and study methods to minimize it. Feature sets obtained through the minimum redundancy - maximum relevance framework represent broader spectrum of characteristics of phenotypes than those obtained through standard ranking methods; they are more robust, generalize well to unseen data, and lead to significantly improved classifications in extensive experiments on 5 gene expressions data sets.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Institutions

University of California, Berkeley

Lawrence Berkeley National Laboratory

National Energy Research Scientific Computing Center

References and Citations

Add This Paper to Your Research Feed

Any time a new paper drops it will be there.