Key points are not available for this paper at this time.
Selecting a small subset of genes out of the thousands of genes in microarray data is important for accurate classification of phenotypes. Widely used methods typically rank genes according to their differential expressions among phenotypes and pick the top-ranked genes. We observe that feature sets so obtained have certain redundancy and study methods to minimize it. Feature sets obtained through the minimum redundancy - maximum relevance framework represent broader spectrum of characteristics of phenotypes than those obtained through standard ranking methods; they are more robust, generalize well to unseen data, and lead to significantly improved classifications in extensive experiments on 5 gene expressions data sets.
Building similarity graph...
Analyzing shared references across papers
Loading...
C. Ding
Georgia Institute of Technology
Hujin Peng
Ocean University of China
University of California, Berkeley
Lawrence Berkeley National Laboratory
National Energy Research Scientific Computing Center
Building similarity graph...
Analyzing shared references across papers
Loading...
Ding et al. (Tue,) studied this question.
synapsesocial.com/papers/6a1c167a69a4af5b15a95704 — DOI: https://doi.org/10.1109/csb.2003.1227396