Key points are not available for this paper at this time.
Selecting a small subset of genes out of the thousands of genes in microarray data is important for accurate classification of phenotypes. Widely used methods typically rank genes according to their differential expressions among phenotypes and pick the top-ranked genes. We observe that feature sets so obtained have certain redundancy and study methods to minimize it. Feature sets obtained through the minimum redundancy - maximum relevance framework represent broader spectrum of characteristics of phenotypes than those obtained through standard ranking methods; they are more robust, generalize well to unseen data, and lead to significantly improved classifications in extensive experiments on 5 gene expressions data sets.
Building similarity graph...
Analyzing shared references across papers
Loading...
University of California, Berkeley
Lawrence Berkeley National Laboratory
National Energy Research Scientific Computing Center
Add This Paper to Your Research Feed
Any time a new paper drops it will be there.
Ding et al. (Tue,) studied this question.