Key points are not available for this paper at this time.
We propose to employ an alternative implementation of random forests, that provides unbiased variable selection in the individual classification trees. When this method is applied using subsampling without replacement, the resulting variable importance measures can be used reliably for variable selection even in situations where the potential predictor variables vary in their scale of measurement or their number of categories. The usage of both random forest algorithms and their variable importance measures in the R system for statistical computing is illustrated and documented thoroughly in an application re-analyzing data from a study on RNA editing. Therefore the suggested method can be applied straightforwardly by scientists in bioinformatics research.
Building similarity graph...
Analyzing shared references across papers
Loading...
Carolin Strobl
University of Zurich
Anne‐Laure Boulesteix
Zimmer Biomet (Netherlands)
Achim Zeileis
Universität Innsbruck
SHILAP Revista de lepidopterología
BMC Bioinformatics
Ludwig-Maximilians-Universität München
Technical University of Munich
Friedrich-Alexander-Universität Erlangen-Nürnberg
Building similarity graph...
Analyzing shared references across papers
Loading...
Strobl et al. (Thu,) studied this question.
synapsesocial.com/papers/69d94d6f9873513554835d92 — DOI: https://doi.org/10.1186/1471-2105-8-25
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: