Key points are not available for this paper at this time.
Evaluating the performance of a classification algorithm critically requires a measure of the degree to which unseen examples have been identified with their correct class labels. In practice, generalizability is frequently estimated by averaging the accuracies obtained on individual cross-validation folds. This procedure, however, is problematic in two ways. First, it does not allow for the derivation of meaningful confidence intervals. Second, it leads to an optimistic estimate when a biased classifier is tested on an imbalanced dataset. We show that both problems can be overcome by replacing the conventional point estimate of accuracy by an estimate of the posterior distribution of the balanced accuracy.
Building similarity graph...
Analyzing shared references across papers
Loading...
Kay H. Brodersen
University of Zurich
Cheng Soon Ong
Commonwealth Scientific and Industrial Research Organisation
Klaas Ε. Stephan
University of Iowa
ETH Zurich
University of Zurich
Building similarity graph...
Analyzing shared references across papers
Loading...
Brodersen et al. (Sun,) studied this question.
synapsesocial.com/papers/69d72166ef370a38abf50cad — DOI: https://doi.org/10.1109/icpr.2010.764
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: