Los puntos clave no están disponibles para este artículo en este momento.
We consider the problem of learning linear prediction models with model misspecification bias. In such case, the collinearity among input variables may inflate the error of parameter estimation, resulting in instability of prediction results when training and test distributions do not match. In this paper we theoretically analyze this fundamental problem and propose a sample reweighting method that reduces collinearity among input variables. Our method can be seen as a pretreatment of data to improve the condition of design matrix, and it can then be combined with any standard learning method for parameter estimation and variable selection. Empirical studies on both simulation and real datasets demonstrate the effectiveness of our method in terms of more stable performance across different distributed data.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zheyan Shen
Tsinghua University
Peng Cui
Dongshin University
Tong Zhang
Rutgers, The State University of New Jersey
Tsinghua University
Zhejiang University
Hong Kong University of Science and Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Shen et al. (Fri,) studied this question.
synapsesocial.com/papers/6a15652a15658026c0824cd2 — DOI: https://doi.org/10.1609/aaai.v34i04.6024