What question did this study set out to answer?

This research aims to understand out-of-distribution generalization by framing it within distribution shifts and causal perspectives.

April 25, 2026

Counterfactual Risk Minimization for Out-of-Distribution Generalization

Key Points

This research aims to understand out-of-distribution generalization by framing it within distribution shifts and causal perspectives.
Developed a novel three-dimensional coordinate system to represent distribution shifts.
Created a new multidomain visual recognition dataset called CONA.
Evaluated Counterfactual Risk Minimization against several state-of-the-art competitors on four benchmark datasets.
Counterfactual Risk Minimization outperformed existing methods on all evaluated datasets.
The novel dataset CONA facilitated comprehensive evaluations for out-of-distribution challenges.
Insights on future directions for improving OOD generalization were gathered.

Abstract

The out-of-distribution (OOD) property in data is deemed as one main challenge hindering the generalization ability of machine learning algorithms. However, the underlying reasons for this property remain an intriguing and open question that has yet to be fully understood. In this paper, we seek to enhance our understanding of the OOD phenomenon by framing it as a problem of distribution shift and addressing it through two complementary causal perspectives. The first is a generative causal view that elucidates the data generation process. We introduce a novel three-dimensional coordinate system to represent three fundamental distribution shifts, illustrating their role in various OOD generalization problems. The second is an anti-causal view that focuses on the model learning process. We develop an effective approach dubbed Counterfactual Risk Minimization (CRM) to address arbitrary distribution shifts in a unified framework. Additionally, we introduce a new multidomain visual recognition dataset called CONA to facilitate further exploration of OOD generalization. We conduct evaluations of CRM alongside several state-of-the-art competitors on four benchmark datasets under the three distribution shifts. The results not only affirm CRM's superiority but also shed light on potential future directions.

Bookmark

Counterfactual Risk Minimization for Out-of-Distribution Generalization

Key Points

Abstract

Cite This Study