July 11, 2024Open Access

Matching-Based Policy Learning

Key Points

Key points are not available for this paper at this time.

Abstract

Treatment heterogeneity is ubiquitous in many areas, motivating practitioners to search for the optimal policy that maximizes the expected outcome based on individualized characteristics. However, most existing policy learning methods rely on weighting-based approaches, which may suffer from high instability in observational studies. To enhance the robustness of the estimated policy, we propose a matching-based estimator of the policy improvement upon a randomized baseline. After correcting the conditional bias, we learn the optimal policy by maximizing the estimate over a policy class. We derive a non-asymptotic high probability bound for the regret of the learned policy and show that the convergence rate is almost 1/n. The competitive finite sample performance of the proposed method is demonstrated in extensive simulation studies and a real data application.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper

Cite This Study

Li et al. (Thu,) studied this question.

synapsesocial.com/papers/68e60ad1b6db64358759e3b7 https://doi.org/https://doi.org/10.48550/arxiv.2407.08468

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Mark Helpful

Bookmark

Relay

View Full Paper