Estimation of optimal dynamic anticoagulation regimes from observational data: a regret‐based approach | Synapse