CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization | Synapse