Action Robust Reinforcement Learning via Optimal Adversary Aware Policy Optimization | Synapse