Extremum-Seeking Action Selection for Accelerating Policy Optimization | Synapse