Guided Policy Optimization under Partial Observability | Synapse