GFlowNet Training by Policy Gradients | Synapse