Policy-Based Bayesian Active Causal Discovery with Deep Reinforcement Learning | Synapse