Information-directed policy sampling for episodic Bayesian Markov decision processes | Synapse