SlateQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets | Synapse