Compositional shield synthesis for safe reinforcement learning in partial observability | Synapse