From T-Mazes to Labyrinths: Learning from Model-Based Feedback | Synapse