Inverse Q-Learning Done Right: Offline Imitation Learning in Q^-Realizable MDPs | Synapse