Partially Observable Reinforcement Learning with Memory Traces | Synapse