What Effects the Generalization in Visual Reinforcement Learning: Policy Consistency with Truncated Return Prediction | Synapse