Off-Policy Reinforcement Learning for H_ Control Design | Synapse