Deep reinforecement learning based optimal defense for cyber-physical system in presence of unknown cyber-attack | Synapse