Deep reinforcement learning for autonomous control of hole-doped Hubbard clusters: A comparative study | Synapse