Pre-Training Transformers as Energy-Based Cloze Models | Synapse