Deduplicating Training Data Makes Language Models Better | Synapse