Early Convolutions Help Transformers See Better | Synapse