From RNNs to BERT: A Review of Neural Models for Sequence Learning | Synapse