LSTMs Can Learn Syntax-Sensitive Dependencies Well, But Modeling Structure Makes Them Better | Synapse