We introduce a new attention mechanism for both syntax and semantics.An implicit deep learning model is used to jointly learn Q-K-V embeddings and contextual token representations designed to simultaneously capturesyntactic co-occurrence and semantic alignment.
Gary Nan Tie (Thu,) studied this question.