BP-Transformer: Modelling Long-Range Context via Binary Partitioning | Synapse