Lower bound on the information-loss incurred by compressing a depth-2 feed-forward layer of transformer into a single fully connected layer | Synapse