Exploiting Attention Sparsity for Dual Context-Length Regimes | Synapse