Hierarchical Gated Delta Memory: Attention-Free Language Modeling with Constant Inference-Time Memory | Synapse