The Two-Layer Efficiency Stack: FlashAttention (Operation Efficiency) and Power Metric (Allocation Efficiency) as Independent, Compounding Layers of the Inference Compute Stack | Synapse