Block-structured matrix reordering for efficient SDDMM on tensor cores | Synapse