Scaling Laws for Transformers on Low-Dimensional Data: A Statistical and Approximation Theory Perspective | Synapse