CSSA: A Cross-Modal Spatial–Semantic Alignment Framework for Remote Sensing Image Captioning | Synapse