LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks | Synapse