ABC-Eval: Benchmarking Large Language Models on Symbolic Music Understanding and Instruction Following | Synapse