When large language models are reliable for judging empathic communication | Synapse