Are LLM-based Evaluators Confusing NLG Quality Criteria? | Synapse