How to Evaluate AI Beyond Fluency: Grounding, Answerability, and Reliability | Synapse