Contextualized Evaluations: Judging Language Model Responses to Underspecified Queries | Synapse