Benchmarking LLM-as-a-Judge Models for 5W1H Extraction Evaluation | Synapse