DeepSeek vs ChatGPT vs Claude: benchmarking large language models for clinical diagnosis using a novel ICD-10-CM-based evaluation framework | Synapse