Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models | Synapse