Multi-video summarization with vision-language models and hybrid optimization | Synapse