VoCo-LLaMA: Towards Vision Compression with Large Language Models | Synapse