Visual-and-Language multimodal fusion for sweeping robot navigation based on CNN and GRU | Synapse