A multi modal fusion framework combining CNN-based image recognition and BERT-based NLP for intelligent retrieval and matching of English teaching resources | Synapse