Video skimming and characterization through the combination of image and language understanding | Synapse