Automated artificial intelligence (AI) models appear to be able to detect alcohol imagery in videos at large scale with high accuracy and in near real time. Of the three AI models tested, ZSL-LLaVA achieved the best balance between accuracy and speed. Offering a cost- and time-efficient alternative to labour-intensive manual coding, ZSL-LLaVA could be used to monitor alcohol-related visual content in videos across diverse media platforms.
Pararath et al. (Mon,) studied this question.