LLM-based visual inspection framework with two paths: text-only inference and vision-language inference. A lightweight GUI enables one-click automation. Models are evaluated on ripeness, freshness, and authenticity tasks.
Yin et al. (Thu,) studied this question.