Los puntos clave no están disponibles para este artículo en este momento.
Various benchmarks have been proposed to test linguistic understanding in pre-trained vision \ we provide concrete insights into how CLIP processes negation on the VALSE existence task; and we highlight inherent limitations in the VALSE dataset as a benchmark for linguistic understanding.
Quantmeyer et al. (Mon,) studied this question.