Los puntos clave no están disponibles para este artículo en este momento.
Language is increasingly being used to define rich visual recognition problems with supporting image collections sourced from the web. Structured prediction models are used in these tasks to take advantage of correlations between co-occurring labels and visual input but risk inadvertently encoding social biases found in web corpora.
Zhao et al. (Sun,) studied this question.