Order-Free RNN With Visual Attention for Multi-Label Classification

Key Points

Key points are not available for this paper at this time.

Abstract

We propose a recurrent neural network (RNN) based model for image multi-label classification. Our model uniquely integrates and learning of visual attention and Long Short Term Memory (LSTM) layers, which jointly learns the labels of interest and their co-occurrences, while the associated image regions are visually attended. Different from existing approaches utilize either model in their network architectures, training of our model does not require pre-defined label orders. Moreover, a robust inference process is introduced so that prediction errors would not propagate and thus affect the performance. Our experiments on NUS-WISE and MS-COCO datasets confirm the design of our network and its effectiveness in solving multi-label classification problems.

Mark Helpful

Bookmark

Relay

View Full Paper

Mark Helpful

Bookmark

Relay

View Full Paper

Cite This Study

Chen et al. (Fri,) studied this question.

synapsesocial.com/papers/69db57d474ec163421835b6c https://doi.org/https://doi.org/10.1609/aaai.v32i1.12230

Also Consider

Synapse has enriched 2 closely related papers on similar clinical questions. Consider them for comparative context:

Also Consider

Synapse has enriched 2 closely related papers on similar clinical questions. Consider them for comparative context: