Key points are not available for this paper at this time.
Cross-modal hashing is an important approach for multimodal data management and application. Existing unsupervised cross-modal hashing algorithms mainly rely on data features in pre-trained models to mine their similarity relationships. However, their optimization objectives are based on the static metric between the original uni-modal features, without further exploring data correlations during the training. In addition, most of them mainly focus on association mining and alignment among pairwise instances in continuous space but ignore the latent structural correlations contained in the semantic hashing space. In this paper, we propose an unsupervised hash learning framework ASSPH to solve the above problems. Firstly, we propose an adaptive learning scheme, with limited data and training batches, to enrich semantic correlations of unlabeled instances during the training process and meanwhile to ensure a smooth convergence of the training process. Secondly, we present an asymmetric structural semantic representation learning scheme. We introduce structural semantic metrics based on graph adjacency relations and meanwhile align the inter- and intra-modal semantics in the hash space with an asymmetric binary optimization process. Finally, we conduct extensive experiments to validate the enhancements of our work in comparison with existing works.
Building similarity graph...
Analyzing shared references across papers
Loading...
Liang Li
University of Electronic Science and Technology of China
Baihua Zheng
Singapore Management University
Weiwei Sun
Experimental Center of Forestry in North China
Fudan University
Singapore Management University
Building similarity graph...
Analyzing shared references across papers
Loading...
Li et al. (Mon,) studied this question.
synapsesocial.com/papers/6a1d8c4b43708a372d5e5a69 — DOI: https://doi.org/10.1145/3503161.3548431