What type of study is this?

September 10, 2025Open Access

Robust Supervised Deep Discrete Hashing for Cross-Modal Retrieval

Key Points

Robust Supervised Deep Discrete Hashing enhances cross-modal retrieval efficiency by generating discriminative hash codes.
Extensive experiments on four benchmark datasets show significant performance improvement over state-of-the-art methods.
This approach integrates a convolutional neural network and a multilayer perceptron for learning modality-specific features.
Using a non-redundant feature selection strategy addresses the challenge of optimal feature selection in deep cross-modal hashing.

Abstract

The exponential growth of multi-modal data in the real world poses significant challenges to efficient retrieval, and traditional single-modal methods are no longer suitable for the growth of multi-modal data. To address this issue, hashing retrieval methods play an important role in cross-modal retrieval tasks when referring to a large amount of multi-modal data. However, effectively embedding multi-modal data into a common low-dimensional Hamming space remains challenging. A critical issue is that feature redundancies in existing methods lead to suboptimal hash codes, severely degrading retrieval performance; yet, selecting optimal features remains an open problem in deep cross-modal hashing. In this paper, we propose an end-to-end approach, named Robust Supervised Deep Discrete Hashing (RSDDH), which can accomplish feature learning and hashing learning simultaneously. RSDDH has a hybrid deep architecture consisting of a convolutional neural network and a multilayer perceptron adaptively learning modality-specific representations. Moreover, it utilizes a non-redundant feature selection strategy to select optimal features for generating discriminative hash codes. Furthermore, it employs a direct discrete hashing scheme (SVDDH) to solve the binary constraint optimization problem without relaxation, fully preserving the intrinsic properties of hash codes. Additionally, RSDDH employs inter-modal and intra-modal consistency preservation strategies to reduce the gap between modalities and improve the discriminability of learned Hamming space. Extensive experiments on four benchmark datasets demonstrate that RSDDH significantly outperforms state-of-the-art cross-modal hashing methods.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Xiwei Dong

Fei Wu

Jintao Zhai

Journals

Technologies

Actions

Institutions

Wuhan University

Nanjing University of Posts and Telecommunications

Qufu Normal University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Robust Supervised Deep Discrete Hashing for Cross-Modal Retrieval

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study