July 10, 2024Open Access

Masked Graph Transformer for Large-Scale Recommendation

Key Points

Key points are not available for this paper at this time.

Abstract

Graph Transformers have garnered significant attention for learning graph-structured data, thanks to their superb ability to capture long-range dependencies among nodes. However, the quadratic space and time complexity hinders the scalability of Graph Transformers, particularly for large-scale recommendation. Here we propose an efficient Masked Graph Transformer, named MGFormer, capable of capturing all-pair interactions among nodes with a linear complexity. To achieve this, we treat all user/item nodes as independent tokens, enhance them with positional embeddings, and feed them into a kernelized attention module. Additionally, we incorporate learnable relative degree information to appropriately reweigh the attentions. Experimental results show the superior performance of our MGFormer, even with a single attention layer.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Chen et al. (Wed,) studied this question.

synapsesocial.com/papers/68e60be9b6db64358759ecc0 — DOI: https://doi.org/10.1145/3626772.3657971

Authors

Huiyuan Chen

University of California, Riverside

Zhe Xu

East China University of Science and Technology

Chin‐Chia Michael Yeh

Visa (United Kingdom)

Actions

Institutions

University of Illinois Urbana-Champaign

Visa (United States)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Masked Graph Transformer for Large-Scale Recommendation

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion