We propose a low-rank adaptation method for training privacy-preserving vision transformer (ViT) models that efficiently freezes pre-trained ViT model weights. In the proposed method, trainable rank decomposition matrices are injected into each layer of the ViT architecture, and moreover, the patch embedding layer is not frozen, unlike in the case of the conventional low-rank adaptation methods. The proposed method allows us not only to reduce the number of trainable parameters but to also maintain almost the same accuracy as that of full-time tuning.
Building similarity graph...
Analyzing shared references across papers
Loading...
Han Yu Lin
National Taiwan Ocean University
Shoko Imaizumi
Chiba University
Hitoshi Kiya
Tokyo Metropolitan University
Building similarity graph...
Analyzing shared references across papers
Loading...
Lin et al. (Wed,) studied this question.
synapsesocial.com/papers/68ef858cc6a308ba0635548c — DOI: https://doi.org/10.48550/arxiv.2507.11943
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: