What question did this study set out to answer?

The aim is to explore methods for protecting financial data privacy while improving model accuracy through federated learning.

April 1, 2026

Privacy protection and intelligent modeling of financial data based on federated learning

Key Points

The aim is to explore methods for protecting financial data privacy while improving model accuracy through federated learning.
Proposed a privacy-preserving modeling framework combining federated learning and gradient boosting decision tree.
Analyzed the information exchange during training in federated GBDT to identify risks.
Developed two privacy protection schemes using encryption technology.
The proposed method reduces model accuracy loss significantly.
Ensures effective label privacy during the modeling process.

Abstract

This article explores financial data privacy protection and intelligent modeling methods based on federated learning (FL). With the widespread application of artificial intelligence (AI) technology in the financial field, data silos and privacy breaches have become increasingly prominent, seriously restricting model performance and data collaboration efficiency. In response to this challenge, this article proposes a privacy preserving modeling framework that integrates FL and gradient boosting decision tree (GBDT), fully leveraging the advantages of GBDT in fitting ability and prediction accuracy, while utilizing FL's characteristics in breaking data silos to achieve efficient and secure joint modeling. This article delves into the information exchange mechanism during the training process of federated GBDT and identifies the potential privacy leakage risks that label information may pose in gradient exchange. To address this issue, two privacy protection schemes have been designed: in the presence of semi trusted third parties, a basic scheme is constructed using semi homomorphic encryption technology; Introducing threshold semi homomorphic encryption mechanism to enhance privacy protection in scenarios without any trusted third party. The results indicate that the proposed method significantly reduces the loss of model accuracy while effectively ensuring label privacy.

Bookmark

Cite This Study

Xinjun Mao (Sun,) studied this question.

synapsesocial.com/papers/69ccb6fd16edfba7beb88c2c https://doi.org/https://doi.org/10.1049/icp.2026.0300

Bookmark