With the prevalence of Internet AI technology, financial fraud becomes an imperative problem, especially in the context of machine learning. Technologies such as deep learning and natural language processing provide effective tools for detecting fraud with the guidance of financial statements, improving the efficiency and accuracy of data analysis, and helping to ensure financial safety. In this study, we propose a sophisticated representation learning method to detect financial statement fraud by tracking the detailed changes in the company’s Management Discussion and Analysis (MD&A) documents over time. Unlike traditional word frequency methods, we align paragraphs between consecutive disclosures based on their similarity at the paragraph level and categorize them into three types: added, deleted, and matched. Next, we create multivariate change trajectory representations based on fraud-related word categories. Finally, we use these word-level change trajectories to design a fraud detection model and compare it with several traditional models as well as the latest Time-Series Foundation Models. Experiments on 24 years of financial report data, from 1995 to 2019, show that our representation learning method significantly improves the performance of financial statement fraud detection across 11 different machine learning models, consistently outperforming traditional word frequency methods. Our method opens a new paradigm for feature engineering in financial statement fraud detection. Our code can be found at https://github.com/LittelStudent/Financial-Statement-Fraud-Detection-ParaEmb-FraudW2V.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yue Yu
Ministry of Natural Resources
Zhen Wu
Institute of Information Engineering
Yanni Han
Institute of Information Engineering
ACM Transactions on Internet Technology
Fordham University
Institute of Information Engineering
Building similarity graph...
Analyzing shared references across papers
Loading...
Yu et al. (Tue,) studied this question.
synapsesocial.com/papers/698d6e925be6419ac0d5453c — DOI: https://doi.org/10.1145/3796514