Arbeitspapier

Deep Learning application for fraud detection in financial statements

Financial statement fraud is an area of significant consternation for potential investors, auditing companies, and state regulators. Intelligent systems facilitate detecting financial statement fraud and assist the decision-making of relevant stakeholders. Previous research detected instances in which financial statements have been fraudulently misrepresented in managerial comments. The paper aims to investigate whether it is possible to develop an enhanced system for detecting financial fraud through the combination of information sourced from financial ratios and managerial comments within corporate annual reports. We employ a hierarchical attention network (HAN) with a long short-term memory (LSTM) encoder to extract text features from the Management Discussion and Analysis (MD&A) section of annual reports. The model is designed to offer two distinct features. First, it reflects the structured hierarchy of documents, which previous models were unable to capture. Second, the model embodies two different attention mechanisms at the word and sentence level, which allows content to be differentiated in terms of its importance in the process of constructing the document representation. As a result of its architecture, the model captures both content and context of managerial comments, which serve as supplementary predictors to financial ratios in the detection of fraudulent reporting. Additionally, the model provides interpretable indicators denoted as “red-flag” sentences, which assist stakeholders in their process of determining whether further investigation of a specific annual report is required. Empirical results demonstrate that textual features of MD&A sections extracted by HAN yield promising classification results and substantially reinforce financial ratios.

Language
Englisch

Bibliographic citation
Series: IRTG 1792 Discussion Paper ; No. 2020-007

Classification
Wirtschaft
Mathematical and Quantitative Methods: General
Subject
fraud detection
financial statements
deep learning
text analytics

Event
Geistige Schöpfung
(who)
Craja, Patricia
Kim, Alisa
Lessmann, Stefan
Event
Veröffentlichung
(who)
Humboldt-Universität zu Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series"
(where)
Berlin
(when)
2020

Handle
Last update
10.03.2025, 11:45 AM CET

Data provider

This object is provided by:
ZBW - Deutsche Zentralbibliothek für Wirtschaftswissenschaften - Leibniz-Informationszentrum Wirtschaft. If you have any questions about the object, please contact the data provider.

Object type

  • Arbeitspapier

Associated

  • Craja, Patricia
  • Kim, Alisa
  • Lessmann, Stefan
  • Humboldt-Universität zu Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series"

Time of origin

  • 2020

Other Objects (12)