Deep Learning for Effective Classification and Information Extraction of Financial Documents

Valentin-Adrian Serbanescu, Maruf A. Dhali*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

The financial and accounting sectors are encountering increased demands to effectively manage large volumes of documents in today’s digital environment. Meeting this demand is crucial for accurate archiving, maintaining efficiency and competitiveness, and ensuring operational excellence in the industry. This study proposes and analyzes machine learning-based pipelines to effectively classify and extract information from scanned and photographed financial documents, such as invoices, receipts, bank statements, etc. It also addresses the challenges associated with financial document processing using deep learning techniques. This research explores several models, including LeNet5, VGG19, and MobileNetV2 for document classification and RoBERTa, LayoutLMv3, and GraphDoc for information extraction. The models are trained and tested on financial documents from previously available benchmark datasets and a new dataset with financial documents in Romanian. Results show MobileNetV2 excels in classification tasks (with accuracies of 99.24% with data augmentation and 93.33% without augmentation), while RoBERTa and LayoutLMv3 lead in extraction tasks (with F1-scores of 0.7761 and 0.7426, respectively). Despite the challenges posed by the imbalanced dataset and cross-language documents, the proposed pipeline shows potential for automating the processing of financial documents in the relevant sectors.
Original languageEnglish
Title of host publicationProceedings of the 14th International Conference on Pattern Recognition Applications and Methods
EditorsModesto Castrillon-Santana, Maria De Marsico, Ana Fred
Place of PublicationPorto, Portugal
PublisherSciTePress
Pages749-756
Number of pages7
Volume1
ISBN (Electronic)978-989-758-730-6
DOIs
Publication statusPublished - 2025

Publication series

Name
ISSN (Electronic)2184-4313

Keywords

  • Deep Learning
  • Classification
  • Financial Documents
  • Optical Character Recognition
  • CNN
  • RoBERTa
  • LayoutLMv3
  • GraphDoc
  • Information Extraction
  • Document Processing

Fingerprint

Dive into the research topics of 'Deep Learning for Effective Classification and Information Extraction of Financial Documents'. Together they form a unique fingerprint.

Cite this