CNN-LSTM-Based Bilingual Receipt Information Extraction Using Template-Based Data Generation

Conference proceedings article

Authors/Editors

Strategic Research Themes

Big Data Analytics (Digital Transformation)

Publication Details

Author list: Santitham Prom-on, Phoramint Chotwarutkit, Poonyawee Wongwisetsuk, Jaturon Harnsomburana

Publication year: 2024

Start page: 1

End page: 5

Number of pages: 5

URL: https://ieeexplore.ieee.org/document/10770740

Languages: English-United States (EN-US)

View on publisher site

Abstract

This paper discusses the development of a bilingual receipt information extraction system using a CNN-LSTM model with data augmentation techniques. The system targets the extraction of essential information such as company names, dates, and total amounts from receipts containing both Thai and English text. To address the limited availability of annotated data, synthetic receipt samples were generated from initial templates, creating a diverse training dataset. The model’s performance was evaluated on both the generated dataset and the SROIE 2019 dataset, achieving high accuracy across all tested information classes. While the CNN effectively extracts features, the LSTM processes these features for accurate information extraction. Future work aims to incorporate transformer-based models to enhance the system’s contextual understanding and generalization capabilities. This research highlights the effectiveness of combining CNNs and LSTMs in handling complex, multilingual datasets for practical applications in information extraction.

Keywords

CNN, data augmentation, Digital image processing, LSTM