CNN-LSTM-Based Bilingual Receipt Information Extraction Using Template-Based Data Generation

Conference proceedings article


Authors/Editors


Strategic Research Themes


Publication Details

Author listSantitham Prom-on, Phoramint Chotwarutkit, Poonyawee Wongwisetsuk, Jaturon Harnsomburana

Publication year2024

Start page1

End page5

Number of pages5

URLhttps://ieeexplore.ieee.org/document/10770740

LanguagesEnglish-United States (EN-US)


View on publisher site


Abstract

This paper discusses the development of a bilingual receipt information extraction system using a CNN-LSTM model with data augmentation techniques. The system targets the extraction of essential information such as company names, dates, and total amounts from receipts containing both Thai and English text. To address the limited availability of annotated data, synthetic receipt samples were generated from initial templates, creating a diverse training dataset. The model’s performance was evaluated on both the generated dataset and the SROIE 2019 dataset, achieving high accuracy across all tested information classes. While the CNN effectively extracts features, the LSTM processes these features for accurate information extraction. Future work aims to incorporate transformer-based models to enhance the system’s contextual understanding and generalization capabilities. This research highlights the effectiveness of combining CNNs and LSTMs in handling complex, multilingual datasets for practical applications in information extraction.


Keywords

CNNdata augmentationDigital image processingLSTM


Last updated on 2025-19-08 at 12:00