Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey
Journal article
Authors/Editors
Strategic Research Themes
Publication Details
Author list: Zhang W.; Wang Y.; Song Y.; Wei V.J.; Tian Y.; Qi Y.; Chan J.H.; Wong R.C.; Yang H.
Publisher: Institute of Electrical and Electronics Engineers
Publication year: 2024
Journal: IEEE Transactions on Knowledge and Data Engineering (1041-4347)
Volume number: 36
Issue number: 11
Start page: 6699
End page: 20
Number of pages: -6678
ISSN: 1041-4347
eISSN: 1558-2191
Languages: English-Great Britain (EN-GB)
Abstract
The emergence of natural language processing has revolutionized the way users interact with tabular data, enabling a shift from traditional query languages and manual plotting to more intuitive, language-based interfaces. The rise of large language models (LLMs) such as ChatGPT and its successors has further advanced this field, opening new avenues for natural language processing techniques. This survey presents a comprehensive overview of natural language interfaces for tabular data querying and visualization, which allow users to interact with data using natural language queries. We introduce the fundamental concepts and techniques underlying these interfaces with a particular emphasis on semantic parsing, the key technology facilitating the translation from natural language to SQL queries or data visualization commands. We then delve into the recent advancements in Text-to-SQL and Text-to-Vis problems from the perspectives of datasets, methodologies, metrics, and system designs. This includes a deep dive into the influence of LLMs, highlighting their strengths, limitations, and potential for future improvements. Through this survey, we aim to provide a roadmap for researchers and practitioners interested in developing and applying natural language interfaces for data interaction in the era of large language models. IEEE
Keywords
Large language models, natural language interface, semantic parsing, text -to-SQL, text-to-visualization