Talk to Your Data: A Survey on Natural Language Interfaces for Data Visualization in the Age of LLMs
The Hong Kong University of Science and Technology (Guangzhou)
数据科学与分析学域
PhD Qualifying Examination
By Mr LUO Tianqi
摘要
The rapid evolution of Natural Language Interfaces for Visualization (NL2VIS) has significantly transformed how users interact with data, culminating in the integration of Large Language Models (LLMs) in recent years. This survey presents the first comprehensive review of NL2VIS systems in the LLM era, covering the methodological evolution from rule-based and neural approaches to pre-trained and large-scale generative models. We propose a unified pipeline framework that spans three operational spaces—Data Space, Visualization Space, and Narration Space—capturing the full spectrum of system capabilities from query interpretation to storytelling. Our review systematically categorizes over 100 publications and 50 systems, analyzes key challenges such as semantic ambiguity, evaluation inconsistency, and limited dialogue support, and surveys existing benchmarks and evaluation frameworks. We also highlight emerging research directions, such as multi-agent pipelines for complex visualization tasks, step-wise reasoning for ambiguity resolution, and automatic data synthesis for robust model training. This survey aims to serve as a foundational reference for researchers and practitioners, outlining a roadmap toward more interpretable, conversational, and user-centered NL2VIS systems powered by LLMs.
PQE Committee
Chair of Committee: Prof. WANG Wei
Prime Supervisor: Prof. LUO Yuyu
Co-Supervisor: Prof. TANG Nan
Examiner: Prof. YANG Weikai
日期
11 June 2025
时间
15:00:00 - 16:00:00
地点
E1-148 (HKUST-GZ)