论文开题审查

Neural Information Retrievaland Beyond

The Hong Kong University of Science and Technology (Guangzhou)

数据科学与分析学域

PhD Thesis Proposal Examination

By Mr. ZHOU Jiawei

摘要

For decades, information retrieval (IR) has been fundamental in helping people access knowledge. As data volumes grow beyond human capacity, retrieval systems have become indispensable.

However, we are now witnessing a fundamental shift. With the emergence of large language models (LLMs) such as ChatGPT, Claude, and DeepSeek, retrieval is no longer designed solely for human users. Increasingly, LLM-based AI systems act as autonomous agents, performing tasks on behalf of people. These systems depend on retrieval not just to deliver results to users, but to gather and ground information for reasoning, content generation, and decision-making. This has given rise to a new paradigm: retrieval-augmented generation (RAG), where retrieval serves as the memory and knowledge interface for AI.

This thesis investigates retrieval for both human users and AI systems. I begin by examining retrieval systems tailored for human-facing applications, with an emphasis on pre-training, transparency, multi-modal data. Building on this foundation, I analyze the emerging gap between user-oriented retrieval and AI-oriented retrieval, highlighting the limitations of current methods when integrated into LLM pipelines. Finally,I propose to explore how retrieval systems can be explicitly redesigned to support AIagents—improving factuality, adaptability, and efficiency in end-to-end tasks.

TPE Committee

Chair of Committee: Prof. CHU Xiaowen

Prime Supervisor: Prof. CHEN Lei

Co-Supervisor: Prof. TSUNG Fugee

Examiner: Prof. LIANG Yuxuan

日期

10 June 2025

时间

09:00:00 - 10:00:00

地点

E1-147 (HKUST-GZ)