TOWARDS THE ENHANCEMENT OF LARGE LANGUAGE MODELS

论文开题审查

The Hong Kong University of Science and Technology (Guangzhou)

数据科学与分析学域

PhD Thesis Proposal Examination

By Mr. Yuxin JIANG

摘要

Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications. However, the inherent complexity of LLMs poses significant challenges in effectively enhancing their capabilities. The intricate architectures and the vast amount of data required for training add layers of difficulty to optimizing these models. To systematically address these challenges, we categorize the LLM ecosystem into three critical stages within its development pipeline: construction; utilization and augmentation; and evaluation. This framework not only highlights the key areas for innovation but also underscores the importance of interdisciplinary collaboration in overcoming the complexities associated with LLMs. To this end, we introduce our proposed methods in different stages and present the experimental results obtained. Eventually, we explore the prevailing challenges and outline the most encouraging paths for future research.

TPE Committee

Chairperson: Prof Xiaowen CHU

Prime Supervisor: Prof Wei WANG

Co-Supervisor: Prof Jiaqiang HUANG

Examiner: Prof Wenjia WANG

日期

12 June 2024

时间

14:50:00 - 16:05:00

地点

E1-149