TOWARDS THE ENHANCEMENT OF LARGE LANGUAGE MODELS

Thesis Proposal Examination

The Hong Kong University of Science and Technology (Guangzhou)

Data Science and Analytics Thrust

PhD Thesis Proposal Examination

By Mr. Yuxin JIANG

Abstract

Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications. However, the inherent complexity of LLMs poses significant challenges in effectively enhancing their capabilities. The intricate architectures and the vast amount of data required for training add layers of difficulty to optimizing these models. To systematically address these challenges, we categorize the LLM ecosystem into three critical stages within its development pipeline: construction; utilization and augmentation; and evaluation. This framework not only highlights the key areas for innovation but also underscores the importance of interdisciplinary collaboration in overcoming the complexities associated with LLMs. To this end, we introduce our proposed methods in different stages and present the experimental results obtained. Eventually, we explore the prevailing challenges and outline the most encouraging paths for future research.

TPE Committee

Chairperson: Prof Xiaowen CHU

Prime Supervisor: Prof Wei WANG

Co-Supervisor: Prof Jiaqiang HUANG

Examiner: Prof Wenjia WANG

Date

12 June 2024

Time

14:50:00 - 16:05:00

Location

E1-149