PhD Qualifying-Exam

Towards a Large Gesture Language Model: Applications and Challenges

The Hong Kong University of Science and Technology (Guangzhou)

Data Science and Analytics Thrust

PhD Qualifying Examination

By Mr. Minghui QIU

Abstract

The widespread success of Large Language Models has garnered significant attention across various domains. In a similar vein, gesture language, as a form of human communication, holds immense potential to enhance our daily lives. This survey aims to explore the concept of constructing a large gesture language model. Broadly, the survey is divided into two parts.

The first part entails a comprehensive review of the gesture recognition system, encompassing its applications, technological advancements in sensing modalities, and implementation. For instance, the review highlights the applications of gesture recognition in fields such as rehabilitation, prosthesis control, human-machine interface, and sign language recognition, underscoring the potential benefits of a large gesture language model in these areas. Additionally, it examines the advancements in sensing modalities for wearable devices, including electrical, mechanical, acoustical and optical, which are crucial for accurately interpreting a wide range of gestures. This analysis further underscores the substantial potential for developing a large gesture language model while also shedding light on the associated data challenges.

In the second part, the survey recognizes the opportunities on wearable devices and successful technologies, and proposes to address the data challenges through two key steps. Firstly, it suggests expanding the data volume with gamified crowdsourcing to ensure a diverse and comprehensive dataset for training the large gesture language model. Secondly, the survey advocates resolving the heterogeneity problems through transfer learning, multimodal co-learning, and multitask learning. These approaches are identified as crucial for enhancing data quality, improving model accuracy, and addressing issues related to diverse and nuanced gestures.

PQE Committee

Chairperson: Prof. Nan TANG

Prime Supervisor: Prof Kaishun WU

Co-Supervisor: Prof Mingming FAN

Examiner: Prof Yanlin ZHANG

Date

05 June 2024

Time

09:50:00 - 11:05:00

Location

E1-147