DSA学域研讨会

Empowering Large Language Models with Graph Learning Techniques

Large Language Models have become transformative tools at the forefront of artificial general intelligence. As a group specializing in computation and learning on graphs, we explore how graph-based methodologies can augment LLM capabilities and advance the AGI pipeline. In this talk, I will mainly present two recent works.

First, I will discuss our work on developing more expressive learnable position encodings for LLMs by integrating positional encodings with token representations. These positional encodings follow the principles of rotational equivariance rooted in graph and geometric learning. Our method enables LLMs to better handle arithmetic tasks and long-range reasoning, areas where traditional sequential encodings often struggle to capture complex token relationships.

Second, I will introduce our research on Knowledge Graph-based Retrieval-Augmented Generation. This work investigates the trade-off between retrieval effectiveness and computational efficiency. We demonstrate that a lightweight model, employing encoded directional structural distances and parallel triple scoring, achieves superior accuracy and speed. The size of the retrieved subgraphs can be flexibly adjusted to align with the query's requirements and the downstream LLM's capabilities.

If time permits, I will also briefly highlight our investigations into state-space models, including challenges like recency bias and oversmoothing in the Mamba model. Relevant publications include:

Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding, Zhu et al., 2024 (to appear in arxiv)

Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation, Li et al., arxiv 2024

Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing, Wang et al., 2024 (to appear in arxiv)

Pan Li

助理教授 

Georgia Tech

Pan Li is an assistant professor at Georgia Tech. ECE department as an assistant professor and an affiliated assistant professor at Purdue CS. Pan's research interest lies broadly in the area of machine learning and computation with graph data.  Pan Li's work has been recognized by several awards including NSF Early Career Award, several industry research awards from Meta, JPMorgan, Sony, Amazon, the Best Paper award at the Learning on Graph Conference 2022 and the Best Paper award nomination at the Web Conference 2021.

日期

08 January 2025

时间

11:00:00 - 12:00:00

地点

香港科技大学(广州)W4-2楼-202室

Join Link

Zoom Meeting ID:
914 0056 4983


Passcode: dsat

主办方

数据科学与分析学域

联系邮箱

dsat@hkust-gz.edu.cn