A Survey of Learning-based Multi-agent Navigation

博士资格考试

The Hong Kong University of Science and Technology (Guangzhou)

数据科学与分析学域

PhD Qualifying Examination

By Miss HUANG Yunjie

摘要

Multi-Agent Path Finding (MAPF) asks a team of robots to reach individual goals without collisions—an operation that underpins warehouse fulfilment, traffic coordination, and swarm robotics. Classical search planners such as A*, Conflict-Based Search, and their bounded-optimal extensions guarantee short paths, yet their running time and memory grow steeply with agent density, map size, and partial observability. As fleets scale into the hundreds, these costs hinder real-time deployment and motivate alternative approaches.

Reinforcement learning (RL) offers a complementary route: by training decentralised policies from experience, agents can react on-the-fly instead of exhaustively enumerating the joint state space. Recent work enriches RL with imitation, heuristic guidance, graph neural reasoning, and hierarchical abstractions, extending learned navigation from small grids to continuous, dynamics-aware domains. At the same time, GPU-based simulators such as Isaac Gym, ROS-integrated Gazebo, and lightweight benchmarks like VMAS and POGEMA provide the large-batch roll-outs needed to refine and test these methods under realistic sensing and actuation constraints.

This survey organises the field along four themes—classical planners, RL founda tions, learning-centric MAPF algorithms, and evaluation tool-chains—highlighting key advances, open safety and energy challenges, and the prospects for hybrid solutions that blend RL adaptability with search guarantees.

PQE Committee

Chair of Committee: Prof. LUO Qiong

Prime Supervisor: Prof. LI Lei

Co-Supervisor: Prof. LI Haoang

Examiner: Prof. LIANG Yuxuan

日期

09 June 2025

时间

17:00:00 - 18:00:00

地点

E1-147 (HKUST-GZ)

Join Link

Zoom Meeting ID:
981 0920 8770

Tencent Meeting ID:
dsa2025