Scalable Self-Improving Foundation Agents: From Agent Generation to Environmental Scaling and Long-Horizon Evolution
The Hong Kong University of Science and Technology (Guangzhou)
Data Science and Analytics Thrust
PhD Qualifying Examination
By Mr. ZHANG, Jiayi
Abstract
Foundation agents extend large language models from passive text generators into systems that plan, call tools, coordinate modules, maintain memory, and act in external environments. Yet most current agents remain static: prompts and workflows are hand-engineered, evaluation is concentrated on fixed benchmarks, and improvement mechanisms are short-horizon or task-specific. This survey argues that scalable self-improving foundation agents require progress along three coupled stages. Agent generation makes prompts, workflows, sub-agents, reasoning structures, and tool-use policies automatically constructible from execution feedback. Environmental scaling provides diverse, executable, and verifiable feedback beyond fixed benchmark instances, by generating heterogeneous environments and verifiable interaction worlds rather than only more rollouts. Long-horizon evolution turns accumulated histories into structured state that can revise memory, reward, policy, and the improvement mechanism itself. We review recent progress along these three stages, identify open challenges in feedback reliability, environment validity, long-horizon credit assignment, reward hacking, memory drift, and governance, and outline a dissertation agenda that connects agent generation, environmental scaling, and long-horizon evolution into a cumulative research program toward generalizable foundation agents.
PQE Committee
- Chair: Prof. CHU, Xiaowen
- Prime Supervisor: Prof. LUO, Yuyu
- Co-Supervisor: Prof. LIU, Bang
- Examiner: Prof. TANG, Jing
Date
09 June 2026
Time
16:00:00 - 17:00:00
Location
E1-149, HKUST(GZ)
Join Link
Zoom Meeting ID: 933 7726 8600
Passcode: dsa2026