PhD Qualifying-Exam

Scalable Self-Improving Foundation Agents: From Agent Generation to Environmental Scaling and Long-Horizon Evolution

The Hong Kong University of Science and Technology (Guangzhou)

Data Science and Analytics Thrust

PhD Qualifying Examination

By Mr. ZHANG, Jiayi

Abstract

Foundation agents extend large language models from passive text generators into systems that plan, call tools, coordinate modules, maintain memory, and act in external environments. Yet most current agents remain static: prompts and workflows are hand-engineered, evaluation is concentrated on fixed benchmarks, and improvement mechanisms are short-horizon or task-specific. This survey argues that scalable self-improving foundation agents require progress along three coupled stages. Agent generation makes prompts, workflows, sub-agents, reasoning structures, and tool-use policies automatically constructible from execution feedback. Environmental scaling provides diverse, executable, and verifiable feedback beyond fixed benchmark instances, by generating heterogeneous environments and verifiable interaction worlds rather than only more rollouts. Long-horizon evolution turns accumulated histories into structured state that can revise memory, reward, policy, and the improvement mechanism itself. We review recent progress along these three stages, identify open challenges in feedback reliability, environment validity, long-horizon credit assignment, reward hacking, memory drift, and governance, and outline a dissertation agenda that connects agent generation, environmental scaling, and long-horizon evolution into a cumulative research program toward generalizable foundation agents.

PQE Committee

  • Chair: Prof. CHU, Xiaowen
  • Prime Supervisor: Prof. LUO, Yuyu
  • Co-Supervisor: Prof. LIU, Bang
  • Examiner: Prof. TANG, Jing

Date

09 June 2026

Time

16:00:00 - 17:00:00

Location

E1-149, HKUST(GZ)

Join Link

Zoom Meeting ID:
933 7726 8600


Passcode: dsa2026