A Survey of Semantic SVG Generation for IntelligentVisual Authoring
The Hong Kong University of Science and Technology (Guangzhou)
Data Science and Analytics Thrust
PhD Qualifying Examination
By Mr. XU, Jing
Abstract
Scalable Vector Graphics (SVG) serve simultaneously as rendered images and editable source code. This dual nature makes SVG a promising representation for visual content authoring, because visual artifacts can be generated, recovered, and manipulated as structured documents with source-level structure, not fixed pixels alone. Such a representation enables fine-grained editing, component reuse, animation control, and seamless integration with downstream design and authoring work-flows. However, producing a visually plausible SVG is usually insufficient. A generated SVG may consist of thousands of anonymous paths, losing editable text and object identity, or becoming fragile even under a small local edit. In other words, it may be visually adequate yet structurally opaque, making downstream manipulation difficult. This limitation highlights a recurring appearance–structure gap in authoring-oriented SVG generation, where rendered similarity underdetermines source utility. When downstream workflows require editing, reuse, verification, animation, or data binding, methods must consider meaningful objects, reusable components, editable primitives, and stable structural relations in addition to visual appearance. This survey reviews SVG generation methods that are relevant to authoring workflows through the lens of this gap. We cover methods that create or recover SVGs from text prompts, raster images, sketches, and examples. To enable systematic comparison, we organize prior work with a faceted taxonomy rather than a strict hierarchy: entry condition or task, operational source unit, generation mechanism, feedback or evaluation signal, and authoring affordance or output contract. Here,
semantic refers to operation-relevant source structure, not only object-category labels. Based on
this review, we identify four directions for advancing authoring-oriented semantic SVG generation
when downstream workflows require editing or reuse: render–DOM–language grounding, structure-aware training objectives and rewards, workflow-level benchmarks, and robust SVG recovery from
real-world visual inputs.
PQE Committee
Chair: Prof. YU, Xu Jeffrey
Prime Supervisor: Prof. YANG, Weikai
Co-Supervisor: Prof. LUO, Yuyu
Examiner: Prof. TANG, Guoming
Date
09 June 2026
Time
16:00:00 - 17:00:00
Location
E1-147, HKUST(GZ)
Event Organizer
Data Science and Analytics Thrust
dsarpg@hkust-gz.edu.cn