博士资格考试

Computational Methods for Spatial Transcriptomics: A Modeling-Oriented Survey

The Hong Kong University of Science and Technology (Guangzhou)

数据科学与分析学域

PhD Qualifying Examination

By Ms. FENG, Junning

摘要

Spatial transcriptomics (ST) enables gene-expression profiling in situ, restoring the spatial context that tissue dissociation irreversibly discards: a cell’s anatomical position, its local neighborhood composition, and the contact-dependent signals that collectively shape its transcriptional state. This capability has driven the development of diverse computational tasks, including spatial domain identification, spatially variable gene detection, cell-type deconvolution, and microenvironmental niche analysis. However, translating these high-dimensional data into robust biological insights presents significant computational challenges. ST datasets are inherently sparse, spatially autocorrelated, often multimodal, and subject to strong platform-specific biases, typically in the absence of definitive ground truth.

This survey reviews computational methods for spatial transcriptomics from a modeling-oriented perspective, spanning all major computational task families within a unified analytical framework. Methods are evaluated according to the biological or statistical objects they formalize and the assumptions required for their interpretability. This perspective surfaces cross-task methodological tensions, including conflicts between spatial-dependence modeling and independent-sample assumptions or between representational flexibility and biological interpretability, and provides principled criteria for method selection across different data and hypothesis regimes. The review covers spatial domain and variable gene detection, cell-type deconvolution, ligand-receptor communication and niche modeling, representation learning and foundation models, spatial gene program extraction, and counterfactual perturbation modeling.

Through this framework, we identify a conceptual progression in the field, moving from the descriptive mapping of tissue architecture through the learning of reusable, context-aware representations to predictive, perturbation-oriented modeling, and characterize the methodological trade-offs that accompany each transition. Against this progression, five structural gaps stand out: the separation between spatial domain inference and gene-level explanation; the lack of gene-level representations that encode niche-conditioned function; the difficulty of validating transferable spatial representations across organs, platforms, resolutions, and disease states; the absence of stable spatial gene program vocabularies for cross-cohort comparison; and the limited evidence base for uncertainty-aware spatial perturbation modeling. Closing these gaps requires an analytical stance that foregrounds modeling assumptions and evidential limits alongside algorithmic performance, a reorientation that motivates the framework employed throughout this survey.

PQE Committee

  • Chair: Prof. WANG, Wei
  • Prime Supervisor: Prof. ZHANG, Yanlin
  • Co-Supervisor: Prof. CHU, Xiaowen
  • Examiner: Prof. LI, Lei

日期

10 June 2026

时间

17:00:00 - 18:00:00

地点

E1-150, HKUST(GZ)