Cohesive Subgraph Analysis in Complex Graphs: Efficient Algorithms for Dynamic, Temporal, and Distributed Scenarios
The Hong Kong University of Science and Technology (Guangzhou)
数据科学与分析学域
PhD Thesis Examination
By Mr. Anxin TIAN
摘要
Cohesive subgraph analysis serves as a fundamental tool for understanding the organization and evolution of complex networks. While extensive research has been conducted on static and undirected networks, the challenges of handling dynamic directed graphs, temporal bipartite networks, and distributed scenarios remain largely unexplored. This thesis addresses these challenges by developing efficient algorithms and novel indexing techniques for three critical problems: maximal D-truss maintenance in dynamic directed graphs, temporal core analysis in bipartite networks and distributed D-truss computation on large directed graphs.
For the dynamic D-truss problem, we first establish theoretical bounds on the structural changes of D-truss under edge updates, providing the foundation for efficient maintenance strategies. We propose an order-based D-Index structure that effectively captures the hierarchical relationships among edges based on their cycle and flow triangle counts. Building upon this index, we develop batch-update algorithms that significantly reduce the maintenance cost by processing multiple updates simultaneously. Our experimental results on real-world networks demonstrate that the proposed solution achieves up to 10x speedup compared to state-of-the-art approaches while maintaining optimal space complexity.
To address temporal bipartite core analysis, we introduce a novel temporal containment property for (𝛼, 𝛽)-cores and formalize it through a DAG-based hierarchy. Based on this theoretical framework, we design a superior-optimized index structure that achieves both space efficiency and query performance guarantees. The index supports efficient retrieval of temporal cores within arbitrary time windows and can be dynamically maintained with logarithmic update complexity. Extensive experiments on eight real-world temporal bipartite networks verify the effectiveness of our approach, showing consistent query time under 100ms even for graphs with millions of edges.
Though the D-truss decomposition is effective for revealing the cycle-flow relationships in directed graphs, its single-machine solutions are far from scalable for real-world large graphs. For proposing efficient distributed solutions for D-truss decomposition, first, we introduce a converge-based algorithm that computes trussness pairs iteratively to a fixed point. Then, we utilize the peel idea and propose a batch-peel algorithm, which spares the extra overhead. For addressing the bottleneck of communication cost, we propose triangle-related acceleration and the type-aware balanced partitioner. Finally, we present the stratified local-peel method that reduces considerable communication overhead.
Our proposed methods advance the existing state-of-the-art solutions in dynamic, temporal, and distributed scenarios, offering both theoretical foundations and practical tools for real-world applications.
TEC
Chairperson: Prof Mark Nicholas GRIMSHAW-AAGAARD
Prime Supervisor: Prof Lei CHEN
Co-Supervisor: Prof Can YANG
Examiners:
Prof Lei LI
Prof Wenjia WANG
Prof Xiaofang ZHOU
Prof Zhiguo GONG
日期
25 July 2025
时间
15:30:00 - 17:30:00
地点
E4-201, HKUST(GZ)
Join Link
Zoom Meeting ID: 99824505391
Passcode: dsa2025
主办方
数据科学与分析学域
联系邮箱
dsarpg@hkust-gz.edu.cn