Data-Centric Visual Analytics for Machine Learning
摘要
Machine learning has achieved great success across various applications. However, it is still challenging for users to build high-performance machine learning models due to the need to analyze large-scale training samples and understand the relationships between samples and models. This process is very time-consuming and expertise-demanding. Visual analytics offers a promising solution by tightly integrating machine learning with interactive visualization techniques, which loop humans directly into the analysis process. Along this line, my research includes the following three topics: 1) For scenarios with limited annotated samples, we develop FSLDiagnotor that combines an automatic sample recommendation method and interactive visualization to help users examine and adjust annotated samples and boost model performance; 2) For scenarios that the samples are annotated but with poor quality, we develop Reweighter to allow users examine both the reweighting relationships and reweighting results and then make informed adjustments. 3) For scenarios that the sample distributions are changing after deploying the model, we develop DriftVis to facilitate the analysis of distribution changes and timely update the training samples. The effectiveness of our methods is demonstrated by both quantitative evaluation and case studies.
演讲者简介
杨维铠
Tsinghua University
Weikai Yang is a 5th-year Ph.D. candidate at Tsinghua University. His research interests lie in integrating machine learning into visual analytics. In particular, he focuses on lowering the barriers to help practitioners better explore large-scale data and build high-performance machine learning models in real-world applications. Currently, he has published 12 papers (9 CCF-A papers) and 1 book (Springer, second author). Among these, he is the first author of 5 CCF-A papers published in top-tier journals and conferences, such as IEEE TVCG and IEEE VIS.
日期
2024年4月16日
时间
14:30:00 - 15:30:00
地点
香港科技大学(广州)W2-2F-201
Join Link
Zoom Meeting ID: 846 9646 3368
Passcode: dsacma
主办方
数据科学与分析学域 Computational Media and Arts Thrust
联系邮箱
dsat@hkust-gz.edu.cn cmat@hkust-gz.edu.cn