Task-Oriented Learning fromPositive-Unlabeled Data: Techniquesand Applications
The Hong Kong University of Science and Technology (Guangzhou)
Data Science and Analytics Thrust
PhD Thesis Proposal Examination
By Ms. Kexin SHI
Abstract
Positive-Unlabeled (PU) learning is a prevalent approach across various domains where only a subset of instances is labeled as positive, while the rest remain unlabeled. This thesis proposal explores the integration of PU learning in specific applications within recommender systems and bioinformatics. In recommender systems, PU learning faces challenges, including false negatives in model optimization for implicit collaborative filtering and the filter bubble effect. Conversely, in bioinformatics, the focus lies on large-scale gene or protein screening tasks, which is crucial for guiding wet-lab experiments effectively. This proposal outlines preliminary research in both fields, introducing innovative methodologies such as PDNS and Hard-BPR for addressing the false negative issue in recommender systems, alongside PractiCPP for facilitating computational screening of peptides in bioinformatics. Looking ahead, the focus will be on tackling the challenges associated with filter bubbles in recommender systems.
TPE Committee
Chair of Committee: Prof. Xiaowen CHU
Prime Supervisor: Prof. Wenjia WANG
Co-Supervisor: Prof. Xinzhou GUO
Examiner: Prof. Lei LI
Date
28 November 2024
Time
10:00:00 - 11:00:00
Location
E3-105