Task-Oriented Learning fromPositive-Unlabeled Data: Techniquesand Applications
The Hong Kong University of Science and Technology (Guangzhou)
Data Science and Analytics Thrust
PhD Thesis Proposal Examination
By Ms. Kexin SHI
Abstract
Positive-Unlabeled (PU) learning is a prevalent approach across various domains where
only a subset of instances is labeled as positive, while the rest remain unlabeled. This
thesis proposal explores the integration of PU learning in specific applications within
recommender systems and bioinformatics. In recommender systems, PU learning faces
challenges, including false negatives in model optimization for implicit collaborative filtering and the filter bubble effect. Conversely, in bioinformatics, the focus lies on large-scale
gene or protein screening tasks, which is crucial for guiding wet-lab experiments effectively. This proposal outlines preliminary research in both fields, introducing innovative
methodologies such as PDNS and Hard-BPR for addressing the false negative issue in
recommender systems, alongside PractiCPP for facilitating computational screening of
peptides in bioinformatics. Looking ahead, the focus will be on tackling the challenges
associated with filter bubbles in recommender systems.
TPE Committee
Chair of Committee: Prof. Xiaowen CHU
Prime Supervisor: Prof. Wenjia WANG
Co-Supervisor: Prof. Xinzhou GUO
Examiner: Prof. Lei LI
Date
28 November 2024
Time
10:00:00 - 11:00:00
Location
E3-105