Thesis Proposal Examination

Task-Oriented Learning fromPositive-Unlabeled Data: Techniquesand Applications

The Hong Kong University of Science and Technology (Guangzhou)

Data Science and Analytics Thrust

PhD Thesis Proposal Examination

By Ms. Kexin SHI

Abstract

Positive-Unlabeled (PU) learning is a prevalent approach across various domains where
only a subset of instances is labeled as positive, while the rest remain unlabeled. This
thesis proposal explores the integration of PU learning in specific applications within
recommender systems and bioinformatics. In recommender systems, PU learning faces
challenges, including false negatives in model optimization for implicit collaborative filtering and the filter bubble effect. Conversely, in bioinformatics, the focus lies on large-scale
gene or protein screening tasks, which is crucial for guiding wet-lab experiments effectively. This proposal outlines preliminary research in both fields, introducing innovative
methodologies such as PDNS and Hard-BPR for addressing the false negative issue in
recommender systems, alongside PractiCPP for facilitating computational screening of
peptides in bioinformatics. Looking ahead, the focus will be on tackling the challenges
associated with filter bubbles in recommender systems.

TPE Committee

Chair of Committee: Prof. Xiaowen CHU

Prime Supervisor: Prof. Wenjia WANG

Co-Supervisor: Prof. Xinzhou GUO

Examiner: Prof. Lei LI

Date

28 November 2024

Time

10:00:00 - 11:00:00

Location

E3-105