Algorithms for Combinatorial and Corrupted Bandits, and Beyond

DSA学域研讨会

Algorithms for Combinatorial and Corrupted Bandits, and Beyond

摘要

The multi-armed bandit problem (MAB) is a popular model for online decision making that elucidates the dilemma between exploration and exploitation. In this talk, I will discuss several generalizations of the standard MAB problem. Firstly, the prevalence of online recommender systems motivates us to revisit the cascading bandit model, an instance of a combinatorial bandit model. The second topic concerns stochastic bandits with adversarially corrupted observations. For both scenarios, we provide analysis of efficient algorithms and show that they almost match their lower bounds. This talk will also discuss some of my current and future works including risk-aware bandits, bandits applied to wireless communications, and gradient bandit algorithms.

演讲者简介

钟梓昕

Postdoctoral Fellow

University of Alberta

Zixin Zhong was born in China in 1995. She is a postdoctoral fellow at the Department of Computing Science of the University of Alberta and is supervised by Prof. Csaba Szepesvári. Dr. Zhong received her PhD degree from the Department of Mathematics of the National University of Singapore (NUS) in October 2021 and won the Louis Chen Hsiao Yun Best Dissertation Prize, which is awarded annually to the NUS student with the best Ph.D. thesis in mathematics and its applications. Zixin was privileged to be supervised by Prof. Vincent Y. F. Tan and Prof. Wang Chi Cheung during her PhD study and she worked with them as a research fellow between June 2021 and July 2022.

Dr. Zhong’s research interests are in reinforcement learning, online machine learning and, in particular, multi-armed bandits. Her work has been presented at top machine learning (ML) conferences including ICML and AISTATS, and in top ML journals such as the Journal of Machine Learning Research (JMLR). She also serves as a reviewer for several conferences and journals including (in alphabetical order) AISTATS, ICLR, ICML, NeurIPS, TIT, TSP, and TMLR. She was recognized as a top reviewer for NeurIPS 2022.

日期

24 July 2023

时间

10:30:00 - 12:00:00

地点

线上

Join Link

Zoom Meeting ID:
881 3813 3624

Passcode: dsat

主办方

数据科学与分析学域

联系邮箱

dsat@hkust-gz.edu.cn