About me

Hi, I’m Hao Qin, a forth-year PhD student at The University of Arizona in the program of GIDP-STATS. I am fortunately advised by Prof. Chicheng Zhang. Before that, I received my Bachelor degree in Applied Mathematics from Shandong University and Master degree in Data Science from The University of Wisconsin-Madison.

Research Interests

I am interested in the theoretical analysis of the reinforcement learning algorithms including multi-armed bandit algorithms and MDPs. Also, I am interested in finding the reward to align with the properse of training. Currently, I am working on the following directions:
  • Multi-armed bandits
  • Markov Decision Processing
  • Reward Modeling

Publications

KL-MS
Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards.
Hao Qin, Kwang-Sung Jun, Chicheng Zhang
Conference on Neural Information Processing Systems (NeurIPS) 2023
arXiv | Code

News