Picture for Hanchi Huang

Hanchi Huang

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints

Add code
Aug 24, 2023
Viaarxiv icon

Curriculum-based Asymmetric Multi-task Reinforcement Learning

Add code
Nov 07, 2022
Viaarxiv icon