Picture for Yan Dai

Yan Dai

uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs

Add code
Oct 04, 2024
Viaarxiv icon

Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop Networks

Add code
Aug 29, 2024
Viaarxiv icon

Refined Sample Complexity for Markov Games with Independent Linear Function Approximation

Add code
Feb 11, 2024
Viaarxiv icon

Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise

Add code
Feb 02, 2024
Figure 1 for Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise
Figure 2 for Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise
Viaarxiv icon

The Crucial Role of Normalization in Sharpness-Aware Minimization

Add code
May 24, 2023
Viaarxiv icon

Refined Regret for Adversarial MDPs with Linear Function Approximation

Add code
Jan 30, 2023
Viaarxiv icon

Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning

Add code
Jan 25, 2023
Viaarxiv icon

Skeleton-based Action Recognition via Adaptive Cross-Form Learning

Add code
Jun 30, 2022
Figure 1 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 2 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 3 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 4 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Viaarxiv icon

Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback

Add code
May 26, 2022
Figure 1 for Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback
Viaarxiv icon

Variance-Aware Sparse Linear Bandits

Add code
May 26, 2022
Figure 1 for Variance-Aware Sparse Linear Bandits
Viaarxiv icon