Picture for Joongkyu Lee

Joongkyu Lee

Nonstationary Generalized Linear Bandits with Discounted Online Mirror Descent

Add code
May 25, 2026
Viaarxiv icon

Optimal Design for Multinomial Logit Model with Applications to Best Assortment Identification

Add code
May 25, 2026
Viaarxiv icon

Multi-Step Likelihood-Ratio Correction for Reinforcement Learning with Verifiable Rewards

Add code
May 20, 2026
Viaarxiv icon

Block-Sphere Vector Quantization

Add code
May 19, 2026
Viaarxiv icon

Demystifying Linear MDPs and Novel Dynamics Aggregation Framework

Add code
Oct 31, 2024
Figure 1 for Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Figure 2 for Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Figure 3 for Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Figure 4 for Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Viaarxiv icon

Nearly Minimax Optimal Regret for Multinomial Logistic Bandit

Add code
May 16, 2024
Figure 1 for Nearly Minimax Optimal Regret for Multinomial Logistic Bandit
Figure 2 for Nearly Minimax Optimal Regret for Multinomial Logistic Bandit
Figure 3 for Nearly Minimax Optimal Regret for Multinomial Logistic Bandit
Viaarxiv icon

Learning Uncertainty-Aware Temporally-Extended Actions

Add code
Feb 08, 2024
Figure 1 for Learning Uncertainty-Aware Temporally-Extended Actions
Figure 2 for Learning Uncertainty-Aware Temporally-Extended Actions
Figure 3 for Learning Uncertainty-Aware Temporally-Extended Actions
Figure 4 for Learning Uncertainty-Aware Temporally-Extended Actions
Viaarxiv icon