Picture for Dabeen Lee

Dabeen Lee

Algorithm for Contextual Queueing Bandits with Rate-Optimal Queue Length Regret

Add code
Jun 08, 2026
Viaarxiv icon

Learning Weakly Communicating Average-Reward CMDPs: Strong Duality and Improved Regret

Add code
May 12, 2026
Viaarxiv icon

Primal-Dual Policy Optimization for Linear CMDPs with Adversarial Losses

Add code
May 12, 2026
Viaarxiv icon

Chebyshev Center-Based Direction Selection for Multi-Objective Optimization and Training PINNs

Add code
May 11, 2026
Viaarxiv icon

Logistic Bandits with $\tilde{O}(\sqrt{dT})$ Regret without Context Diversity Assumptions

Add code
Apr 24, 2026
Viaarxiv icon

Near-Optimal Primal-Dual Algorithm for Learning Linear Mixture CMDPs with Adversarial Rewards

Add code
Mar 29, 2026
Viaarxiv icon

Learning to Route and Schedule LLMs from User Retrials via Contextual Queueing Bandits

Add code
Feb 02, 2026
Viaarxiv icon

Queue Length Regret Bounds for Contextual Queueing Bandits

Add code
Jan 27, 2026
Viaarxiv icon

An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints

Add code
May 28, 2025
Figure 1 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Figure 2 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Figure 3 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Figure 4 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Viaarxiv icon

Neural Logistic Bandits

Add code
May 04, 2025
Viaarxiv icon