Picture for Dabeen Lee

Dabeen Lee

Chebyshev Center-Based Direction Selection for Multi-Objective Optimization and Training PINNs

Add code
May 11, 2026
Viaarxiv icon

Logistic Bandits with $\tilde{O}(\sqrt{dT})$ Regret without Context Diversity Assumptions

Add code
Apr 24, 2026
Viaarxiv icon

Near-Optimal Primal-Dual Algorithm for Learning Linear Mixture CMDPs with Adversarial Rewards

Add code
Mar 29, 2026
Viaarxiv icon

Learning to Route and Schedule LLMs from User Retrials via Contextual Queueing Bandits

Add code
Feb 02, 2026
Viaarxiv icon

Queue Length Regret Bounds for Contextual Queueing Bandits

Add code
Jan 27, 2026
Viaarxiv icon

An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints

Add code
May 28, 2025
Figure 1 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Figure 2 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Figure 3 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Figure 4 for An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Viaarxiv icon

Neural Logistic Bandits

Add code
May 04, 2025
Viaarxiv icon

Improved Regret Bound for Safe Reinforcement Learning via Tighter Cost Pessimism and Reward Optimism

Add code
Oct 14, 2024
Viaarxiv icon

Provably Efficient Infinite-Horizon Average-Reward Reinforcement Learning with Linear Function Approximation

Add code
Sep 16, 2024
Viaarxiv icon

Reinforcement Learning for Infinite-Horizon Average-Reward MDPs with Multinomial Logistic Function Approximation

Add code
Jun 19, 2024
Figure 1 for Reinforcement Learning for Infinite-Horizon Average-Reward MDPs with Multinomial Logistic Function Approximation
Figure 2 for Reinforcement Learning for Infinite-Horizon Average-Reward MDPs with Multinomial Logistic Function Approximation
Figure 3 for Reinforcement Learning for Infinite-Horizon Average-Reward MDPs with Multinomial Logistic Function Approximation
Viaarxiv icon