Picture for John C. S. Lui

John C. S. Lui

Fusing Reward and Dueling Feedback in Stochastic Bandits

Add code
Apr 22, 2025
Viaarxiv icon

Provable Zero-Shot Generalization in Offline Reinforcement Learning

Add code
Mar 11, 2025
Viaarxiv icon

Online Clustering of Dueling Bandits

Add code
Feb 04, 2025
Viaarxiv icon

Large Language Model-Enhanced Multi-Armed Bandits

Add code
Feb 03, 2025
Viaarxiv icon

Offline Learning for Combinatorial Multi-armed Bandits

Add code
Jan 31, 2025
Viaarxiv icon

Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification

Add code
Jan 03, 2025
Viaarxiv icon

Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts

Add code
Jan 01, 2025
Figure 1 for Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
Figure 2 for Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
Figure 3 for Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
Figure 4 for Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
Viaarxiv icon

Unifying KV Cache Compression for Large Language Models with LeanKV

Add code
Dec 04, 2024
Figure 1 for Unifying KV Cache Compression for Large Language Models with LeanKV
Figure 2 for Unifying KV Cache Compression for Large Language Models with LeanKV
Figure 3 for Unifying KV Cache Compression for Large Language Models with LeanKV
Figure 4 for Unifying KV Cache Compression for Large Language Models with LeanKV
Viaarxiv icon

Combinatorial Logistic Bandits

Add code
Oct 22, 2024
Figure 1 for Combinatorial Logistic Bandits
Figure 2 for Combinatorial Logistic Bandits
Figure 3 for Combinatorial Logistic Bandits
Figure 4 for Combinatorial Logistic Bandits
Viaarxiv icon

Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds

Add code
Aug 16, 2024
Viaarxiv icon