Picture for John C. S. Lui

John C. S. Lui

Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond

Add code
Jun 03, 2024
Viaarxiv icon

Cost-Effective Online Multi-LLM Selection with Versatile Reward Models

Add code
May 26, 2024
Viaarxiv icon

FedConPE: Efficient Federated Conversational Bandits with Heterogeneous Clients

Add code
May 05, 2024
Viaarxiv icon

Variance-Dependent Regret Bounds for Non-stationary Linear Bandits

Add code
Mar 15, 2024
Figure 1 for Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Figure 2 for Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Viaarxiv icon

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Add code
Feb 26, 2024
Figure 1 for Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
Figure 2 for Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
Figure 3 for Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
Figure 4 for Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
Viaarxiv icon

Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes

Add code
Feb 06, 2024
Figure 1 for Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes
Figure 2 for Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes
Figure 3 for Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes
Figure 4 for Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes
Viaarxiv icon

Adversarial Attacks on Cooperative Multi-agent Bandits

Add code
Nov 03, 2023
Figure 1 for Adversarial Attacks on Cooperative Multi-agent Bandits
Figure 2 for Adversarial Attacks on Cooperative Multi-agent Bandits
Figure 3 for Adversarial Attacks on Cooperative Multi-agent Bandits
Viaarxiv icon

Online Clustering of Bandits with Misspecified User Models

Add code
Oct 10, 2023
Figure 1 for Online Clustering of Bandits with Misspecified User Models
Figure 2 for Online Clustering of Bandits with Misspecified User Models
Viaarxiv icon

Online Corrupted User Detection and Regret Minimization

Add code
Oct 10, 2023
Figure 1 for Online Corrupted User Detection and Regret Minimization
Figure 2 for Online Corrupted User Detection and Regret Minimization
Figure 3 for Online Corrupted User Detection and Regret Minimization
Figure 4 for Online Corrupted User Detection and Regret Minimization
Viaarxiv icon

Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs

Add code
Aug 08, 2023
Figure 1 for Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs
Figure 2 for Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs
Figure 3 for Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs
Viaarxiv icon