Picture for Flint Xiaofeng Fan

Flint Xiaofeng Fan

An LLM-based Recommender System Environment

Add code
Jun 01, 2024
Viaarxiv icon

CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

Add code
Mar 29, 2024
Viaarxiv icon

Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

Add code
Jan 07, 2024
Viaarxiv icon

Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning

Add code
Jun 28, 2023
Viaarxiv icon

FedHQL: Federated Heterogeneous Q-Learning

Add code
Jan 26, 2023
Viaarxiv icon

Federated Neural Bandit

Add code
May 28, 2022
Figure 1 for Federated Neural Bandit
Figure 2 for Federated Neural Bandit
Figure 3 for Federated Neural Bandit
Viaarxiv icon

Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

Add code
Oct 26, 2021
Figure 1 for Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee
Figure 2 for Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee
Figure 3 for Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee
Figure 4 for Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee
Viaarxiv icon