Picture for Sheng Yue

Sheng Yue

Executable Agentic Memory for GUI Agent

Add code
May 12, 2026
Viaarxiv icon

AdamO: A Collapse-Suppressed Optimizer for Offline RL

Add code
May 03, 2026
Viaarxiv icon

FORLER: Federated Offline Reinforcement Learning with Q-Ensemble and Actor Rectification

Add code
Feb 02, 2026
Viaarxiv icon

Context Learning for Multi-Agent Discussion

Add code
Feb 02, 2026
Viaarxiv icon

Less is More: Clustered Cross-Covariance Control for Offline RL

Add code
Jan 28, 2026
Viaarxiv icon

AugFL: Augmenting Federated Learning with Pretrained Models

Add code
Mar 04, 2025
Viaarxiv icon

Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency

Add code
May 29, 2024
Figure 1 for Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency
Figure 2 for Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency
Figure 3 for Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency
Figure 4 for Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency
Viaarxiv icon

OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning

Add code
May 29, 2024
Figure 1 for OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
Figure 2 for OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
Figure 3 for OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
Figure 4 for OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
Viaarxiv icon

Federated Offline Policy Optimization with Dual Regularization

Add code
May 29, 2024
Viaarxiv icon

How to Leverage Diverse Demonstrations in Offline Imitation Learning

Add code
May 29, 2024
Figure 1 for How to Leverage Diverse Demonstrations in Offline Imitation Learning
Figure 2 for How to Leverage Diverse Demonstrations in Offline Imitation Learning
Figure 3 for How to Leverage Diverse Demonstrations in Offline Imitation Learning
Figure 4 for How to Leverage Diverse Demonstrations in Offline Imitation Learning
Viaarxiv icon