Picture for Yanxiao Zhao

Yanxiao Zhao

ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents

Add code
Aug 19, 2025
Viaarxiv icon

CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning

Add code
Feb 17, 2025
Viaarxiv icon

Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency

Add code
Mar 12, 2024
Figure 1 for Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency
Figure 2 for Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency
Figure 3 for Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency
Figure 4 for Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency
Viaarxiv icon

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

Add code
Feb 05, 2024
Figure 1 for Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Figure 2 for Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Figure 3 for Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Figure 4 for Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Viaarxiv icon