Picture for Guozheng Ma

Guozheng Ma

Beyond One-Size-Fits-All: Diagnosis-Driven Online Reinforcement Learning with Offline Priors

Add code
Jun 24, 2026
Viaarxiv icon

What Makes Value Learning Efficient in Residual Reinforcement Learning?

Add code
Feb 11, 2026
Viaarxiv icon

Language-based Trial and Error Falls Behind in the Era of Experience

Add code
Jan 29, 2026
Viaarxiv icon

Towards Reliable Medical LLMs: Benchmarking and Enhancing Confidence Estimation of Large Language Models in Medical Consultation

Add code
Jan 22, 2026
Viaarxiv icon

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Add code
May 30, 2025
Figure 1 for Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Figure 2 for Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Figure 3 for Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Figure 4 for Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Viaarxiv icon

Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning

Add code
Apr 24, 2025
Viaarxiv icon

Faster and Better 3D Splatting via Group Training

Add code
Dec 10, 2024
Figure 1 for Faster and Better 3D Splatting via Group Training
Figure 2 for Faster and Better 3D Splatting via Group Training
Figure 3 for Faster and Better 3D Splatting via Group Training
Figure 4 for Faster and Better 3D Splatting via Group Training
Viaarxiv icon

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping

Add code
Feb 22, 2024
Viaarxiv icon

Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages

Add code
Oct 11, 2023
Figure 1 for Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Figure 2 for Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Figure 3 for Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Figure 4 for Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Viaarxiv icon

Are Large Language Models Really Robust to Word-Level Perturbations?

Add code
Sep 27, 2023
Figure 1 for Are Large Language Models Really Robust to Word-Level Perturbations?
Figure 2 for Are Large Language Models Really Robust to Word-Level Perturbations?
Figure 3 for Are Large Language Models Really Robust to Word-Level Perturbations?
Figure 4 for Are Large Language Models Really Robust to Word-Level Perturbations?
Viaarxiv icon