Picture for Qi Gu

Qi Gu

$V_{0.5}$: Generalist Value Model as a Prior for Sparse RL Rollouts

Add code
Mar 11, 2026
Viaarxiv icon

TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training

Add code
Mar 02, 2026
Viaarxiv icon

AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition

Add code
Feb 11, 2026
Viaarxiv icon

Learning to Self-Verify Makes Language Models Better Reasoners

Add code
Feb 07, 2026
Viaarxiv icon

ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Add code
Feb 06, 2026
Viaarxiv icon

$V_0$: A Generalist Value Model for Any Policy at State Zero

Add code
Feb 03, 2026
Viaarxiv icon

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Add code
Feb 03, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Add code
Sep 30, 2025
Viaarxiv icon

System-level Simulation of Reconfigurable Intelligent Surface assisted Wireless Communications System

Add code
Jun 29, 2022
Figure 1 for System-level Simulation of Reconfigurable Intelligent Surface assisted Wireless Communications System
Figure 2 for System-level Simulation of Reconfigurable Intelligent Surface assisted Wireless Communications System
Figure 3 for System-level Simulation of Reconfigurable Intelligent Surface assisted Wireless Communications System
Figure 4 for System-level Simulation of Reconfigurable Intelligent Surface assisted Wireless Communications System
Viaarxiv icon