Picture for Cheng Qian

Cheng Qian

May

ISACL: Internal State Analyzer for Copyrighted Training Data Leakage

Add code
Aug 25, 2025
Viaarxiv icon

UserBench: An Interactive Gym Environment for User-Centric Agents

Add code
Jul 29, 2025
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Viaarxiv icon

Atomic Reasoning for Scientific Table Claim Verification

Add code
Jun 08, 2025
Viaarxiv icon

DecisionFlow: Advancing Large Language Model as Principled Decision Maker

Add code
May 27, 2025
Viaarxiv icon

ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges

Add code
May 21, 2025
Viaarxiv icon

RM-R1: Reward Modeling as Reasoning

Add code
May 05, 2025
Viaarxiv icon

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Viaarxiv icon

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Viaarxiv icon

Alice: Proactive Learning with Teacher's Demonstrations for Weak-to-Strong Generalization

Add code
Apr 09, 2025
Viaarxiv icon