Picture for Cheng Qian

Cheng Qian

May

Agentic Reasoning for Large Language Models

Add code
Jan 18, 2026
Viaarxiv icon

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning

Add code
Jan 17, 2026
Viaarxiv icon

Current Agents Fail to Leverage World Model as Tool for Foresight

Add code
Jan 08, 2026
Viaarxiv icon

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Add code
Dec 21, 2025
Viaarxiv icon

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Add code
Dec 18, 2025
Viaarxiv icon

LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering

Add code
Nov 17, 2025
Figure 1 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 2 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 3 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 4 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Viaarxiv icon

Self-Improving LLM Agents at Test-Time

Add code
Oct 09, 2025
Figure 1 for Self-Improving LLM Agents at Test-Time
Figure 2 for Self-Improving LLM Agents at Test-Time
Figure 3 for Self-Improving LLM Agents at Test-Time
Figure 4 for Self-Improving LLM Agents at Test-Time
Viaarxiv icon

Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Figure 2 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Figure 3 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Figure 4 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Viaarxiv icon

LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering

Add code
Sep 11, 2025
Figure 1 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Figure 2 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Figure 3 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Figure 4 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Viaarxiv icon

ISACL: Internal State Analyzer for Copyrighted Training Data Leakage

Add code
Aug 25, 2025
Figure 1 for ISACL: Internal State Analyzer for Copyrighted Training Data Leakage
Figure 2 for ISACL: Internal State Analyzer for Copyrighted Training Data Leakage
Figure 3 for ISACL: Internal State Analyzer for Copyrighted Training Data Leakage
Figure 4 for ISACL: Internal State Analyzer for Copyrighted Training Data Leakage
Viaarxiv icon