Picture for Cheng Qian

Cheng Qian

May

QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression

Add code
Mar 22, 2026
Viaarxiv icon

How Far Can Unsupervised RLVR Scale LLM Training?

Add code
Mar 09, 2026
Viaarxiv icon

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Add code
Feb 24, 2026
Viaarxiv icon

Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs

Add code
Feb 07, 2026
Viaarxiv icon

Copyright Detective: A Forensic System to Evidence LLMs Flickering Copyright Leakage Risks

Add code
Feb 05, 2026
Viaarxiv icon

Agentic Reasoning for Large Language Models

Add code
Jan 18, 2026
Viaarxiv icon

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning

Add code
Jan 17, 2026
Viaarxiv icon

Current Agents Fail to Leverage World Model as Tool for Foresight

Add code
Jan 08, 2026
Viaarxiv icon

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Add code
Dec 21, 2025
Viaarxiv icon

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Add code
Dec 18, 2025
Viaarxiv icon