Picture for Chengwei Qin

Chengwei Qin

Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning

Add code
Jun 11, 2026
Viaarxiv icon

The Illusion of Multi-Agent Advantage

Add code
Jun 11, 2026
Viaarxiv icon

Notes2Skills: From Lab Notebooks to Certainty-Aware Scientific Agent Skills

Add code
Jun 10, 2026
Viaarxiv icon

LoopMoE: Unifying Iterative Computation with Mixture-of-Experts for Language Modeling

Add code
Jun 03, 2026
Viaarxiv icon

DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization

Add code
May 29, 2026
Viaarxiv icon

Unified-MAS: Universally Generating Domain-Specific Nodes for Empowering Automatic Multi-Agent Systems

Add code
Mar 23, 2026
Viaarxiv icon

CoMMET: To What Extent Can LLMs Perform Theory of Mind Tasks?

Add code
Mar 12, 2026
Viaarxiv icon

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges

Add code
Mar 12, 2026
Viaarxiv icon

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

Add code
Mar 03, 2026
Viaarxiv icon

Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

Add code
Feb 05, 2026
Viaarxiv icon