Picture for Yu Cheng

Yu Cheng

One Refiner to Unlock Them All: Inference-Time Reasoning Elicitation via Reinforcement Query Refinement

Add code
Apr 28, 2026
Viaarxiv icon

Learning to Evolve: A Self-Improving Framework for Multi-Agent Systems via Textual Parameter Graph Optimization

Add code
Apr 22, 2026
Viaarxiv icon

ProjLens: Unveiling the Role of Projectors in Multimodal Model Safety

Add code
Apr 21, 2026
Viaarxiv icon

TEMPO: Scaling Test-time Training for Large Reasoning Models

Add code
Apr 21, 2026
Viaarxiv icon

CoTEvol: Self-Evolving Chain-of-Thoughts for Data Synthesis in Mathematical Reasoning

Add code
Apr 16, 2026
Viaarxiv icon

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Add code
Apr 15, 2026
Viaarxiv icon

RIRF: Reasoning Image Restoration Framework

Add code
Apr 10, 2026
Viaarxiv icon

HiFloat4 Format for Language Model Pre-training on Ascend NPUs

Add code
Apr 09, 2026
Viaarxiv icon

Face-D(^2)CL: Multi-Domain Synergistic Representation with Dual Continual Learning for Facial DeepFake Detection

Add code
Apr 09, 2026
Viaarxiv icon

Memory Intelligence Agent

Add code
Apr 07, 2026
Viaarxiv icon