Picture for Le Sun

Le Sun

Verifiable Environments Are LEGO Bricks: Recursive Composition for Reasoning Generalization

Add code
Jun 10, 2026
Viaarxiv icon

The Meta-Agent Challenge: Are Current Agents Capable of Autonomous Agent Development?

Add code
Jun 03, 2026
Viaarxiv icon

QDS-SNN: Energy-efficient Quantum Deeply-Supervised Spiking Neural Network Algorithm for Traffic Sign Recognition

Add code
Jun 03, 2026
Viaarxiv icon

Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation

Add code
May 29, 2026
Viaarxiv icon

Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and Recombination

Add code
May 29, 2026
Viaarxiv icon

LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents

Add code
May 28, 2026
Viaarxiv icon

MetaphorVU: Towards Metaphorical Video Understanding

Add code
May 25, 2026
Viaarxiv icon

Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation

Add code
May 19, 2026
Viaarxiv icon

Learning from Failures: Correction-Oriented Policy Optimization with Verifiable Rewards

Add code
May 14, 2026
Viaarxiv icon

All Languages Matter: Understanding and Mitigating Language Bias in Multilingual RAG

Add code
Apr 22, 2026
Viaarxiv icon