Picture for Dongsheng Li

Dongsheng Li

National University of Defense Technology, Changsha, China

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Add code
Mar 25, 2026
Viaarxiv icon

SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

Add code
Mar 24, 2026
Viaarxiv icon

Understanding Reasoning in LLMs through Strategic Information Allocation under Uncertainty

Add code
Mar 16, 2026
Viaarxiv icon

Improving Diffusion Planners by Self-Supervised Action Gating with Energies

Add code
Mar 03, 2026
Viaarxiv icon

State-Action Inpainting Diffuser for Continuous Control with Delay

Add code
Mar 02, 2026
Viaarxiv icon

Let Your Image Move with Your Motion! -- Implicit Multi-Object Multi-Motion Transfer

Add code
Mar 01, 2026
Viaarxiv icon

Reasoning-Driven Multimodal LLM for Domain Generalization

Add code
Feb 27, 2026
Viaarxiv icon

pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation

Add code
Feb 26, 2026
Viaarxiv icon

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Add code
Feb 26, 2026
Viaarxiv icon

Sparrow: Text-Anchored Window Attention with Visual-Semantic Glimpsing for Speculative Decoding in Video LLMs

Add code
Feb 17, 2026
Viaarxiv icon