Picture for Song Han

Song Han

University of Connecticut

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Add code
Apr 27, 2026
Viaarxiv icon

Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation

Add code
Apr 14, 2026
Viaarxiv icon

Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLM

Add code
Apr 08, 2026
Viaarxiv icon

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Add code
Apr 08, 2026
Viaarxiv icon

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Add code
Apr 06, 2026
Viaarxiv icon

Adaptive Block-Scaled Data Types

Add code
Mar 30, 2026
Viaarxiv icon

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Add code
Mar 12, 2026
Viaarxiv icon

Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs

Add code
Feb 19, 2026
Viaarxiv icon

ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

Add code
Feb 12, 2026
Viaarxiv icon

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Add code
Feb 03, 2026
Viaarxiv icon