Picture for Kai Han

Kai Han

and Other Contributors

DECA: Decentralizing Block-Wise Adam for Efficient LLM Full-Parameter Fine-Tuning on Non-IID Data

Add code
Jun 02, 2026
Viaarxiv icon

FGRPO: Federated GRPO with Adaptive Aggregation on Non-IID Data

Add code
Jun 02, 2026
Viaarxiv icon

Scaling Parallel Sequence Models to Foundation-Scale Vision Encoders

Add code
May 30, 2026
Viaarxiv icon

iVGR: Internalizing Visually Grounded Reasoning for MLLMs with Reinforcement Learning

Add code
May 29, 2026
Viaarxiv icon

CodeBind: Decoupled Representation Learning for Multimodal Alignment with Unified Compositional Codebook

Add code
May 18, 2026
Viaarxiv icon

Near-Policy: Accelerating On-Policy Distillation via Asynchronous Generation and Selective Packing

Add code
May 07, 2026
Viaarxiv icon

Sculpt4D: Generating 4D Shapes via Sparse-Attention Diffusion Transformers

Add code
Apr 23, 2026
Viaarxiv icon

Mask Is What DLLM Needs: A Masked Data Training Paradigm for Diffusion LLMs

Add code
Mar 16, 2026
Viaarxiv icon

MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning

Add code
Mar 14, 2026
Viaarxiv icon

Speed3R: Sparse Feed-forward 3D Reconstruction Models

Add code
Mar 09, 2026
Viaarxiv icon