Picture for Hengjie Cao

Hengjie Cao

The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training

Add code
Mar 11, 2026
Viaarxiv icon

Multi-Head Attention as a Source of Catastrophic Forgetting in MoE Transformers

Add code
Feb 13, 2026
Viaarxiv icon

SD-MoE: Spectral Decomposition for Effective Expert Specialization

Add code
Feb 13, 2026
Viaarxiv icon

Dispelling the Curse of Singularities in Neural Network Optimizations

Add code
Feb 01, 2026
Viaarxiv icon