Picture for Mengyi Chen

Mengyi Chen

The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training

Add code
Mar 11, 2026
Viaarxiv icon

SD-MoE: Spectral Decomposition for Effective Expert Specialization

Add code
Feb 13, 2026
Viaarxiv icon

Multi-Head Attention as a Source of Catastrophic Forgetting in MoE Transformers

Add code
Feb 13, 2026
Viaarxiv icon

Dispelling the Curse of Singularities in Neural Network Optimizations

Add code
Feb 01, 2026
Viaarxiv icon

Scalable learning of macroscopic stochastic dynamics

Add code
Nov 17, 2025
Viaarxiv icon

Learning Macroscopic Dynamics from Partial Microscopic Observations

Add code
Oct 31, 2024
Viaarxiv icon

BAMBOO: a predictive and transferable machine learning force field framework for liquid electrolyte development

Add code
Apr 12, 2024
Viaarxiv icon