Picture for Zheng Zhang

Zheng Zhang

EJ

Muon+: Towards Better Muon via One Additional Normalization Step

Add code
Feb 26, 2026
Viaarxiv icon

Decoupling Vision and Language: Codebook Anchored Visual Adaptation

Add code
Feb 23, 2026
Viaarxiv icon

Powering Up Zeroth-Order Training via Subspace Gradient Orthogonalization

Add code
Feb 19, 2026
Viaarxiv icon

MirrorLA: Reflecting Feature Map for Vision Linear Attention

Add code
Feb 04, 2026
Viaarxiv icon

Causal Graph Spatial-Temporal Autoencoder for Reliable and Interpretable Process Monitoring

Add code
Feb 03, 2026
Viaarxiv icon

Interpreting and Controlling LLM Reasoning through Integrated Policy Gradient

Add code
Feb 03, 2026
Viaarxiv icon

STILL: Selecting Tokens for Intra-Layer Hybrid Attention to Linearize LLMs

Add code
Feb 02, 2026
Viaarxiv icon

Grad2Reward: From Sparse Judgment to Dense Rewards for Improving Open-Ended LLM Reasoning

Add code
Feb 02, 2026
Viaarxiv icon

One Size, Many Fits: Aligning Diverse Group-Wise Click Preferences in Large-Scale Advertising Image Generation

Add code
Feb 02, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon