Picture for Huan Li

Huan Li

Variance-Adaptive Muon: Accelerating LLM Pretraining with NSR-Modulated and Variance-Scaled Momentum

Add code
Jan 21, 2026
Viaarxiv icon

Convergence Rate Analysis of the AdamW-Style Shampoo: Unifying One-sided and Two-Sided Preconditioning

Add code
Jan 12, 2026
Viaarxiv icon

SafeLoad: Efficient Admission Control Framework for Identifying Memory-Overloading Queries in Cloud Data Warehouses

Add code
Jan 05, 2026
Viaarxiv icon

Analyzing the Mechanism of Attention Collapse in VGGT from a Dynamics Perspective

Add code
Dec 25, 2025
Viaarxiv icon

CTkvr: KV Cache Retrieval for Long-Context LLMs via Centroid then Token Indexing

Add code
Dec 17, 2025
Viaarxiv icon

MovSemCL: Movement-Semantics Contrastive Learning for Trajectory Similarity

Add code
Nov 15, 2025
Viaarxiv icon

QuiZSF: An efficient data-model interaction framework for zero-shot time-series forecasting

Add code
Aug 09, 2025
Viaarxiv icon

On the $O(\frac{\sqrt{d}}{K^{1/4}})$ Convergence Rate of AdamW Measured by $\ell_1$ Norm

Add code
May 17, 2025
Viaarxiv icon

CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning

Add code
Apr 24, 2025
Figure 1 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Figure 2 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Figure 3 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Figure 4 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Viaarxiv icon

HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models

Add code
Apr 24, 2025
Viaarxiv icon