Picture for Huy Nguyen

Huy Nguyen

FIBER: A Differentially Private Optimizer with Filter-Aware Innovation Bias Correction

Add code
May 05, 2026
Viaarxiv icon

On Bayesian Softmax-Gated Mixture-of-Experts Models

Add code
Apr 22, 2026
Viaarxiv icon

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Apr 14, 2026
Viaarxiv icon

Adaptive Stopping for Multi-Turn LLM Reasoning

Add code
Apr 01, 2026
Viaarxiv icon

Rethinking Multinomial Logistic Mixture of Experts with Sigmoid Gating Function

Add code
Feb 01, 2026
Viaarxiv icon

A Statistical Theory of Gated Attention through the Lens of Hierarchical Mixture of Experts

Add code
Feb 01, 2026
Viaarxiv icon

Improving Minimax Estimation Rates for Contaminated Mixture of Multinomial Logistic Experts via Expert Heterogeneity

Add code
Jan 31, 2026
Viaarxiv icon

Cite-While-You-Generate: Training-Free Evidence Attribution for Multimodal Clinical Summarization

Add code
Jan 23, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks

Add code
Oct 05, 2025
Figure 1 for DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks
Figure 2 for DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks
Figure 3 for DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks
Figure 4 for DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks
Viaarxiv icon