Picture for Huy Nguyen

Huy Nguyen

DoRAN: Stabilizing Weight-Decomposed Low-Rank Adaptation via Noise Injection and Auxiliary Networks

Add code
Oct 05, 2025
Viaarxiv icon

HoRA: Cross-Head Low-Rank Adaptation with Joint Hypernetworks

Add code
Oct 05, 2025
Viaarxiv icon

AG-VPReID.VIR: Bridging Aerial and Ground Platforms for Video-based Visible-Infrared Person Re-ID

Add code
Jul 24, 2025
Viaarxiv icon

On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts

Add code
May 24, 2025
Viaarxiv icon

CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition

Add code
May 19, 2025
Figure 1 for CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition
Figure 2 for CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition
Figure 3 for CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition
Figure 4 for CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition
Viaarxiv icon

On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating

Add code
May 16, 2025
Viaarxiv icon

AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-based Person Re-Identification

Add code
Mar 11, 2025
Viaarxiv icon

Convergence Rates for Softmax Gating Mixture of Experts

Add code
Mar 05, 2025
Figure 1 for Convergence Rates for Softmax Gating Mixture of Experts
Figure 2 for Convergence Rates for Softmax Gating Mixture of Experts
Figure 3 for Convergence Rates for Softmax Gating Mixture of Experts
Figure 4 for Convergence Rates for Softmax Gating Mixture of Experts
Viaarxiv icon

On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation

Add code
Feb 05, 2025
Figure 1 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Figure 2 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Figure 3 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Figure 4 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Viaarxiv icon

RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts

Add code
Feb 05, 2025
Viaarxiv icon