Picture for Nhat Ho

Nhat Ho

On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts

Add code
May 24, 2025
Viaarxiv icon

CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition

Add code
May 19, 2025
Viaarxiv icon

Model Selection for Gaussian-gated Gaussian Mixture of Experts Using Dendrograms of Mixing Measures

Add code
May 19, 2025
Viaarxiv icon

On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating

Add code
May 16, 2025
Viaarxiv icon

Convergence Rates for Softmax Gating Mixture of Experts

Add code
Mar 05, 2025
Viaarxiv icon

MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification

Add code
Feb 11, 2025
Viaarxiv icon

RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts

Add code
Feb 05, 2025
Viaarxiv icon

On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation

Add code
Feb 05, 2025
Figure 1 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Figure 2 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Figure 3 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Figure 4 for On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Viaarxiv icon

Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning

Add code
Jan 31, 2025
Figure 1 for Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning
Figure 2 for Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning
Figure 3 for Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning
Figure 4 for Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning
Viaarxiv icon

Lightspeed Geometric Dataset Distance via Sliced Optimal Transport

Add code
Jan 31, 2025
Figure 1 for Lightspeed Geometric Dataset Distance via Sliced Optimal Transport
Figure 2 for Lightspeed Geometric Dataset Distance via Sliced Optimal Transport
Figure 3 for Lightspeed Geometric Dataset Distance via Sliced Optimal Transport
Figure 4 for Lightspeed Geometric Dataset Distance via Sliced Optimal Transport
Viaarxiv icon