Picture for Chaoxiang Cai

Chaoxiang Cai

Long-Tailed Distribution-Aware Router For Mixture-of-Experts in Large Vision-Language Model

Add code
Jul 02, 2025
Viaarxiv icon

Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model

Add code
Jun 28, 2024
Figure 1 for Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Figure 2 for Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Figure 3 for Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Figure 4 for Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Viaarxiv icon