Picture for Quang Pham

Quang Pham

CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition

Add code
May 19, 2025
Viaarxiv icon

On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating

Add code
May 16, 2025
Viaarxiv icon

Sequence Transferability and Task Order Selection in Continual Learning

Add code
Feb 10, 2025
Viaarxiv icon

LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models

Add code
Nov 01, 2024
Viaarxiv icon

Class-incremental Learning for Time Series: Benchmark and Evaluation

Add code
Feb 19, 2024
Viaarxiv icon

CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition

Add code
Feb 04, 2024
Figure 1 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Figure 2 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Figure 3 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Figure 4 for CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Viaarxiv icon

HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts

Add code
Dec 12, 2023
Viaarxiv icon

On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation

Add code
Nov 18, 2023
Figure 1 for On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation
Figure 2 for On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation
Figure 3 for On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation
Figure 4 for On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation
Viaarxiv icon

Adaptive-saturated RNN: Remember more with less instability

Add code
Apr 24, 2023
Viaarxiv icon

Continual Learning: Fast and Slow

Add code
Sep 06, 2022
Figure 1 for Continual Learning: Fast and Slow
Figure 2 for Continual Learning: Fast and Slow
Figure 3 for Continual Learning: Fast and Slow
Figure 4 for Continual Learning: Fast and Slow
Viaarxiv icon