Picture for Mingcan Xiang

Mingcan Xiang

Sparsity-Controllable Dynamic Top-p MoE for Large Foundation Model Pre-training

Add code
Dec 16, 2025
Viaarxiv icon

Understanding and Alleviating Memory Consumption in RLHF for LLMs

Add code
Oct 21, 2024
Figure 1 for Understanding and Alleviating Memory Consumption in RLHF for LLMs
Figure 2 for Understanding and Alleviating Memory Consumption in RLHF for LLMs
Figure 3 for Understanding and Alleviating Memory Consumption in RLHF for LLMs
Viaarxiv icon

AdapMTL: Adaptive Pruning Framework for Multitask Learning Model

Add code
Aug 07, 2024
Viaarxiv icon