Picture for Zhiqing Kui

Zhiqing Kui

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

End-to-end Adaptive Distributed Training on PaddlePaddle

Add code
Dec 06, 2021
Figure 1 for End-to-end Adaptive Distributed Training on PaddlePaddle
Figure 2 for End-to-end Adaptive Distributed Training on PaddlePaddle
Figure 3 for End-to-end Adaptive Distributed Training on PaddlePaddle
Figure 4 for End-to-end Adaptive Distributed Training on PaddlePaddle
Viaarxiv icon