Picture for Baihui Liu

Baihui Liu

ParaDySe: A Parallel-Strategy Switching Framework for Dynamic Sequence Lengths in Transformer

Add code
Nov 17, 2025
Viaarxiv icon

A Survey on Memory-Efficient Large-Scale Model Training in AI for Science

Add code
Jan 21, 2025
Figure 1 for A Survey on Memory-Efficient Large-Scale Model Training in AI for Science
Figure 2 for A Survey on Memory-Efficient Large-Scale Model Training in AI for Science
Figure 3 for A Survey on Memory-Efficient Large-Scale Model Training in AI for Science
Figure 4 for A Survey on Memory-Efficient Large-Scale Model Training in AI for Science
Viaarxiv icon