Picture for Guangming Tan

Guangming Tan

CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model Training

Add code
May 06, 2026
Viaarxiv icon

TACO: Efficient Communication Compression of Intermediate Tensors for Scalable Tensor-Parallel LLM Training

Add code
Apr 27, 2026
Viaarxiv icon

Research Paradigm of Materials Science Tetrahedra with Artificial Intelligence

Add code
Mar 14, 2026
Viaarxiv icon

MatRIS: Toward Reliable and Efficient Pretrained Machine Learning Interatomic Potentials

Add code
Mar 05, 2026
Viaarxiv icon

Exploring Landscapes for Better Minima along Valleys

Add code
Oct 31, 2025
Viaarxiv icon

FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs

Add code
Dec 30, 2024
Viaarxiv icon

I/O Lower Bounds for Auto-tuning of Convolutions in CNNs

Add code
Dec 31, 2020
Figure 1 for I/O Lower Bounds for Auto-tuning of Convolutions in CNNs
Figure 2 for I/O Lower Bounds for Auto-tuning of Convolutions in CNNs
Figure 3 for I/O Lower Bounds for Auto-tuning of Convolutions in CNNs
Figure 4 for I/O Lower Bounds for Auto-tuning of Convolutions in CNNs
Viaarxiv icon