Picture for Wencong Xiao

Wencong Xiao

Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach

Add code
Jun 07, 2024
Viaarxiv icon

Llumnix: Dynamic Scheduling for Large Language Model Serving

Add code
Jun 05, 2024
Viaarxiv icon

FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving

Add code
Aug 14, 2023
Figure 1 for FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving
Figure 2 for FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving
Figure 3 for FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving
Figure 4 for FusionAD: Multi-modality Fusion for Prediction and Planning Tasks of Autonomous Driving
Viaarxiv icon

MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs

Add code
Jan 01, 2023
Figure 1 for MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs
Figure 2 for MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs
Figure 3 for MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs
Figure 4 for MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs
Viaarxiv icon

Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training

Add code
Dec 16, 2020
Figure 1 for Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training
Figure 2 for Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training
Figure 3 for Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training
Figure 4 for Focusing More on Conflicts with Mis-Predictions Helps Language Pre-Training
Viaarxiv icon

Balanced Sparsity for Efficient DNN Inference on GPU

Add code
Nov 02, 2018
Figure 1 for Balanced Sparsity for Efficient DNN Inference on GPU
Figure 2 for Balanced Sparsity for Efficient DNN Inference on GPU
Figure 3 for Balanced Sparsity for Efficient DNN Inference on GPU
Figure 4 for Balanced Sparsity for Efficient DNN Inference on GPU
Viaarxiv icon