Alert button
Picture for Shengen Yan

Shengen Yan

Alert button

Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

Add code
Bookmark button
Alert button
Apr 08, 2024
Enshu Liu, Junyi Zhu, Zinan Lin, Xuefei Ning, Matthew B. Blaschko, Sergey Yekhanin, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

Viaarxiv icon

Evaluating Quantized Large Language Models

Add code
Bookmark button
Alert button
Feb 28, 2024
Shiyao Li, Xuefei Ning, Luning Wang, Tengxuan Liu, Xiangsheng Shi, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

Viaarxiv icon

LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K

Add code
Bookmark button
Alert button
Feb 06, 2024
Tao Yuan, Xuefei Ning, Dong Zhou, Zhijie Yang, Shiyao Li, Minghui Zhuang, Zheyue Tan, Zhuyu Yao, Dahua Lin, Boxun Li, Guohao Dai, Shengen Yan, Yu Wang

Viaarxiv icon

A Simulation Platform for Multi-tenant Machine Learning Services on Thousands of GPUs

Add code
Bookmark button
Alert button
Jan 10, 2022
Ruofan Liang, Bingsheng He, Shengen Yan, Peng Sun

Figure 1 for A Simulation Platform for Multi-tenant Machine Learning Services on Thousands of GPUs
Figure 2 for A Simulation Platform for Multi-tenant Machine Learning Services on Thousands of GPUs
Figure 3 for A Simulation Platform for Multi-tenant Machine Learning Services on Thousands of GPUs
Figure 4 for A Simulation Platform for Multi-tenant Machine Learning Services on Thousands of GPUs
Viaarxiv icon

Characterization and Prediction of Deep Learning Workloads in Large-Scale GPU Datacenters

Add code
Bookmark button
Alert button
Sep 06, 2021
Qinghao Hu, Peng Sun, Shengen Yan, Yonggang Wen, Tianwei Zhang

Figure 1 for Characterization and Prediction of Deep Learning Workloads in Large-Scale GPU Datacenters
Figure 2 for Characterization and Prediction of Deep Learning Workloads in Large-Scale GPU Datacenters
Figure 3 for Characterization and Prediction of Deep Learning Workloads in Large-Scale GPU Datacenters
Figure 4 for Characterization and Prediction of Deep Learning Workloads in Large-Scale GPU Datacenters
Viaarxiv icon

Deep Image: Scaling up Image Recognition

Add code
Bookmark button
Alert button
Jul 06, 2015
Ren Wu, Shengen Yan, Yi Shan, Qingqing Dang, Gang Sun

Figure 1 for Deep Image: Scaling up Image Recognition
Figure 2 for Deep Image: Scaling up Image Recognition
Figure 3 for Deep Image: Scaling up Image Recognition
Figure 4 for Deep Image: Scaling up Image Recognition
Viaarxiv icon