Alert button
Picture for Jianfei Chen

Jianfei Chen

Alert button

Accelerating Transformer Pre-Training with 2:4 Sparsity

Add code
Bookmark button
Alert button
Apr 02, 2024
Yuezhou Hu, Kang Zhao, Weiyu Huang, Jianfei Chen, Jun Zhu

Viaarxiv icon

Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization

Add code
Bookmark button
Alert button
Mar 19, 2024
Haocheng Xi, Yuxiang Chen, Kang Zhao, Kaijun Zheng, Jianfei Chen, Jun Zhu

Figure 1 for Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization
Figure 2 for Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization
Figure 3 for Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization
Figure 4 for Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization
Viaarxiv icon

Efficient Backpropagation with Variance-Controlled Adaptive Sampling

Add code
Bookmark button
Alert button
Feb 27, 2024
Ziteng Wang, Jianfei Chen, Jun Zhu

Viaarxiv icon

C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory

Add code
Bookmark button
Alert button
Feb 26, 2024
Tianjiao Luo, Tim Pearce, Huayu Chen, Jianfei Chen, Jun Zhu

Viaarxiv icon

DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics

Add code
Bookmark button
Alert button
Oct 28, 2023
Kaiwen Zheng, Cheng Lu, Jianfei Chen, Jun Zhu

Viaarxiv icon

Investigating Uncertainty Calibration of Aligned Language Models under the Multiple-Choice Setting

Add code
Bookmark button
Alert button
Oct 18, 2023
Guande He, Peng Cui, Jianfei Chen, Wenbo Hu, Jun Zhu

Viaarxiv icon

Memory Efficient Optimizers with 4-bit States

Add code
Bookmark button
Alert button
Sep 06, 2023
Bingrui Li, Jianfei Chen, Jun Zhu

Figure 1 for Memory Efficient Optimizers with 4-bit States
Figure 2 for Memory Efficient Optimizers with 4-bit States
Figure 3 for Memory Efficient Optimizers with 4-bit States
Figure 4 for Memory Efficient Optimizers with 4-bit States
Viaarxiv icon

Training Transformers with 4-bit Integers

Add code
Bookmark button
Alert button
Jun 22, 2023
Haocheng Xi, Changhao Li, Jianfei Chen, Jun Zhu

Figure 1 for Training Transformers with 4-bit Integers
Figure 2 for Training Transformers with 4-bit Integers
Figure 3 for Training Transformers with 4-bit Integers
Figure 4 for Training Transformers with 4-bit Integers
Viaarxiv icon