Picture for Jianfei Chen

Jianfei Chen

Diffusion Bridge Implicit Models

May 24, 2024
Viaarxiv icon

SparseDM: Toward Sparse Efficient Diffusion Models

Apr 16, 2024
Viaarxiv icon

Accelerating Transformer Pre-Training with 2:4 Sparsity

Add code
Apr 02, 2024
Figure 1 for Accelerating Transformer Pre-Training with 2:4 Sparsity
Figure 2 for Accelerating Transformer Pre-Training with 2:4 Sparsity
Figure 3 for Accelerating Transformer Pre-Training with 2:4 Sparsity
Figure 4 for Accelerating Transformer Pre-Training with 2:4 Sparsity
Viaarxiv icon

Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization

Add code
Mar 19, 2024
Figure 1 for Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization
Figure 2 for Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization
Figure 3 for Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization
Figure 4 for Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization
Viaarxiv icon

Efficient Backpropagation with Variance-Controlled Adaptive Sampling

Add code
Feb 27, 2024
Viaarxiv icon

C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory

Feb 26, 2024
Viaarxiv icon

DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics

Add code
Oct 28, 2023
Viaarxiv icon

Investigating Uncertainty Calibration of Aligned Language Models under the Multiple-Choice Setting

Add code
Oct 18, 2023
Viaarxiv icon

Memory Efficient Optimizers with 4-bit States

Add code
Sep 06, 2023
Figure 1 for Memory Efficient Optimizers with 4-bit States
Figure 2 for Memory Efficient Optimizers with 4-bit States
Figure 3 for Memory Efficient Optimizers with 4-bit States
Figure 4 for Memory Efficient Optimizers with 4-bit States
Viaarxiv icon

Training Transformers with 4-bit Integers

Add code
Jun 22, 2023
Figure 1 for Training Transformers with 4-bit Integers
Figure 2 for Training Transformers with 4-bit Integers
Figure 3 for Training Transformers with 4-bit Integers
Figure 4 for Training Transformers with 4-bit Integers
Viaarxiv icon