Picture for Jinghan Yao

Jinghan Yao

DK

MAC-Attention: a Match-Amend-Complete Scheme for Fast and Accurate Attention Computation

Add code
Mar 31, 2026
Viaarxiv icon

Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer

Add code
Aug 30, 2024
Figure 1 for Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer
Figure 2 for Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer
Figure 3 for Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer
Figure 4 for Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer
Viaarxiv icon

Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference

Add code
Jan 17, 2024
Figure 1 for Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference
Figure 2 for Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference
Figure 3 for Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference
Figure 4 for Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference
Viaarxiv icon

Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference

Add code
May 24, 2023
Figure 1 for Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference
Figure 2 for Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference
Figure 3 for Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference
Figure 4 for Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference
Viaarxiv icon

SOFT: Softmax-free Transformer with Linear Complexity

Add code
Oct 29, 2021
Figure 1 for SOFT: Softmax-free Transformer with Linear Complexity
Figure 2 for SOFT: Softmax-free Transformer with Linear Complexity
Figure 3 for SOFT: Softmax-free Transformer with Linear Complexity
Figure 4 for SOFT: Softmax-free Transformer with Linear Complexity
Viaarxiv icon

Single Pixel Reconstruction for One-stage Instance Segmentation

Add code
May 17, 2019
Figure 1 for Single Pixel Reconstruction for One-stage Instance Segmentation
Figure 2 for Single Pixel Reconstruction for One-stage Instance Segmentation
Figure 3 for Single Pixel Reconstruction for One-stage Instance Segmentation
Figure 4 for Single Pixel Reconstruction for One-stage Instance Segmentation
Viaarxiv icon