Picture for Fuwei Yang

Fuwei Yang

Partial Convolution Meets Visual Attention

Add code
Mar 05, 2025
Viaarxiv icon

Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE

Add code
Feb 10, 2025
Figure 1 for Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE
Figure 2 for Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE
Figure 3 for Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE
Figure 4 for Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE
Viaarxiv icon

FTP: A Fine-grained Token-wise Pruner for Large Language Models via Token Routing

Add code
Dec 16, 2024
Viaarxiv icon