Picture for Kefeng Zhang

Kefeng Zhang

C2T: A Classifier-Based Tree Construction Method in Speculative Decoding

Add code
Feb 19, 2025
Figure 1 for C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
Figure 2 for C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
Figure 3 for C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
Figure 4 for C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
Viaarxiv icon

MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures

Add code
Feb 19, 2025
Figure 1 for MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Figure 2 for MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Figure 3 for MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Figure 4 for MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Viaarxiv icon

PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation

Add code
Dec 04, 2024
Figure 1 for PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation
Figure 2 for PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation
Figure 3 for PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation
Figure 4 for PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation
Viaarxiv icon

EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference

Add code
Oct 16, 2024
Figure 1 for EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference
Figure 2 for EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference
Figure 3 for EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference
Figure 4 for EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference
Viaarxiv icon

Optimizing AD Pruning of Sponsored Search with Reinforcement Learning

Add code
Aug 05, 2020
Figure 1 for Optimizing AD Pruning of Sponsored Search with Reinforcement Learning
Figure 2 for Optimizing AD Pruning of Sponsored Search with Reinforcement Learning
Figure 3 for Optimizing AD Pruning of Sponsored Search with Reinforcement Learning
Figure 4 for Optimizing AD Pruning of Sponsored Search with Reinforcement Learning
Viaarxiv icon