Picture for Jianchao Tan

Jianchao Tan

MONA: Muon Optimizer with Nesterov Acceleration for Scalable Language Model Training

Add code
May 26, 2026
Viaarxiv icon

FG$^2$-GDN: Enhancing Long-Context Gated Delta Networks with Doubly Fine-Grained Control

Add code
Apr 21, 2026
Viaarxiv icon

SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention

Add code
Apr 15, 2026
Viaarxiv icon

AsyncTLS: Efficient Generative LLM Inference with Asynchronous Two-level Sparse Attention

Add code
Apr 09, 2026
Viaarxiv icon

AFA-LoRA: Enabling Non-Linear Adaptations in LoRA with Activation Function Annealing

Add code
Dec 27, 2025
Viaarxiv icon

Accelerate Speculative Decoding with Sparse Computation in Verification

Add code
Dec 26, 2025
Viaarxiv icon

C2T: A Classifier-Based Tree Construction Method in Speculative Decoding

Add code
Feb 19, 2025
Figure 1 for C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
Figure 2 for C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
Figure 3 for C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
Figure 4 for C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
Viaarxiv icon

MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures

Add code
Feb 19, 2025
Figure 1 for MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Figure 2 for MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Figure 3 for MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Figure 4 for MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Viaarxiv icon

PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation

Add code
Dec 04, 2024
Figure 1 for PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation
Figure 2 for PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation
Figure 3 for PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation
Figure 4 for PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation
Viaarxiv icon

EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference

Add code
Oct 16, 2024
Figure 1 for EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference
Figure 2 for EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference
Figure 3 for EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference
Figure 4 for EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference
Viaarxiv icon