Picture for Zhenxu Tian

Zhenxu Tian

Where Matters More Than What: Decoding-aligned KV Cache Compression via Position-aware Pseudo Queries

Add code
Mar 12, 2026
Viaarxiv icon

LongFlow: Efficient KV Cache Compression for Reasoning M

Add code
Mar 12, 2026
Viaarxiv icon

Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification

Add code
May 19, 2025
Figure 1 for Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Figure 2 for Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Figure 3 for Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Figure 4 for Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Viaarxiv icon