Picture for Zimeng Wu

Zimeng Wu

Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference

Add code
Jan 19, 2026
Viaarxiv icon