Picture for Yingqi Fan

Yingqi Fan

PowerOPD: Stabilizing On-Policy Distillation with Bounded Power Transformation

Add code
Jun 15, 2026
Viaarxiv icon

AdaSR: Adaptive Streaming Reasoning with Hierarchical Relative Policy Optimization

Add code
Jun 12, 2026
Viaarxiv icon

CompRank: Efficient LLM Reranking via Token-Level Compression and Decoding-Free Scoring

Add code
Jun 10, 2026
Viaarxiv icon

miniReranker: Efficient Multimodal Reranking through Visual Cache Reuse and Interaction Sparsity

Add code
Jun 09, 2026
Viaarxiv icon

ProactiveLLM: Learning Active Interaction for Streaming Large Language Models

Add code
May 30, 2026
Viaarxiv icon

What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models

Add code
Feb 28, 2026
Viaarxiv icon

HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit

Add code
Feb 27, 2026
Viaarxiv icon

On-Policy Supervised Fine-Tuning for Efficient Reasoning

Add code
Feb 13, 2026
Viaarxiv icon

ViCA: Efficient Multimodal LLMs with Vision-Only Cross-Attention

Add code
Feb 07, 2026
Viaarxiv icon

SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling

Add code
Jun 04, 2025
Viaarxiv icon