Picture for Anhao Zhao

Anhao Zhao

PowerOPD: Stabilizing On-Policy Distillation with Bounded Power Transformation

Add code
Jun 15, 2026
Viaarxiv icon

AdaSR: Adaptive Streaming Reasoning with Hierarchical Relative Policy Optimization

Add code
Jun 12, 2026
Viaarxiv icon

miniReranker: Efficient Multimodal Reranking through Visual Cache Reuse and Interaction Sparsity

Add code
Jun 09, 2026
Viaarxiv icon

Escaping the KL Agreement Trap in On-Policy Distillation

Add code
Jun 08, 2026
Viaarxiv icon

Beyond FLOPs: Benchmarking Real Inference Acceleration of LLM Pruning under a GEMM-Centric Taxonomy

Add code
Jun 08, 2026
Viaarxiv icon

ProactiveLLM: Learning Active Interaction for Streaming Large Language Models

Add code
May 30, 2026
Viaarxiv icon

What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models

Add code
Feb 28, 2026
Viaarxiv icon

On-Policy Supervised Fine-Tuning for Efficient Reasoning

Add code
Feb 13, 2026
Viaarxiv icon

ViCA: Efficient Multimodal LLMs with Vision-Only Cross-Attention

Add code
Feb 07, 2026
Viaarxiv icon

From LLMs to LRMs: Rethinking Pruning for Reasoning-Centric Models

Add code
Jan 26, 2026
Viaarxiv icon