Picture for Anhao Zhao

Anhao Zhao

What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models

Add code
Feb 28, 2026
Viaarxiv icon

On-Policy Supervised Fine-Tuning for Efficient Reasoning

Add code
Feb 13, 2026
Viaarxiv icon

ViCA: Efficient Multimodal LLMs with Vision-Only Cross-Attention

Add code
Feb 07, 2026
Viaarxiv icon

From LLMs to LRMs: Rethinking Pruning for Reasoning-Centric Models

Add code
Jan 26, 2026
Viaarxiv icon

SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling

Add code
Jun 04, 2025
Viaarxiv icon

LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding

Add code
May 22, 2025
Viaarxiv icon

Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning

Add code
May 22, 2025
Figure 1 for Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning
Figure 2 for Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning
Figure 3 for Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning
Figure 4 for Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning
Viaarxiv icon

Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism

Add code
Jul 24, 2024
Figure 1 for Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism
Figure 2 for Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism
Figure 3 for Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism
Figure 4 for Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism
Viaarxiv icon