Picture for Qianhao Yuan

Qianhao Yuan

Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation

Add code
May 19, 2026
Viaarxiv icon

DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation

Add code
Feb 26, 2026
Viaarxiv icon

ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers

Add code
Apr 01, 2025
Viaarxiv icon

SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency

Add code
Feb 04, 2025
Viaarxiv icon