Picture for Zijun Geng

Zijun Geng

Fine-R1: Make Multi-modal LLMs Excel in Fine-Grained Visual Recognition by Chain-of-Thought Reasoning

Add code
Feb 07, 2026
Viaarxiv icon

Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language Models

Add code
Jan 25, 2025
Viaarxiv icon