Picture for Hongyin Zhao

Hongyin Zhao

Understand, Think, and Answer: Advancing Visual Reasoning with Large Multimodal Models

Add code
May 27, 2025
Viaarxiv icon

Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models

Add code
Oct 21, 2024
Figure 1 for Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Figure 2 for Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Figure 3 for Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Figure 4 for Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Viaarxiv icon

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring

Add code
Mar 14, 2024
Viaarxiv icon