Picture for Yufei Zhan

Yufei Zhan

VFaith: Do Large Multimodal Models Really Reason on Seen Images Rather than Previous Memories?

Add code
Jun 13, 2025
Viaarxiv icon

Understand, Think, and Answer: Advancing Visual Reasoning with Large Multimodal Models

Add code
May 27, 2025
Viaarxiv icon

Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models

Add code
Oct 21, 2024
Figure 1 for Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Figure 2 for Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Figure 3 for Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Figure 4 for Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Viaarxiv icon

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring

Add code
Mar 14, 2024
Figure 1 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Figure 2 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Figure 3 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Figure 4 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Viaarxiv icon

Mitigating Hallucination in Visual Language Models with Visual Supervision

Add code
Nov 27, 2023
Figure 1 for Mitigating Hallucination in Visual Language Models with Visual Supervision
Figure 2 for Mitigating Hallucination in Visual Language Models with Visual Supervision
Figure 3 for Mitigating Hallucination in Visual Language Models with Visual Supervision
Figure 4 for Mitigating Hallucination in Visual Language Models with Visual Supervision
Viaarxiv icon

Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models

Add code
Nov 27, 2023
Figure 1 for Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
Figure 2 for Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
Figure 3 for Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
Figure 4 for Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
Viaarxiv icon