Picture for Youngjae Yu

Youngjae Yu

What MLLMs Learn about When they Learn about Multimodal Reasoning: Perception, Reasoning, or their Integration?

Add code
Oct 02, 2025
Viaarxiv icon

Zero-shot Multimodal Document Retrieval via Cross-modal Question Generation

Add code
Aug 23, 2025
Viaarxiv icon

InfoCausalQA:Can Models Perform Non-explicit Causal Reasoning Based on Infographic?

Add code
Aug 08, 2025
Viaarxiv icon

Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers

Add code
Jun 16, 2025
Viaarxiv icon

Are Any-to-Any Models More Consistent Across Modality Transfers Than Specialists?

Add code
May 30, 2025
Viaarxiv icon

Subtle Risks, Critical Failures: A Framework for Diagnosing Physical Safety of LLMs for Embodied Decision Making

Add code
May 26, 2025
Viaarxiv icon

Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation

Add code
May 24, 2025
Viaarxiv icon

MAVL: A Multilingual Audio-Video Lyrics Dataset for Animated Song Translation

Add code
May 24, 2025
Viaarxiv icon

DUSK: Do Not Unlearn Shared Knowledge

Add code
May 21, 2025
Viaarxiv icon

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Add code
May 17, 2025
Viaarxiv icon