Picture for Yunzhuo Hao

Yunzhuo Hao

CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs

Add code
May 30, 2025
Viaarxiv icon

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Add code
May 13, 2025
Viaarxiv icon

Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Add code
May 12, 2025
Viaarxiv icon

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Add code
Apr 23, 2025
Viaarxiv icon

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Add code
Apr 08, 2025
Viaarxiv icon

Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark

Add code
Jan 09, 2025
Viaarxiv icon

Exploring Backdoor Vulnerabilities of Chat Models

Add code
Apr 03, 2024
Viaarxiv icon