SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Add code
May 22, 2025
Figure 1 for SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
Figure 2 for SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
Figure 3 for SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
Figure 4 for SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: