Picture for Kaituo Feng

Kaituo Feng

Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing

Add code
Jun 11, 2025
Viaarxiv icon

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

Add code
May 27, 2025
Viaarxiv icon

SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Add code
May 22, 2025
Viaarxiv icon

Video-R1: Reinforcing Video Reasoning in MLLMs

Add code
Mar 27, 2025
Viaarxiv icon

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Add code
Dec 03, 2024
Figure 1 for AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
Figure 2 for AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
Figure 3 for AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
Figure 4 for AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
Viaarxiv icon

Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?

Add code
Oct 02, 2024
Viaarxiv icon

Keypoint-based Progressive Chain-of-Thought Distillation for LLMs

Add code
May 25, 2024
Viaarxiv icon

On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving

Add code
Mar 02, 2024
Viaarxiv icon

Learning to Generate Parameters of ConvNets for Unseen Image Data

Add code
Oct 24, 2023
Viaarxiv icon

Shared Growth of Graph Neural Networks via Free-direction Knowledge Distillation

Add code
Jul 08, 2023
Viaarxiv icon