Picture for Jianxiang He

Jianxiang He

VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding

Add code
Aug 09, 2025
Viaarxiv icon

Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding

Add code
Mar 17, 2025
Viaarxiv icon

A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges

Add code
Dec 16, 2024
Viaarxiv icon