Picture for Xin Wan

Xin Wan

TimeSearch-R: Adaptive Temporal Search for Long-Form Video Understanding via Self-Verification Reinforcement Learning

Add code
Nov 07, 2025
Viaarxiv icon

TimeSearch: Hierarchical Video Search with Spotlight and Reflection for Human-like Long Video Understanding

Add code
Apr 02, 2025
Figure 1 for TimeSearch: Hierarchical Video Search with Spotlight and Reflection for Human-like Long Video Understanding
Figure 2 for TimeSearch: Hierarchical Video Search with Spotlight and Reflection for Human-like Long Video Understanding
Figure 3 for TimeSearch: Hierarchical Video Search with Spotlight and Reflection for Human-like Long Video Understanding
Figure 4 for TimeSearch: Hierarchical Video Search with Spotlight and Reflection for Human-like Long Video Understanding
Viaarxiv icon

MammothModa: Multi-Modal Large Language Model

Add code
Jun 26, 2024
Viaarxiv icon