Picture for Yongdong Luo

Yongdong Luo

Event-Anchored Frame Selection for Effective Long-Video Understanding

Add code
Mar 01, 2026
Viaarxiv icon

Wavelet-based Frame Selection by Detecting Semantic Boundary for Long Video Understanding

Add code
Feb 28, 2026
Viaarxiv icon

QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension

Add code
Mar 11, 2025
Figure 1 for QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension
Figure 2 for QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension
Figure 3 for QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension
Figure 4 for QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension
Viaarxiv icon

Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension

Add code
Nov 20, 2024
Figure 1 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Figure 2 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Figure 3 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Figure 4 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Viaarxiv icon

Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization

Add code
Apr 17, 2024
Figure 1 for Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization
Figure 2 for Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization
Figure 3 for Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization
Figure 4 for Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization
Viaarxiv icon

A Unified Framework for 3D Point Cloud Visual Grounding

Add code
Aug 23, 2023
Figure 1 for A Unified Framework for 3D Point Cloud Visual Grounding
Figure 2 for A Unified Framework for 3D Point Cloud Visual Grounding
Figure 3 for A Unified Framework for 3D Point Cloud Visual Grounding
Figure 4 for A Unified Framework for 3D Point Cloud Visual Grounding
Viaarxiv icon