Picture for Tuyen Tran

Tuyen Tran

Planner-Refiner: Dynamic Space-Time Refinement for Vision-Language Alignment in Videos

Add code
Aug 10, 2025
Viaarxiv icon

MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation

Add code
Apr 04, 2025
Viaarxiv icon

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

Add code
Sep 09, 2024
Figure 1 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 2 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 3 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 4 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Viaarxiv icon

Unified Framework with Consistency across Modalities for Human Activity Recognition

Add code
Sep 04, 2024
Figure 1 for Unified Framework with Consistency across Modalities for Human Activity Recognition
Figure 2 for Unified Framework with Consistency across Modalities for Human Activity Recognition
Figure 3 for Unified Framework with Consistency across Modalities for Human Activity Recognition
Figure 4 for Unified Framework with Consistency across Modalities for Human Activity Recognition
Viaarxiv icon

The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation

Add code
Aug 22, 2024
Viaarxiv icon