Picture for Shoubin Yu

Shoubin Yu

VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos

Add code
May 29, 2024
Figure 1 for VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Figure 2 for VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Figure 3 for VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Figure 4 for VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Viaarxiv icon

RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives

Add code
May 28, 2024
Viaarxiv icon

STAR: A Benchmark for Situated Reasoning in Real-World Videos

Add code
May 15, 2024
Figure 1 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Figure 2 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Figure 3 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Figure 4 for STAR: A Benchmark for Situated Reasoning in Real-World Videos
Viaarxiv icon

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion

Add code
Feb 08, 2024
Viaarxiv icon

A Simple LLM Framework for Long-Range Video Question-Answering

Add code
Dec 28, 2023
Viaarxiv icon

Self-Chained Image-Language Model for Video Localization and Question Answering

Add code
May 11, 2023
Figure 1 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 2 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 3 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 4 for Self-Chained Image-Language Model for Video Localization and Question Answering
Viaarxiv icon

Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection

Add code
Dec 08, 2021
Figure 1 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Figure 2 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Figure 3 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Figure 4 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Viaarxiv icon