Picture for Shoubin Yu

Shoubin Yu

STAR: A Benchmark for Situated Reasoning in Real-World Videos

Add code
May 15, 2024
Viaarxiv icon

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion

Add code
Feb 08, 2024
Viaarxiv icon

A Simple LLM Framework for Long-Range Video Question-Answering

Add code
Dec 28, 2023
Viaarxiv icon

Self-Chained Image-Language Model for Video Localization and Question Answering

Add code
May 11, 2023
Figure 1 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 2 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 3 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 4 for Self-Chained Image-Language Model for Video Localization and Question Answering
Viaarxiv icon

Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection

Add code
Dec 08, 2021
Figure 1 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Figure 2 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Figure 3 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Figure 4 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Viaarxiv icon