Picture for Limin Wang

Limin Wang

VFIMamba: Video Frame Interpolation with State Space Models

Add code
Jul 02, 2024
Viaarxiv icon

EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation

Add code
Jun 27, 2024
Figure 1 for EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation
Figure 2 for EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation
Figure 3 for EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation
Figure 4 for EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation
Viaarxiv icon

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Add code
Jun 13, 2024
Figure 1 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 2 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 3 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 4 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Viaarxiv icon

OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Add code
Jun 12, 2024
Figure 1 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 2 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 3 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 4 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Viaarxiv icon

Open-Vocabulary Spatio-Temporal Action Detection

Add code
May 17, 2024
Viaarxiv icon

Accelerating Image Generation with Sub-path Linear Approximation Model

Add code
Apr 23, 2024
Figure 1 for Accelerating Image Generation with Sub-path Linear Approximation Model
Figure 2 for Accelerating Image Generation with Sub-path Linear Approximation Model
Figure 3 for Accelerating Image Generation with Sub-path Linear Approximation Model
Figure 4 for Accelerating Image Generation with Sub-path Linear Approximation Model
Viaarxiv icon

Sparse Global Matching for Video Frame Interpolation with Large Motion

Add code
Apr 15, 2024
Figure 1 for Sparse Global Matching for Video Frame Interpolation with Large Motion
Figure 2 for Sparse Global Matching for Video Frame Interpolation with Large Motion
Figure 3 for Sparse Global Matching for Video Frame Interpolation with Large Motion
Figure 4 for Sparse Global Matching for Video Frame Interpolation with Large Motion
Viaarxiv icon

SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos

Add code
Apr 06, 2024
Figure 1 for SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos
Figure 2 for SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos
Figure 3 for SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos
Figure 4 for SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos
Viaarxiv icon

Dual DETRs for Multi-Label Temporal Action Detection

Add code
Mar 31, 2024
Viaarxiv icon

Multiple Object Tracking as ID Prediction

Add code
Mar 25, 2024
Figure 1 for Multiple Object Tracking as ID Prediction
Figure 2 for Multiple Object Tracking as ID Prediction
Figure 3 for Multiple Object Tracking as ID Prediction
Figure 4 for Multiple Object Tracking as ID Prediction
Viaarxiv icon