Picture for Mengmeng Xu

Mengmeng Xu

Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation

Add code
Nov 27, 2022
Figure 1 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 2 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 3 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Figure 4 for Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation
Viaarxiv icon

Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization

Add code
Nov 18, 2022
Figure 1 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Figure 2 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Figure 3 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Figure 4 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Viaarxiv icon

Negative Frames Matter in Egocentric Visual Query 2D Localization

Add code
Aug 03, 2022
Figure 1 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Figure 2 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Figure 3 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Figure 4 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Viaarxiv icon

ETAD: A Unified Framework for Efficient Temporal Action Detection

Add code
May 14, 2022
Figure 1 for ETAD: A Unified Framework for Efficient Temporal Action Detection
Figure 2 for ETAD: A Unified Framework for Efficient Temporal Action Detection
Figure 3 for ETAD: A Unified Framework for Efficient Temporal Action Detection
Figure 4 for ETAD: A Unified Framework for Efficient Temporal Action Detection
Viaarxiv icon

Contrastive Language-Action Pre-training for Temporal Localization

Add code
Apr 26, 2022
Figure 1 for Contrastive Language-Action Pre-training for Temporal Localization
Figure 2 for Contrastive Language-Action Pre-training for Temporal Localization
Figure 3 for Contrastive Language-Action Pre-training for Temporal Localization
Figure 4 for Contrastive Language-Action Pre-training for Temporal Localization
Viaarxiv icon

SegTAD: Precise Temporal Action Detection via Semantic Segmentation

Add code
Mar 03, 2022
Figure 1 for SegTAD: Precise Temporal Action Detection via Semantic Segmentation
Figure 2 for SegTAD: Precise Temporal Action Detection via Semantic Segmentation
Figure 3 for SegTAD: Precise Temporal Action Detection via Semantic Segmentation
Figure 4 for SegTAD: Precise Temporal Action Detection via Semantic Segmentation
Viaarxiv icon

Relation-aware Video Reading Comprehension for Temporal Language Grounding

Add code
Oct 18, 2021
Figure 1 for Relation-aware Video Reading Comprehension for Temporal Language Grounding
Figure 2 for Relation-aware Video Reading Comprehension for Temporal Language Grounding
Figure 3 for Relation-aware Video Reading Comprehension for Temporal Language Grounding
Figure 4 for Relation-aware Video Reading Comprehension for Temporal Language Grounding
Viaarxiv icon

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Add code
Oct 13, 2021
Figure 1 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 2 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 3 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 4 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Viaarxiv icon

Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization

Add code
Mar 30, 2021
Figure 1 for Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
Figure 2 for Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
Figure 3 for Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
Figure 4 for Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
Viaarxiv icon

Boundary-sensitive Pre-training for Temporal Localization in Videos

Add code
Nov 24, 2020
Figure 1 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Figure 2 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Figure 3 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Figure 4 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Viaarxiv icon