A2d


GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation

Add code
Jun 18, 2024
Figure 1 for GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation
Figure 2 for GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation
Figure 3 for GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation
Figure 4 for GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation
Viaarxiv icon

MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities

Add code
May 27, 2024
Viaarxiv icon

Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation

Add code
Mar 03, 2024
Viaarxiv icon

Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation

Add code
Sep 21, 2023
Viaarxiv icon

OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation

Add code
Jul 18, 2023
Figure 1 for OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
Figure 2 for OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
Figure 3 for OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
Figure 4 for OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
Viaarxiv icon

LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation

Add code
Jun 14, 2023
Figure 1 for LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
Figure 2 for LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
Figure 3 for LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
Figure 4 for LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
Viaarxiv icon

A2D: Anywhere Anytime Drumming

Add code
Apr 04, 2023
Figure 1 for A2D: Anywhere Anytime Drumming
Figure 2 for A2D: Anywhere Anytime Drumming
Figure 3 for A2D: Anywhere Anytime Drumming
Figure 4 for A2D: Anywhere Anytime Drumming
Viaarxiv icon

Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation

Add code
Jun 08, 2022
Figure 1 for Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Figure 2 for Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Figure 3 for Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Figure 4 for Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Viaarxiv icon

Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation

Add code
Apr 06, 2022
Figure 1 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Figure 2 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Figure 3 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Figure 4 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Viaarxiv icon

Local-Global Context Aware Transformer for Language-Guided Video Segmentation

Add code
Mar 18, 2022
Figure 1 for Local-Global Context Aware Transformer for Language-Guided Video Segmentation
Figure 2 for Local-Global Context Aware Transformer for Language-Guided Video Segmentation
Figure 3 for Local-Global Context Aware Transformer for Language-Guided Video Segmentation
Figure 4 for Local-Global Context Aware Transformer for Language-Guided Video Segmentation
Viaarxiv icon