Picture for Shih-Fu Chang

Shih-Fu Chang

Columbia University

Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval

Add code
Jun 05, 2022
Figure 1 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 2 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 3 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 4 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Viaarxiv icon

Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

Add code
May 29, 2022
Figure 1 for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Figure 2 for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Figure 3 for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Figure 4 for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Viaarxiv icon

Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting

Add code
Apr 16, 2022
Figure 1 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 2 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 3 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 4 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Viaarxiv icon

Fine-Grained Visual Entailment

Add code
Mar 29, 2022
Figure 1 for Fine-Grained Visual Entailment
Figure 2 for Fine-Grained Visual Entailment
Figure 3 for Fine-Grained Visual Entailment
Viaarxiv icon

Few-Shot Object Detection with Fully Cross-Transformer

Add code
Mar 28, 2022
Figure 1 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 2 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 3 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 4 for Few-Shot Object Detection with Fully Cross-Transformer
Viaarxiv icon

Learning To Recognize Procedural Activities with Distant Supervision

Add code
Jan 26, 2022
Figure 1 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 2 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 3 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 4 for Learning To Recognize Procedural Activities with Distant Supervision
Viaarxiv icon

CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks

Add code
Jan 15, 2022
Figure 1 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Figure 2 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Figure 3 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Figure 4 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Viaarxiv icon

CLIP-Event: Connecting Text and Images with Event Structures

Add code
Jan 13, 2022
Figure 1 for CLIP-Event: Connecting Text and Images with Event Structures
Figure 2 for CLIP-Event: Connecting Text and Images with Event Structures
Figure 3 for CLIP-Event: Connecting Text and Images with Event Structures
Figure 4 for CLIP-Event: Connecting Text and Images with Event Structures
Viaarxiv icon

MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding

Add code
Dec 20, 2021
Figure 1 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Figure 2 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Figure 3 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Figure 4 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Viaarxiv icon

Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks

Add code
Dec 17, 2021
Figure 1 for Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
Figure 2 for Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
Figure 3 for Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
Figure 4 for Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
Viaarxiv icon