Alert button
Picture for Shih-Fu Chang

Shih-Fu Chang

Alert button

Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting

Add code
Bookmark button
Alert button
Apr 16, 2022
Guangxing Han, Jiawei Ma, Shiyuan Huang, Long Chen, Rama Chellappa, Shih-Fu Chang

Figure 1 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 2 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 3 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 4 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Viaarxiv icon

Fine-Grained Visual Entailment

Add code
Bookmark button
Alert button
Mar 29, 2022
Christopher Thomas, Yipeng Zhang, Shih-Fu Chang

Figure 1 for Fine-Grained Visual Entailment
Figure 2 for Fine-Grained Visual Entailment
Figure 3 for Fine-Grained Visual Entailment
Viaarxiv icon

Few-Shot Object Detection with Fully Cross-Transformer

Add code
Bookmark button
Alert button
Mar 28, 2022
Guangxing Han, Jiawei Ma, Shiyuan Huang, Long Chen, Shih-Fu Chang

Figure 1 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 2 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 3 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 4 for Few-Shot Object Detection with Fully Cross-Transformer
Viaarxiv icon

Learning To Recognize Procedural Activities with Distant Supervision

Add code
Bookmark button
Alert button
Jan 26, 2022
Xudong Lin, Fabio Petroni, Gedas Bertasius, Marcus Rohrbach, Shih-Fu Chang, Lorenzo Torresani

Figure 1 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 2 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 3 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 4 for Learning To Recognize Procedural Activities with Distant Supervision
Viaarxiv icon

CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks

Add code
Bookmark button
Alert button
Jan 15, 2022
Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Jianwei Yang, Xiyang Dai, Bin Xiao, Haoxuan You, Shih-Fu Chang, Lu Yuan

Figure 1 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Figure 2 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Figure 3 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Figure 4 for CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Viaarxiv icon

CLIP-Event: Connecting Text and Images with Event Structures

Add code
Bookmark button
Alert button
Jan 13, 2022
Manling Li, Ruochen Xu, Shuohang Wang, Luowei Zhou, Xudong Lin, Chenguang Zhu, Michael Zeng, Heng Ji, Shih-Fu Chang

Figure 1 for CLIP-Event: Connecting Text and Images with Event Structures
Figure 2 for CLIP-Event: Connecting Text and Images with Event Structures
Figure 3 for CLIP-Event: Connecting Text and Images with Event Structures
Figure 4 for CLIP-Event: Connecting Text and Images with Event Structures
Viaarxiv icon

MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding

Add code
Bookmark button
Alert button
Dec 20, 2021
Revanth Gangi Reddy, Xilin Rui, Manling Li, Xudong Lin, Haoyang Wen, Jaemin Cho, Lifu Huang, Mohit Bansal, Avirup Sil, Shih-Fu Chang, Alexander Schwing, Heng Ji

Figure 1 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Figure 2 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Figure 3 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Figure 4 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Viaarxiv icon

Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks

Add code
Bookmark button
Alert button
Dec 17, 2021
Guangxing Han, Yicheng He, Shiyuan Huang, Jiawei Ma, Shih-Fu Chang

Figure 1 for Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
Figure 2 for Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
Figure 3 for Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
Figure 4 for Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
Viaarxiv icon

SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning

Add code
Bookmark button
Alert button
Dec 16, 2021
Zhecan Wang, Haoxuan You, Liunian Harold Li, Alireza Zareian, Suji Park, Yiqing Liang, Kai-Wei Chang, Shih-Fu Chang

Figure 1 for SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
Figure 2 for SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
Figure 3 for SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
Figure 4 for SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
Viaarxiv icon

PreViTS: Contrastive Pretraining with Video Tracking Supervision

Add code
Bookmark button
Alert button
Dec 01, 2021
Brian Chen, Ramprasaath R. Selvaraju, Shih-Fu Chang, Juan Carlos Niebles, Nikhil Naik

Figure 1 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 2 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 3 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 4 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Viaarxiv icon