Alert button
Picture for Xudong Lin

Xudong Lin

Alert button

Supervised Masked Knowledge Distillation for Few-Shot Transformers

Add code
Bookmark button
Alert button
Mar 29, 2023
Han Lin, Guangxing Han, Jiawei Ma, Shiyuan Huang, Xudong Lin, Shih-Fu Chang

Figure 1 for Supervised Masked Knowledge Distillation for Few-Shot Transformers
Figure 2 for Supervised Masked Knowledge Distillation for Few-Shot Transformers
Figure 3 for Supervised Masked Knowledge Distillation for Few-Shot Transformers
Figure 4 for Supervised Masked Knowledge Distillation for Few-Shot Transformers
Viaarxiv icon

In Defense of Structural Symbolic Representation for Video Event-Relation Prediction

Add code
Bookmark button
Alert button
Jan 06, 2023
Andrew Lu, Xudong Lin, Yulei Niu, Shih-Fu Chang

Figure 1 for In Defense of Structural Symbolic Representation for Video Event-Relation Prediction
Figure 2 for In Defense of Structural Symbolic Representation for Video Event-Relation Prediction
Figure 3 for In Defense of Structural Symbolic Representation for Video Event-Relation Prediction
Figure 4 for In Defense of Structural Symbolic Representation for Video Event-Relation Prediction
Viaarxiv icon

TempCLR: Temporal Alignment Representation with Contrastive Learning

Add code
Bookmark button
Alert button
Dec 28, 2022
Yuncong Yang, Jiawei Ma, Shiyuan Huang, Long Chen, Xudong Lin, Guangxing Han, Shih-Fu Chang

Figure 1 for TempCLR: Temporal Alignment Representation with Contrastive Learning
Figure 2 for TempCLR: Temporal Alignment Representation with Contrastive Learning
Figure 3 for TempCLR: Temporal Alignment Representation with Contrastive Learning
Figure 4 for TempCLR: Temporal Alignment Representation with Contrastive Learning
Viaarxiv icon

Video Event Extraction via Tracking Visual States of Arguments

Add code
Bookmark button
Alert button
Nov 05, 2022
Guang Yang, Manling Li, Jiajie Zhang, Xudong Lin, Shih-Fu Chang, Heng Ji

Figure 1 for Video Event Extraction via Tracking Visual States of Arguments
Figure 2 for Video Event Extraction via Tracking Visual States of Arguments
Figure 3 for Video Event Extraction via Tracking Visual States of Arguments
Figure 4 for Video Event Extraction via Tracking Visual States of Arguments
Viaarxiv icon

Learning to Decompose Visual Features with Latent Textual Prompts

Add code
Bookmark button
Alert button
Oct 09, 2022
Feng Wang, Manling Li, Xudong Lin, Hairong Lv, Alexander G. Schwing, Heng Ji

Figure 1 for Learning to Decompose Visual Features with Latent Textual Prompts
Figure 2 for Learning to Decompose Visual Features with Latent Textual Prompts
Figure 3 for Learning to Decompose Visual Features with Latent Textual Prompts
Figure 4 for Learning to Decompose Visual Features with Latent Textual Prompts
Viaarxiv icon

Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World

Add code
Bookmark button
Alert button
Jun 14, 2022
Hammad A. Ayyubi, Christopher Thomas, Lovish Chum, Rahul Lokesh, Yulei Niu, Xudong Lin, Long Chen, Jaywon Koo, Sounak Ray, Shih-Fu Chang

Figure 1 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Figure 2 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Figure 3 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Figure 4 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Viaarxiv icon

Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval

Add code
Bookmark button
Alert button
Jun 05, 2022
Xudong Lin, Simran Tiwari, Shiyuan Huang, Manling Li, Mike Zheng Shou, Heng Ji, Shih-Fu Chang

Figure 1 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 2 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 3 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 4 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Viaarxiv icon

Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

Add code
Bookmark button
Alert button
May 29, 2022
Zhenhailong Wang, Manling Li, Ruochen Xu, Luowei Zhou, Jie Lei, Xudong Lin, Shuohang Wang, Ziyi Yang, Chenguang Zhu, Derek Hoiem, Shih-Fu Chang, Mohit Bansal, Heng Ji

Figure 1 for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Figure 2 for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Figure 3 for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Figure 4 for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Viaarxiv icon

Revitalize Region Feature for Democratizing Video-Language Pre-training

Add code
Bookmark button
Alert button
Mar 19, 2022
Guanyu Cai, Yixiao Ge, Alex Jinpeng Wang, Rui Yan, Xudong Lin, Ying Shan, Lianghua He, Xiaohu Qie, Jianping Wu, Mike Zheng Shou

Figure 1 for Revitalize Region Feature for Democratizing Video-Language Pre-training
Figure 2 for Revitalize Region Feature for Democratizing Video-Language Pre-training
Figure 3 for Revitalize Region Feature for Democratizing Video-Language Pre-training
Figure 4 for Revitalize Region Feature for Democratizing Video-Language Pre-training
Viaarxiv icon