Alert button
Picture for Xiang Yu

Xiang Yu

Alert button

Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!

Add code
Bookmark button
Alert button
Jun 06, 2023
Zaid Khan, Vijay Kumar BG, Samuel Schulter, Xiang Yu, Yun Fu, Manmohan Chandraker

Figure 1 for Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
Figure 2 for Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
Figure 3 for Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
Figure 4 for Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
Viaarxiv icon

Selective Structured State-Spaces for Long-Form Video Understanding

Add code
Bookmark button
Alert button
Mar 25, 2023
Jue Wang, Wentao Zhu, Pichao Wang, Xiang Yu, Linda Liu, Mohamed Omar, Raffay Hamid

Figure 1 for Selective Structured State-Spaces for Long-Form Video Understanding
Figure 2 for Selective Structured State-Spaces for Long-Form Video Understanding
Figure 3 for Selective Structured State-Spaces for Long-Form Video Understanding
Figure 4 for Selective Structured State-Spaces for Long-Form Video Understanding
Viaarxiv icon

EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification

Add code
Bookmark button
Alert button
Aug 03, 2022
Xiang Yu, Zhe Geng, Xiaohua Huang, Qinglu Wang, Daiyin Zhu

Figure 1 for EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification
Figure 2 for EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification
Figure 3 for EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification
Figure 4 for EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification
Viaarxiv icon

Single-Stream Multi-Level Alignment for Vision-Language Pretraining

Add code
Bookmark button
Alert button
Mar 30, 2022
Zaid Khan, Vijay Kumar BG, Xiang Yu, Samuel Schulter, Manmohan Chandraker, Yun Fu

Figure 1 for Single-Stream Multi-Level Alignment for Vision-Language Pretraining
Figure 2 for Single-Stream Multi-Level Alignment for Vision-Language Pretraining
Figure 3 for Single-Stream Multi-Level Alignment for Vision-Language Pretraining
Figure 4 for Single-Stream Multi-Level Alignment for Vision-Language Pretraining
Viaarxiv icon

Controllable Dynamic Multi-Task Architectures

Add code
Bookmark button
Alert button
Mar 28, 2022
Dripta S. Raychaudhuri, Yumin Suh, Samuel Schulter, Xiang Yu, Masoud Faraki, Amit K. Roy-Chowdhury, Manmohan Chandraker

Figure 1 for Controllable Dynamic Multi-Task Architectures
Figure 2 for Controllable Dynamic Multi-Task Architectures
Figure 3 for Controllable Dynamic Multi-Task Architectures
Figure 4 for Controllable Dynamic Multi-Task Architectures
Viaarxiv icon

GraphCoCo: Graph Complementary Contrastive Learning

Add code
Bookmark button
Alert button
Mar 24, 2022
Jiawei Sun, Junchi Yan, Chentao Wu, Yue Ding, Ruoxin Chen, Xiang Yu, Xinyu Lu, Jie Li

Figure 1 for GraphCoCo: Graph Complementary Contrastive Learning
Figure 2 for GraphCoCo: Graph Complementary Contrastive Learning
Figure 3 for GraphCoCo: Graph Complementary Contrastive Learning
Figure 4 for GraphCoCo: Graph Complementary Contrastive Learning
Viaarxiv icon

On Generalizing Beyond Domains in Cross-Domain Continual Learning

Add code
Bookmark button
Alert button
Mar 08, 2022
Christian Simon, Masoud Faraki, Yi-Hsuan Tsai, Xiang Yu, Samuel Schulter, Yumin Suh, Mehrtash Harandi, Manmohan Chandraker

Figure 1 for On Generalizing Beyond Domains in Cross-Domain Continual Learning
Figure 2 for On Generalizing Beyond Domains in Cross-Domain Continual Learning
Figure 3 for On Generalizing Beyond Domains in Cross-Domain Continual Learning
Figure 4 for On Generalizing Beyond Domains in Cross-Domain Continual Learning
Viaarxiv icon

Learning Cross-modal Contrastive Features for Video Domain Adaptation

Add code
Bookmark button
Alert button
Aug 26, 2021
Donghyun Kim, Yi-Hsuan Tsai, Bingbing Zhuang, Xiang Yu, Stan Sclaroff, Kate Saenko, Manmohan Chandraker

Figure 1 for Learning Cross-modal Contrastive Features for Video Domain Adaptation
Figure 2 for Learning Cross-modal Contrastive Features for Video Domain Adaptation
Figure 3 for Learning Cross-modal Contrastive Features for Video Domain Adaptation
Figure 4 for Learning Cross-modal Contrastive Features for Video Domain Adaptation
Viaarxiv icon

Cross-Domain Similarity Learning for Face Recognition in Unseen Domains

Add code
Bookmark button
Alert button
Mar 12, 2021
Masoud Faraki, Xiang Yu, Yi-Hsuan Tsai, Yumin Suh, Manmohan Chandraker

Figure 1 for Cross-Domain Similarity Learning for Face Recognition in Unseen Domains
Figure 2 for Cross-Domain Similarity Learning for Face Recognition in Unseen Domains
Figure 3 for Cross-Domain Similarity Learning for Face Recognition in Unseen Domains
Figure 4 for Cross-Domain Similarity Learning for Face Recognition in Unseen Domains
Viaarxiv icon

Adversarial Attacks and Defenses in Physiological Computing: A Systematic Review

Add code
Bookmark button
Alert button
Feb 11, 2021
Dongrui Wu, Weili Fang, Yi Zhang, Liuqing Yang, Xiaodong Xu, Hanbin Luo, Xiang Yu

Figure 1 for Adversarial Attacks and Defenses in Physiological Computing: A Systematic Review
Figure 2 for Adversarial Attacks and Defenses in Physiological Computing: A Systematic Review
Figure 3 for Adversarial Attacks and Defenses in Physiological Computing: A Systematic Review
Figure 4 for Adversarial Attacks and Defenses in Physiological Computing: A Systematic Review
Viaarxiv icon