Alert button
Picture for Xi Yin

Xi Yin

Alert button

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

Add code
Bookmark button
Alert button
Apr 20, 2022
Thomas Hayes, Songyang Zhang, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh

Figure 1 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 2 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 3 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 4 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Viaarxiv icon

Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer

Add code
Bookmark button
Alert button
Apr 07, 2022
Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh

Figure 1 for Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Figure 2 for Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Figure 3 for Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Figure 4 for Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Viaarxiv icon

Proactive Image Manipulation Detection

Add code
Bookmark button
Alert button
Mar 31, 2022
Vishal Asnani, Xi Yin, Tal Hassner, Sijia Liu, Xiaoming Liu

Figure 1 for Proactive Image Manipulation Detection
Figure 2 for Proactive Image Manipulation Detection
Figure 3 for Proactive Image Manipulation Detection
Figure 4 for Proactive Image Manipulation Detection
Viaarxiv icon

Reverse Engineering of Generative Models: Inferring Model Hyperparameters from Generated Images

Add code
Bookmark button
Alert button
Jun 15, 2021
Vishal Asnani, Xi Yin, Tal Hassner, Xiaoming Liu

Figure 1 for Reverse Engineering of Generative Models: Inferring Model Hyperparameters from Generated Images
Figure 2 for Reverse Engineering of Generative Models: Inferring Model Hyperparameters from Generated Images
Figure 3 for Reverse Engineering of Generative Models: Inferring Model Hyperparameters from Generated Images
Figure 4 for Reverse Engineering of Generative Models: Inferring Model Hyperparameters from Generated Images
Viaarxiv icon

KGSynNet: A Novel Entity Synonyms Discovery Framework with Knowledge Graph

Add code
Bookmark button
Alert button
Apr 01, 2021
Yiying Yang, Xi Yin, Haiqin Yang, Xingjian Fei, Hao Peng, Kaijie Zhou, Kunfeng Lai, Jianping Shen

Figure 1 for KGSynNet: A Novel Entity Synonyms Discovery Framework with Knowledge Graph
Figure 2 for KGSynNet: A Novel Entity Synonyms Discovery Framework with Knowledge Graph
Figure 3 for KGSynNet: A Novel Entity Synonyms Discovery Framework with Knowledge Graph
Figure 4 for KGSynNet: A Novel Entity Synonyms Discovery Framework with Knowledge Graph
Viaarxiv icon

A Multiplexed Network for End-to-End, Multilingual OCR

Add code
Bookmark button
Alert button
Mar 29, 2021
Jing Huang, Guan Pang, Rama Kovvuri, Mandy Toh, Kevin J Liang, Praveen Krishnan, Xi Yin, Tal Hassner

Figure 1 for A Multiplexed Network for End-to-End, Multilingual OCR
Figure 2 for A Multiplexed Network for End-to-End, Multilingual OCR
Figure 3 for A Multiplexed Network for End-to-End, Multilingual OCR
Figure 4 for A Multiplexed Network for End-to-End, Multilingual OCR
Viaarxiv icon

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

Add code
Bookmark button
Alert button
Dec 14, 2020
Vitor Albiero, Xingyu Chen, Xi Yin, Guan Pang, Tal Hassner

Figure 1 for img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation
Figure 2 for img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation
Figure 3 for img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation
Figure 4 for img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation
Viaarxiv icon

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption

Add code
Bookmark button
Alert button
Dec 08, 2020
Zhengyuan Yang, Yijuan Lu, Jianfeng Wang, Xi Yin, Dinei Florencio, Lijuan Wang, Cha Zhang, Lei Zhang, Jiebo Luo

Figure 1 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 2 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 3 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 4 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Viaarxiv icon

VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training

Add code
Bookmark button
Alert button
Sep 28, 2020
Xiaowei Hu, Xi Yin, Kevin Lin, Lijuan Wang, Lei Zhang, Jianfeng Gao, Zicheng Liu

Figure 1 for VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training
Figure 2 for VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training
Figure 3 for VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training
Figure 4 for VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training
Viaarxiv icon