Alert button
Picture for Xin Eric Wang

Xin Eric Wang

Alert button

LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation

Add code
Bookmark button
Alert button
May 18, 2023
Yujie Lu, Xianjun Yang, Xiujun Li, Xin Eric Wang, William Yang Wang

Figure 1 for LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Figure 2 for LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Figure 3 for LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Figure 4 for LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Viaarxiv icon

Discriminative Diffusion Models as Few-shot Vision and Language Learners

Add code
Bookmark button
Alert button
May 18, 2023
Xuehai He, Weixi Feng, Tsu-Jui Fu, Varun Jampani, Arjun Akula, Pradyumna Narayana, Sugato Basu, William Yang Wang, Xin Eric Wang

Figure 1 for Discriminative Diffusion Models as Few-shot Vision and Language Learners
Figure 2 for Discriminative Diffusion Models as Few-shot Vision and Language Learners
Figure 3 for Discriminative Diffusion Models as Few-shot Vision and Language Learners
Figure 4 for Discriminative Diffusion Models as Few-shot Vision and Language Learners
Viaarxiv icon

Multimodal Procedural Planning via Dual Text-Image Prompting

Add code
Bookmark button
Alert button
May 02, 2023
Yujie Lu, Pan Lu, Zhiyu Chen, Wanrong Zhu, Xin Eric Wang, William Yang Wang

Figure 1 for Multimodal Procedural Planning via Dual Text-Image Prompting
Figure 2 for Multimodal Procedural Planning via Dual Text-Image Prompting
Figure 3 for Multimodal Procedural Planning via Dual Text-Image Prompting
Figure 4 for Multimodal Procedural Planning via Dual Text-Image Prompting
Viaarxiv icon

Parameter-Efficient Cross-lingual Transfer of Vision and Language Models via Translation-based Alignment

Add code
Bookmark button
Alert button
May 02, 2023
Zhen Zhang, Jialu Wang, Xin Eric Wang

Figure 1 for Parameter-Efficient Cross-lingual Transfer of Vision and Language Models via Translation-based Alignment
Figure 2 for Parameter-Efficient Cross-lingual Transfer of Vision and Language Models via Translation-based Alignment
Figure 3 for Parameter-Efficient Cross-lingual Transfer of Vision and Language Models via Translation-based Alignment
Figure 4 for Parameter-Efficient Cross-lingual Transfer of Vision and Language Models via Translation-based Alignment
Viaarxiv icon

Multimodal Graph Transformer for Multimodal Question Answering

Add code
Bookmark button
Alert button
Apr 30, 2023
Xuehai He, Xin Eric Wang

Figure 1 for Multimodal Graph Transformer for Multimodal Question Answering
Figure 2 for Multimodal Graph Transformer for Multimodal Question Answering
Figure 3 for Multimodal Graph Transformer for Multimodal Question Answering
Figure 4 for Multimodal Graph Transformer for Multimodal Question Answering
Viaarxiv icon

ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation

Add code
Bookmark button
Alert button
Jan 30, 2023
Kaiwen Zhou, Kaizhi Zheng, Connor Pryor, Yilin Shen, Hongxia Jin, Lise Getoor, Xin Eric Wang

Figure 1 for ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
Figure 2 for ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
Figure 3 for ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
Figure 4 for ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
Viaarxiv icon

Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis

Add code
Bookmark button
Alert button
Dec 09, 2022
Weixi Feng, Xuehai He, Tsu-Jui Fu, Varun Jampani, Arjun Akula, Pradyumna Narayana, Sugato Basu, Xin Eric Wang, William Yang Wang

Figure 1 for Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Figure 2 for Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Figure 3 for Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Figure 4 for Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Viaarxiv icon

Navigation as the Attacker Wishes? Towards Building Byzantine-Robust Embodied Agents under Federated Learning

Add code
Bookmark button
Alert button
Dec 02, 2022
Yunchao Zhang, Zonglin Di, Kaiwen Zhou, Cihang Xie, Xin Eric Wang

Figure 1 for Navigation as the Attacker Wishes? Towards Building Byzantine-Robust Embodied Agents under Federated Learning
Figure 2 for Navigation as the Attacker Wishes? Towards Building Byzantine-Robust Embodied Agents under Federated Learning
Figure 3 for Navigation as the Attacker Wishes? Towards Building Byzantine-Robust Embodied Agents under Federated Learning
Figure 4 for Navigation as the Attacker Wishes? Towards Building Byzantine-Robust Embodied Agents under Federated Learning
Viaarxiv icon

ComCLIP: Training-Free Compositional Image and Text Matching

Add code
Bookmark button
Alert button
Nov 25, 2022
Kenan Jiang, Xuehai He, Ruize Xu, Xin Eric Wang

Figure 1 for ComCLIP: Training-Free Compositional Image and Text Matching
Figure 2 for ComCLIP: Training-Free Compositional Image and Text Matching
Figure 3 for ComCLIP: Training-Free Compositional Image and Text Matching
Figure 4 for ComCLIP: Training-Free Compositional Image and Text Matching
Viaarxiv icon

CPL: Counterfactual Prompt Learning for Vision and Language Models

Add code
Bookmark button
Alert button
Oct 19, 2022
Xuehai He, Diji Yang, Weixi Feng, Tsu-Jui Fu, Arjun Akula, Varun Jampani, Pradyumna Narayana, Sugato Basu, William Yang Wang, Xin Eric Wang

Figure 1 for CPL: Counterfactual Prompt Learning for Vision and Language Models
Figure 2 for CPL: Counterfactual Prompt Learning for Vision and Language Models
Figure 3 for CPL: Counterfactual Prompt Learning for Vision and Language Models
Figure 4 for CPL: Counterfactual Prompt Learning for Vision and Language Models
Viaarxiv icon