Alert button
Picture for Yusheng Xie

Yusheng Xie

Alert button

FairRAG: Fair Human Generation via Fair Retrieval Augmentation

Add code
Bookmark button
Alert button
Apr 05, 2024
Robik Shrestha, Yang Zou, Qiuyu Chen, Zhiheng Li, Yusheng Xie, Siqi Deng

Figure 1 for FairRAG: Fair Human Generation via Fair Retrieval Augmentation
Figure 2 for FairRAG: Fair Human Generation via Fair Retrieval Augmentation
Figure 3 for FairRAG: Fair Human Generation via Fair Retrieval Augmentation
Figure 4 for FairRAG: Fair Human Generation via Fair Retrieval Augmentation
Viaarxiv icon

On the Scalability of Diffusion-based Text-to-Image Generation

Add code
Bookmark button
Alert button
Apr 03, 2024
Hao Li, Yang Zou, Ying Wang, Orchid Majumder, Yusheng Xie, R. Manmatha, Ashwin Swaminathan, Zhuowen Tu, Stefano Ermon, Stefano Soatto

Viaarxiv icon

MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets

Add code
Bookmark button
Alert button
Mar 05, 2024
Hossein Aboutalebi, Hwanjun Song, Yusheng Xie, Arshit Gupta, Justin Sun, Hang Su, Igor Shalyminov, Nikolaos Pappas, Siffi Singh, Saab Mansour

Figure 1 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Figure 2 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Figure 3 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Figure 4 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Viaarxiv icon

Multiple-Question Multiple-Answer Text-VQA

Add code
Bookmark button
Alert button
Nov 15, 2023
Peng Tang, Srikar Appalaraju, R. Manmatha, Yusheng Xie, Vijay Mahadevan

Viaarxiv icon

SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation

Add code
Bookmark button
Alert button
Feb 07, 2023
Yash Patel, Yusheng Xie, Yi Zhu, Srikar Appalaraju, R. Manmatha

Figure 1 for SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation
Figure 2 for SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation
Figure 3 for SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation
Figure 4 for SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation
Viaarxiv icon

AIM: Adapting Image Models for Efficient Video Action Recognition

Add code
Bookmark button
Alert button
Feb 06, 2023
Taojiannan Yang, Yi Zhu, Yusheng Xie, Aston Zhang, Chen Chen, Mu Li

Figure 1 for AIM: Adapting Image Models for Efficient Video Action Recognition
Figure 2 for AIM: Adapting Image Models for Efficient Video Action Recognition
Figure 3 for AIM: Adapting Image Models for Efficient Video Action Recognition
Figure 4 for AIM: Adapting Image Models for Efficient Video Action Recognition
Viaarxiv icon

Towards Differential Relational Privacy and its use in Question Answering

Add code
Bookmark button
Alert button
Mar 30, 2022
Simone Bombari, Alessandro Achille, Zijian Wang, Yu-Xiang Wang, Yusheng Xie, Kunwar Yashraj Singh, Srikar Appalaraju, Vijay Mahadevan, Stefano Soatto

Figure 1 for Towards Differential Relational Privacy and its use in Question Answering
Figure 2 for Towards Differential Relational Privacy and its use in Question Answering
Figure 3 for Towards Differential Relational Privacy and its use in Question Answering
Figure 4 for Towards Differential Relational Privacy and its use in Question Answering
Viaarxiv icon

LaTr: Layout-Aware Transformer for Scene-Text VQA

Add code
Bookmark button
Alert button
Dec 24, 2021
Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha

Figure 1 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Figure 2 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Figure 3 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Figure 4 for LaTr: Layout-Aware Transformer for Scene-Text VQA
Viaarxiv icon

TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation

Add code
Bookmark button
Alert button
Oct 29, 2021
Haoyu Ma, Liangjian Chen, Deying Kong, Zhe Wang, Xingwei Liu, Hao Tang, Xiangyi Yan, Yusheng Xie, Shih-Yao Lin, Xiaohui Xie

Figure 1 for TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation
Figure 2 for TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation
Figure 3 for TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation
Figure 4 for TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation
Viaarxiv icon