Picture for Yusheng Xie

Yusheng Xie

Efficient Scaling of Diffusion Transformers for Text-to-Image Generation

Add code
Dec 16, 2024
Figure 1 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 2 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 3 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 4 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Viaarxiv icon

ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models

Add code
Aug 16, 2024
Figure 1 for ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Figure 2 for ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Figure 3 for ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Figure 4 for ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Viaarxiv icon

Diffusion Soup: Model Merging for Text-to-Image Diffusion Models

Add code
Jun 12, 2024
Figure 1 for Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Figure 2 for Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Figure 3 for Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Figure 4 for Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Viaarxiv icon

FairRAG: Fair Human Generation via Fair Retrieval Augmentation

Add code
Apr 05, 2024
Figure 1 for FairRAG: Fair Human Generation via Fair Retrieval Augmentation
Figure 2 for FairRAG: Fair Human Generation via Fair Retrieval Augmentation
Figure 3 for FairRAG: Fair Human Generation via Fair Retrieval Augmentation
Figure 4 for FairRAG: Fair Human Generation via Fair Retrieval Augmentation
Viaarxiv icon

On the Scalability of Diffusion-based Text-to-Image Generation

Add code
Apr 03, 2024
Viaarxiv icon

MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets

Add code
Mar 05, 2024
Figure 1 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Figure 2 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Figure 3 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Figure 4 for MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Viaarxiv icon

Multiple-Question Multiple-Answer Text-VQA

Add code
Nov 15, 2023
Figure 1 for Multiple-Question Multiple-Answer Text-VQA
Figure 2 for Multiple-Question Multiple-Answer Text-VQA
Figure 3 for Multiple-Question Multiple-Answer Text-VQA
Figure 4 for Multiple-Question Multiple-Answer Text-VQA
Viaarxiv icon

SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation

Add code
Feb 07, 2023
Figure 1 for SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation
Figure 2 for SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation
Figure 3 for SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation
Figure 4 for SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation
Viaarxiv icon

AIM: Adapting Image Models for Efficient Video Action Recognition

Add code
Feb 06, 2023
Figure 1 for AIM: Adapting Image Models for Efficient Video Action Recognition
Figure 2 for AIM: Adapting Image Models for Efficient Video Action Recognition
Figure 3 for AIM: Adapting Image Models for Efficient Video Action Recognition
Figure 4 for AIM: Adapting Image Models for Efficient Video Action Recognition
Viaarxiv icon

Towards Differential Relational Privacy and its use in Question Answering

Add code
Mar 30, 2022
Figure 1 for Towards Differential Relational Privacy and its use in Question Answering
Figure 2 for Towards Differential Relational Privacy and its use in Question Answering
Figure 3 for Towards Differential Relational Privacy and its use in Question Answering
Figure 4 for Towards Differential Relational Privacy and its use in Question Answering
Viaarxiv icon