Picture for Yongxin Yang

Yongxin Yang

Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities

Add code
Apr 01, 2025
Viaarxiv icon

Generating Compositional Scenes via Text-to-image RGBA Instance Generation

Add code
Nov 16, 2024
Figure 1 for Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Figure 2 for Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Figure 3 for Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Figure 4 for Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Viaarxiv icon

MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation

Add code
Apr 03, 2024
Figure 1 for MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Figure 2 for MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Figure 3 for MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Figure 4 for MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Viaarxiv icon

Safety Fine-Tuning at No Cost: A Baseline for Vision Large Language Models

Add code
Feb 03, 2024
Viaarxiv icon

SERF: Fine-Grained Interactive 3D Segmentation and Editing with Radiance Fields

Add code
Dec 26, 2023
Viaarxiv icon

Optimisation-Based Multi-Modal Semantic Image Editing

Add code
Nov 28, 2023
Figure 1 for Optimisation-Based Multi-Modal Semantic Image Editing
Figure 2 for Optimisation-Based Multi-Modal Semantic Image Editing
Figure 3 for Optimisation-Based Multi-Modal Semantic Image Editing
Figure 4 for Optimisation-Based Multi-Modal Semantic Image Editing
Viaarxiv icon

ChiroDiff: Modelling chirographic data with Diffusion Models

Add code
Apr 07, 2023
Figure 1 for ChiroDiff: Modelling chirographic data with Diffusion Models
Figure 2 for ChiroDiff: Modelling chirographic data with Diffusion Models
Figure 3 for ChiroDiff: Modelling chirographic data with Diffusion Models
Figure 4 for ChiroDiff: Modelling chirographic data with Diffusion Models
Viaarxiv icon

Learning to Name Classes for Vision and Language Models

Add code
Apr 04, 2023
Viaarxiv icon

Region Proposal Network Pre-Training Helps Label-Efficient Object Detection

Add code
Nov 16, 2022
Viaarxiv icon

ZooD: Exploiting Model Zoo for Out-of-Distribution Generalization

Add code
Oct 17, 2022
Figure 1 for ZooD: Exploiting Model Zoo for Out-of-Distribution Generalization
Figure 2 for ZooD: Exploiting Model Zoo for Out-of-Distribution Generalization
Figure 3 for ZooD: Exploiting Model Zoo for Out-of-Distribution Generalization
Figure 4 for ZooD: Exploiting Model Zoo for Out-of-Distribution Generalization
Viaarxiv icon