Picture for Wanggui He

Wanggui He

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

Add code
Jul 11, 2024
Viaarxiv icon

MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Add code
Jun 11, 2024
Figure 1 for MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
Figure 2 for MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
Figure 3 for MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
Figure 4 for MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
Viaarxiv icon

Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback

Add code
Apr 22, 2024
Viaarxiv icon

TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System

Add code
Nov 23, 2023
Viaarxiv icon

Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training

Add code
Apr 19, 2021
Figure 1 for Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training
Figure 2 for Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training
Figure 3 for Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training
Figure 4 for Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training
Viaarxiv icon