Picture for Weifeng Chen

Weifeng Chen

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Add code
Jul 10, 2024
Viaarxiv icon

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning

Add code
Apr 23, 2024
Figure 1 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Figure 2 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Figure 3 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Figure 4 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Viaarxiv icon

Magic Clothing: Controllable Garment-Driven Image Synthesis

Add code
Apr 15, 2024
Figure 1 for Magic Clothing: Controllable Garment-Driven Image Synthesis
Figure 2 for Magic Clothing: Controllable Garment-Driven Image Synthesis
Figure 3 for Magic Clothing: Controllable Garment-Driven Image Synthesis
Figure 4 for Magic Clothing: Controllable Garment-Driven Image Synthesis
Viaarxiv icon

OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Add code
Mar 07, 2024
Figure 1 for OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Figure 2 for OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Figure 3 for OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Figure 4 for OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Viaarxiv icon

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

Add code
Feb 09, 2024
Viaarxiv icon

DiffusionGPT: LLM-Driven Text-to-Image Generation System

Add code
Jan 18, 2024
Viaarxiv icon

AffordanceLLM: Grounding Affordance from Vision Language Models

Add code
Jan 12, 2024
Viaarxiv icon

Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search

Add code
Nov 15, 2023
Viaarxiv icon

Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models

Add code
May 23, 2023
Figure 1 for Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Figure 2 for Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Figure 3 for Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Figure 4 for Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Viaarxiv icon

MSRL: Distributed Reinforcement Learning with Dataflow Fragments

Add code
Oct 03, 2022
Figure 1 for MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Figure 2 for MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Figure 3 for MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Figure 4 for MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Viaarxiv icon