Picture for Hefeng Wu

Hefeng Wu

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning

Add code
Apr 23, 2024
Figure 1 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Figure 2 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Figure 3 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Figure 4 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Viaarxiv icon

DiffusionGPT: LLM-Driven Text-to-Image Generation System

Add code
Jan 18, 2024
Viaarxiv icon

Dual-View Data Hallucination with Semantic Relation Guidance for Few-Shot Image Recognition

Add code
Jan 13, 2024
Viaarxiv icon

SQLNet: Scale-Modulated Query and Localization Network for Few-Shot Class-Agnostic Counting

Add code
Nov 16, 2023
Figure 1 for SQLNet: Scale-Modulated Query and Localization Network for Few-Shot Class-Agnostic Counting
Figure 2 for SQLNet: Scale-Modulated Query and Localization Network for Few-Shot Class-Agnostic Counting
Figure 3 for SQLNet: Scale-Modulated Query and Localization Network for Few-Shot Class-Agnostic Counting
Figure 4 for SQLNet: Scale-Modulated Query and Localization Network for Few-Shot Class-Agnostic Counting
Viaarxiv icon

Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search

Add code
Nov 15, 2023
Viaarxiv icon

SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction

Add code
Oct 10, 2023
Figure 1 for SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction
Figure 2 for SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction
Figure 3 for SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction
Figure 4 for SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction
Viaarxiv icon

Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation

Add code
Sep 23, 2023
Figure 1 for Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation
Figure 2 for Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation
Figure 3 for Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation
Figure 4 for Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation
Viaarxiv icon

Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models

Add code
May 23, 2023
Figure 1 for Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Figure 2 for Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Figure 3 for Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Figure 4 for Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Viaarxiv icon

Multi-object Video Generation from Single Frame Layouts

Add code
May 06, 2023
Figure 1 for Multi-object Video Generation from Single Frame Layouts
Figure 2 for Multi-object Video Generation from Single Frame Layouts
Figure 3 for Multi-object Video Generation from Single Frame Layouts
Figure 4 for Multi-object Video Generation from Single Frame Layouts
Viaarxiv icon

Category-Adaptive Label Discovery and Noise Rejection for Multi-label Image Recognition with Partial Positive Labels

Add code
Nov 15, 2022
Figure 1 for Category-Adaptive Label Discovery and Noise Rejection for Multi-label Image Recognition with Partial Positive Labels
Figure 2 for Category-Adaptive Label Discovery and Noise Rejection for Multi-label Image Recognition with Partial Positive Labels
Figure 3 for Category-Adaptive Label Discovery and Noise Rejection for Multi-label Image Recognition with Partial Positive Labels
Figure 4 for Category-Adaptive Label Discovery and Noise Rejection for Multi-label Image Recognition with Partial Positive Labels
Viaarxiv icon