Picture for Zhipeng Hu

Zhipeng Hu

LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation

Add code
Jun 30, 2024
Figure 1 for LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
Figure 2 for LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
Figure 3 for LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
Figure 4 for LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
Viaarxiv icon

Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization

Add code
Jun 24, 2024
Figure 1 for Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Figure 2 for Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Figure 3 for Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Figure 4 for Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Viaarxiv icon

XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques

Add code
Feb 20, 2024
Figure 1 for XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques
Figure 2 for XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques
Figure 3 for XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques
Figure 4 for XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques
Viaarxiv icon

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks

Add code
Jan 23, 2024
Figure 1 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Figure 2 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Figure 3 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Figure 4 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Viaarxiv icon

Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation

Add code
Jan 02, 2024
Viaarxiv icon

Text-Guided 3D Face Synthesis -- From Generation to Editing

Add code
Dec 01, 2023
Figure 1 for Text-Guided 3D Face Synthesis -- From Generation to Editing
Figure 2 for Text-Guided 3D Face Synthesis -- From Generation to Editing
Figure 3 for Text-Guided 3D Face Synthesis -- From Generation to Editing
Figure 4 for Text-Guided 3D Face Synthesis -- From Generation to Editing
Viaarxiv icon

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model

Add code
Oct 03, 2023
Figure 1 for AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Figure 2 for AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Figure 3 for AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Figure 4 for AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Viaarxiv icon

EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior

Add code
Aug 25, 2023
Figure 1 for EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior
Figure 2 for EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior
Figure 3 for EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior
Figure 4 for EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior
Viaarxiv icon

Structure-CLIP: Enhance Multi-modal Language Representations with Structure Knowledge

Add code
May 06, 2023
Figure 1 for Structure-CLIP: Enhance Multi-modal Language Representations with Structure Knowledge
Figure 2 for Structure-CLIP: Enhance Multi-modal Language Representations with Structure Knowledge
Figure 3 for Structure-CLIP: Enhance Multi-modal Language Representations with Structure Knowledge
Figure 4 for Structure-CLIP: Enhance Multi-modal Language Representations with Structure Knowledge
Viaarxiv icon

TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles

Add code
Apr 01, 2023
Figure 1 for TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles
Figure 2 for TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles
Figure 3 for TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles
Figure 4 for TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles
Viaarxiv icon