Picture for Haoran Cheng

Haoran Cheng

SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization

Add code
Apr 20, 2025
Figure 1 for SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization
Figure 2 for SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization
Figure 3 for SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization
Figure 4 for SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization
Viaarxiv icon

Discriminator-Free Direct Preference Optimization for Video Diffusion

Add code
Apr 11, 2025
Viaarxiv icon

GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators

Add code
Dec 20, 2024
Viaarxiv icon

VP-MEL: Visual Prompts Guided Multimodal Entity Linking

Add code
Dec 10, 2024
Figure 1 for VP-MEL: Visual Prompts Guided Multimodal Entity Linking
Figure 2 for VP-MEL: Visual Prompts Guided Multimodal Entity Linking
Figure 3 for VP-MEL: Visual Prompts Guided Multimodal Entity Linking
Figure 4 for VP-MEL: Visual Prompts Guided Multimodal Entity Linking
Viaarxiv icon

Searching Priors Makes Text-to-Video Synthesis Better

Add code
Jun 05, 2024
Figure 1 for Searching Priors Makes Text-to-Video Synthesis Better
Figure 2 for Searching Priors Makes Text-to-Video Synthesis Better
Figure 3 for Searching Priors Makes Text-to-Video Synthesis Better
Figure 4 for Searching Priors Makes Text-to-Video Synthesis Better
Viaarxiv icon

EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation

Add code
Feb 02, 2024
Figure 1 for EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation
Figure 2 for EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation
Figure 3 for EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation
Figure 4 for EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation
Viaarxiv icon

Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving

Add code
Dec 19, 2023
Figure 1 for Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving
Figure 2 for Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving
Figure 3 for Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving
Figure 4 for Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving
Viaarxiv icon

Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning

Add code
Nov 29, 2023
Figure 1 for Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning
Figure 2 for Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning
Figure 3 for Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning
Figure 4 for Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning
Viaarxiv icon

DHOT-GM: Robust Graph Matching Using A Differentiable Hierarchical Optimal Transport Framework

Add code
Oct 18, 2023
Figure 1 for DHOT-GM: Robust Graph Matching Using A Differentiable Hierarchical Optimal Transport Framework
Figure 2 for DHOT-GM: Robust Graph Matching Using A Differentiable Hierarchical Optimal Transport Framework
Figure 3 for DHOT-GM: Robust Graph Matching Using A Differentiable Hierarchical Optimal Transport Framework
Figure 4 for DHOT-GM: Robust Graph Matching Using A Differentiable Hierarchical Optimal Transport Framework
Viaarxiv icon

MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection

Add code
Aug 18, 2023
Figure 1 for MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection
Figure 2 for MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection
Figure 3 for MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection
Figure 4 for MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection
Viaarxiv icon