Picture for Chaoyi Wang

Chaoyi Wang

Preview WB-DH: Towards Whole Body Digital Human Bench for the Generation of Whole-body Talking Avatar Videos

Add code
Aug 12, 2025
Viaarxiv icon

Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning

Add code
Jul 02, 2025
Viaarxiv icon

Towards Film-Making Production Dialogue, Narration, Monologue Adaptive Moving Dubbing Benchmarks

Add code
Apr 30, 2025
Viaarxiv icon

OCC-MLLM-CoT-Alpha: Towards Multi-stage Occlusion Recognition Based on Large Language Models via 3D-Aware Supervision and Chain-of-Thoughts Guidance

Add code
Apr 07, 2025
Figure 1 for OCC-MLLM-CoT-Alpha: Towards Multi-stage Occlusion Recognition Based on Large Language Models via 3D-Aware Supervision and Chain-of-Thoughts Guidance
Figure 2 for OCC-MLLM-CoT-Alpha: Towards Multi-stage Occlusion Recognition Based on Large Language Models via 3D-Aware Supervision and Chain-of-Thoughts Guidance
Figure 3 for OCC-MLLM-CoT-Alpha: Towards Multi-stage Occlusion Recognition Based on Large Language Models via 3D-Aware Supervision and Chain-of-Thoughts Guidance
Viaarxiv icon

PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation

Add code
Mar 09, 2025
Viaarxiv icon

Bilateral Guided Radiance Field Processing

Add code
Jun 01, 2024
Figure 1 for Bilateral Guided Radiance Field Processing
Figure 2 for Bilateral Guided Radiance Field Processing
Figure 3 for Bilateral Guided Radiance Field Processing
Figure 4 for Bilateral Guided Radiance Field Processing
Viaarxiv icon

Video Generation with Consistency Tuning

Add code
Mar 11, 2024
Figure 1 for Video Generation with Consistency Tuning
Figure 2 for Video Generation with Consistency Tuning
Figure 3 for Video Generation with Consistency Tuning
Viaarxiv icon