Picture for Xiang Yin

Xiang Yin

How far have we gone in Generative Image Restoration? A study on its capability, limitations and evaluation practices

Add code
Mar 05, 2026
Viaarxiv icon

Position: Evaluation of Visual Processing Should Be Human-Centered, Not Metric-Centered

Add code
Feb 28, 2026
Viaarxiv icon

ALIVE: Animate Your World with Lifelike Audio-Video Generation

Add code
Feb 09, 2026
Viaarxiv icon

RRT*former: Environment-Aware Sampling-Based Motion Planning using Transformer

Add code
Nov 19, 2025
Viaarxiv icon

Bridging Perception and Planning: Towards End-to-End Planning for Signal Temporal Logic Tasks

Add code
Sep 16, 2025
Viaarxiv icon

UniTalker: Conversational Speech-Visual Synthesis

Add code
Aug 06, 2025
Viaarxiv icon

Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis

Add code
May 19, 2025
Viaarxiv icon

Online Synthesis of Control Barrier Functions with Local Occupancy Grid Maps for Safe Navigation in Unknown Environments

Add code
May 17, 2025
Viaarxiv icon

Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

Add code
Feb 26, 2025
Figure 1 for Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Figure 2 for Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Figure 3 for Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Figure 4 for Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Viaarxiv icon

HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation

Add code
Feb 10, 2025
Figure 1 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 2 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 3 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 4 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Viaarxiv icon