Picture for Yuxuan Wang

Yuxuan Wang

Sherman

FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models

Add code
Jul 03, 2025
Viaarxiv icon

From Experts to a Generalist: Toward General Whole-Body Control for Humanoid Robots

Add code
Jun 15, 2025
Viaarxiv icon

RL from Physical Feedback: Aligning Large Motion Models with Humanoid Control

Add code
Jun 15, 2025
Viaarxiv icon

DragNeXt: Rethinking Drag-Based Image Editing

Add code
Jun 09, 2025
Viaarxiv icon

Sounding that Object: Interactive Object-Aware Image to Audio Generation

Add code
Jun 04, 2025
Viaarxiv icon

Discrete Markov Bridge

Add code
May 26, 2025
Viaarxiv icon

Towards Reliable Large Audio Language Model

Add code
May 25, 2025
Viaarxiv icon

CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training

Add code
May 23, 2025
Viaarxiv icon

AudioMorphix: Training-free audio editing with diffusion probabilistic models

Add code
May 21, 2025
Viaarxiv icon

Leveraging Large Language Models for Command Injection Vulnerability Analysis in Python: An Empirical Study on Popular Open-Source Projects

Add code
May 21, 2025
Viaarxiv icon