Picture for Yuxuan Wang

Yuxuan Wang

Sherman

Discrete Markov Bridge

Add code
May 26, 2025
Viaarxiv icon

Towards Reliable Large Audio Language Model

Add code
May 25, 2025
Viaarxiv icon

CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training

Add code
May 23, 2025
Viaarxiv icon

AudioMorphix: Training-free audio editing with diffusion probabilistic models

Add code
May 21, 2025
Figure 1 for AudioMorphix: Training-free audio editing with diffusion probabilistic models
Figure 2 for AudioMorphix: Training-free audio editing with diffusion probabilistic models
Figure 3 for AudioMorphix: Training-free audio editing with diffusion probabilistic models
Figure 4 for AudioMorphix: Training-free audio editing with diffusion probabilistic models
Viaarxiv icon

Leveraging Large Language Models for Command Injection Vulnerability Analysis in Python: An Empirical Study on Popular Open-Source Projects

Add code
May 21, 2025
Figure 1 for Leveraging Large Language Models for Command Injection Vulnerability Analysis in Python: An Empirical Study on Popular Open-Source Projects
Figure 2 for Leveraging Large Language Models for Command Injection Vulnerability Analysis in Python: An Empirical Study on Popular Open-Source Projects
Figure 3 for Leveraging Large Language Models for Command Injection Vulnerability Analysis in Python: An Empirical Study on Popular Open-Source Projects
Figure 4 for Leveraging Large Language Models for Command Injection Vulnerability Analysis in Python: An Empirical Study on Popular Open-Source Projects
Viaarxiv icon

Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image

Add code
May 20, 2025
Viaarxiv icon

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Add code
May 19, 2025
Viaarxiv icon

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Add code
May 19, 2025
Figure 1 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Figure 2 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Figure 3 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Figure 4 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Viaarxiv icon

JAEGER: Dual-Level Humanoid Whole-Body Controller

Add code
May 10, 2025
Figure 1 for JAEGER: Dual-Level Humanoid Whole-Body Controller
Figure 2 for JAEGER: Dual-Level Humanoid Whole-Body Controller
Figure 3 for JAEGER: Dual-Level Humanoid Whole-Body Controller
Figure 4 for JAEGER: Dual-Level Humanoid Whole-Body Controller
Viaarxiv icon

Probing and Inducing Combinational Creativity in Vision-Language Models

Add code
Apr 17, 2025
Viaarxiv icon