Picture for Zhuo Chen

Zhuo Chen

refer to the report for detailed contributions

KnowCoder-V2: Deep Knowledge Analysis

Add code
Jun 07, 2025
Viaarxiv icon

Sounding that Object: Interactive Object-Aware Image to Audio Generation

Add code
Jun 04, 2025
Viaarxiv icon

Towards Reliable Large Audio Language Model

Add code
May 25, 2025
Viaarxiv icon

Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning

Add code
May 22, 2025
Viaarxiv icon

AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

Add code
May 22, 2025
Viaarxiv icon

AudioMorphix: Training-free audio editing with diffusion probabilistic models

Add code
May 21, 2025
Viaarxiv icon

FreeMesh: Boosting Mesh Generation with Coordinates Merging

Add code
May 19, 2025
Viaarxiv icon

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Add code
May 19, 2025
Viaarxiv icon

Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment

Add code
May 07, 2025
Viaarxiv icon

Multi-modal Knowledge Graph Generation with Semantics-enriched Prompts

Add code
Apr 18, 2025
Viaarxiv icon