Picture for Zhuo Chen

Zhuo Chen

refer to the report for detailed contributions

KnowCoder-V2: Deep Knowledge Analysis

Add code
Jun 07, 2025
Figure 1 for KnowCoder-V2: Deep Knowledge Analysis
Figure 2 for KnowCoder-V2: Deep Knowledge Analysis
Figure 3 for KnowCoder-V2: Deep Knowledge Analysis
Figure 4 for KnowCoder-V2: Deep Knowledge Analysis
Viaarxiv icon

Sounding that Object: Interactive Object-Aware Image to Audio Generation

Add code
Jun 04, 2025
Viaarxiv icon

Towards Reliable Large Audio Language Model

Add code
May 25, 2025
Viaarxiv icon

Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning

Add code
May 22, 2025
Figure 1 for Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning
Figure 2 for Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning
Figure 3 for Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning
Figure 4 for Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning
Viaarxiv icon

AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

Add code
May 22, 2025
Figure 1 for AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models
Figure 2 for AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models
Figure 3 for AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models
Figure 4 for AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models
Viaarxiv icon

AudioMorphix: Training-free audio editing with diffusion probabilistic models

Add code
May 21, 2025
Figure 1 for AudioMorphix: Training-free audio editing with diffusion probabilistic models
Figure 2 for AudioMorphix: Training-free audio editing with diffusion probabilistic models
Figure 3 for AudioMorphix: Training-free audio editing with diffusion probabilistic models
Figure 4 for AudioMorphix: Training-free audio editing with diffusion probabilistic models
Viaarxiv icon

FreeMesh: Boosting Mesh Generation with Coordinates Merging

Add code
May 19, 2025
Figure 1 for FreeMesh: Boosting Mesh Generation with Coordinates Merging
Figure 2 for FreeMesh: Boosting Mesh Generation with Coordinates Merging
Figure 3 for FreeMesh: Boosting Mesh Generation with Coordinates Merging
Figure 4 for FreeMesh: Boosting Mesh Generation with Coordinates Merging
Viaarxiv icon

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Add code
May 19, 2025
Figure 1 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Figure 2 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Figure 3 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Figure 4 for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Viaarxiv icon

Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment

Add code
May 07, 2025
Viaarxiv icon

Multi-modal Knowledge Graph Generation with Semantics-enriched Prompts

Add code
Apr 18, 2025
Figure 1 for Multi-modal Knowledge Graph Generation with Semantics-enriched Prompts
Figure 2 for Multi-modal Knowledge Graph Generation with Semantics-enriched Prompts
Figure 3 for Multi-modal Knowledge Graph Generation with Semantics-enriched Prompts
Figure 4 for Multi-modal Knowledge Graph Generation with Semantics-enriched Prompts
Viaarxiv icon