Picture for Junbo Zhang

Junbo Zhang

JoyAgent-JDGenie: Technical Report on the GAIA

Add code
Oct 01, 2025
Viaarxiv icon

MiDashengLM: Efficient Audio Understanding with General Audio Captions

Add code
Aug 06, 2025
Figure 1 for MiDashengLM: Efficient Audio Understanding with General Audio Captions
Figure 2 for MiDashengLM: Efficient Audio Understanding with General Audio Captions
Figure 3 for MiDashengLM: Efficient Audio Understanding with General Audio Captions
Figure 4 for MiDashengLM: Efficient Audio Understanding with General Audio Captions
Viaarxiv icon

Unified Vision-Language-Action Model

Add code
Jun 24, 2025
Figure 1 for Unified Vision-Language-Action Model
Figure 2 for Unified Vision-Language-Action Model
Figure 3 for Unified Vision-Language-Action Model
Figure 4 for Unified Vision-Language-Action Model
Viaarxiv icon

Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders

Add code
Jun 13, 2025
Viaarxiv icon

GLAP: General contrastive audio-text pretraining across domains and languages

Add code
Jun 12, 2025
Viaarxiv icon

X-ARES: A Comprehensive Framework for Assessing Audio Encoder Performance

Add code
May 22, 2025
Viaarxiv icon

Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering

Add code
Mar 17, 2025
Figure 1 for Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering
Figure 2 for Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering
Figure 3 for Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering
Figure 4 for Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering
Viaarxiv icon

Improving Open-world Continual Learning under the Constraints of Scarce Labeled Data

Add code
Feb 28, 2025
Figure 1 for Improving Open-world Continual Learning under the Constraints of Scarce Labeled Data
Figure 2 for Improving Open-world Continual Learning under the Constraints of Scarce Labeled Data
Figure 3 for Improving Open-world Continual Learning under the Constraints of Scarce Labeled Data
Figure 4 for Improving Open-world Continual Learning under the Constraints of Scarce Labeled Data
Viaarxiv icon

Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping

Add code
Feb 27, 2025
Viaarxiv icon

The ICME 2025 Audio Encoder Capability Challenge

Add code
Jan 25, 2025
Viaarxiv icon