Picture for Junbo Zhang

Junbo Zhang

Unified Vision-Language-Action Model

Add code
Jun 24, 2025
Viaarxiv icon

Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders

Add code
Jun 13, 2025
Viaarxiv icon

GLAP: General contrastive audio-text pretraining across domains and languages

Add code
Jun 12, 2025
Viaarxiv icon

X-ARES: A Comprehensive Framework for Assessing Audio Encoder Performance

Add code
May 22, 2025
Viaarxiv icon

Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering

Add code
Mar 17, 2025
Viaarxiv icon

Improving Open-world Continual Learning under the Constraints of Scarce Labeled Data

Add code
Feb 28, 2025
Viaarxiv icon

Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping

Add code
Feb 27, 2025
Viaarxiv icon

The ICME 2025 Audio Encoder Capability Challenge

Add code
Jan 25, 2025
Viaarxiv icon

UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method

Add code
Sep 08, 2024
Figure 1 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Figure 2 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Figure 3 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Figure 4 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Viaarxiv icon

Personalized Federated Continual Learning via Multi-granularity Prompt

Add code
Jun 27, 2024
Figure 1 for Personalized Federated Continual Learning via Multi-granularity Prompt
Figure 2 for Personalized Federated Continual Learning via Multi-granularity Prompt
Figure 3 for Personalized Federated Continual Learning via Multi-granularity Prompt
Figure 4 for Personalized Federated Continual Learning via Multi-granularity Prompt
Viaarxiv icon