Picture for Junbo Zhang

Junbo Zhang

Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering

Add code
Mar 17, 2025
Viaarxiv icon

Improving Open-world Continual Learning under the Constraints of Scarce Labeled Data

Add code
Feb 28, 2025
Viaarxiv icon

Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping

Add code
Feb 27, 2025
Viaarxiv icon

The ICME 2025 Audio Encoder Capability Challenge

Add code
Jan 25, 2025
Viaarxiv icon

UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method

Add code
Sep 08, 2024
Figure 1 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Figure 2 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Figure 3 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Figure 4 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Viaarxiv icon

Personalized Federated Continual Learning via Multi-granularity Prompt

Add code
Jun 27, 2024
Figure 1 for Personalized Federated Continual Learning via Multi-granularity Prompt
Figure 2 for Personalized Federated Continual Learning via Multi-granularity Prompt
Figure 3 for Personalized Federated Continual Learning via Multi-granularity Prompt
Figure 4 for Personalized Federated Continual Learning via Multi-granularity Prompt
Viaarxiv icon

Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding

Add code
Jun 19, 2024
Viaarxiv icon

Bridging Language Gaps in Audio-Text Retrieval

Add code
Jun 11, 2024
Figure 1 for Bridging Language Gaps in Audio-Text Retrieval
Figure 2 for Bridging Language Gaps in Audio-Text Retrieval
Figure 3 for Bridging Language Gaps in Audio-Text Retrieval
Figure 4 for Bridging Language Gaps in Audio-Text Retrieval
Viaarxiv icon

Scaling up masked audio encoder learning for general audio classification

Add code
Jun 11, 2024
Figure 1 for Scaling up masked audio encoder learning for general audio classification
Figure 2 for Scaling up masked audio encoder learning for general audio classification
Figure 3 for Scaling up masked audio encoder learning for general audio classification
Figure 4 for Scaling up masked audio encoder learning for general audio classification
Viaarxiv icon

Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion

Add code
Mar 12, 2024
Figure 1 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 2 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 3 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 4 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Viaarxiv icon