Picture for Jun Du

Jun Du

M3SD: Multi-modal, Multi-scenario and Multi-language Speaker Diarization Dataset

Add code
Jun 17, 2025
Viaarxiv icon

Exploring Speaker Diarization with Mixture of Experts

Add code
Jun 17, 2025
Viaarxiv icon

Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge

Add code
May 12, 2025
Viaarxiv icon

Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration

Add code
Apr 17, 2025
Viaarxiv icon

Col-OLHTR: A Novel Framework for Multimodal Online Handwritten Text Recognition

Add code
Feb 10, 2025
Viaarxiv icon

Latent Swap Joint Diffusion for Long-Form Audio Generation

Add code
Feb 07, 2025
Figure 1 for Latent Swap Joint Diffusion for Long-Form Audio Generation
Figure 2 for Latent Swap Joint Diffusion for Long-Form Audio Generation
Figure 3 for Latent Swap Joint Diffusion for Long-Form Audio Generation
Figure 4 for Latent Swap Joint Diffusion for Long-Form Audio Generation
Viaarxiv icon

PaMMA-Net: Plasmas magnetic measurement evolution based on data-driven incremental accumulative prediction

Add code
Jan 23, 2025
Viaarxiv icon

Skeleton and Font Generation Network for Zero-shot Chinese Character Generation

Add code
Jan 14, 2025
Viaarxiv icon

Learned Data Compression: Challenges and Opportunities for the Future

Add code
Dec 14, 2024
Figure 1 for Learned Data Compression: Challenges and Opportunities for the Future
Figure 2 for Learned Data Compression: Challenges and Opportunities for the Future
Figure 3 for Learned Data Compression: Challenges and Opportunities for the Future
Figure 4 for Learned Data Compression: Challenges and Opportunities for the Future
Viaarxiv icon

RFL: Simplifying Chemical Structure Recognition with Ring-Free Language

Add code
Dec 10, 2024
Figure 1 for RFL: Simplifying Chemical Structure Recognition with Ring-Free Language
Figure 2 for RFL: Simplifying Chemical Structure Recognition with Ring-Free Language
Figure 3 for RFL: Simplifying Chemical Structure Recognition with Ring-Free Language
Figure 4 for RFL: Simplifying Chemical Structure Recognition with Ring-Free Language
Viaarxiv icon