Picture for Yong Xu

Yong Xu

HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models

Add code
Sep 30, 2024
Figure 1 for HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
Figure 2 for HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
Figure 3 for HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
Figure 4 for HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
Viaarxiv icon

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Add code
Sep 17, 2024
Viaarxiv icon

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization

Add code
Sep 01, 2024
Figure 1 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 2 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 3 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Figure 4 for LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
Viaarxiv icon

Advancing Multi-talker ASR Performance with Large Language Models

Add code
Aug 30, 2024
Figure 1 for Advancing Multi-talker ASR Performance with Large Language Models
Figure 2 for Advancing Multi-talker ASR Performance with Large Language Models
Figure 3 for Advancing Multi-talker ASR Performance with Large Language Models
Figure 4 for Advancing Multi-talker ASR Performance with Large Language Models
Viaarxiv icon

Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning

Add code
Aug 18, 2024
Figure 1 for Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning
Figure 2 for Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning
Figure 3 for Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning
Figure 4 for Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning
Viaarxiv icon

OpenCity: Open Spatio-Temporal Foundation Models for Traffic Prediction

Add code
Aug 16, 2024
Viaarxiv icon

Self-supervised 3D Point Cloud Completion via Multi-view Adversarial Learning

Add code
Jul 13, 2024
Figure 1 for Self-supervised 3D Point Cloud Completion via Multi-view Adversarial Learning
Figure 2 for Self-supervised 3D Point Cloud Completion via Multi-view Adversarial Learning
Figure 3 for Self-supervised 3D Point Cloud Completion via Multi-view Adversarial Learning
Figure 4 for Self-supervised 3D Point Cloud Completion via Multi-view Adversarial Learning
Viaarxiv icon

Text-Queried Target Sound Event Localization

Add code
Jun 23, 2024
Figure 1 for Text-Queried Target Sound Event Localization
Figure 2 for Text-Queried Target Sound Event Localization
Figure 3 for Text-Queried Target Sound Event Localization
Figure 4 for Text-Queried Target Sound Event Localization
Viaarxiv icon

Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment

Add code
Jun 17, 2024
Figure 1 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Figure 2 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Figure 3 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Figure 4 for Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
Viaarxiv icon

FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction

Add code
May 28, 2024
Figure 1 for FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction
Figure 2 for FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction
Figure 3 for FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction
Figure 4 for FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction
Viaarxiv icon