Picture for Guangzhi Sun

Guangzhi Sun

SOT Triggered Neural Clustering for Speaker Attributed ASR

Add code
Jul 02, 2024
Figure 1 for SOT Triggered Neural Clustering for Speaker Attributed ASR
Figure 2 for SOT Triggered Neural Clustering for Speaker Attributed ASR
Figure 3 for SOT Triggered Neural Clustering for Speaker Attributed ASR
Figure 4 for SOT Triggered Neural Clustering for Speaker Attributed ASR
Viaarxiv icon

SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR

Add code
Jun 28, 2024
Viaarxiv icon

video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models

Add code
Jun 22, 2024
Viaarxiv icon

Can Large Language Models Understand Spatial Audio?

Add code
Jun 12, 2024
Viaarxiv icon

Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning

Add code
Jun 01, 2024
Viaarxiv icon

Bayesian WeakS-to-Strong from Text Classification to Generation

Add code
May 24, 2024
Viaarxiv icon

CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models

Add code
May 22, 2024
Viaarxiv icon

Matching domain experts by training from scratch on domain knowledge

Add code
May 15, 2024
Figure 1 for Matching domain experts by training from scratch on domain knowledge
Figure 2 for Matching domain experts by training from scratch on domain knowledge
Figure 3 for Matching domain experts by training from scratch on domain knowledge
Figure 4 for Matching domain experts by training from scratch on domain knowledge
Viaarxiv icon

M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset

Add code
Mar 21, 2024
Figure 1 for M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
Figure 2 for M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
Figure 3 for M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
Figure 4 for M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
Viaarxiv icon

Large language models surpass human experts in predicting neuroscience results

Add code
Mar 14, 2024
Figure 1 for Large language models surpass human experts in predicting neuroscience results
Figure 2 for Large language models surpass human experts in predicting neuroscience results
Figure 3 for Large language models surpass human experts in predicting neuroscience results
Figure 4 for Large language models surpass human experts in predicting neuroscience results
Viaarxiv icon