Picture for Helen Meng

Helen Meng

Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models

Add code
Jul 18, 2024
Figure 1 for Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models
Figure 2 for Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models
Figure 3 for Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models
Figure 4 for Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models
Viaarxiv icon

Large Language Model-based FMRI Encoding of Language Functions for Subjects with Neurocognitive Disorder

Add code
Jul 15, 2024
Viaarxiv icon

Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System

Add code
Jul 13, 2024
Viaarxiv icon

Autoregressive Speech Synthesis without Vector Quantization

Add code
Jul 11, 2024
Viaarxiv icon

Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation

Add code
Jul 08, 2024
Viaarxiv icon

Purple-teaming LLMs with Adversarial Defender Training

Add code
Jul 01, 2024
Figure 1 for Purple-teaming LLMs with Adversarial Defender Training
Figure 2 for Purple-teaming LLMs with Adversarial Defender Training
Figure 3 for Purple-teaming LLMs with Adversarial Defender Training
Figure 4 for Purple-teaming LLMs with Adversarial Defender Training
Viaarxiv icon

Seamless Language Expansion: Enhancing Multilingual Mastery in Self-Supervised Models

Add code
Jun 20, 2024
Figure 1 for Seamless Language Expansion: Enhancing Multilingual Mastery in Self-Supervised Models
Figure 2 for Seamless Language Expansion: Enhancing Multilingual Mastery in Self-Supervised Models
Figure 3 for Seamless Language Expansion: Enhancing Multilingual Mastery in Self-Supervised Models
Figure 4 for Seamless Language Expansion: Enhancing Multilingual Mastery in Self-Supervised Models
Viaarxiv icon

Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers

Add code
Jun 16, 2024
Figure 1 for Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers
Figure 2 for Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers
Figure 3 for Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers
Figure 4 for Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers
Viaarxiv icon

UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner

Add code
Jun 14, 2024
Figure 1 for UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner
Figure 2 for UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner
Figure 3 for UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner
Figure 4 for UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner
Viaarxiv icon

Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition

Add code
Jun 14, 2024
Figure 1 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Figure 2 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Figure 3 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Figure 4 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Viaarxiv icon