Picture for Aswin Sivaraman

Aswin Sivaraman

Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection

Add code
Jun 13, 2024
Figure 1 for Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection
Figure 2 for Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection
Figure 3 for Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection
Figure 4 for Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection
Viaarxiv icon

The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement

Add code
Nov 14, 2022
Figure 1 for The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement
Figure 2 for The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement
Figure 3 for The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement
Figure 4 for The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement
Viaarxiv icon

Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training

Add code
Oct 20, 2021
Figure 1 for Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training
Figure 2 for Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training
Viaarxiv icon

Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection

Add code
May 08, 2021
Figure 1 for Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection
Figure 2 for Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection
Figure 3 for Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection
Viaarxiv icon

Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification

Add code
Apr 05, 2021
Figure 1 for Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
Figure 2 for Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
Figure 3 for Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
Viaarxiv icon

Self-Supervised Learning for Personalized Speech Enhancement

Add code
Apr 05, 2021
Figure 1 for Self-Supervised Learning for Personalized Speech Enhancement
Figure 2 for Self-Supervised Learning for Personalized Speech Enhancement
Figure 3 for Self-Supervised Learning for Personalized Speech Enhancement
Figure 4 for Self-Supervised Learning for Personalized Speech Enhancement
Viaarxiv icon

Detecting Extraneous Content in Podcasts

Add code
Mar 03, 2021
Figure 1 for Detecting Extraneous Content in Podcasts
Figure 2 for Detecting Extraneous Content in Podcasts
Figure 3 for Detecting Extraneous Content in Podcasts
Figure 4 for Detecting Extraneous Content in Podcasts
Viaarxiv icon

Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement

Add code
Nov 06, 2020
Figure 1 for Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Figure 2 for Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Figure 3 for Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Figure 4 for Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Viaarxiv icon

Sparse Mixture of Local Experts for Efficient Speech Enhancement

Add code
May 16, 2020
Figure 1 for Sparse Mixture of Local Experts for Efficient Speech Enhancement
Figure 2 for Sparse Mixture of Local Experts for Efficient Speech Enhancement
Viaarxiv icon

Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances

Add code
Feb 03, 2019
Figure 1 for Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances
Figure 2 for Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances
Figure 3 for Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances
Figure 4 for Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances
Viaarxiv icon