Picture for Sundararajan Srinivasan

Sundararajan Srinivasan

Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization

Add code
Jun 26, 2024
Figure 1 for Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
Figure 2 for Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
Figure 3 for Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
Figure 4 for Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
Viaarxiv icon

AG-LSEC: Audio Grounded Lexical Speaker Error Correction

Add code
Jun 25, 2024
Viaarxiv icon

SpeechVerse: A Large-scale Generalizable Audio Language Model

Add code
May 14, 2024
Figure 1 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 2 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 3 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 4 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Viaarxiv icon

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

Add code
May 14, 2024
Viaarxiv icon

End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation

Add code
Nov 01, 2023
Viaarxiv icon

Speaker Diarization of Scripted Audiovisual Content

Add code
Aug 04, 2023
Viaarxiv icon

Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction

Add code
Jun 15, 2023
Figure 1 for Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
Figure 2 for Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
Figure 3 for Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
Figure 4 for Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
Viaarxiv icon

Device Directedness with Contextual Cues for Spoken Dialog Systems

Add code
Nov 23, 2022
Figure 1 for Device Directedness with Contextual Cues for Spoken Dialog Systems
Figure 2 for Device Directedness with Contextual Cues for Spoken Dialog Systems
Figure 3 for Device Directedness with Contextual Cues for Spoken Dialog Systems
Figure 4 for Device Directedness with Contextual Cues for Spoken Dialog Systems
Viaarxiv icon

Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech

Add code
Dec 10, 2021
Figure 1 for Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Figure 2 for Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Figure 3 for Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Figure 4 for Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Viaarxiv icon

Representation learning through cross-modal conditional teacher-student training for speech emotion recognition

Add code
Nov 30, 2021
Figure 1 for Representation learning through cross-modal conditional teacher-student training for speech emotion recognition
Figure 2 for Representation learning through cross-modal conditional teacher-student training for speech emotion recognition
Figure 3 for Representation learning through cross-modal conditional teacher-student training for speech emotion recognition
Viaarxiv icon