Picture for Rohit Paturi

Rohit Paturi

Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization

Add code
Jun 26, 2024
Figure 1 for Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
Figure 2 for Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
Figure 3 for Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
Figure 4 for Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
Viaarxiv icon

AG-LSEC: Audio Grounded Lexical Speaker Error Correction

Add code
Jun 25, 2024
Viaarxiv icon

SpeechVerse: A Large-scale Generalizable Audio Language Model

Add code
May 14, 2024
Figure 1 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 2 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 3 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 4 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Viaarxiv icon

Generalized zero-shot audio-to-intent classification

Add code
Nov 04, 2023
Figure 1 for Generalized zero-shot audio-to-intent classification
Figure 2 for Generalized zero-shot audio-to-intent classification
Figure 3 for Generalized zero-shot audio-to-intent classification
Figure 4 for Generalized zero-shot audio-to-intent classification
Viaarxiv icon

End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation

Add code
Nov 01, 2023
Viaarxiv icon

Speaker Diarization of Scripted Audiovisual Content

Add code
Aug 04, 2023
Viaarxiv icon

Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction

Add code
Jun 15, 2023
Figure 1 for Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
Figure 2 for Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
Figure 3 for Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
Figure 4 for Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
Viaarxiv icon

Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech

Add code
Dec 10, 2021
Figure 1 for Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Figure 2 for Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Figure 3 for Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Figure 4 for Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Viaarxiv icon