Picture for Sriram Ganapathy

Sriram Ganapathy

Spoken Language Understanding on Unseen Tasks With In-Context Learning

Add code
May 12, 2025
Viaarxiv icon

LLM supervised Pre-training for Multimodal Emotion Recognition in Conversations

Add code
Jan 20, 2025
Figure 1 for LLM supervised Pre-training for Multimodal Emotion Recognition in Conversations
Figure 2 for LLM supervised Pre-training for Multimodal Emotion Recognition in Conversations
Figure 3 for LLM supervised Pre-training for Multimodal Emotion Recognition in Conversations
Figure 4 for LLM supervised Pre-training for Multimodal Emotion Recognition in Conversations
Viaarxiv icon

Uncovering the role of semantic and acoustic cues in normal and dichotic listening

Add code
Nov 18, 2024
Viaarxiv icon

Gradient-free Post-hoc Explainability Using Distillation Aided Learnable Approach

Add code
Sep 17, 2024
Viaarxiv icon

Leveraging Content and Acoustic Representations for Efficient Speech Emotion Recognition

Add code
Sep 09, 2024
Figure 1 for Leveraging Content and Acoustic Representations for Efficient Speech Emotion Recognition
Figure 2 for Leveraging Content and Acoustic Representations for Efficient Speech Emotion Recognition
Figure 3 for Leveraging Content and Acoustic Representations for Efficient Speech Emotion Recognition
Figure 4 for Leveraging Content and Acoustic Representations for Efficient Speech Emotion Recognition
Viaarxiv icon

STAB: Speech Tokenizer Assessment Benchmark

Add code
Sep 04, 2024
Figure 1 for STAB: Speech Tokenizer Assessment Benchmark
Figure 2 for STAB: Speech Tokenizer Assessment Benchmark
Figure 3 for STAB: Speech Tokenizer Assessment Benchmark
Figure 4 for STAB: Speech Tokenizer Assessment Benchmark
Viaarxiv icon

Improving Self-supervised Pre-training using Accent-Specific Codebooks

Add code
Jul 04, 2024
Figure 1 for Improving Self-supervised Pre-training using Accent-Specific Codebooks
Figure 2 for Improving Self-supervised Pre-training using Accent-Specific Codebooks
Figure 3 for Improving Self-supervised Pre-training using Accent-Specific Codebooks
Figure 4 for Improving Self-supervised Pre-training using Accent-Specific Codebooks
Viaarxiv icon

Towards the Next Frontier in Speech Representation Learning Using Disentanglement

Add code
Jul 02, 2024
Viaarxiv icon

The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments

Add code
Jun 13, 2024
Figure 1 for The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments
Figure 2 for The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments
Figure 3 for The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments
Figure 4 for The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments
Viaarxiv icon

Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization

Add code
Jan 23, 2024
Viaarxiv icon