Picture for Sriram Ganapathy

Sriram Ganapathy

STAB: Speech Tokenizer Assessment Benchmark

Add code
Sep 04, 2024
Figure 1 for STAB: Speech Tokenizer Assessment Benchmark
Figure 2 for STAB: Speech Tokenizer Assessment Benchmark
Figure 3 for STAB: Speech Tokenizer Assessment Benchmark
Figure 4 for STAB: Speech Tokenizer Assessment Benchmark
Viaarxiv icon

Improving Self-supervised Pre-training using Accent-Specific Codebooks

Add code
Jul 04, 2024
Figure 1 for Improving Self-supervised Pre-training using Accent-Specific Codebooks
Figure 2 for Improving Self-supervised Pre-training using Accent-Specific Codebooks
Figure 3 for Improving Self-supervised Pre-training using Accent-Specific Codebooks
Figure 4 for Improving Self-supervised Pre-training using Accent-Specific Codebooks
Viaarxiv icon

Towards the Next Frontier in Speech Representation Learning Using Disentanglement

Add code
Jul 02, 2024
Figure 1 for Towards the Next Frontier in Speech Representation Learning Using Disentanglement
Figure 2 for Towards the Next Frontier in Speech Representation Learning Using Disentanglement
Figure 3 for Towards the Next Frontier in Speech Representation Learning Using Disentanglement
Figure 4 for Towards the Next Frontier in Speech Representation Learning Using Disentanglement
Viaarxiv icon

The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments

Add code
Jun 13, 2024
Figure 1 for The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments
Figure 2 for The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments
Figure 3 for The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments
Figure 4 for The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments
Viaarxiv icon

Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization

Add code
Jan 23, 2024
Figure 1 for Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization
Figure 2 for Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization
Figure 3 for Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization
Figure 4 for Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization
Viaarxiv icon

Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement

Add code
Jan 09, 2024
Figure 1 for Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement
Figure 2 for Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement
Figure 3 for Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement
Figure 4 for Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement
Viaarxiv icon

LLM Augmented LLMs: Expanding Capabilities through Composition

Add code
Jan 04, 2024
Figure 1 for LLM Augmented LLMs: Expanding Capabilities through Composition
Figure 2 for LLM Augmented LLMs: Expanding Capabilities through Composition
Figure 3 for LLM Augmented LLMs: Expanding Capabilities through Composition
Figure 4 for LLM Augmented LLMs: Expanding Capabilities through Composition
Viaarxiv icon

Summary of the DISPLACE Challenge 2023 -- DIarization of SPeaker and LAnguage in Conversational Environments

Add code
Nov 23, 2023
Viaarxiv icon

Self-Influence Guided Data Reweighting for Language Model Pre-training

Add code
Nov 02, 2023
Figure 1 for Self-Influence Guided Data Reweighting for Language Model Pre-training
Figure 2 for Self-Influence Guided Data Reweighting for Language Model Pre-training
Figure 3 for Self-Influence Guided Data Reweighting for Language Model Pre-training
Figure 4 for Self-Influence Guided Data Reweighting for Language Model Pre-training
Viaarxiv icon

Accented Speech Recognition With Accent-specific Codebooks

Add code
Oct 27, 2023
Figure 1 for Accented Speech Recognition With Accent-specific Codebooks
Figure 2 for Accented Speech Recognition With Accent-specific Codebooks
Figure 3 for Accented Speech Recognition With Accent-specific Codebooks
Figure 4 for Accented Speech Recognition With Accent-specific Codebooks
Viaarxiv icon