Picture for Shikhar Bharadwaj

Shikhar Bharadwaj

The CMU-AIST submission for the ICME 2025 Audio Encoder Challenge

Add code
Jan 22, 2026
Viaarxiv icon

PRiSM: Benchmarking Phone Realization in Speech Models

Add code
Jan 20, 2026
Viaarxiv icon

OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder

Add code
Jul 18, 2025
Figure 1 for OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder
Figure 2 for OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder
Figure 3 for OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder
Figure 4 for OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder
Viaarxiv icon

Context-Driven Dynamic Pruning for Large Speech Foundation Models

Add code
May 24, 2025
Figure 1 for Context-Driven Dynamic Pruning for Large Speech Foundation Models
Figure 2 for Context-Driven Dynamic Pruning for Large Speech Foundation Models
Figure 3 for Context-Driven Dynamic Pruning for Large Speech Foundation Models
Figure 4 for Context-Driven Dynamic Pruning for Large Speech Foundation Models
Viaarxiv icon

ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems

Add code
Mar 11, 2025
Figure 1 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Figure 2 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Figure 3 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Figure 4 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Viaarxiv icon

ESPnet-SpeechLM: An Open Speech Language Model Toolkit

Add code
Feb 21, 2025
Figure 1 for ESPnet-SpeechLM: An Open Speech Language Model Toolkit
Figure 2 for ESPnet-SpeechLM: An Open Speech Language Model Toolkit
Figure 3 for ESPnet-SpeechLM: An Open Speech Language Model Toolkit
Figure 4 for ESPnet-SpeechLM: An Open Speech Language Model Toolkit
Viaarxiv icon

STAB: Speech Tokenizer Assessment Benchmark

Add code
Sep 04, 2024
Figure 1 for STAB: Speech Tokenizer Assessment Benchmark
Figure 2 for STAB: Speech Tokenizer Assessment Benchmark
Figure 3 for STAB: Speech Tokenizer Assessment Benchmark
Figure 4 for STAB: Speech Tokenizer Assessment Benchmark
Viaarxiv icon

IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages

Add code
Apr 25, 2024
Viaarxiv icon

Multimodal Modeling For Spoken Language Identification

Add code
Sep 19, 2023
Figure 1 for Multimodal Modeling For Spoken Language Identification
Figure 2 for Multimodal Modeling For Spoken Language Identification
Figure 3 for Multimodal Modeling For Spoken Language Identification
Figure 4 for Multimodal Modeling For Spoken Language Identification
Viaarxiv icon

MASR: Metadata Aware Speech Representation

Add code
Jul 20, 2023
Viaarxiv icon