Picture for Boris Ginsburg

Boris Ginsburg

Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR

Add code
Sep 02, 2024
Figure 1 for Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR
Viaarxiv icon

NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks

Add code
Aug 23, 2024
Figure 1 for NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
Figure 2 for NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
Figure 3 for NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
Figure 4 for NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
Viaarxiv icon

Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models

Add code
Jul 29, 2024
Viaarxiv icon

Schrödinger Bridge for Generative Speech Enhancement

Add code
Jul 22, 2024
Viaarxiv icon

Romanization Encoding For Multilingual ASR

Add code
Jul 05, 2024
Viaarxiv icon

Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations

Add code
Jul 03, 2024
Figure 1 for Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
Figure 2 for Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
Figure 3 for Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
Figure 4 for Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
Viaarxiv icon

Less is More: Accurate Speech Recognition & Translation without Web-Scale Data

Add code
Jun 28, 2024
Viaarxiv icon

BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5

Add code
Jun 28, 2024
Viaarxiv icon

DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment

Add code
Jun 27, 2024
Figure 1 for DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment
Figure 2 for DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment
Figure 3 for DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment
Figure 4 for DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment
Viaarxiv icon

Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment

Add code
Jun 25, 2024
Figure 1 for Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment
Figure 2 for Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment
Figure 3 for Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment
Figure 4 for Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment
Viaarxiv icon