Picture for Oleksii Hrinchuk

Oleksii Hrinchuk

Longer is (Not Necessarily) Stronger: Punctuated Long-Sequence Training for Enhanced Speech Recognition and Translation

Add code
Sep 09, 2024
Viaarxiv icon

BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5

Add code
Jun 28, 2024
Viaarxiv icon

Less is More: Accurate Speech Recognition & Translation without Web-Scale Data

Add code
Jun 28, 2024
Viaarxiv icon

SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation

Add code
Oct 13, 2023
Figure 1 for SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation
Figure 2 for SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation
Figure 3 for SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation
Figure 4 for SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation
Viaarxiv icon

Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition

Add code
May 19, 2023
Figure 1 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 2 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 3 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 4 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Viaarxiv icon

Leveraging Synthetic Targets for Machine Translation

Add code
May 07, 2023
Figure 1 for Leveraging Synthetic Targets for Machine Translation
Figure 2 for Leveraging Synthetic Targets for Machine Translation
Figure 3 for Leveraging Synthetic Targets for Machine Translation
Figure 4 for Leveraging Synthetic Targets for Machine Translation
Viaarxiv icon

Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation

Add code
Jun 02, 2022
Figure 1 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 2 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 3 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 4 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Viaarxiv icon

NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21

Add code
Nov 16, 2021
Figure 1 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Figure 2 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Figure 3 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Figure 4 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Viaarxiv icon

Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition

Add code
Apr 05, 2021
Figure 1 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 2 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 3 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 4 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Viaarxiv icon

Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model

Add code
Oct 23, 2019
Figure 1 for Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Figure 2 for Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Figure 3 for Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Figure 4 for Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Viaarxiv icon