Picture for Arun Narayanan

Arun Narayanan

Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping

Add code
Jun 04, 2024
Viaarxiv icon

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

Add code
Feb 27, 2024
Figure 1 for Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models
Figure 2 for Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models
Figure 3 for Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models
Figure 4 for Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models
Viaarxiv icon

A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation

Add code
Sep 14, 2022
Figure 1 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 2 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 3 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 4 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Viaarxiv icon

Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments

Add code
May 17, 2022
Figure 1 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 2 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 3 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Figure 4 for Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments
Viaarxiv icon

A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy

Add code
May 06, 2022
Figure 1 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 2 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 3 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 4 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Viaarxiv icon

Mask scalar prediction for improving robust automatic speech recognition

Add code
Apr 26, 2022
Figure 1 for Mask scalar prediction for improving robust automatic speech recognition
Figure 2 for Mask scalar prediction for improving robust automatic speech recognition
Figure 3 for Mask scalar prediction for improving robust automatic speech recognition
Figure 4 for Mask scalar prediction for improving robust automatic speech recognition
Viaarxiv icon

Extracting Targeted Training Data from ASR Models, and How to Mitigate It

Add code
Apr 18, 2022
Figure 1 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Figure 2 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Figure 3 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Figure 4 for Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Viaarxiv icon

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Add code
Apr 13, 2022
Figure 1 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 2 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 3 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 4 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Viaarxiv icon

A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation

Add code
Nov 18, 2021
Figure 1 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Figure 2 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Figure 3 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Figure 4 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Viaarxiv icon

SNRi Target Training for Joint Speech Enhancement and Recognition

Add code
Nov 01, 2021
Figure 1 for SNRi Target Training for Joint Speech Enhancement and Recognition
Figure 2 for SNRi Target Training for Joint Speech Enhancement and Recognition
Figure 3 for SNRi Target Training for Joint Speech Enhancement and Recognition
Figure 4 for SNRi Target Training for Joint Speech Enhancement and Recognition
Viaarxiv icon