Alert button

"speech recognition": models, code, and papers
Alert button

Preliminary Study on SSCF-derived Polar Coordinate for ASR

Nov 30, 2022
Sotheara Leang, Eric Castelli, Dominique Vaufreydaz, Sethserey Sam

Figure 1 for Preliminary Study on SSCF-derived Polar Coordinate for ASR
Figure 2 for Preliminary Study on SSCF-derived Polar Coordinate for ASR
Figure 3 for Preliminary Study on SSCF-derived Polar Coordinate for ASR
Figure 4 for Preliminary Study on SSCF-derived Polar Coordinate for ASR
Viaarxiv icon

BUT Opensat 2019 Speech Recognition System

Jan 30, 2020
Martin Karafiát, Murali Karthick Baskar, Igor Szöke, Hari Krishna Vydana, Karel Veselý, Jan "Honza'' Černocký

Figure 1 for BUT Opensat 2019 Speech Recognition System
Figure 2 for BUT Opensat 2019 Speech Recognition System
Figure 3 for BUT Opensat 2019 Speech Recognition System
Figure 4 for BUT Opensat 2019 Speech Recognition System
Viaarxiv icon

Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition

Oct 11, 2021
Yiming Wang, Jinyu Li, Heming Wang, Yao Qian, Chengyi Wang, Yu Wu

Figure 1 for Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition
Figure 2 for Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition
Figure 3 for Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition
Figure 4 for Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition
Viaarxiv icon

Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model

Add code
Bookmark button
Alert button
Jan 06, 2022
Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu

Figure 1 for Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Figure 2 for Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Figure 3 for Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Figure 4 for Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Viaarxiv icon

Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation

Nov 29, 2022
Stefan Braun, Erik McDermott, Roger Hsiao

Figure 1 for Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation
Figure 2 for Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation
Figure 3 for Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation
Figure 4 for Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation
Viaarxiv icon

Evaluating and reducing the distance between synthetic and real speech distributions

Add code
Bookmark button
Alert button
Nov 29, 2022
Christoph Minixhofer, Ondřej Klejch, Peter Bell

Figure 1 for Evaluating and reducing the distance between synthetic and real speech distributions
Figure 2 for Evaluating and reducing the distance between synthetic and real speech distributions
Viaarxiv icon

Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition

Nov 03, 2020
Ching-Feng Yeh, Yongqiang Wang, Yangyang Shi, Chunyang Wu, Frank Zhang, Julian Chan, Michael L. Seltzer

Figure 1 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Figure 2 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Figure 3 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Figure 4 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Viaarxiv icon

Learning to Rank Microphones for Distant Speech Recognition

Add code
Bookmark button
Alert button
Apr 06, 2021
Samuele Cornell, Alessio Brutti, Marco Matassoni, Stefano Squartini

Figure 1 for Learning to Rank Microphones for Distant Speech Recognition
Figure 2 for Learning to Rank Microphones for Distant Speech Recognition
Figure 3 for Learning to Rank Microphones for Distant Speech Recognition
Figure 4 for Learning to Rank Microphones for Distant Speech Recognition
Viaarxiv icon

Privacy-Preserving Speech Representation Learning using Vector Quantization

Mar 15, 2022
Pierre Champion, Denis Jouvet, Anthony Larcher

Figure 1 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Figure 2 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Figure 3 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Figure 4 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Viaarxiv icon

Adversarial Attacks and Defenses for Speech Recognition Systems

Add code
Bookmark button
Alert button
Mar 31, 2021
Piotr Żelasko, Sonal Joshi, Yiwen Shao, Jesus Villalba, Jan Trmal, Najim Dehak, Sanjeev Khudanpur

Figure 1 for Adversarial Attacks and Defenses for Speech Recognition Systems
Viaarxiv icon