Alert button

"speech recognition": models, code, and papers
Alert button

Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition

Dec 21, 2020
Shoma Ishida

Figure 1 for Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition
Figure 2 for Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition
Figure 3 for Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition
Figure 4 for Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition
Viaarxiv icon

Improving Transformer-based Speech Recognition Using Unsupervised Pre-training

Oct 31, 2019
Dongwei Jiang, Xiaoning Lei, Wubo Li, Ne Luo, Yuxuan Hu, Wei Zou, Xiangang Li

Figure 1 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 2 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 3 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 4 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Viaarxiv icon

An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

Mar 16, 2021
Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur

Figure 1 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 2 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 3 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 4 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Viaarxiv icon

English Broadcast News Speech Recognition by Humans and Machines

Apr 30, 2019
Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltan Tuske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko

Figure 1 for English Broadcast News Speech Recognition by Humans and Machines
Figure 2 for English Broadcast News Speech Recognition by Humans and Machines
Figure 3 for English Broadcast News Speech Recognition by Humans and Machines
Figure 4 for English Broadcast News Speech Recognition by Humans and Machines
Viaarxiv icon

Improving Speech-to-Speech Translation Through Unlabeled Text

Oct 26, 2022
Xuan-Phi Nguyen, Sravya Popuri, Changhan Wang, Yun Tang, Ilia Kulikov, Hongyu Gong

Figure 1 for Improving Speech-to-Speech Translation Through Unlabeled Text
Figure 2 for Improving Speech-to-Speech Translation Through Unlabeled Text
Figure 3 for Improving Speech-to-Speech Translation Through Unlabeled Text
Figure 4 for Improving Speech-to-Speech Translation Through Unlabeled Text
Viaarxiv icon

Privacy-Preserving Speech Representation Learning using Vector Quantization

Mar 15, 2022
Pierre Champion, Denis Jouvet, Anthony Larcher

Figure 1 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Figure 2 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Figure 3 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Figure 4 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Viaarxiv icon

Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model

Dec 14, 2021
Keqi Deng, Songjun Cao, Yike Zhang, Long Ma

Figure 1 for Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Figure 2 for Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Figure 3 for Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Figure 4 for Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Viaarxiv icon

Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation

Oct 27, 2022
Marvin Lavechin, Marianne Métais, Hadrien Titeux, Alodie Boissonnet, Jade Copet, Morgane Rivière, Elika Bergelson, Alejandrina Cristia, Emmanuel Dupoux, Hervé Bredin

Figure 1 for Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation
Figure 2 for Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation
Figure 3 for Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation
Figure 4 for Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation
Viaarxiv icon

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition

Jun 16, 2021
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori

Figure 1 for Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Figure 2 for Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Viaarxiv icon

End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English

Oct 26, 2022
Abhinav Goyal, Anupam Singh, Nikesh Garera

Figure 1 for End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English
Figure 2 for End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English
Figure 3 for End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English
Figure 4 for End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English
Viaarxiv icon