Alert button

"speech": models, code, and papers
Alert button

Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech

Mar 03, 2021
Joerg Schmalenstroeer, Jens Heitkaemper, Joerg Ullmann, Reinhold Haeb-Umbach

Figure 1 for Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech
Figure 2 for Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech
Figure 3 for Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech
Figure 4 for Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech
Viaarxiv icon

Mixtures of Deep Neural Experts for Automated Speech Scoring

Jun 23, 2021
Sara Papi, Edmondo Trentin, Roberto Gretter, Marco Matassoni, Daniele Falavigna

Figure 1 for Mixtures of Deep Neural Experts for Automated Speech Scoring
Figure 2 for Mixtures of Deep Neural Experts for Automated Speech Scoring
Figure 3 for Mixtures of Deep Neural Experts for Automated Speech Scoring
Viaarxiv icon

Large-vocabulary Audio-visual Speech Recognition in Noisy Environments

Sep 10, 2021
Wentao Yu, Steffen Zeiler, Dorothea Kolossa

Figure 1 for Large-vocabulary Audio-visual Speech Recognition in Noisy Environments
Figure 2 for Large-vocabulary Audio-visual Speech Recognition in Noisy Environments
Figure 3 for Large-vocabulary Audio-visual Speech Recognition in Noisy Environments
Figure 4 for Large-vocabulary Audio-visual Speech Recognition in Noisy Environments
Viaarxiv icon

Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition

May 14, 2021
Bhargav Pulugundla, Yang Gao, Brian King, Gokce Keskin, Harish Mallidi, Minhua Wu, Jasha Droppo, Roland Maas

Figure 1 for Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition
Figure 2 for Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition
Figure 3 for Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition
Figure 4 for Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition
Viaarxiv icon

Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning

May 27, 2022
Xiliang Zhu, Shayna Gardiner, David Rossouw, Tere Roldán, Simon Corston-Oliver

Figure 1 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 2 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 3 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 4 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Viaarxiv icon

The MSXF TTS System for ICASSP 2022 ADD Challenge

Jan 27, 2022
Chunyong Yang, Pengfei Liu, Yanli Chen, Hongbin Wang, Min Liu

Figure 1 for The MSXF TTS System for ICASSP 2022 ADD Challenge
Viaarxiv icon

Continuous Speech Recognition using EEG and Video

Dec 24, 2019
Gautam Krishna, Mason Carnahan, Co Tran, Ahmed H Tewfik

Figure 1 for Continuous Speech Recognition using EEG and Video
Figure 2 for Continuous Speech Recognition using EEG and Video
Figure 3 for Continuous Speech Recognition using EEG and Video
Figure 4 for Continuous Speech Recognition using EEG and Video
Viaarxiv icon

Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?

Oct 29, 2019
Bhavya Ghai, Buvana Ramanan, Klaus Mueller

Figure 1 for Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?
Figure 2 for Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?
Viaarxiv icon

Recent improvements of ASR models in the face of adversarial attacks

Add code
Bookmark button
Alert button
Apr 04, 2022
Raphael Olivier, Bhiksha Raj

Figure 1 for Recent improvements of ASR models in the face of adversarial attacks
Figure 2 for Recent improvements of ASR models in the face of adversarial attacks
Figure 3 for Recent improvements of ASR models in the face of adversarial attacks
Figure 4 for Recent improvements of ASR models in the face of adversarial attacks
Viaarxiv icon

Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries

May 20, 2021
Sukhdeep S. Sodhi, Ellie Ka-In Chio, Ambarish Jash, Santiago Ontañón, Ajit Apte, Ankit Kumar, Ayooluwakunmi Jeje, Dima Kuzmin, Harry Fung, Heng-Tze Cheng, Jon Effrat, Tarush Bali, Nitin Jindal, Pei Cao, Sarvjeet Singh, Senqiang Zhou, Tameen Khan, Amol Wankhede, Moustafa Alzantot, Allen Wu, Tushar Chandra

Figure 1 for Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Figure 2 for Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Figure 3 for Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Figure 4 for Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Viaarxiv icon