Alert button

"speech": models, code, and papers
Alert button

Bayesian Neural Network Language Modeling for Speech Recognition

Aug 28, 2022
Boyang Xue, Shoukang Hu, Junhao Xu, Mengzhe Geng, Xunying Liu, Helen Meng

Figure 1 for Bayesian Neural Network Language Modeling for Speech Recognition
Figure 2 for Bayesian Neural Network Language Modeling for Speech Recognition
Figure 3 for Bayesian Neural Network Language Modeling for Speech Recognition
Figure 4 for Bayesian Neural Network Language Modeling for Speech Recognition
Viaarxiv icon

Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints

Mar 03, 2023
Paul Magron, Tuomas Virtanen

Figure 1 for Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints
Figure 2 for Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints
Figure 3 for Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints
Viaarxiv icon

How to Leverage DNN-based speech enhancement for multi-channel speaker verification?

Oct 17, 2022
Sandipana Dowerah, Romain Serizel, Denis Jouvet, Mohammad Mohammadamini, Driss Matrouf

Figure 1 for How to Leverage DNN-based speech enhancement for multi-channel speaker verification?
Viaarxiv icon

Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition

Jun 01, 2022
Holy Lovenia, Hiroki Tanaka, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura

Figure 1 for Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition
Figure 2 for Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition
Figure 3 for Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition
Figure 4 for Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition
Viaarxiv icon

Improving Speech Recognition on Noisy Speech via Speech Enhancement with Multi-Discriminators CycleGAN

Dec 12, 2021
Chia-Yu Li, Ngoc Thang Vu

Figure 1 for Improving Speech Recognition on Noisy Speech via Speech Enhancement with Multi-Discriminators CycleGAN
Figure 2 for Improving Speech Recognition on Noisy Speech via Speech Enhancement with Multi-Discriminators CycleGAN
Figure 3 for Improving Speech Recognition on Noisy Speech via Speech Enhancement with Multi-Discriminators CycleGAN
Figure 4 for Improving Speech Recognition on Noisy Speech via Speech Enhancement with Multi-Discriminators CycleGAN
Viaarxiv icon

Continuous descriptor-based control for deep audio synthesis

Feb 27, 2023
Ninon Devis, Nils Demerlé, Sarah Nabi, David Genova, Philippe Esling

Figure 1 for Continuous descriptor-based control for deep audio synthesis
Figure 2 for Continuous descriptor-based control for deep audio synthesis
Figure 3 for Continuous descriptor-based control for deep audio synthesis
Figure 4 for Continuous descriptor-based control for deep audio synthesis
Viaarxiv icon

Textless Speech-to-Speech Translation on Real Data

Dec 15, 2021
Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Juan Pino, Jiatao Gu, Wei-Ning Hsu

Figure 1 for Textless Speech-to-Speech Translation on Real Data
Figure 2 for Textless Speech-to-Speech Translation on Real Data
Figure 3 for Textless Speech-to-Speech Translation on Real Data
Figure 4 for Textless Speech-to-Speech Translation on Real Data
Viaarxiv icon

Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems

Apr 11, 2022
Vishal Sunder, Eric Fosler-Lussier, Samuel Thomas, Hong-Kwang J. Kuo, Brian Kingsbury

Figure 1 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Figure 2 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Figure 3 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Viaarxiv icon

Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks

Nov 03, 2022
Zitha Sasindran, Harsha Yelchuri, Supreeth Rao, T. V. Prabhakar

Figure 1 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 2 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 3 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 4 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Viaarxiv icon

ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement

Jun 27, 2022
Ishan Chatterjee, Maruchi Kim, Vivek Jayaram, Shyamnath Gollakota, Ira Kemelmacher-Shlizerman, Shwetak Patel, Steven M. Seitz

Figure 1 for ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Figure 2 for ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Figure 3 for ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Figure 4 for ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Viaarxiv icon