Alert button

"speech": models, code, and papers
Alert button

Towards Energy-Efficient, Low-Latency and Accurate Spiking LSTMs

Oct 23, 2022
Gourav Datta, Haoqin Deng, Robert Aviles, Peter A. Beerel

Figure 1 for Towards Energy-Efficient, Low-Latency and Accurate Spiking LSTMs
Figure 2 for Towards Energy-Efficient, Low-Latency and Accurate Spiking LSTMs
Figure 3 for Towards Energy-Efficient, Low-Latency and Accurate Spiking LSTMs
Figure 4 for Towards Energy-Efficient, Low-Latency and Accurate Spiking LSTMs
Viaarxiv icon

Improving spatial cues for hearables using a parameterized binaural CDR estimator

Jul 17, 2022
Reza Ghanavi, Craig Jin

Figure 1 for Improving spatial cues for hearables using a parameterized binaural CDR estimator
Figure 2 for Improving spatial cues for hearables using a parameterized binaural CDR estimator
Figure 3 for Improving spatial cues for hearables using a parameterized binaural CDR estimator
Figure 4 for Improving spatial cues for hearables using a parameterized binaural CDR estimator
Viaarxiv icon

A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition

Aug 15, 2020
Sitong Zhou, Homayoon Beigi

Figure 1 for A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition
Figure 2 for A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition
Figure 3 for A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition
Figure 4 for A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition
Viaarxiv icon

Meta-Learning for Adaptive Filters with Higher-Order Frequency Dependencies

Add code
Bookmark button
Alert button
Sep 20, 2022
Junkai Wu, Jonah Casebeer, Nicholas J. Bryan, Paris Smaragdis

Figure 1 for Meta-Learning for Adaptive Filters with Higher-Order Frequency Dependencies
Figure 2 for Meta-Learning for Adaptive Filters with Higher-Order Frequency Dependencies
Figure 3 for Meta-Learning for Adaptive Filters with Higher-Order Frequency Dependencies
Figure 4 for Meta-Learning for Adaptive Filters with Higher-Order Frequency Dependencies
Viaarxiv icon

Generalizing in the Real World with Representation Learning

Add code
Bookmark button
Alert button
Oct 18, 2022
Tegan Maharaj

Figure 1 for Generalizing in the Real World with Representation Learning
Figure 2 for Generalizing in the Real World with Representation Learning
Figure 3 for Generalizing in the Real World with Representation Learning
Figure 4 for Generalizing in the Real World with Representation Learning
Viaarxiv icon

Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors

Feb 27, 2021
Manuel Sam Ribeiro, Joanne Cleland, Aciel Eshky, Korin Richmond, Steve Renals

Figure 1 for Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors
Figure 2 for Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors
Figure 3 for Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors
Figure 4 for Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors
Viaarxiv icon

Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings

Add code
Bookmark button
Alert button
Jul 29, 2021
Prerak Srivastava, Antoine Deleforge, Emmanuel Vincent

Figure 1 for Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings
Figure 2 for Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings
Figure 3 for Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings
Figure 4 for Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings
Viaarxiv icon

A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music

Add code
Bookmark button
Alert button
Mar 04, 2021
Hanbin Bae, Jae-Sung Bae, Young-Sun Joo, Young-Ik Kim, Hoon-Young Cho

Figure 1 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 2 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 3 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 4 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Viaarxiv icon

A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline

Add code
Bookmark button
Alert button
Sep 22, 2020
Yerbolat Khassanov, Saida Mussakhojayeva, Almas Mirzakhmetov, Alen Adiyev, Mukhamet Nurpeiissov, Huseyin Atakan Varol

Figure 1 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 2 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 3 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 4 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Viaarxiv icon

VocBench: A Neural Vocoder Benchmark for Speech Synthesis

Add code
Bookmark button
Alert button
Dec 06, 2021
Ehab A. AlBadawy, Andrew Gibiansky, Qing He, Jilong Wu, Ming-Ching Chang, Siwei Lyu

Figure 1 for VocBench: A Neural Vocoder Benchmark for Speech Synthesis
Figure 2 for VocBench: A Neural Vocoder Benchmark for Speech Synthesis
Figure 3 for VocBench: A Neural Vocoder Benchmark for Speech Synthesis
Viaarxiv icon