Alert button

"speech": models, code, and papers
Alert button

Self-Attention Generative Adversarial Network for Speech Enhancement

Add code
Bookmark button
Alert button
Oct 18, 2020
Huy Phan, Huy Le Nguyen, Oliver Y. Chén, Philipp Koch, Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins

Figure 1 for Self-Attention Generative Adversarial Network for Speech Enhancement
Figure 2 for Self-Attention Generative Adversarial Network for Speech Enhancement
Figure 3 for Self-Attention Generative Adversarial Network for Speech Enhancement
Figure 4 for Self-Attention Generative Adversarial Network for Speech Enhancement
Viaarxiv icon

Learning Audio Representations with MLPs

Mar 16, 2022
Mashrur M. Morshed, Ahmad Omar Ahsan, Hasan Mahmud, Md. Kamrul Hasan

Figure 1 for Learning Audio Representations with MLPs
Figure 2 for Learning Audio Representations with MLPs
Figure 3 for Learning Audio Representations with MLPs
Figure 4 for Learning Audio Representations with MLPs
Viaarxiv icon

Curriculum Pre-training for End-to-End Speech Translation

Add code
Bookmark button
Alert button
Apr 21, 2020
Chengyi Wang, Yu Wu, Shujie Liu, Ming Zhou, Zhenglu Yang

Figure 1 for Curriculum Pre-training for End-to-End Speech Translation
Figure 2 for Curriculum Pre-training for End-to-End Speech Translation
Figure 3 for Curriculum Pre-training for End-to-End Speech Translation
Figure 4 for Curriculum Pre-training for End-to-End Speech Translation
Viaarxiv icon

Learning to Mediate Disparities Towards Pragmatic Communication

Add code
Bookmark button
Alert button
Mar 25, 2022
Yuwei Bao, Sayan Ghosh, Joyce Chai

Figure 1 for Learning to Mediate Disparities Towards Pragmatic Communication
Figure 2 for Learning to Mediate Disparities Towards Pragmatic Communication
Figure 3 for Learning to Mediate Disparities Towards Pragmatic Communication
Figure 4 for Learning to Mediate Disparities Towards Pragmatic Communication
Viaarxiv icon

Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model

Jul 26, 2021
Quandong Wang, Junnan Wu, Zhao Yan, Sichong Qian, Liyong Guo, Lichun Fan, Weiji Zhuang, Peng Gao, Yujun Wang

Figure 1 for Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model
Figure 2 for Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model
Figure 3 for Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model
Figure 4 for Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model
Viaarxiv icon

The USYD-JD Speech Translation System for IWSLT 2021

Add code
Bookmark button
Alert button
Jul 24, 2021
Liang Ding, Di Wu, Dacheng Tao

Figure 1 for The USYD-JD Speech Translation System for IWSLT 2021
Figure 2 for The USYD-JD Speech Translation System for IWSLT 2021
Figure 3 for The USYD-JD Speech Translation System for IWSLT 2021
Figure 4 for The USYD-JD Speech Translation System for IWSLT 2021
Viaarxiv icon

HaGRID - HAnd Gesture Recognition Image Dataset

Add code
Bookmark button
Alert button
Jun 16, 2022
Alexander Kapitanov, Andrew Makhlyarchuk, Karina Kvanchiani

Figure 1 for HaGRID - HAnd Gesture Recognition Image Dataset
Figure 2 for HaGRID - HAnd Gesture Recognition Image Dataset
Figure 3 for HaGRID - HAnd Gesture Recognition Image Dataset
Figure 4 for HaGRID - HAnd Gesture Recognition Image Dataset
Viaarxiv icon

Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS

Add code
Bookmark button
Alert button
Jun 18, 2021
Xiaochun An, Frank K. Soong, Lei Xie

Figure 1 for Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS
Figure 2 for Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS
Figure 3 for Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS
Figure 4 for Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS
Viaarxiv icon

Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder

Nov 06, 2020
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe

Figure 1 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Figure 2 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Figure 3 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Figure 4 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Viaarxiv icon

Spoken Speech Enhancement using EEG

Oct 29, 2019
Gautam Krishna, Yan Han, Co Tran, Mason Carnahan, Ahmed H Tewfik

Figure 1 for Spoken Speech Enhancement using EEG
Figure 2 for Spoken Speech Enhancement using EEG
Figure 3 for Spoken Speech Enhancement using EEG
Figure 4 for Spoken Speech Enhancement using EEG
Viaarxiv icon