Alert button

"speech": models, code, and papers
Alert button

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

Apr 20, 2021
Yuzi Yan, Xu Tan, Bohan Li, Tao Qin, Sheng Zhao, Yuan Shen, Tie-Yan Liu

Figure 1 for AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
Figure 2 for AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
Figure 3 for AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
Figure 4 for AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
Viaarxiv icon

Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech

Aug 29, 2021
Injy Hamed, Pavel Denisov, Chia-Yu Li, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu

Figure 1 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Figure 2 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Figure 3 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Figure 4 for Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech
Viaarxiv icon

Distillation-Resistant Watermarking for Model Protection in NLP

Oct 07, 2022
Xuandong Zhao, Lei Li, Yu-Xiang Wang

Figure 1 for Distillation-Resistant Watermarking for Model Protection in NLP
Figure 2 for Distillation-Resistant Watermarking for Model Protection in NLP
Figure 3 for Distillation-Resistant Watermarking for Model Protection in NLP
Figure 4 for Distillation-Resistant Watermarking for Model Protection in NLP
Viaarxiv icon

K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables

Oct 11, 2021
Jounghee Kim, Pilsung Kang

Figure 1 for K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables
Figure 2 for K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables
Figure 3 for K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables
Figure 4 for K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables
Viaarxiv icon

The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022

Sep 23, 2022
Qutang Cai, Guoqiang Hong, Zhijian Ye, Ximin Li, Haizhou Li

Figure 1 for The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022
Figure 2 for The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022
Figure 3 for The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022
Figure 4 for The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022
Viaarxiv icon

Leveraging Pre-trained Language Model for Speech Sentiment Analysis

Jun 11, 2021
Suwon Shon, Pablo Brusco, Jing Pan, Kyu J. Han, Shinji Watanabe

Figure 1 for Leveraging Pre-trained Language Model for Speech Sentiment Analysis
Figure 2 for Leveraging Pre-trained Language Model for Speech Sentiment Analysis
Figure 3 for Leveraging Pre-trained Language Model for Speech Sentiment Analysis
Figure 4 for Leveraging Pre-trained Language Model for Speech Sentiment Analysis
Viaarxiv icon

It's a long way! Layer-wise Relevance Propagation for Echo State Networks applied to Earth System Variability

Oct 18, 2022
Marco Landt-Hayen, Peer Kröger, Martin Claus, Willi Rath

Figure 1 for It's a long way! Layer-wise Relevance Propagation for Echo State Networks applied to Earth System Variability
Figure 2 for It's a long way! Layer-wise Relevance Propagation for Echo State Networks applied to Earth System Variability
Figure 3 for It's a long way! Layer-wise Relevance Propagation for Echo State Networks applied to Earth System Variability
Figure 4 for It's a long way! Layer-wise Relevance Propagation for Echo State Networks applied to Earth System Variability
Viaarxiv icon

Attention-Based Keyword Localisation in Speech using Visual Grounding

Jun 23, 2021
Kayode Olaleye, Herman Kamper

Figure 1 for Attention-Based Keyword Localisation in Speech using Visual Grounding
Figure 2 for Attention-Based Keyword Localisation in Speech using Visual Grounding
Figure 3 for Attention-Based Keyword Localisation in Speech using Visual Grounding
Figure 4 for Attention-Based Keyword Localisation in Speech using Visual Grounding
Viaarxiv icon

Neural-FST Class Language Model for End-to-End Speech Recognition

Jan 31, 2022
Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer

Figure 1 for Neural-FST Class Language Model for End-to-End Speech Recognition
Figure 2 for Neural-FST Class Language Model for End-to-End Speech Recognition
Figure 3 for Neural-FST Class Language Model for End-to-End Speech Recognition
Viaarxiv icon

Dürfen Maschinen denken (können)? Warum Künstliche Intelligenz eine Ethik braucht. (Are Machines Allowed to (be able to) Think? Why Artificial Intelligence Needs Ethics)

Aug 15, 2022
Karsten Wendland

Viaarxiv icon