Alert button

"speech": models, code, and papers
Alert button

Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting

Add code
Bookmark button
Alert button
Jun 30, 2022
Hyeon-Kyeong Shin, Hyewon Han, Doyeon Kim, Soo-Whan Chung, Hong-Goo Kang

Figure 1 for Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
Figure 2 for Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
Figure 3 for Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
Figure 4 for Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
Viaarxiv icon

Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation

Add code
Bookmark button
Alert button
Nov 02, 2020
Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier

Figure 1 for Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation
Figure 2 for Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation
Figure 3 for Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation
Figure 4 for Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation
Viaarxiv icon

Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability

Add code
Bookmark button
Alert button
Apr 03, 2021
Rui Liu, Berrak Sisman, Haizhou Li

Figure 1 for Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability
Figure 2 for Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability
Figure 3 for Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability
Figure 4 for Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability
Viaarxiv icon

AudioVisual Speech Synthesis: A brief literature review

Add code
Bookmark button
Alert button
Feb 18, 2021
Efthymios Georgiou, Athanasios Katsamanis

Viaarxiv icon

Learning with Limited Samples -- Meta-Learning and Applications to Communication Systems

Oct 03, 2022
Lisha Chen, Sharu Theresa Jose, Ivana Nikoloska, Sangwoo Park, Tianyi Chen, Osvaldo Simeone

Figure 1 for Learning with Limited Samples -- Meta-Learning and Applications to Communication Systems
Figure 2 for Learning with Limited Samples -- Meta-Learning and Applications to Communication Systems
Figure 3 for Learning with Limited Samples -- Meta-Learning and Applications to Communication Systems
Figure 4 for Learning with Limited Samples -- Meta-Learning and Applications to Communication Systems
Viaarxiv icon

A Database for Research on Detection and Enhancement of Speech Transmitted over HF links

Add code
Bookmark button
Alert button
Jun 16, 2021
Jens Heitkaemper, Joerg Schmalenstroeer, Joerg Ullmann, Valentin Ion, Reinhold Haeb-Umbach

Figure 1 for A Database for Research on Detection and Enhancement of Speech Transmitted over HF links
Figure 2 for A Database for Research on Detection and Enhancement of Speech Transmitted over HF links
Figure 3 for A Database for Research on Detection and Enhancement of Speech Transmitted over HF links
Figure 4 for A Database for Research on Detection and Enhancement of Speech Transmitted over HF links
Viaarxiv icon

Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task

Add code
Bookmark button
Alert button
Jul 12, 2021
Yun Tang, Juan Pino, Xian Li, Changhan Wang, Dmitriy Genzel

Figure 1 for Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task
Figure 2 for Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task
Figure 3 for Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task
Figure 4 for Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task
Viaarxiv icon

Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition

Add code
Bookmark button
Alert button
Jul 01, 2021
Qiujia Li, Chao Zhang, Philip C. Woodland

Figure 1 for Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition
Figure 2 for Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition
Figure 3 for Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition
Figure 4 for Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition
Viaarxiv icon

Adversarial and Safely Scaled Question Generation

Oct 17, 2022
Sreehari Sankar, Zhihang Dong

Figure 1 for Adversarial and Safely Scaled Question Generation
Figure 2 for Adversarial and Safely Scaled Question Generation
Figure 3 for Adversarial and Safely Scaled Question Generation
Figure 4 for Adversarial and Safely Scaled Question Generation
Viaarxiv icon

Continuous Pseudo-Labeling from the Start

Add code
Bookmark button
Alert button
Oct 17, 2022
Dan Berrebbi, Ronan Collobert, Samy Bengio, Navdeep Jaitly, Tatiana Likhomanenko

Figure 1 for Continuous Pseudo-Labeling from the Start
Figure 2 for Continuous Pseudo-Labeling from the Start
Figure 3 for Continuous Pseudo-Labeling from the Start
Figure 4 for Continuous Pseudo-Labeling from the Start
Viaarxiv icon