Alert button

"speech": models, code, and papers
Alert button

Automatic Speech Recognition of Low-Resource Languages Based on Chukchi

Add code
Bookmark button
Alert button
Oct 11, 2022
Anastasia Safonova, Tatiana Yudina, Emil Nadimanov, Cydnie Davenport

Figure 1 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Figure 2 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Figure 3 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Figure 4 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Viaarxiv icon

Visually Grounded Keyword Detection and Localisation for Low-Resource Languages

Add code
Bookmark button
Alert button
Feb 01, 2023
Kayode Kolawole Olaleye

Figure 1 for Visually Grounded Keyword Detection and Localisation for Low-Resource Languages
Figure 2 for Visually Grounded Keyword Detection and Localisation for Low-Resource Languages
Figure 3 for Visually Grounded Keyword Detection and Localisation for Low-Resource Languages
Figure 4 for Visually Grounded Keyword Detection and Localisation for Low-Resource Languages
Viaarxiv icon

Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition

May 13, 2022
Zengrui Jin, Mengzhe Geng, Jiajun Deng, Tianzi Wang, Shujie Hu, Guinan Li, Xunying Liu

Figure 1 for Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition
Figure 2 for Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition
Figure 3 for Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition
Figure 4 for Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition
Viaarxiv icon

An Overview on Language Models: Recent Developments and Outlook

Add code
Bookmark button
Alert button
Mar 10, 2023
Chengwei Wei, Yun-Cheng Wang, Bin Wang, C. -C. Jay Kuo

Figure 1 for An Overview on Language Models: Recent Developments and Outlook
Figure 2 for An Overview on Language Models: Recent Developments and Outlook
Figure 3 for An Overview on Language Models: Recent Developments and Outlook
Figure 4 for An Overview on Language Models: Recent Developments and Outlook
Viaarxiv icon

Automatic Generation of Multiple-Choice Questions

Mar 25, 2023
Cheng Zhang

Figure 1 for Automatic Generation of Multiple-Choice Questions
Figure 2 for Automatic Generation of Multiple-Choice Questions
Figure 3 for Automatic Generation of Multiple-Choice Questions
Figure 4 for Automatic Generation of Multiple-Choice Questions
Viaarxiv icon

Multi-channel target speech enhancement based on ERB-scaled spatial coherence features

Jul 17, 2022
Yicheng Hsu, Yonghan Lee, Mingsian R. Bai

Figure 1 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 2 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 3 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 4 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Viaarxiv icon

Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition

Jul 13, 2022
Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro

Figure 1 for Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition
Figure 2 for Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition
Figure 3 for Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition
Figure 4 for Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition
Viaarxiv icon

Chain-based Discriminative Autoencoders for Speech Recognition

Add code
Bookmark button
Alert button
Mar 28, 2022
Hung-Shin Lee, Pin-Tuan Huang, Yao-Fei Cheng, Hsin-Min Wang

Figure 1 for Chain-based Discriminative Autoencoders for Speech Recognition
Figure 2 for Chain-based Discriminative Autoencoders for Speech Recognition
Figure 3 for Chain-based Discriminative Autoencoders for Speech Recognition
Viaarxiv icon

Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition

Mar 07, 2023
Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak

Figure 1 for Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition
Figure 2 for Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition
Figure 3 for Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition
Figure 4 for Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition
Viaarxiv icon

Improving Self-Supervised Learning for Audio Representations by Feature Diversity and Decorrelation

Mar 07, 2023
Bac Nguyen, Stefan Uhlich, Fabien Cardinaux

Figure 1 for Improving Self-Supervised Learning for Audio Representations by Feature Diversity and Decorrelation
Figure 2 for Improving Self-Supervised Learning for Audio Representations by Feature Diversity and Decorrelation
Figure 3 for Improving Self-Supervised Learning for Audio Representations by Feature Diversity and Decorrelation
Figure 4 for Improving Self-Supervised Learning for Audio Representations by Feature Diversity and Decorrelation
Viaarxiv icon