Alert button

"speech": models, code, and papers
Alert button

Data-augmented cross-lingual synthesis in a teacher-student framework

Add code
Bookmark button
Alert button
Mar 31, 2022
Marcel de Korte, Jaebok Kim, Aki Kunikoshi, Adaeze Adigwe, Esther Klabbers

Figure 1 for Data-augmented cross-lingual synthesis in a teacher-student framework
Figure 2 for Data-augmented cross-lingual synthesis in a teacher-student framework
Figure 3 for Data-augmented cross-lingual synthesis in a teacher-student framework
Figure 4 for Data-augmented cross-lingual synthesis in a teacher-student framework
Viaarxiv icon

Federated Acoustic Modeling For Automatic Speech Recognition

Feb 08, 2021
Xiaodong Cui, Songtao Lu, Brian Kingsbury

Figure 1 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 2 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 3 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 4 for Federated Acoustic Modeling For Automatic Speech Recognition
Viaarxiv icon

Compact Graph Architecture for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Aug 06, 2020
A. Shirian, T. Guha

Figure 1 for Compact Graph Architecture for Speech Emotion Recognition
Figure 2 for Compact Graph Architecture for Speech Emotion Recognition
Figure 3 for Compact Graph Architecture for Speech Emotion Recognition
Figure 4 for Compact Graph Architecture for Speech Emotion Recognition
Viaarxiv icon

Toward Zero Oracle Word Error Rate on the Switchboard Benchmark

Add code
Bookmark button
Alert button
Jun 13, 2022
Arlo Faria, Adam Janin, Korbinian Riedhammer, Sidhi Adkoli

Figure 1 for Toward Zero Oracle Word Error Rate on the Switchboard Benchmark
Figure 2 for Toward Zero Oracle Word Error Rate on the Switchboard Benchmark
Figure 3 for Toward Zero Oracle Word Error Rate on the Switchboard Benchmark
Figure 4 for Toward Zero Oracle Word Error Rate on the Switchboard Benchmark
Viaarxiv icon

Noise-robust voice conversion with domain adversarial training

Add code
Bookmark button
Alert button
Jan 26, 2022
Hongqiang Du, Lei Xie, Haizhou Li

Figure 1 for Noise-robust voice conversion with domain adversarial training
Figure 2 for Noise-robust voice conversion with domain adversarial training
Figure 3 for Noise-robust voice conversion with domain adversarial training
Figure 4 for Noise-robust voice conversion with domain adversarial training
Viaarxiv icon

A Neural-Network Framework for the Design of Individualised Hearing-Loss Compensation

Add code
Bookmark button
Alert button
Jul 14, 2022
Fotios Drakopoulos, Sarah Verhulst

Figure 1 for A Neural-Network Framework for the Design of Individualised Hearing-Loss Compensation
Figure 2 for A Neural-Network Framework for the Design of Individualised Hearing-Loss Compensation
Figure 3 for A Neural-Network Framework for the Design of Individualised Hearing-Loss Compensation
Figure 4 for A Neural-Network Framework for the Design of Individualised Hearing-Loss Compensation
Viaarxiv icon

Multiclass ASMA vs Targeted PGD Attack in Image Segmentation

Add code
Bookmark button
Alert button
Aug 03, 2022
Johnson Vo, Jiabao Xie, Sahil Patel

Figure 1 for Multiclass ASMA vs Targeted PGD Attack in Image Segmentation
Figure 2 for Multiclass ASMA vs Targeted PGD Attack in Image Segmentation
Figure 3 for Multiclass ASMA vs Targeted PGD Attack in Image Segmentation
Figure 4 for Multiclass ASMA vs Targeted PGD Attack in Image Segmentation
Viaarxiv icon

Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT

Add code
Bookmark button
Alert button
May 15, 2022
Bowen Shi, Abdelrahman Mohamed, Wei-Ning Hsu

Figure 1 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Figure 2 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Figure 3 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Figure 4 for Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Viaarxiv icon

A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images

Add code
Bookmark button
Alert button
Feb 16, 2021
Yongwan Lim, Asterios Toutios, Yannick Bliesener, Ye Tian, Sajan Goud Lingala, Colin Vaz, Tanner Sorensen, Miran Oh, Sarah Harper, Weiyi Chen, Yoonjeong Lee, Johannes Töger, Mairym Lloréns Montesserin, Caitlin Smith, Bianca Godinez, Louis Goldstein, Dani Byrd, Krishna S. Nayak, Shrikanth S. Narayanan

Figure 1 for A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images
Figure 2 for A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images
Figure 3 for A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images
Figure 4 for A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images
Viaarxiv icon

UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data

Add code
Bookmark button
Alert button
Jan 19, 2021
Chengyi Wang, Yu Wu, Yao Qian, Kenichi Kumatani, Shujie Liu, Furu Wei, Michael Zeng, Xuedong Huang

Figure 1 for UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Figure 2 for UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Figure 3 for UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Figure 4 for UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Viaarxiv icon