Alert button

"speech": models, code, and papers
Alert button

CopyPaste: An Augmentation Method for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Oct 27, 2020
Raghavendra Pappagari, Jesús Villalba, Piotr Żelasko, Laureano Moro-Velazquez, Najim Dehak

Figure 1 for CopyPaste: An Augmentation Method for Speech Emotion Recognition
Figure 2 for CopyPaste: An Augmentation Method for Speech Emotion Recognition
Figure 3 for CopyPaste: An Augmentation Method for Speech Emotion Recognition
Figure 4 for CopyPaste: An Augmentation Method for Speech Emotion Recognition
Viaarxiv icon

Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation

Add code
Bookmark button
Alert button
Mar 08, 2021
Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu

Figure 1 for Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Figure 2 for Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Figure 3 for Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Figure 4 for Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Viaarxiv icon

Towards Learning Universal Audio Representations

Add code
Bookmark button
Alert button
Nov 23, 2021
Luyu Wang, Pauline Luc, Yan Wu, Adria Recasens, Lucas Smaira, Andrew Brock, Andrew Jaegle, Jean-Baptiste Alayrac, Sander Dieleman, Joao Carreira, Aaron van den Oord

Figure 1 for Towards Learning Universal Audio Representations
Figure 2 for Towards Learning Universal Audio Representations
Figure 3 for Towards Learning Universal Audio Representations
Figure 4 for Towards Learning Universal Audio Representations
Viaarxiv icon

Contextualized Translation of Automatically Segmented Speech

Add code
Bookmark button
Alert button
Aug 05, 2020
Marco Gaido, Mattia Antonino Di Gangi, Matteo Negri, Mauro Cettolo, Marco Turchi

Figure 1 for Contextualized Translation of Automatically Segmented Speech
Figure 2 for Contextualized Translation of Automatically Segmented Speech
Figure 3 for Contextualized Translation of Automatically Segmented Speech
Figure 4 for Contextualized Translation of Automatically Segmented Speech
Viaarxiv icon

Unified and Multilingual Author Profiling for Detecting Haters

Add code
Bookmark button
Alert button
Sep 19, 2021
Ipek Baris Schlicht, Angel Felipe Magnossão de Paula

Figure 1 for Unified and Multilingual Author Profiling for Detecting Haters
Figure 2 for Unified and Multilingual Author Profiling for Detecting Haters
Figure 3 for Unified and Multilingual Author Profiling for Detecting Haters
Figure 4 for Unified and Multilingual Author Profiling for Detecting Haters
Viaarxiv icon

Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method

Add code
Bookmark button
Alert button
Jul 15, 2021
Candy Olivia Mawalim, Masashi Unoki

Figure 1 for Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method
Figure 2 for Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method
Figure 3 for Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method
Figure 4 for Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method
Viaarxiv icon

Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference

Oct 18, 2021
Atsuo Hiroe

Figure 1 for Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference
Figure 2 for Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference
Figure 3 for Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference
Figure 4 for Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference
Viaarxiv icon

Improving Transformer-based Speech Recognition Using Unsupervised Pre-training

Oct 31, 2019
Dongwei Jiang, Xiaoning Lei, Wubo Li, Ne Luo, Yuxuan Hu, Wei Zou, Xiangang Li

Figure 1 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 2 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 3 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 4 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Viaarxiv icon

Video-Driven Speech Reconstruction using Generative Adversarial Networks

Jun 14, 2019
Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Video-Driven Speech Reconstruction using Generative Adversarial Networks
Figure 2 for Video-Driven Speech Reconstruction using Generative Adversarial Networks
Figure 3 for Video-Driven Speech Reconstruction using Generative Adversarial Networks
Figure 4 for Video-Driven Speech Reconstruction using Generative Adversarial Networks
Viaarxiv icon

Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck

Add code
Bookmark button
Alert button
Aug 19, 2019
Shuang Ma, Daniel McDuff, Yale Song

Figure 1 for Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Figure 2 for Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Figure 3 for Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Figure 4 for Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Viaarxiv icon