Alert button

"speech": models, code, and papers
Alert button

Constructive and Toxic Speech Detection for Open-domain Social Media Comments in Vietnamese

Mar 24, 2021
Luan Thanh Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

Figure 1 for Constructive and Toxic Speech Detection for Open-domain Social Media Comments in Vietnamese
Figure 2 for Constructive and Toxic Speech Detection for Open-domain Social Media Comments in Vietnamese
Figure 3 for Constructive and Toxic Speech Detection for Open-domain Social Media Comments in Vietnamese
Figure 4 for Constructive and Toxic Speech Detection for Open-domain Social Media Comments in Vietnamese
Viaarxiv icon

Multi-view Temporal Alignment for Non-parallel Articulatory-to-Acoustic Speech Synthesis

Add code
Bookmark button
Alert button
Dec 30, 2020
Jose A. Gonzalez-Lopez, Miriam Gonzalez-Atienza, Alejandro Gomez-Alanis, Jose L. Perez-Cordoba, Phil D. Green

Figure 1 for Multi-view Temporal Alignment for Non-parallel Articulatory-to-Acoustic Speech Synthesis
Figure 2 for Multi-view Temporal Alignment for Non-parallel Articulatory-to-Acoustic Speech Synthesis
Figure 3 for Multi-view Temporal Alignment for Non-parallel Articulatory-to-Acoustic Speech Synthesis
Viaarxiv icon

Enhanced exemplar autoencoder with cycle consistency loss in any-to-one voice conversion

Add code
Bookmark button
Alert button
Apr 12, 2022
Weida Liang, Lantian Li, Wenqiang Du, Dong Wang

Figure 1 for Enhanced exemplar autoencoder with cycle consistency loss in any-to-one voice conversion
Figure 2 for Enhanced exemplar autoencoder with cycle consistency loss in any-to-one voice conversion
Figure 3 for Enhanced exemplar autoencoder with cycle consistency loss in any-to-one voice conversion
Figure 4 for Enhanced exemplar autoencoder with cycle consistency loss in any-to-one voice conversion
Viaarxiv icon

Improving speaker de-identification with functional data analysis of f0 trajectories

Add code
Bookmark button
Alert button
Mar 31, 2022
Lauri Tavi, Tomi Kinnunen, Rosa González Hautamäki

Figure 1 for Improving speaker de-identification with functional data analysis of f0 trajectories
Figure 2 for Improving speaker de-identification with functional data analysis of f0 trajectories
Figure 3 for Improving speaker de-identification with functional data analysis of f0 trajectories
Figure 4 for Improving speaker de-identification with functional data analysis of f0 trajectories
Viaarxiv icon

TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain

Mar 18, 2021
Kai Wang, Bengbeng He, Wei-Ping Zhu

Figure 1 for TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
Figure 2 for TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
Figure 3 for TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
Figure 4 for TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
Viaarxiv icon

The Multilingual TEDx Corpus for Speech Recognition and Translation

Add code
Bookmark button
Alert button
Feb 02, 2021
Elizabeth Salesky, Matthew Wiesner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post

Figure 1 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Figure 2 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Figure 3 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Figure 4 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Viaarxiv icon

Almost Unsupervised Text to Speech and Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 13, 2019
Yi Ren, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu

Figure 1 for Almost Unsupervised Text to Speech and Automatic Speech Recognition
Figure 2 for Almost Unsupervised Text to Speech and Automatic Speech Recognition
Figure 3 for Almost Unsupervised Text to Speech and Automatic Speech Recognition
Figure 4 for Almost Unsupervised Text to Speech and Automatic Speech Recognition
Viaarxiv icon

A New 27 Class Sign Language Dataset Collected from 173 Individuals

Mar 08, 2022
Arda Mavi, Zeynep Dikle

Figure 1 for A New 27 Class Sign Language Dataset Collected from 173 Individuals
Viaarxiv icon

Speech Enhancement using Separable Polling Attention and Global Layer Normalization followed with PReLU

May 06, 2021
Dengfeng Ke, Jinsong Zhang, Yanlu Xie, Yanyan Xu, Binghuai Lin

Figure 1 for Speech Enhancement using Separable Polling Attention and Global Layer Normalization followed with PReLU
Figure 2 for Speech Enhancement using Separable Polling Attention and Global Layer Normalization followed with PReLU
Figure 3 for Speech Enhancement using Separable Polling Attention and Global Layer Normalization followed with PReLU
Figure 4 for Speech Enhancement using Separable Polling Attention and Global Layer Normalization followed with PReLU
Viaarxiv icon

Disentangled Speaker Representation Learning via Mutual Information Minimization

Add code
Bookmark button
Alert button
Aug 17, 2022
Sung Hwan Mun, Min Hyun Han, Minchan Kim, Dongjune Lee, Nam Soo Kim

Figure 1 for Disentangled Speaker Representation Learning via Mutual Information Minimization
Figure 2 for Disentangled Speaker Representation Learning via Mutual Information Minimization
Figure 3 for Disentangled Speaker Representation Learning via Mutual Information Minimization
Figure 4 for Disentangled Speaker Representation Learning via Mutual Information Minimization
Viaarxiv icon