Alert button

"speech": models, code, and papers
Alert button

USDnet: Unsupervised Speech Dereverberation via Neural Forward Filtering

Feb 01, 2024
Zhong-Qiu Wang

Viaarxiv icon

Experimental Study: Enhancing Voice Spoofing Detection Models with wav2vec 2.0

Feb 27, 2024
Taein Kang, Soyul Han, Sunmook Choi, Jaejin Seo, Sanghyeok Chung, Seungeun Lee, Seungsang Oh, Il-Youp Kwak

Viaarxiv icon

PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model

Feb 22, 2024
Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda

Viaarxiv icon

A Case Study on Filtering for End-to-End Speech Translation

Feb 02, 2024
Md Mahfuz Ibn Alam, Antonios Anastasopoulos

Viaarxiv icon

Sigma-lognormal modeling of speech

Jan 27, 2024
C. Carmona-Duarte, M. A. Ferrer, R. Plamondon, A. Gomez-Rodellar, P. Gomez-Vilda

Viaarxiv icon

An Analysis of the Variance of Diffusion-based Speech Enhancement

Feb 01, 2024
Bunlong Lay, Timo Gerkmann

Viaarxiv icon

Adversarial speech for voice privacy protection from Personalized Speech generation

Jan 22, 2024
Shihao Chen, Liping Chen, Jie Zhang, KongAik Lee, Zhenhua Ling, Lirong Dai

Viaarxiv icon

Scaling Up Adaptive Filter Optimizers

Mar 01, 2024
Jonah Casebeer, Nicholas J. Bryan, Paris Smaragdis

Figure 1 for Scaling Up Adaptive Filter Optimizers
Figure 2 for Scaling Up Adaptive Filter Optimizers
Figure 3 for Scaling Up Adaptive Filter Optimizers
Viaarxiv icon

An Attention Long Short-Term Memory based system for automatic classification of speech intelligibility

Feb 05, 2024
Miguel Fernández-Díaz, Ascensión Gallardo-Antolín

Viaarxiv icon

Probing Critical Learning Dynamics of PLMs for Hate Speech Detection

Feb 03, 2024
Sarah Masud, Mohammad Aflah Khan, Vikram Goyal, Md Shad Akhtar, Tanmoy Chakraborty

Viaarxiv icon