Alert button

"speech": models, code, and papers
Alert button

AttentionStitch: How Attention Solves the Speech Editing Problem

Add code
Bookmark button
Alert button
Mar 05, 2024
Antonios Alexos, Pierre Baldi

Figure 1 for AttentionStitch: How Attention Solves the Speech Editing Problem
Viaarxiv icon

Speech Emotion Recognition Via CNN-Transforemr and Multidimensional Attention Mechanism

Add code
Bookmark button
Alert button
Mar 07, 2024
Xiaoyu Tang, Yixin Lin, Ting Dang, Yuanfang Zhang, Jintao Cheng

Figure 1 for Speech Emotion Recognition Via CNN-Transforemr and Multidimensional Attention Mechanism
Figure 2 for Speech Emotion Recognition Via CNN-Transforemr and Multidimensional Attention Mechanism
Figure 3 for Speech Emotion Recognition Via CNN-Transforemr and Multidimensional Attention Mechanism
Figure 4 for Speech Emotion Recognition Via CNN-Transforemr and Multidimensional Attention Mechanism
Viaarxiv icon

Chinese Offensive Language Detection:Current Status and Future Directions

Mar 29, 2024
Yunze Xiao, Houda Bouamor, Wajdi Zaghouani

Viaarxiv icon

NLP Systems That Can't Tell Use from Mention Censor Counterspeech, but Teaching the Distinction Helps

Apr 02, 2024
Kristina Gligoric, Myra Cheng, Lucia Zheng, Esin Durmus, Dan Jurafsky

Viaarxiv icon

Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks

Add code
Bookmark button
Alert button
Mar 08, 2024
Vikas Tokala, Eric Grinstein, Mike Brookes, Simon Doclo, Jesper Jensen, Patrick A. Naylor

Figure 1 for Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks
Figure 2 for Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks
Figure 3 for Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks
Figure 4 for Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks
Viaarxiv icon

A (More) Realistic Evaluation Setup for Generalisation of Community Models on Malicious Content Detection

Apr 02, 2024
Ivo Verhoeven, Pushkar Mishra, Rahel Beloch, Helen Yannakoudakis, Ekaterina Shutova

Viaarxiv icon

Voice EHR: Introducing Multimodal Audio Data for Health

Apr 02, 2024
James Anibal, Hannah Huth, Ming Li, Lindsey Hazen, Yen Minh Lam, Nguyen Thi Thu Hang, Michael Kleinman, Shelley Ost, Christopher Jackson, Laura Sprabery, Cheran Elangovan, Balaji Krishnaiah, Lee Akst, Ioan Lina, Iqbal Elyazar, Lenny Ekwati, Stefan Jansen, Richard Nduwayezu, Charisse Garcia, Jeffrey Plum, Jacqueline Brenner, Miranda Song, Emily Ricotta, David Clifton, C. Louise Thwaites, Yael Bensoussan, Bradford Wood

Viaarxiv icon

EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech

Mar 13, 2024
Ziqi Liang, Haoxiang Shi, Jiawei Wang, Keda Lu

Figure 1 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 2 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 3 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 4 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Viaarxiv icon

Towards a Fully Interpretable and More Scalable RSA Model for Metaphor Understanding

Apr 03, 2024
Gaia Carenini, Luca Bischetti, Walter Schaeken, Valentina Bambini

Viaarxiv icon

MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages

Apr 02, 2024
Daryna Dementieva, Nikolay Babakov, Alexander Panchenko

Viaarxiv icon