Alert button

"speech": models, code, and papers
Alert button

Audio-Visual Neural Syntax Acquisition

Add code
Bookmark button
Alert button
Oct 11, 2023
Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David Cox, David Harwath, Yang Zhang, Karen Livescu, James Glass

Figure 1 for Audio-Visual Neural Syntax Acquisition
Figure 2 for Audio-Visual Neural Syntax Acquisition
Figure 3 for Audio-Visual Neural Syntax Acquisition
Figure 4 for Audio-Visual Neural Syntax Acquisition
Viaarxiv icon

MyVoice: Arabic Speech Resource Collaboration Platform

Jul 23, 2023
Yousseif Elshahawy, Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali

Figure 1 for MyVoice: Arabic Speech Resource Collaboration Platform
Figure 2 for MyVoice: Arabic Speech Resource Collaboration Platform
Figure 3 for MyVoice: Arabic Speech Resource Collaboration Platform
Figure 4 for MyVoice: Arabic Speech Resource Collaboration Platform
Viaarxiv icon

Causality Guided Disentanglement for Cross-Platform Hate Speech Detection

Aug 10, 2023
Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

Figure 1 for Causality Guided Disentanglement for Cross-Platform Hate Speech Detection
Figure 2 for Causality Guided Disentanglement for Cross-Platform Hate Speech Detection
Figure 3 for Causality Guided Disentanglement for Cross-Platform Hate Speech Detection
Figure 4 for Causality Guided Disentanglement for Cross-Platform Hate Speech Detection
Viaarxiv icon

Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model

Aug 11, 2023
Fan Zhang, Naye Ji, Fuxing Gao, Siyuan Zhao, Zhaohan Wang, Shunman Li

Figure 1 for Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Figure 2 for Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Figure 3 for Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Figure 4 for Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Viaarxiv icon

Audio-visual video-to-speech synthesis with synthesized input audio

Add code
Bookmark button
Alert button
Jul 31, 2023
Triantafyllos Kefalas, Yannis Panagakis, Maja Pantic

Figure 1 for Audio-visual video-to-speech synthesis with synthesized input audio
Figure 2 for Audio-visual video-to-speech synthesis with synthesized input audio
Figure 3 for Audio-visual video-to-speech synthesis with synthesized input audio
Figure 4 for Audio-visual video-to-speech synthesis with synthesized input audio
Viaarxiv icon

SGGNet$^2$: Speech-Scene Graph Grounding Network for Speech-guided Navigation

Jul 14, 2023
Dohyun Kim, Yeseung Kim, Jaehwi Jang, Minjae Song, Woojin Choi, Daehyung Park

Figure 1 for SGGNet$^2$: Speech-Scene Graph Grounding Network for Speech-guided Navigation
Figure 2 for SGGNet$^2$: Speech-Scene Graph Grounding Network for Speech-guided Navigation
Figure 3 for SGGNet$^2$: Speech-Scene Graph Grounding Network for Speech-guided Navigation
Figure 4 for SGGNet$^2$: Speech-Scene Graph Grounding Network for Speech-guided Navigation
Viaarxiv icon

Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection

Aug 31, 2023
Fatma Elsafoury

Figure 1 for Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection
Viaarxiv icon

BanLemma: A Word Formation Dependent Rule and Dictionary Based Bangla Lemmatizer

Add code
Bookmark button
Alert button
Nov 06, 2023
Sadia Afrin, Md. Shahad Mahmud Chowdhury, Md. Ekramul Islam, Faisal Ahamed Khan, Labib Imam Chowdhury, MD. Motahar Mahtab, Nazifa Nuha Chowdhury, Massud Forkan, Neelima Kundu, Hakim Arif, Mohammad Mamun Or Rashid, Mohammad Ruhul Amin, Nabeel Mohammed

Viaarxiv icon

Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model

Jul 29, 2023
S. Rijal, R. Neupane, S. P. Mainali, S. K. Regmi, S. Maharjan

Figure 1 for Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Figure 2 for Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Figure 3 for Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Figure 4 for Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Viaarxiv icon

Indonesian Automatic Speech Recognition with XLSR-53

Aug 20, 2023
Panji Arisaputra, Amalia Zahra

Figure 1 for Indonesian Automatic Speech Recognition with XLSR-53
Figure 2 for Indonesian Automatic Speech Recognition with XLSR-53
Figure 3 for Indonesian Automatic Speech Recognition with XLSR-53
Figure 4 for Indonesian Automatic Speech Recognition with XLSR-53
Viaarxiv icon