Alert button

"speech": models, code, and papers
Alert button

MOSPC: MOS Prediction Based on Pairwise Comparison

Jun 18, 2023
Kexin Wang, Yunlong Zhao, Qianqian Dong, Tom Ko, Mingxuan Wang

Figure 1 for MOSPC: MOS Prediction Based on Pairwise Comparison
Figure 2 for MOSPC: MOS Prediction Based on Pairwise Comparison
Figure 3 for MOSPC: MOS Prediction Based on Pairwise Comparison
Figure 4 for MOSPC: MOS Prediction Based on Pairwise Comparison
Viaarxiv icon

Leveraging Large Text Corpora for End-to-End Speech Summarization

Add code
Bookmark button
Alert button
Mar 02, 2023
Kohei Matsuura, Takanori Ashihara, Takafumi Moriya, Tomohiro Tanaka, Atsunori Ogawa, Marc Delcroix, Ryo Masumura

Figure 1 for Leveraging Large Text Corpora for End-to-End Speech Summarization
Figure 2 for Leveraging Large Text Corpora for End-to-End Speech Summarization
Figure 3 for Leveraging Large Text Corpora for End-to-End Speech Summarization
Figure 4 for Leveraging Large Text Corpora for End-to-End Speech Summarization
Viaarxiv icon

SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks

Add code
Bookmark button
Alert button
Mar 01, 2023
Kai-Wei Chang, Yu-Kai Wang, Hua Shen, Iu-thing Kang, Wei-Cheng Tseng, Shang-Wen Li, Hung-yi Lee

Figure 1 for SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks
Figure 2 for SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks
Figure 3 for SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks
Figure 4 for SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks
Viaarxiv icon

Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition

Add code
Bookmark button
Alert button
Feb 18, 2023
Xie Chen, Ziyang Ma, Changli Tang, Yujin Wang, Zhisheng Zheng

Figure 1 for Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition
Figure 2 for Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition
Figure 3 for Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition
Figure 4 for Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition
Viaarxiv icon

MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition

Feb 27, 2023
Yoohwan Kwon, Soo-Whan Chung

Figure 1 for MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition
Figure 2 for MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition
Figure 3 for MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition
Figure 4 for MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition
Viaarxiv icon

Identity Construction in a Misogynist Incels Forum

Add code
Bookmark button
Alert button
Jun 30, 2023
Michael Miller Yoder, Chloe Perry, David West Brown, Kathleen M. Carley, Meredith Pruden

Figure 1 for Identity Construction in a Misogynist Incels Forum
Figure 2 for Identity Construction in a Misogynist Incels Forum
Figure 3 for Identity Construction in a Misogynist Incels Forum
Figure 4 for Identity Construction in a Misogynist Incels Forum
Viaarxiv icon

Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network

Mar 13, 2023
Cong Han, Nima Mesgarani

Figure 1 for Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network
Figure 2 for Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network
Figure 3 for Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network
Figure 4 for Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network
Viaarxiv icon

Towards spoken dialect identification of Irish

Add code
Bookmark button
Alert button
Jul 14, 2023
Liam Lonergan, Mengjie Qian, Neasa Ní Chiaráin, Christer Gobl, Ailbhe Ní Chasaide

Figure 1 for Towards spoken dialect identification of Irish
Figure 2 for Towards spoken dialect identification of Irish
Figure 3 for Towards spoken dialect identification of Irish
Figure 4 for Towards spoken dialect identification of Irish
Viaarxiv icon

Developmental Bootstrapping of AIs

Aug 11, 2023
Mark Stefik, Robert Price

Figure 1 for Developmental Bootstrapping of AIs
Figure 2 for Developmental Bootstrapping of AIs
Figure 3 for Developmental Bootstrapping of AIs
Figure 4 for Developmental Bootstrapping of AIs
Viaarxiv icon

UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models

Add code
Bookmark button
Alert button
Jul 29, 2023
Sen Fang, Bowen Gao, Yangjian Wu, Jingwen Cai, Teik Toe Teoh

Figure 1 for UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Figure 2 for UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Figure 3 for UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Figure 4 for UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Viaarxiv icon