Alert button

"speech": models, code, and papers
Alert button

SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts

Add code
Bookmark button
Alert button
Jun 19, 2023
Haibin Wu, Kai-Wei Chang, Yuan-Kuei Wu, Hung-yi Lee

Figure 1 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Figure 2 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Figure 3 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Figure 4 for SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts
Viaarxiv icon

Generative Speech Recognition Error Correction with Large Language Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke

Figure 1 for Generative Speech Recognition Error Correction with Large Language Models
Figure 2 for Generative Speech Recognition Error Correction with Large Language Models
Figure 3 for Generative Speech Recognition Error Correction with Large Language Models
Figure 4 for Generative Speech Recognition Error Correction with Large Language Models
Viaarxiv icon

MFCCGAN: A Novel MFCC-Based Speech Synthesizer Using Adversarial Learning

Add code
Bookmark button
Alert button
Jun 22, 2023
Mohammad Reza Hasanabadi Majid Behdad Davood Gharavian

Viaarxiv icon

Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data

Jul 05, 2023
Hongmin Cai, Xiaoke Huang, Zhengliang Liu, Wenxiong Liao, Haixing Dai, Zihao Wu, Dajiang Zhu, Hui Ren, Quanzheng Li, Tianming Liu, Xiang Li

Figure 1 for Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data
Figure 2 for Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data
Figure 3 for Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data
Figure 4 for Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data
Viaarxiv icon

Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders

May 25, 2023
Nina R Benway, Yashish M Siriwardena, Jonathan L Preston, Elaine Hitchcock, Tara McAllister, Carol Espy-Wilson

Figure 1 for Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders
Figure 2 for Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders
Figure 3 for Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders
Figure 4 for Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders
Viaarxiv icon

Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time

Add code
Bookmark button
Alert button
Aug 03, 2023
Xinfeng Li, Chen Yan, Xuancun Lu, Zihan Zeng, Xiaoyu Ji, Wenyuan Xu

Figure 1 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time
Figure 2 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time
Figure 3 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time
Figure 4 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time
Viaarxiv icon

Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks

Add code
Bookmark button
Alert button
Jul 31, 2023
João A. Leite, Carolina Scarton, Diego F. Silva

Figure 1 for Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks
Figure 2 for Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks
Figure 3 for Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks
Figure 4 for Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks
Viaarxiv icon

NarrativePlay: Interactive Narrative Understanding

Add code
Bookmark button
Alert button
Oct 02, 2023
Runcong Zhao, Wenjia Zhang, Jiazheng Li, Lixing Zhu, Yanran Li, Yulan He, Lin Gui

Figure 1 for NarrativePlay: Interactive Narrative Understanding
Figure 2 for NarrativePlay: Interactive Narrative Understanding
Figure 3 for NarrativePlay: Interactive Narrative Understanding
Figure 4 for NarrativePlay: Interactive Narrative Understanding
Viaarxiv icon

InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations

Add code
Bookmark button
Alert button
Oct 09, 2023
Nils Feldhus, Qianli Wang, Tatiana Anikina, Sahil Chopra, Cennet Oguz, Sebastian Möller

Figure 1 for InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Figure 2 for InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Figure 3 for InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Figure 4 for InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Viaarxiv icon

An objective evaluation of Hearing Aids and DNN-based speech enhancement in complex acoustic scenes

Add code
Bookmark button
Alert button
Jul 24, 2023
Enric Gusó, Joanna Luberadzka, Martí Baig, Umut Sayin Saraç, Xavier Serra

Figure 1 for An objective evaluation of Hearing Aids and DNN-based speech enhancement in complex acoustic scenes
Figure 2 for An objective evaluation of Hearing Aids and DNN-based speech enhancement in complex acoustic scenes
Figure 3 for An objective evaluation of Hearing Aids and DNN-based speech enhancement in complex acoustic scenes
Figure 4 for An objective evaluation of Hearing Aids and DNN-based speech enhancement in complex acoustic scenes
Viaarxiv icon