Alert button

"speech": models, code, and papers
Alert button

3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty

Add code
Bookmark button
Alert button
Feb 27, 2023
Rongzhi Gu, Shi-Xiong Zhang, Dong Yu

Figure 1 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 2 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 3 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 4 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Viaarxiv icon

New Challenges for Content Privacy in Speech and Audio

Jan 21, 2023
Jennifer Williams, Karla Pizzi, Shuvayanti Das, Paul-Gauthier Noe

Figure 1 for New Challenges for Content Privacy in Speech and Audio
Figure 2 for New Challenges for Content Privacy in Speech and Audio
Viaarxiv icon

A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation

Add code
Bookmark button
Alert button
Apr 02, 2023
Bo-Kyeong Kim, Jaemin Kang, Daeun Seo, Hancheol Park, Shinkook Choi, Hyungshin Kim, Sungsu Lim

Figure 1 for A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation
Figure 2 for A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation
Figure 3 for A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation
Figure 4 for A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation
Viaarxiv icon

Cost-effective Models for Detecting Depression from Speech

Feb 18, 2023
Mashrura Tasnim, Jekaterina Novikova

Figure 1 for Cost-effective Models for Detecting Depression from Speech
Figure 2 for Cost-effective Models for Detecting Depression from Speech
Figure 3 for Cost-effective Models for Detecting Depression from Speech
Figure 4 for Cost-effective Models for Detecting Depression from Speech
Viaarxiv icon

Detecting post-stroke aphasia using EEG-based neural envelope tracking of natural speech

Mar 14, 2023
Pieter De Clercq, Jill Kries, Ramtin Mehraram, Jonas Vanthornhout, Tom Francart, Maaike Vandermosten

Figure 1 for Detecting post-stroke aphasia using EEG-based neural envelope tracking of natural speech
Figure 2 for Detecting post-stroke aphasia using EEG-based neural envelope tracking of natural speech
Figure 3 for Detecting post-stroke aphasia using EEG-based neural envelope tracking of natural speech
Figure 4 for Detecting post-stroke aphasia using EEG-based neural envelope tracking of natural speech
Viaarxiv icon

Cognitive performance in open-plan office acoustic simulations: Effects of room acoustics and semantics but not spatial separation of sound sources

Jun 13, 2023
Manuj Yadav, Markus Georgi, Larissa Leist, Maria Klatte, Sabine J. Schlittmeier, Janina Fels

Figure 1 for Cognitive performance in open-plan office acoustic simulations: Effects of room acoustics and semantics but not spatial separation of sound sources
Figure 2 for Cognitive performance in open-plan office acoustic simulations: Effects of room acoustics and semantics but not spatial separation of sound sources
Figure 3 for Cognitive performance in open-plan office acoustic simulations: Effects of room acoustics and semantics but not spatial separation of sound sources
Figure 4 for Cognitive performance in open-plan office acoustic simulations: Effects of room acoustics and semantics but not spatial separation of sound sources
Viaarxiv icon

Enhancing conversational quality in language learning chatbots: An evaluation of GPT4 for ASR error correction

Jul 19, 2023
Long Mai, Julie Carson-Berndsen

Figure 1 for Enhancing conversational quality in language learning chatbots: An evaluation of GPT4 for ASR error correction
Figure 2 for Enhancing conversational quality in language learning chatbots: An evaluation of GPT4 for ASR error correction
Figure 3 for Enhancing conversational quality in language learning chatbots: An evaluation of GPT4 for ASR error correction
Figure 4 for Enhancing conversational quality in language learning chatbots: An evaluation of GPT4 for ASR error correction
Viaarxiv icon

UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model

Add code
Bookmark button
Alert button
Jun 01, 2023
Anastasiia Iashchenko, Pavel Andreev, Ivan Shchekotov, Nicholas Babaev, Dmitry Vetrov

Figure 1 for UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model
Figure 2 for UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model
Figure 3 for UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model
Figure 4 for UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model
Viaarxiv icon

Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training

Add code
Bookmark button
Alert button
May 22, 2023
Jianfeng He, Julian Salazar, Kaisheng Yao, Haoqi Li, Jinglun Cai

Figure 1 for Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training
Figure 2 for Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training
Figure 3 for Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training
Figure 4 for Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training
Viaarxiv icon

ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement

Add code
Bookmark button
Alert button
Dec 21, 2022
Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi

Figure 1 for ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Figure 2 for ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Figure 3 for ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Figure 4 for ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Viaarxiv icon