Alert button

"speech": models, code, and papers
Alert button

DWFormer: Dynamic Window transFormer for Speech Emotion Recognition

Mar 03, 2023
Shuaiqi Chen, Xiaofen Xing, Weibin Zhang, Weidong Chen, Xiangmin Xu

Figure 1 for DWFormer: Dynamic Window transFormer for Speech Emotion Recognition
Figure 2 for DWFormer: Dynamic Window transFormer for Speech Emotion Recognition
Figure 3 for DWFormer: Dynamic Window transFormer for Speech Emotion Recognition
Figure 4 for DWFormer: Dynamic Window transFormer for Speech Emotion Recognition
Viaarxiv icon

Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices

Jul 14, 2023
Zitha Sasindran, Harsha Yelchuri, T. V. Prabhakar

Figure 1 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Figure 2 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Figure 3 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Figure 4 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Viaarxiv icon

Visually grounded few-shot word learning in low-resource settings

Jun 21, 2023
Leanne Nortje, Dan Oneata, Herman Kamper

Figure 1 for Visually grounded few-shot word learning in low-resource settings
Figure 2 for Visually grounded few-shot word learning in low-resource settings
Figure 3 for Visually grounded few-shot word learning in low-resource settings
Figure 4 for Visually grounded few-shot word learning in low-resource settings
Viaarxiv icon

Improving Meeting Inclusiveness using Speech Interruption Analysis

Apr 02, 2023
Szu-Wei Fu, Yaran Fan, Yasaman Hosseinkashi, Jayant Gupchup, Ross Cutler

Figure 1 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Figure 2 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Figure 3 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Figure 4 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Viaarxiv icon

A processing framework to access large quantities of whispered speech found in ASMR

Mar 13, 2023
Pablo Perez Zarazaga, Gustav Eje Henter, Zofia Malisz

Figure 1 for A processing framework to access large quantities of whispered speech found in ASMR
Figure 2 for A processing framework to access large quantities of whispered speech found in ASMR
Figure 3 for A processing framework to access large quantities of whispered speech found in ASMR
Figure 4 for A processing framework to access large quantities of whispered speech found in ASMR
Viaarxiv icon

Improving RNN-Transducers with Acoustic LookAhead

Jul 11, 2023
Vinit S. Unni, Ashish Mittal, Preethi Jyothi, Sunita Sarawagi

Figure 1 for Improving RNN-Transducers with Acoustic LookAhead
Figure 2 for Improving RNN-Transducers with Acoustic LookAhead
Figure 3 for Improving RNN-Transducers with Acoustic LookAhead
Figure 4 for Improving RNN-Transducers with Acoustic LookAhead
Viaarxiv icon

Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning

Mar 07, 2023
Zhaoxi Mu, Xinyu Yang, Wenjing Zhu

Figure 1 for Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Figure 2 for Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Figure 3 for Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Figure 4 for Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Viaarxiv icon

Diffusion Posterior Sampling for Informed Single-Channel Dereverberation

Jun 21, 2023
Jean-Marie Lemercier, Simon Welker, Timo Gerkmann

Figure 1 for Diffusion Posterior Sampling for Informed Single-Channel Dereverberation
Figure 2 for Diffusion Posterior Sampling for Informed Single-Channel Dereverberation
Viaarxiv icon

PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement

Feb 16, 2023
Muqiao Yang, Joseph Konan, David Bick, Yunyang Zeng, Shuo Han, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

Figure 1 for PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement
Figure 2 for PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement
Figure 3 for PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement
Viaarxiv icon

Speech Enhancement for Virtual Meetings on Cellular Networks

Feb 02, 2023
Hojeong Lee, Minseon Gwak, Kawon Lee, Minjeong Kim, Joseph Konan, Ojas Bhargave

Figure 1 for Speech Enhancement for Virtual Meetings on Cellular Networks
Figure 2 for Speech Enhancement for Virtual Meetings on Cellular Networks
Figure 3 for Speech Enhancement for Virtual Meetings on Cellular Networks
Figure 4 for Speech Enhancement for Virtual Meetings on Cellular Networks
Viaarxiv icon