Alert button

"speech": models, code, and papers
Alert button

MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition

Oct 27, 2023
Jiamin Xie, John H. L. Hansen

Viaarxiv icon

Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures

Sep 14, 2023
Jia Qi Yip, Dianwen Ng, Bin Ma, Chng Eng Siong

Figure 1 for Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures
Figure 2 for Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures
Figure 3 for Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures
Figure 4 for Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures
Viaarxiv icon

Guided Flows for Generative Modeling and Decision Making

Nov 22, 2023
Qinqing Zheng, Matt Le, Neta Shaul, Yaron Lipman, Aditya Grover, Ricky T. Q. Chen

Viaarxiv icon

A Text-to-Text Model for Multilingual Offensive Language Identification

Dec 06, 2023
Tharindu Ranasinghe, Marcos Zampieri

Figure 1 for A Text-to-Text Model for Multilingual Offensive Language Identification
Figure 2 for A Text-to-Text Model for Multilingual Offensive Language Identification
Figure 3 for A Text-to-Text Model for Multilingual Offensive Language Identification
Figure 4 for A Text-to-Text Model for Multilingual Offensive Language Identification
Viaarxiv icon

Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis

Oct 09, 2023
Jianqiao Lu, Wenyong Huang, Nianzu Zheng, Xingshan Zeng, Yu Ting Yeung, Xiao Chen

Figure 1 for Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis
Figure 2 for Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis
Figure 3 for Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis
Figure 4 for Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis
Viaarxiv icon

DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation

Oct 02, 2023
Roi Benita, Michael Elad, Joseph Keshet

Viaarxiv icon

Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation

Oct 04, 2023
Aman Khullar, Daniel Nkemelu, Cuong V. Nguyen, Michael L. Best

Viaarxiv icon

Test-Time Training for Speech

Sep 28, 2023
Sri Harsha Dumpala, Chandramouli Sastry, Sageev Oore

Figure 1 for Test-Time Training for Speech
Figure 2 for Test-Time Training for Speech
Figure 3 for Test-Time Training for Speech
Figure 4 for Test-Time Training for Speech
Viaarxiv icon

Summary of the DISPLACE Challenge 2023 -- DIarization of SPeaker and LAnguage in Conversational Environments

Nov 21, 2023
Shikha Baghel, Shreyas Ramoji, Somil Jain, Pratik Roy Chowdhuri, Prachi Singh, Deepu Vijayasenan, Sriram Ganapathy

Viaarxiv icon

Energy-Based Models For Speech Synthesis

Oct 19, 2023
Wanli Sun, Zehai Tu, Anton Ragni

Figure 1 for Energy-Based Models For Speech Synthesis
Figure 2 for Energy-Based Models For Speech Synthesis
Figure 3 for Energy-Based Models For Speech Synthesis
Figure 4 for Energy-Based Models For Speech Synthesis
Viaarxiv icon