Alert button

"speech": models, code, and papers
Alert button

RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System

Apr 14, 2022
Muhammed Zahid Ozturk, Chenshu Wu, Beibei Wang, Min Wu, K. J. Ray Liu

Figure 1 for RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
Figure 2 for RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
Figure 3 for RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
Figure 4 for RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
Viaarxiv icon

A variational autoencoder-based nonnegative matrix factorisation model for deep dictionary learning

Jan 18, 2023
Hong-Bo Xie, Caoyuan Li, Shuliang Wang, Richard Yi Da Xu, Kerrie Mengersen

Figure 1 for A variational autoencoder-based nonnegative matrix factorisation model for deep dictionary learning
Figure 2 for A variational autoencoder-based nonnegative matrix factorisation model for deep dictionary learning
Figure 3 for A variational autoencoder-based nonnegative matrix factorisation model for deep dictionary learning
Figure 4 for A variational autoencoder-based nonnegative matrix factorisation model for deep dictionary learning
Viaarxiv icon

TRILLsson: Distilled Universal Paralinguistic Speech Representations

Mar 01, 2022
Joel Shor, Subhashini Venugopalan

Figure 1 for TRILLsson: Distilled Universal Paralinguistic Speech Representations
Figure 2 for TRILLsson: Distilled Universal Paralinguistic Speech Representations
Figure 3 for TRILLsson: Distilled Universal Paralinguistic Speech Representations
Figure 4 for TRILLsson: Distilled Universal Paralinguistic Speech Representations
Viaarxiv icon

CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing

Feb 21, 2022
Tao Wang, Jiangyan Yi, Ruibo Fu, Jianhua Tao, Zhengqi Wen

Figure 1 for CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing
Figure 2 for CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing
Figure 3 for CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing
Figure 4 for CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing
Viaarxiv icon

Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada

Jul 27, 2022
Madhavaraj A, Bharathi Pilar, Ramakrishnan A G

Figure 1 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Figure 2 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Figure 3 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Figure 4 for Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada
Viaarxiv icon

Speech Pre-training with Acoustic Piece

Apr 07, 2022
Shuo Ren, Shujie Liu, Yu Wu, Long Zhou, Furu Wei

Figure 1 for Speech Pre-training with Acoustic Piece
Figure 2 for Speech Pre-training with Acoustic Piece
Figure 3 for Speech Pre-training with Acoustic Piece
Figure 4 for Speech Pre-training with Acoustic Piece
Viaarxiv icon

HateCheckHIn: Evaluating Hindi Hate Speech Detection Models

Apr 30, 2022
Mithun Das, Punyajoy Saha, Binny Mathew, Animesh Mukherjee

Figure 1 for HateCheckHIn: Evaluating Hindi Hate Speech Detection Models
Figure 2 for HateCheckHIn: Evaluating Hindi Hate Speech Detection Models
Figure 3 for HateCheckHIn: Evaluating Hindi Hate Speech Detection Models
Figure 4 for HateCheckHIn: Evaluating Hindi Hate Speech Detection Models
Viaarxiv icon

Computing Optimal Location of Microphone for Improved Speech Recognition

Mar 24, 2022
Karan Nathwani, Bhavya Dixit, Sunil Kumar Kopparapu

Figure 1 for Computing Optimal Location of Microphone for Improved Speech Recognition
Figure 2 for Computing Optimal Location of Microphone for Improved Speech Recognition
Figure 3 for Computing Optimal Location of Microphone for Improved Speech Recognition
Figure 4 for Computing Optimal Location of Microphone for Improved Speech Recognition
Viaarxiv icon

Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech

Sep 07, 2022
Huu-Tien Dang, Thi-Hai-Yen Vuong, Xuan-Hieu Phan

Figure 1 for Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech
Figure 2 for Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech
Figure 3 for Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech
Figure 4 for Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech
Viaarxiv icon

Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches

Jul 07, 2022
Tusarkanta Dalai, Tapas Kumar Mishra, Pankaj K Sa

Figure 1 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 2 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 3 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 4 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Viaarxiv icon