Alert button

"speech": models, code, and papers
Alert button

Emotion Recognition In Persian Speech Using Deep Neural Networks

Apr 28, 2022
Ali Yazdani, Hossein Simchi, Yaser Shekofteh

Figure 1 for Emotion Recognition In Persian Speech Using Deep Neural Networks
Figure 2 for Emotion Recognition In Persian Speech Using Deep Neural Networks
Figure 3 for Emotion Recognition In Persian Speech Using Deep Neural Networks
Figure 4 for Emotion Recognition In Persian Speech Using Deep Neural Networks
Viaarxiv icon

Auditory Model based Phase-Aware Bayesian Spectral Amplitude Estimator for Single-Channel Speech Enhancement

Feb 10, 2022
Suman Samui, Indrajit Chakrabarti, Soumya K. Ghosh

Figure 1 for Auditory Model based Phase-Aware Bayesian Spectral Amplitude Estimator for Single-Channel Speech Enhancement
Figure 2 for Auditory Model based Phase-Aware Bayesian Spectral Amplitude Estimator for Single-Channel Speech Enhancement
Figure 3 for Auditory Model based Phase-Aware Bayesian Spectral Amplitude Estimator for Single-Channel Speech Enhancement
Figure 4 for Auditory Model based Phase-Aware Bayesian Spectral Amplitude Estimator for Single-Channel Speech Enhancement
Viaarxiv icon

Training Speech Enhancement Systems with Noisy Speech Datasets

May 26, 2021
Koichi Saito, Stefan Uhlich, Giorgio Fabbro, Yuki Mitsufuji

Figure 1 for Training Speech Enhancement Systems with Noisy Speech Datasets
Figure 2 for Training Speech Enhancement Systems with Noisy Speech Datasets
Figure 3 for Training Speech Enhancement Systems with Noisy Speech Datasets
Figure 4 for Training Speech Enhancement Systems with Noisy Speech Datasets
Viaarxiv icon

Macro-block dropout for improved regularization in training end-to-end speech recognition models

Dec 29, 2022
Chanwoo Kim, Sathish Indurti, Jinhwan Park, Wonyong Sung

Figure 1 for Macro-block dropout for improved regularization in training end-to-end speech recognition models
Figure 2 for Macro-block dropout for improved regularization in training end-to-end speech recognition models
Figure 3 for Macro-block dropout for improved regularization in training end-to-end speech recognition models
Figure 4 for Macro-block dropout for improved regularization in training end-to-end speech recognition models
Viaarxiv icon

SA: Sliding attack for synthetic speech detection with resistance to clipping and self-splicing

Aug 27, 2022
Deng JiaCheng, Dong Li, Yan Diqun, Wang Rangding, Zeng Jiaming

Figure 1 for SA: Sliding attack for synthetic speech detection with resistance to clipping and self-splicing
Figure 2 for SA: Sliding attack for synthetic speech detection with resistance to clipping and self-splicing
Figure 3 for SA: Sliding attack for synthetic speech detection with resistance to clipping and self-splicing
Figure 4 for SA: Sliding attack for synthetic speech detection with resistance to clipping and self-splicing
Viaarxiv icon

Effect of noise suppression losses on speech distortion and ASR performance

Nov 23, 2021
Sebastian Braun, Hannes Gamper

Figure 1 for Effect of noise suppression losses on speech distortion and ASR performance
Figure 2 for Effect of noise suppression losses on speech distortion and ASR performance
Figure 3 for Effect of noise suppression losses on speech distortion and ASR performance
Figure 4 for Effect of noise suppression losses on speech distortion and ASR performance
Viaarxiv icon

Speech Decomposition Based on a Hybrid Speech Model and Optimal Segmentation

May 04, 2021
Alfredo Esquivel Jaramillo, Jesper Kjær Nielsen, Mads Græsbøll Christensen

Figure 1 for Speech Decomposition Based on a Hybrid Speech Model and Optimal Segmentation
Figure 2 for Speech Decomposition Based on a Hybrid Speech Model and Optimal Segmentation
Figure 3 for Speech Decomposition Based on a Hybrid Speech Model and Optimal Segmentation
Viaarxiv icon

Hate Speech Classifiers Learn Human-Like Social Stereotypes

Oct 28, 2021
Aida Mostafazadeh Davani, Mohammad Atari, Brendan Kennedy, Morteza Dehghani

Figure 1 for Hate Speech Classifiers Learn Human-Like Social Stereotypes
Figure 2 for Hate Speech Classifiers Learn Human-Like Social Stereotypes
Figure 3 for Hate Speech Classifiers Learn Human-Like Social Stereotypes
Figure 4 for Hate Speech Classifiers Learn Human-Like Social Stereotypes
Viaarxiv icon

Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques

Add code
Bookmark button
Alert button
Jan 26, 2022
Tu Anh Dinh, Danni Liu, Jan Niehues

Figure 1 for Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques
Figure 2 for Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques
Figure 3 for Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques
Figure 4 for Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques
Viaarxiv icon

End-to-end model for named entity recognition from speech without paired training data

Add code
Bookmark button
Alert button
Apr 02, 2022
Salima Mdhaffar, Jarod Duret, Titouan Parcollet, Yannick Estève

Figure 1 for End-to-end model for named entity recognition from speech without paired training data
Figure 2 for End-to-end model for named entity recognition from speech without paired training data
Figure 3 for End-to-end model for named entity recognition from speech without paired training data
Figure 4 for End-to-end model for named entity recognition from speech without paired training data
Viaarxiv icon