Alert button

"speech": models, code, and papers
Alert button

DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language

Dec 28, 2020
Md. Rezaul Karim, Sumon Kanti Dey, Bharathi Raja Chakravarthi

Figure 1 for DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language
Figure 2 for DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language
Figure 3 for DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language
Figure 4 for DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language
Viaarxiv icon

Estimating articulatory movements in speech production with transformer networks

Apr 11, 2021
Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh

Figure 1 for Estimating articulatory movements in speech production with transformer networks
Figure 2 for Estimating articulatory movements in speech production with transformer networks
Figure 3 for Estimating articulatory movements in speech production with transformer networks
Figure 4 for Estimating articulatory movements in speech production with transformer networks
Viaarxiv icon

Sequence-level self-learning with multiple hypotheses

Dec 10, 2021
Kenichi Kumatani, Dimitrios Dimitriadis, Yashesh Gaur, Robert Gmyr, Sefik Emre Eskimez, Jinyu Li, Michael Zeng

Figure 1 for Sequence-level self-learning with multiple hypotheses
Figure 2 for Sequence-level self-learning with multiple hypotheses
Figure 3 for Sequence-level self-learning with multiple hypotheses
Figure 4 for Sequence-level self-learning with multiple hypotheses
Viaarxiv icon

Improving CTC-based ASR Models with Gated Interlayer Collaboration

May 25, 2022
Yuting Yang, Yuke Li, Binbin Du

Figure 1 for Improving CTC-based ASR Models with Gated Interlayer Collaboration
Figure 2 for Improving CTC-based ASR Models with Gated Interlayer Collaboration
Figure 3 for Improving CTC-based ASR Models with Gated Interlayer Collaboration
Figure 4 for Improving CTC-based ASR Models with Gated Interlayer Collaboration
Viaarxiv icon

Gesticulator: A framework for semantically-aware speech-driven gesture generation

Jan 25, 2020
Taras Kucherenko, Patrik Jonell, Sanne van Waveren, Gustav Eje Henter, Simon Alexanderson, Iolanda Leite, Hedvig Kjellström

Figure 1 for Gesticulator: A framework for semantically-aware speech-driven gesture generation
Figure 2 for Gesticulator: A framework for semantically-aware speech-driven gesture generation
Figure 3 for Gesticulator: A framework for semantically-aware speech-driven gesture generation
Figure 4 for Gesticulator: A framework for semantically-aware speech-driven gesture generation
Viaarxiv icon

Alzheimer's Dementia Recognition Using Acoustic, Lexical, Disfluency and Speech Pause Features Robust to Noisy Inputs

Jun 29, 2021
Morteza Rohanian, Julian Hough, Matthew Purver

Figure 1 for Alzheimer's Dementia Recognition Using Acoustic, Lexical, Disfluency and Speech Pause Features Robust to Noisy Inputs
Figure 2 for Alzheimer's Dementia Recognition Using Acoustic, Lexical, Disfluency and Speech Pause Features Robust to Noisy Inputs
Figure 3 for Alzheimer's Dementia Recognition Using Acoustic, Lexical, Disfluency and Speech Pause Features Robust to Noisy Inputs
Viaarxiv icon

Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization

Nov 29, 2021
Brian Yan, Chunlei Zhang, Meng Yu, Shi-Xiong Zhang, Siddharth Dalmia, Dan Berrebbi, Chao Weng, Shinji Watanabe, Dong Yu

Figure 1 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 2 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 3 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 4 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Viaarxiv icon

Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition

Oct 26, 2020
Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

Figure 1 for Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition
Figure 2 for Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition
Figure 3 for Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition
Figure 4 for Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition
Viaarxiv icon

Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data

Oct 14, 2021
Haitong Zhang, Yue Lin

Figure 1 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Figure 2 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Figure 3 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Figure 4 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Viaarxiv icon

indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages

Mar 31, 2022
Anirudh Gupta, Neeraj Chhimwal, Ankur Dhuriya, Rishabh Gaur, Priyanshi Shah, Harveen Singh Chadha, Vivek Raghavan

Figure 1 for indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages
Figure 2 for indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages
Figure 3 for indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages
Viaarxiv icon