Alert button

"speech": models, code, and papers
Alert button

Improved Speech Representations with Multi-Target Autoregressive Predictive Coding

Apr 11, 2020
Yu-An Chung, James Glass

Figure 1 for Improved Speech Representations with Multi-Target Autoregressive Predictive Coding
Figure 2 for Improved Speech Representations with Multi-Target Autoregressive Predictive Coding
Figure 3 for Improved Speech Representations with Multi-Target Autoregressive Predictive Coding
Figure 4 for Improved Speech Representations with Multi-Target Autoregressive Predictive Coding
Viaarxiv icon

Towards Fluent Translations from Disfluent Speech

Nov 07, 2018
Elizabeth Salesky, Susanne Burger, Jan Niehues, Alex Waibel

Figure 1 for Towards Fluent Translations from Disfluent Speech
Figure 2 for Towards Fluent Translations from Disfluent Speech
Figure 3 for Towards Fluent Translations from Disfluent Speech
Figure 4 for Towards Fluent Translations from Disfluent Speech
Viaarxiv icon

Sequence-level self-learning with multiple hypotheses

Dec 10, 2021
Kenichi Kumatani, Dimitrios Dimitriadis, Yashesh Gaur, Robert Gmyr, Sefik Emre Eskimez, Jinyu Li, Michael Zeng

Figure 1 for Sequence-level self-learning with multiple hypotheses
Figure 2 for Sequence-level self-learning with multiple hypotheses
Figure 3 for Sequence-level self-learning with multiple hypotheses
Figure 4 for Sequence-level self-learning with multiple hypotheses
Viaarxiv icon

Understanding and Detecting Dangerous Speech in Social Media

Add code
Bookmark button
Alert button
May 04, 2020
Ali Alshehri, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

Figure 1 for Understanding and Detecting Dangerous Speech in Social Media
Figure 2 for Understanding and Detecting Dangerous Speech in Social Media
Figure 3 for Understanding and Detecting Dangerous Speech in Social Media
Figure 4 for Understanding and Detecting Dangerous Speech in Social Media
Viaarxiv icon

Examining a hate speech corpus for hate speech detection and popularity prediction

Add code
Bookmark button
Alert button
May 12, 2018
Filip Klubička, Raquel Fernández

Figure 1 for Examining a hate speech corpus for hate speech detection and popularity prediction
Figure 2 for Examining a hate speech corpus for hate speech detection and popularity prediction
Figure 3 for Examining a hate speech corpus for hate speech detection and popularity prediction
Figure 4 for Examining a hate speech corpus for hate speech detection and popularity prediction
Viaarxiv icon

Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization

Nov 29, 2021
Brian Yan, Chunlei Zhang, Meng Yu, Shi-Xiong Zhang, Siddharth Dalmia, Dan Berrebbi, Chao Weng, Shinji Watanabe, Dong Yu

Figure 1 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 2 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 3 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 4 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Viaarxiv icon

ViDA-MAN: Visual Dialog with Digital Humans

Oct 26, 2021
Tong Shen, Jiawei Zuo, Fan Shi, Jin Zhang, Liqin Jiang, Meng Chen, Zhengchen Zhang, Wei Zhang, Xiaodong He, Tao Mei

Figure 1 for ViDA-MAN: Visual Dialog with Digital Humans
Figure 2 for ViDA-MAN: Visual Dialog with Digital Humans
Viaarxiv icon

Does Visual Self-Supervision Improve Learning of Speech Representations?

May 04, 2020
Abhinav Shukla, Stavros Petridis, Maja Pantic

Figure 1 for Does Visual Self-Supervision Improve Learning of Speech Representations?
Figure 2 for Does Visual Self-Supervision Improve Learning of Speech Representations?
Figure 3 for Does Visual Self-Supervision Improve Learning of Speech Representations?
Figure 4 for Does Visual Self-Supervision Improve Learning of Speech Representations?
Viaarxiv icon

NIESR: Nuisance Invariant End-to-end Speech Recognition

Add code
Bookmark button
Alert button
Jul 07, 2019
I-Hung Hsu, Ayush Jaiswal, Premkumar Natarajan

Figure 1 for NIESR: Nuisance Invariant End-to-end Speech Recognition
Figure 2 for NIESR: Nuisance Invariant End-to-end Speech Recognition
Figure 3 for NIESR: Nuisance Invariant End-to-end Speech Recognition
Figure 4 for NIESR: Nuisance Invariant End-to-end Speech Recognition
Viaarxiv icon

Semi-Supervised Speech Recognition via Graph-based Temporal Classification

Oct 29, 2020
Niko Moritz, Takaaki Hori, Jonathan Le Roux

Figure 1 for Semi-Supervised Speech Recognition via Graph-based Temporal Classification
Figure 2 for Semi-Supervised Speech Recognition via Graph-based Temporal Classification
Figure 3 for Semi-Supervised Speech Recognition via Graph-based Temporal Classification
Figure 4 for Semi-Supervised Speech Recognition via Graph-based Temporal Classification
Viaarxiv icon