Alert button

"speech": models, code, and papers
Alert button

Multi-Modal Pre-Training for Automated Speech Recognition

Oct 12, 2021
David M. Chan, Shalini Ghosh, Debmalya Chakrabarty, Björn Hoffmeister

Figure 1 for Multi-Modal Pre-Training for Automated Speech Recognition
Figure 2 for Multi-Modal Pre-Training for Automated Speech Recognition
Figure 3 for Multi-Modal Pre-Training for Automated Speech Recognition
Figure 4 for Multi-Modal Pre-Training for Automated Speech Recognition
Viaarxiv icon

Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement

Sep 03, 2022
Siddarth Ravichandran, Ondřej Texler, Dimitar Dinev, Hyun Jae Kang

Figure 1 for Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement
Figure 2 for Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement
Figure 3 for Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement
Figure 4 for Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement
Viaarxiv icon

The THUEE System Description for the IARPA OpenASR21 Challenge

Jun 29, 2022
Jing Zhao, Haoyu Wang, Jinpeng Li, Shuzhou Chai, Guan-Bo Wang, Guoguo Chen, Wei-Qiang Zhang

Figure 1 for The THUEE System Description for the IARPA OpenASR21 Challenge
Figure 2 for The THUEE System Description for the IARPA OpenASR21 Challenge
Figure 3 for The THUEE System Description for the IARPA OpenASR21 Challenge
Figure 4 for The THUEE System Description for the IARPA OpenASR21 Challenge
Viaarxiv icon

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

Dec 18, 2020
Binny Mathew, Punyajoy Saha, Seid Muhie Yimam, Chris Biemann, Pawan Goyal, Animesh Mukherjee

Figure 1 for HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
Figure 2 for HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
Figure 3 for HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
Figure 4 for HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
Viaarxiv icon

Cross-linguistically Consistent Semantic and Syntactic Annotation of Child-directed Speech

Sep 22, 2021
Ida Szubert, Omri Abend, Nathan Schneider, Samuel Gibbon, Sharon Goldwater, Mark Steedman

Figure 1 for Cross-linguistically Consistent Semantic and Syntactic Annotation of Child-directed Speech
Figure 2 for Cross-linguistically Consistent Semantic and Syntactic Annotation of Child-directed Speech
Figure 3 for Cross-linguistically Consistent Semantic and Syntactic Annotation of Child-directed Speech
Figure 4 for Cross-linguistically Consistent Semantic and Syntactic Annotation of Child-directed Speech
Viaarxiv icon

A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery

Jun 29, 2022
Werner van der Merwe, Herman Kamper, Johan du Preez

Figure 1 for A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery
Figure 2 for A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery
Figure 3 for A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery
Figure 4 for A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery
Viaarxiv icon

The Use of Voice Source Features for Sung Speech Recognition

Feb 23, 2021
Gerardo Roa Dabike, Jon Barker

Figure 1 for The Use of Voice Source Features for Sung Speech Recognition
Figure 2 for The Use of Voice Source Features for Sung Speech Recognition
Figure 3 for The Use of Voice Source Features for Sung Speech Recognition
Figure 4 for The Use of Voice Source Features for Sung Speech Recognition
Viaarxiv icon

The Volctrans Neural Speech Translation System for IWSLT 2021

May 16, 2021
Chengqi Zhao, Zhicheng Liu, Jian Tong, Tao Wang, Mingxuan Wang, Rong Ye, Qianqian Dong, Jun Cao, Lei Li

Figure 1 for The Volctrans Neural Speech Translation System for IWSLT 2021
Figure 2 for The Volctrans Neural Speech Translation System for IWSLT 2021
Figure 3 for The Volctrans Neural Speech Translation System for IWSLT 2021
Figure 4 for The Volctrans Neural Speech Translation System for IWSLT 2021
Viaarxiv icon

Attention-based Contextual Language Model Adaptation for Speech Recognition

Jun 02, 2021
Richard Diehl Martinez, Scott Novotney, Ivan Bulyko, Ariya Rastrow, Andreas Stolcke, Ankur Gandhe

Figure 1 for Attention-based Contextual Language Model Adaptation for Speech Recognition
Figure 2 for Attention-based Contextual Language Model Adaptation for Speech Recognition
Figure 3 for Attention-based Contextual Language Model Adaptation for Speech Recognition
Figure 4 for Attention-based Contextual Language Model Adaptation for Speech Recognition
Viaarxiv icon

Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems

Jul 09, 2021
Jesús Villalba, Sonal Joshi, Piotr Żelasko, Najim Dehak

Figure 1 for Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems
Figure 2 for Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems
Figure 3 for Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems
Figure 4 for Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems
Viaarxiv icon