Alert button

"speech recognition": models, code, and papers
Alert button

A Streaming End-to-End Framework For Spoken Language Understanding

May 20, 2021
Nihal Potdar, Anderson R. Avila, Chao Xing, Dong Wang, Yiran Cao, Xiao Chen

Figure 1 for A Streaming End-to-End Framework For Spoken Language Understanding
Figure 2 for A Streaming End-to-End Framework For Spoken Language Understanding
Figure 3 for A Streaming End-to-End Framework For Spoken Language Understanding
Figure 4 for A Streaming End-to-End Framework For Spoken Language Understanding
Viaarxiv icon

Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs

Add code
Bookmark button
Alert button
Apr 07, 2021
Sujeong Cha, Wangrui Hou, Hyun Jung, My Phung, Michael Picheny, Hong-Kwang Kuo, Samuel Thomas, Edmilson Morais

Figure 1 for Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs
Figure 2 for Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs
Figure 3 for Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs
Figure 4 for Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs
Viaarxiv icon

Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding

Feb 12, 2021
Milind Rao, Pranav Dheram, Gautam Tiwari, Anirudh Raju, Jasha Droppo, Ariya Rastrow, Andreas Stolcke

Figure 1 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Figure 2 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Figure 3 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Viaarxiv icon

Content-Aware Speaker Embeddings for Speaker Diarisation

Add code
Bookmark button
Alert button
Feb 12, 2021
G. Sun, D. Liu, C. Zhang, P. C. Woodland

Figure 1 for Content-Aware Speaker Embeddings for Speaker Diarisation
Figure 2 for Content-Aware Speaker Embeddings for Speaker Diarisation
Figure 3 for Content-Aware Speaker Embeddings for Speaker Diarisation
Figure 4 for Content-Aware Speaker Embeddings for Speaker Diarisation
Viaarxiv icon

Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems

Apr 11, 2022
Vishal Sunder, Eric Fosler-Lussier, Samuel Thomas, Hong-Kwang J. Kuo, Brian Kingsbury

Figure 1 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Figure 2 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Figure 3 for Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Viaarxiv icon

DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization

Add code
Bookmark button
Alert button
Dec 11, 2020
Shaoshi Ling, Yuzong Liu

Figure 1 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Figure 2 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Figure 3 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Figure 4 for DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Viaarxiv icon

Nanopore Base Calling on the Edge

Add code
Bookmark button
Alert button
Nov 09, 2020
Peter Perešíni, Vladimír Boža, Broňa Brejová, Tomáš Vinař

Figure 1 for Nanopore Base Calling on the Edge
Figure 2 for Nanopore Base Calling on the Edge
Figure 3 for Nanopore Base Calling on the Edge
Figure 4 for Nanopore Base Calling on the Edge
Viaarxiv icon

Goal-driven text descriptions for images

Add code
Bookmark button
Alert button
Aug 28, 2021
Ruotian Luo

Figure 1 for Goal-driven text descriptions for images
Figure 2 for Goal-driven text descriptions for images
Figure 3 for Goal-driven text descriptions for images
Figure 4 for Goal-driven text descriptions for images
Viaarxiv icon

Predicting Entity Popularity to Improve Spoken Entity Recognition by Virtual Assistants

May 26, 2020
Christophe Van Gysel, Manos Tsagkias, Ernest Pusateri, Ilya Oparin

Figure 1 for Predicting Entity Popularity to Improve Spoken Entity Recognition by Virtual Assistants
Figure 2 for Predicting Entity Popularity to Improve Spoken Entity Recognition by Virtual Assistants
Figure 3 for Predicting Entity Popularity to Improve Spoken Entity Recognition by Virtual Assistants
Viaarxiv icon

How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition

Add code
Bookmark button
Alert button
Nov 24, 2021
Haoran Sun, Lantian Li, Thomas Fang Zheng, Dong Wang

Figure 1 for How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Figure 2 for How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Figure 3 for How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Figure 4 for How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Viaarxiv icon