Alert button

"speech recognition": models, code, and papers
Alert button

SNRi Target Training for Joint Speech Enhancement and Recognition

Add code
Bookmark button
Alert button
Nov 01, 2021
Yuma Koizumi, Shigeki Karita, Arun Narayanan, Sankaran Panchapagesan, Michiel Bacchiani

Figure 1 for SNRi Target Training for Joint Speech Enhancement and Recognition
Figure 2 for SNRi Target Training for Joint Speech Enhancement and Recognition
Figure 3 for SNRi Target Training for Joint Speech Enhancement and Recognition
Figure 4 for SNRi Target Training for Joint Speech Enhancement and Recognition
Viaarxiv icon

Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM

Add code
Bookmark button
Alert button
Sep 08, 2022
Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara

Figure 1 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 2 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 3 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 4 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Viaarxiv icon

Generalizing in the Real World with Representation Learning

Add code
Bookmark button
Alert button
Oct 18, 2022
Tegan Maharaj

Figure 1 for Generalizing in the Real World with Representation Learning
Figure 2 for Generalizing in the Real World with Representation Learning
Figure 3 for Generalizing in the Real World with Representation Learning
Figure 4 for Generalizing in the Real World with Representation Learning
Viaarxiv icon

Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation

Jul 04, 2021
Ryo Masumura, Daiki Okamura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi

Figure 1 for Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation
Figure 2 for Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation
Viaarxiv icon

Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition

Oct 20, 2017
Zhehuai Chen, Jasha Droppo, Jinyu Li, Wayne Xiong

Figure 1 for Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition
Figure 2 for Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition
Figure 3 for Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition
Figure 4 for Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition
Viaarxiv icon

Out-of-Distribution Representation Learning for Time Series Classification

Add code
Bookmark button
Alert button
Sep 26, 2022
Wang Lu, Jindong Wang, Xinwei Sun, Yiqiang Chen, Xing Xie

Figure 1 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 2 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 3 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 4 for Out-of-Distribution Representation Learning for Time Series Classification
Viaarxiv icon

Model Blending for Text Classification

Aug 05, 2022
Ramit Pahwa

Figure 1 for Model Blending for Text Classification
Figure 2 for Model Blending for Text Classification
Figure 3 for Model Blending for Text Classification
Figure 4 for Model Blending for Text Classification
Viaarxiv icon

Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing

Add code
Bookmark button
Alert button
Feb 05, 2023
Han He, Jinho D. Choi

Figure 1 for Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing
Figure 2 for Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing
Figure 3 for Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing
Figure 4 for Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing
Viaarxiv icon

Streaming automatic speech recognition with the transformer model

Jan 09, 2020
Niko Moritz, Takaaki Hori, Jonathan Le Roux

Figure 1 for Streaming automatic speech recognition with the transformer model
Figure 2 for Streaming automatic speech recognition with the transformer model
Figure 3 for Streaming automatic speech recognition with the transformer model
Viaarxiv icon