Alert button

"speech recognition": models, code, and papers
Alert button

Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding

Apr 11, 2022
Sanjana Sankar, Denis Beautemps, Thomas Hueber

Figure 1 for Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
Figure 2 for Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
Figure 3 for Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
Viaarxiv icon

Adversarial Attacks on ASR Systems: An Overview

Aug 03, 2022
Xiao Zhang, Hao Tan, Xuan Huang, Denghui Zhang, Keke Tang, Zhaoquan Gu

Figure 1 for Adversarial Attacks on ASR Systems: An Overview
Figure 2 for Adversarial Attacks on ASR Systems: An Overview
Figure 3 for Adversarial Attacks on ASR Systems: An Overview
Viaarxiv icon

Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition

Jan 22, 2016
Amr Bakry, Ahmed Elgammal

Figure 1 for Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition
Figure 2 for Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition
Figure 3 for Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition
Figure 4 for Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition
Viaarxiv icon

Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning

May 27, 2022
Xiliang Zhu, Shayna Gardiner, David Rossouw, Tere Roldán, Simon Corston-Oliver

Figure 1 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 2 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 3 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 4 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Viaarxiv icon

English Conversational Telephone Speech Recognition by Humans and Machines

Mar 06, 2017
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall

Figure 1 for English Conversational Telephone Speech Recognition by Humans and Machines
Figure 2 for English Conversational Telephone Speech Recognition by Humans and Machines
Figure 3 for English Conversational Telephone Speech Recognition by Humans and Machines
Figure 4 for English Conversational Telephone Speech Recognition by Humans and Machines
Viaarxiv icon

Articulatory information and Multiview Features for Large Vocabulary Continuous Speech Recognition

Feb 16, 2018
Vikramjit Mitra, Wen Wang, Chris Bartels, Horacio Franco, Dimitra Vergyri

Figure 1 for Articulatory information and Multiview Features for Large Vocabulary Continuous Speech Recognition
Figure 2 for Articulatory information and Multiview Features for Large Vocabulary Continuous Speech Recognition
Figure 3 for Articulatory information and Multiview Features for Large Vocabulary Continuous Speech Recognition
Figure 4 for Articulatory information and Multiview Features for Large Vocabulary Continuous Speech Recognition
Viaarxiv icon

Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model

Add code
Bookmark button
Alert button
Oct 23, 2019
Oleksii Hrinchuk, Mariya Popova, Boris Ginsburg

Figure 1 for Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Figure 2 for Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Figure 3 for Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Figure 4 for Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Viaarxiv icon

UserLibri: A Dataset for ASR Personalization Using Only Text

Jul 02, 2022
Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey

Figure 1 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 2 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 3 for UserLibri: A Dataset for ASR Personalization Using Only Text
Figure 4 for UserLibri: A Dataset for ASR Personalization Using Only Text
Viaarxiv icon

Fast and Robust Unsupervised Contextual Biasing for Speech Recognition

May 04, 2020
Young Mo Kang, Yingbo Zhou

Figure 1 for Fast and Robust Unsupervised Contextual Biasing for Speech Recognition
Figure 2 for Fast and Robust Unsupervised Contextual Biasing for Speech Recognition
Figure 3 for Fast and Robust Unsupervised Contextual Biasing for Speech Recognition
Figure 4 for Fast and Robust Unsupervised Contextual Biasing for Speech Recognition
Viaarxiv icon

Improving speech recognition models with small samples for air traffic control systems

Feb 16, 2021
Yi Lin, Qin Li, Bo Yang, Zhen Yan, Huachun Tan, Zhengmao Chen

Figure 1 for Improving speech recognition models with small samples for air traffic control systems
Figure 2 for Improving speech recognition models with small samples for air traffic control systems
Figure 3 for Improving speech recognition models with small samples for air traffic control systems
Figure 4 for Improving speech recognition models with small samples for air traffic control systems
Viaarxiv icon