Alert button

"speech recognition": models, code, and papers
Alert button

Building Intelligent Autonomous Navigation Agents

Add code
Bookmark button
Alert button
Jun 25, 2021
Devendra Singh Chaplot

Figure 1 for Building Intelligent Autonomous Navigation Agents
Figure 2 for Building Intelligent Autonomous Navigation Agents
Figure 3 for Building Intelligent Autonomous Navigation Agents
Figure 4 for Building Intelligent Autonomous Navigation Agents
Viaarxiv icon

Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks

Add code
Bookmark button
Alert button
May 02, 2021
Siddharth Dalmia, Brian Yan, Vikas Raunak, Florian Metze, Shinji Watanabe

Figure 1 for Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
Figure 2 for Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
Figure 3 for Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
Figure 4 for Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
Viaarxiv icon

Relative Positional Encoding for Speech Recognition and Direct Translation

May 20, 2020
Ngoc-Quan Pham, Thanh-Le Ha, Tuan-Nam Nguyen, Thai-Son Nguyen, Elizabeth Salesky, Sebastian Stueker, Jan Niehues, Alexander Waibel

Figure 1 for Relative Positional Encoding for Speech Recognition and Direct Translation
Figure 2 for Relative Positional Encoding for Speech Recognition and Direct Translation
Figure 3 for Relative Positional Encoding for Speech Recognition and Direct Translation
Figure 4 for Relative Positional Encoding for Speech Recognition and Direct Translation
Viaarxiv icon

Investigations on Phoneme-Based End-To-End Speech Recognition

Add code
Bookmark button
Alert button
May 19, 2020
Albert Zeyer, Wei Zhou, Thomas Ng, Ralf Schlüter, Hermann Ney

Figure 1 for Investigations on Phoneme-Based End-To-End Speech Recognition
Figure 2 for Investigations on Phoneme-Based End-To-End Speech Recognition
Figure 3 for Investigations on Phoneme-Based End-To-End Speech Recognition
Figure 4 for Investigations on Phoneme-Based End-To-End Speech Recognition
Viaarxiv icon

Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays

Add code
Bookmark button
Alert button
Mar 30, 2021
Shanzheng Guan, Shupei Liu, Junqi Chen, Wenbo Zhu, Shengqiang Li, Xu Tan, Ziye Yang, Menglong Xu, Yijiang Chen, Jianyu Wang, Xiao-Lei Zhang

Figure 1 for Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays
Figure 2 for Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays
Figure 3 for Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays
Figure 4 for Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays
Viaarxiv icon

Improved Meta-learning training for Speaker Verification

Mar 29, 2021
Yafeng Chen, Wu Guo, Bin Gu

Figure 1 for Improved Meta-learning training for Speaker Verification
Figure 2 for Improved Meta-learning training for Speaker Verification
Figure 3 for Improved Meta-learning training for Speaker Verification
Figure 4 for Improved Meta-learning training for Speaker Verification
Viaarxiv icon

Using Transformers to Provide Teachers with Personalized Feedback on their Classroom Discourse: The TalkMoves Application

Apr 29, 2021
Abhijit Suresh, Jennifer Jacobs, Vivian Lai, Chenhao Tan, Wayne Ward, James H. Martin, Tamara Sumner

Figure 1 for Using Transformers to Provide Teachers with Personalized Feedback on their Classroom Discourse: The TalkMoves Application
Figure 2 for Using Transformers to Provide Teachers with Personalized Feedback on their Classroom Discourse: The TalkMoves Application
Figure 3 for Using Transformers to Provide Teachers with Personalized Feedback on their Classroom Discourse: The TalkMoves Application
Figure 4 for Using Transformers to Provide Teachers with Personalized Feedback on their Classroom Discourse: The TalkMoves Application
Viaarxiv icon

Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification

Jun 15, 2022
Jingyu Li, Yusheng Tian, Tan Lee

Figure 1 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Figure 2 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Figure 3 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Figure 4 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Viaarxiv icon

CAT: CRF-based ASR Toolkit

Add code
Bookmark button
Alert button
Nov 20, 2019
Keyu An, Hongyu Xiang, Zhijian Ou

Figure 1 for CAT: CRF-based ASR Toolkit
Figure 2 for CAT: CRF-based ASR Toolkit
Figure 3 for CAT: CRF-based ASR Toolkit
Figure 4 for CAT: CRF-based ASR Toolkit
Viaarxiv icon

BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge

Jan 29, 2021
Martin Kocour, Guillermo Cámbara, Jordi Luque, David Bonet, Mireia Farrús, Martin Karafiát, Karel Veselý, Jan ''Honza'' Ĉernocký

Figure 1 for BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge
Figure 2 for BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge
Figure 3 for BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge
Figure 4 for BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge
Viaarxiv icon