Alert button

"speech": models, code, and papers
Alert button

Continuous speech separation: dataset and analysis

Jan 30, 2020
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li

Figure 1 for Continuous speech separation: dataset and analysis
Figure 2 for Continuous speech separation: dataset and analysis
Figure 3 for Continuous speech separation: dataset and analysis
Figure 4 for Continuous speech separation: dataset and analysis
Viaarxiv icon

Direct speech-to-speech translation with a sequence-to-sequence model

Apr 12, 2019
Ye Jia, Ron J. Weiss, Fadi Biadsy, Wolfgang Macherey, Melvin Johnson, Zhifeng Chen, Yonghui Wu

Figure 1 for Direct speech-to-speech translation with a sequence-to-sequence model
Figure 2 for Direct speech-to-speech translation with a sequence-to-sequence model
Figure 3 for Direct speech-to-speech translation with a sequence-to-sequence model
Figure 4 for Direct speech-to-speech translation with a sequence-to-sequence model
Viaarxiv icon

Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition

Feb 14, 2021
Priyabrata Karmakar, Shyh Wei Teng, Guojun Lu

Figure 1 for Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Figure 2 for Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Figure 3 for Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Figure 4 for Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Viaarxiv icon

Detection of Doctored Speech: Towards an End-to-End Parametric Learn-able Filter Approach

Jun 27, 2022
Rohit Arora

Figure 1 for Detection of Doctored Speech: Towards an End-to-End Parametric Learn-able Filter Approach
Figure 2 for Detection of Doctored Speech: Towards an End-to-End Parametric Learn-able Filter Approach
Figure 3 for Detection of Doctored Speech: Towards an End-to-End Parametric Learn-able Filter Approach
Figure 4 for Detection of Doctored Speech: Towards an End-to-End Parametric Learn-able Filter Approach
Viaarxiv icon

TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices

Aug 11, 2020
Alexander Wong, Mahmoud Famouri, Maya Pavlova, Siddharth Surana

Figure 1 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Figure 2 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Figure 3 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Viaarxiv icon

TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation

Mar 31, 2021
Helin Wang, Bo Wu, Lianwu Chen, Meng Yu, Jianwei Yu, Yong Xu, Shi-Xiong Zhang, Chao Weng, Dan Su, Dong Yu

Figure 1 for TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation
Figure 2 for TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation
Figure 3 for TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation
Figure 4 for TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation
Viaarxiv icon

Representation Learning For Speech Recognition Using Feedback Based Relevance Weighting

Feb 15, 2021
Purvi Agrawal, Sriram Ganapathy

Figure 1 for Representation Learning For Speech Recognition Using Feedback Based Relevance Weighting
Figure 2 for Representation Learning For Speech Recognition Using Feedback Based Relevance Weighting
Figure 3 for Representation Learning For Speech Recognition Using Feedback Based Relevance Weighting
Figure 4 for Representation Learning For Speech Recognition Using Feedback Based Relevance Weighting
Viaarxiv icon

Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning

May 27, 2022
Xiliang Zhu, Shayna Gardiner, David Rossouw, Tere Roldán, Simon Corston-Oliver

Figure 1 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 2 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 3 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 4 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Viaarxiv icon

Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition

Sep 03, 2021
Guangzhi Sun, Chao Zhang, Philip C. Woodland

Figure 1 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Figure 2 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Figure 3 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Figure 4 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Viaarxiv icon

Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers

Jun 24, 2022
Josh Belanich, Krishna Somandepalli, Brian Eoff, Brendan Jou

Figure 1 for Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers
Figure 2 for Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers
Viaarxiv icon