Alert button

"speech": models, code, and papers
Alert button

Exploring Retraining-Free Speech Recognition for Intra-sentential Code-Switching

Aug 27, 2021
Zhen Huang, Xiaodan Zhuang, Daben Liu, Xiaoqiang Xiao, Yuchen Zhang, Sabato Marco Siniscalchi

Figure 1 for Exploring Retraining-Free Speech Recognition for Intra-sentential Code-Switching
Figure 2 for Exploring Retraining-Free Speech Recognition for Intra-sentential Code-Switching
Figure 3 for Exploring Retraining-Free Speech Recognition for Intra-sentential Code-Switching
Viaarxiv icon

Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention

Mar 29, 2021
Chengdong Liang, Menglong Xu, Xiao-Lei Zhang

Viaarxiv icon

Super-Human Performance in Online Low-latency Recognition of Conversational Speech

Oct 22, 2020
Thai-Son Nguyen, Sebastian Stueker, Alex Waibel

Figure 1 for Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Figure 2 for Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Figure 3 for Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Figure 4 for Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Viaarxiv icon

You Are What You Tweet: Profiling Users by Past Tweets to Improve Hate Speech Detection

Dec 16, 2020
Prateek Chaudhry, Matthew Lease

Figure 1 for You Are What You Tweet: Profiling Users by Past Tweets to Improve Hate Speech Detection
Figure 2 for You Are What You Tweet: Profiling Users by Past Tweets to Improve Hate Speech Detection
Figure 3 for You Are What You Tweet: Profiling Users by Past Tweets to Improve Hate Speech Detection
Viaarxiv icon

Call-sign recognition and understanding for noisy air-traffic transcripts using surveillance information

Apr 13, 2022
Alexander Blatt, Martin Kocour, Karel Veselý, Igor Szöke, Dietrich Klakow

Figure 1 for Call-sign recognition and understanding for noisy air-traffic transcripts using surveillance information
Figure 2 for Call-sign recognition and understanding for noisy air-traffic transcripts using surveillance information
Figure 3 for Call-sign recognition and understanding for noisy air-traffic transcripts using surveillance information
Figure 4 for Call-sign recognition and understanding for noisy air-traffic transcripts using surveillance information
Viaarxiv icon

Style Transfer of Audio Effects with Differentiable Signal Processing

Add code
Bookmark button
Alert button
Jul 18, 2022
Christian J. Steinmetz, Nicholas J. Bryan, Joshua D. Reiss

Figure 1 for Style Transfer of Audio Effects with Differentiable Signal Processing
Figure 2 for Style Transfer of Audio Effects with Differentiable Signal Processing
Figure 3 for Style Transfer of Audio Effects with Differentiable Signal Processing
Figure 4 for Style Transfer of Audio Effects with Differentiable Signal Processing
Viaarxiv icon

Multimodal Speech Recognition with Unstructured Audio Masking

Oct 16, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott

Figure 1 for Multimodal Speech Recognition with Unstructured Audio Masking
Figure 2 for Multimodal Speech Recognition with Unstructured Audio Masking
Figure 3 for Multimodal Speech Recognition with Unstructured Audio Masking
Figure 4 for Multimodal Speech Recognition with Unstructured Audio Masking
Viaarxiv icon

Improving Reverberant Speech Separation with Multi-stage Training and Curriculum Learning

Add code
Bookmark button
Alert button
Jul 19, 2021
Rohith Aralikatti, Anton Ratnarajah, Zhenyu Tang, Dinesh Manocha

Figure 1 for Improving Reverberant Speech Separation with Multi-stage Training and Curriculum Learning
Figure 2 for Improving Reverberant Speech Separation with Multi-stage Training and Curriculum Learning
Figure 3 for Improving Reverberant Speech Separation with Multi-stage Training and Curriculum Learning
Figure 4 for Improving Reverberant Speech Separation with Multi-stage Training and Curriculum Learning
Viaarxiv icon

A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems

Aug 17, 2021
Xiaoqiang Wang, Yanqing Liu, Sheng Zhao, Jinyu Li

Figure 1 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 2 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 3 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Figure 4 for A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems
Viaarxiv icon

Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation

Add code
Bookmark button
Alert button
Apr 13, 2021
Hirofumi Inaguma, Tatsuya Kawahara, Shinji Watanabe

Figure 1 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Figure 2 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Figure 3 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Figure 4 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Viaarxiv icon