Alert button

"speech": models, code, and papers
Alert button

PolyHope: Two-Level Hope Speech Detection from Tweets

Nov 03, 2022
Fazlourrahman Balouchzahi, Grigori Sidorov, Alexander Gelbukh

Figure 1 for PolyHope: Two-Level Hope Speech Detection from Tweets
Figure 2 for PolyHope: Two-Level Hope Speech Detection from Tweets
Figure 3 for PolyHope: Two-Level Hope Speech Detection from Tweets
Figure 4 for PolyHope: Two-Level Hope Speech Detection from Tweets
Viaarxiv icon

Training Autoregressive Speech Recognition Models with Limited in-domain Supervision

Oct 27, 2022
Chak-Fai Li, Francis Keith, William Hartmann, Matthew Snover

Figure 1 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Figure 2 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Figure 3 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Figure 4 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Viaarxiv icon

Speech Enhancement with Fullband-Subband Cross-Attention Network

Add code
Bookmark button
Alert button
Nov 10, 2022
Jun Chen, Wei Rao, Zilin Wang, Zhiyong Wu, Yannan Wang, Tao Yu, Shidong Shang, Helen Meng

Figure 1 for Speech Enhancement with Fullband-Subband Cross-Attention Network
Figure 2 for Speech Enhancement with Fullband-Subband Cross-Attention Network
Figure 3 for Speech Enhancement with Fullband-Subband Cross-Attention Network
Viaarxiv icon

CB-Conformer: Contextual biasing Conformer for biased word recognition

Add code
Bookmark button
Alert button
Apr 19, 2023
Yaoxun Xu, Baiji Liu, Qiaochu Huang and, Xingchen Song, Zhiyong Wu, Shiyin Kang, Helen Meng

Figure 1 for CB-Conformer: Contextual biasing Conformer for biased word recognition
Figure 2 for CB-Conformer: Contextual biasing Conformer for biased word recognition
Figure 3 for CB-Conformer: Contextual biasing Conformer for biased word recognition
Figure 4 for CB-Conformer: Contextual biasing Conformer for biased word recognition
Viaarxiv icon

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition

Feb 16, 2023
Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran

Figure 1 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 2 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 3 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Figure 4 for JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
Viaarxiv icon

Towards Disentangled Speech Representations

Aug 28, 2022
Cal Peyser, Ronny Huang Andrew Rosenberg Tara N. Sainath, Michael Picheny, Kyunghyun Cho

Figure 1 for Towards Disentangled Speech Representations
Figure 2 for Towards Disentangled Speech Representations
Figure 3 for Towards Disentangled Speech Representations
Figure 4 for Towards Disentangled Speech Representations
Viaarxiv icon

Model-based estimation of in-car-communication feedback applied to speech zone detection

Oct 07, 2022
Kaspar Müller, Simon Doclo, Jan Østergaard, Tobias Wolff

Figure 1 for Model-based estimation of in-car-communication feedback applied to speech zone detection
Figure 2 for Model-based estimation of in-car-communication feedback applied to speech zone detection
Figure 3 for Model-based estimation of in-car-communication feedback applied to speech zone detection
Figure 4 for Model-based estimation of in-car-communication feedback applied to speech zone detection
Viaarxiv icon

Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss

Apr 12, 2023
Zhiyuan Zhao, Lijun Wu, Chuanxin Tang, Dacheng Yin, Yucheng Zhao, Chong Luo

Figure 1 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Figure 2 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Figure 3 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Figure 4 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Viaarxiv icon

RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Dec 15, 2022
Shinhyeok Oh, HyeongRae Noh, Yoonseok Hong, Insoo Oh

Figure 1 for RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis
Figure 2 for RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis
Figure 3 for RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis
Figure 4 for RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis
Viaarxiv icon

Assessing the impact of contextual information in hate speech detection

Add code
Bookmark button
Alert button
Oct 05, 2022
Juan Manuel Pérez, Franco Luque, Demian Zayat, Martín Kondratzky, Agustín Moro, Pablo Serrati, Joaquín Zajac, Paula Miguel, Natalia Debandi, Agustín Gravano, Viviana Cotik

Figure 1 for Assessing the impact of contextual information in hate speech detection
Figure 2 for Assessing the impact of contextual information in hate speech detection
Figure 3 for Assessing the impact of contextual information in hate speech detection
Figure 4 for Assessing the impact of contextual information in hate speech detection
Viaarxiv icon