Alert button

"speech": models, code, and papers
Alert button

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

Jun 01, 2023
Tianyi Xu, Zhanheng Yang, Kaixun Huang, Pengcheng Guo, Ao Zhang, Biao Li, Changru Chen, Chao Li, Lei Xie

Figure 1 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Figure 2 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Figure 3 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Figure 4 for Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Viaarxiv icon

An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention

Jun 09, 2023
Junyu Wang

Figure 1 for An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Figure 2 for An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Figure 3 for An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Figure 4 for An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention
Viaarxiv icon

Turkish Native Language Identification

Aug 04, 2023
Ahmet Yavuz Uluslu, Gerold Schneider

Figure 1 for Turkish Native Language Identification
Figure 2 for Turkish Native Language Identification
Figure 3 for Turkish Native Language Identification
Figure 4 for Turkish Native Language Identification
Viaarxiv icon

Rhythm Modeling for Voice Conversion

Add code
Bookmark button
Alert button
Jul 12, 2023
Benjamin van Niekerk, Marc-André Carbonneau, Herman Kamper

Figure 1 for Rhythm Modeling for Voice Conversion
Figure 2 for Rhythm Modeling for Voice Conversion
Figure 3 for Rhythm Modeling for Voice Conversion
Figure 4 for Rhythm Modeling for Voice Conversion
Viaarxiv icon

NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning

Jun 21, 2023
Kamer Ali Yuksel, Thiago Ferreira, Golara Javadi, Mohamed El-Badrashiny, Ahmet Gunduz

Figure 1 for NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning
Figure 2 for NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning
Viaarxiv icon

Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Power Analysis and Sample Size Estimation

Aug 22, 2023
Hamzeh Ghasemzadeh, Robert E. Hillman, Daryush D. Mehta

Figure 1 for Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Power Analysis and Sample Size Estimation
Figure 2 for Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Power Analysis and Sample Size Estimation
Figure 3 for Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Power Analysis and Sample Size Estimation
Figure 4 for Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Power Analysis and Sample Size Estimation
Viaarxiv icon

GNCformer Enhanced Self-attention for Automatic Speech Recognition

May 22, 2023
J. Li, Z. Duan, S. Li, X. Yu, G. Yang

Figure 1 for GNCformer Enhanced Self-attention for Automatic Speech Recognition
Figure 2 for GNCformer Enhanced Self-attention for Automatic Speech Recognition
Figure 3 for GNCformer Enhanced Self-attention for Automatic Speech Recognition
Figure 4 for GNCformer Enhanced Self-attention for Automatic Speech Recognition
Viaarxiv icon

Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts

Add code
Bookmark button
Alert button
Aug 28, 2023
Thanh Thi Nguyen, Campbell Wilson, Janis Dalins

Figure 1 for Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts
Figure 2 for Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts
Figure 3 for Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts
Figure 4 for Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts
Viaarxiv icon

Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency

Jun 07, 2023
Shigeki Karita, Richard Sproat, Haruko Ishikawa

Figure 1 for Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency
Figure 2 for Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency
Figure 3 for Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency
Viaarxiv icon

AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis

May 08, 2023
Hendric Voß, Stefan Kopp

Figure 1 for AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis
Figure 2 for AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis
Figure 3 for AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis
Figure 4 for AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis
Viaarxiv icon