Alert button

"speech": models, code, and papers
Alert button

Creating Speech-to-Speech Corpus from Dubbed Series

Mar 07, 2022
Massa Baali, Wassim El-Hajj, Ahmed Ali

Figure 1 for Creating Speech-to-Speech Corpus from Dubbed Series
Figure 2 for Creating Speech-to-Speech Corpus from Dubbed Series
Figure 3 for Creating Speech-to-Speech Corpus from Dubbed Series
Figure 4 for Creating Speech-to-Speech Corpus from Dubbed Series
Viaarxiv icon

Automatic Speech Recognition of Low-Resource Languages Based on Chukchi

Oct 11, 2022
Anastasia Safonova, Tatiana Yudina, Emil Nadimanov, Cydnie Davenport

Figure 1 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Figure 2 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Figure 3 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Figure 4 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Viaarxiv icon

Relating the fundamental frequency of speech with EEG using a dilated convolutional network

Jul 05, 2022
Corentin Puffay, Jana Van Canneyt, Jonas Vanthornhout, Hugo Van hamme, Tom Francart

Figure 1 for Relating the fundamental frequency of speech with EEG using a dilated convolutional network
Figure 2 for Relating the fundamental frequency of speech with EEG using a dilated convolutional network
Figure 3 for Relating the fundamental frequency of speech with EEG using a dilated convolutional network
Figure 4 for Relating the fundamental frequency of speech with EEG using a dilated convolutional network
Viaarxiv icon

ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement

Mar 19, 2023
Chaojian Li, Wenwan Chen, Jiayi Yuan, Yingyan, Lin, Ashutosh Sabharwal

Figure 1 for ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement
Figure 2 for ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement
Figure 3 for ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement
Figure 4 for ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement
Viaarxiv icon

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data

May 16, 2022
Alëna Aksënova, Zhehuai Chen, Chung-Cheng Chiu, Daan van Esch, Pavel Golik, Wei Han, Levi King, Bhuvana Ramabhadran, Andrew Rosenberg, Suzan Schwartz, Gary Wang

Figure 1 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Figure 2 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Figure 3 for Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data
Viaarxiv icon

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

May 20, 2022
Hui Zhang, Tian Yuan, Junkun Chen, Xintong Li, Renjie Zheng, Yuxin Huang, Xiaojie Chen, Enlei Gong, Zeyu Chen, Xiaoguang Hu, Dianhai Yu, Yanjun Ma, Liang Huang

Figure 1 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Figure 2 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Figure 3 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Figure 4 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Viaarxiv icon

Progressive Knowledge Distillation: Building Ensembles for Efficient Inference

Feb 20, 2023
Don Kurian Dennis, Abhishek Shetty, Anish Sevekari, Kazuhito Koishida, Virginia Smith

Figure 1 for Progressive Knowledge Distillation: Building Ensembles for Efficient Inference
Figure 2 for Progressive Knowledge Distillation: Building Ensembles for Efficient Inference
Figure 3 for Progressive Knowledge Distillation: Building Ensembles for Efficient Inference
Figure 4 for Progressive Knowledge Distillation: Building Ensembles for Efficient Inference
Viaarxiv icon

Towards Measuring and Scoring Speaker Diarization Fairness

Feb 20, 2023
Yannis Tevissen, Jérôme Boudy, Gérard Chollet, Frédéric Petitpont

Figure 1 for Towards Measuring and Scoring Speaker Diarization Fairness
Figure 2 for Towards Measuring and Scoring Speaker Diarization Fairness
Figure 3 for Towards Measuring and Scoring Speaker Diarization Fairness
Figure 4 for Towards Measuring and Scoring Speaker Diarization Fairness
Viaarxiv icon

N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

Mar 01, 2023
Rao Ma, Mark J F Gales, Kate Knill, Mengjie Qian

Figure 1 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
Figure 2 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
Figure 3 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
Figure 4 for N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
Viaarxiv icon

Hey Dona! Can you help me with student course registration?

Mar 21, 2023
Vishesh Kalvakurthi, Aparna S. Varde, John Jenq

Figure 1 for Hey Dona! Can you help me with student course registration?
Figure 2 for Hey Dona! Can you help me with student course registration?
Figure 3 for Hey Dona! Can you help me with student course registration?
Figure 4 for Hey Dona! Can you help me with student course registration?
Viaarxiv icon