Alert button

"speech": models, code, and papers
Alert button

End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics

Nov 07, 2022
Eda Okur, Saurav Sahay, Roddy Fuentes Alba, Lama Nachman

Figure 1 for End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Figure 2 for End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Figure 3 for End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Figure 4 for End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Viaarxiv icon

A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding

Nov 04, 2021
Yingzhi Wang, Abdelmoumene Boumadane, Abdelwahab Heba

Figure 1 for A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding
Figure 2 for A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding
Figure 3 for A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding
Figure 4 for A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding
Viaarxiv icon

Statistical Analysis of Perspective Scores on Hate Speech Detection

Jun 22, 2021
Hadi Mansourifar, Dana Alsagheer, Weidong Shi, Lan Ni, Yan Huang

Figure 1 for Statistical Analysis of Perspective Scores on Hate Speech Detection
Figure 2 for Statistical Analysis of Perspective Scores on Hate Speech Detection
Figure 3 for Statistical Analysis of Perspective Scores on Hate Speech Detection
Figure 4 for Statistical Analysis of Perspective Scores on Hate Speech Detection
Viaarxiv icon

Distilling a Pretrained Language Model to a Multilingual ASR Model

Jun 25, 2022
Kwanghee Choi, Hyung-Min Park

Figure 1 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 2 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 3 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 4 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Viaarxiv icon

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance

Oct 27, 2022
Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang

Figure 1 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 2 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 3 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 4 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Viaarxiv icon

Analyzing the Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation

Oct 24, 2022
Tsz Kin Lam, Eva Hasler, Felix Hieber

Figure 1 for Analyzing the Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation
Figure 2 for Analyzing the Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation
Figure 3 for Analyzing the Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation
Figure 4 for Analyzing the Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation
Viaarxiv icon

Stimulus-Informed Generalized Canonical Correlation Analysis of Stimulus-Following Brain Responses

Oct 24, 2022
Simon Geirnaert, Tom Francart, Alexander Bertrand

Figure 1 for Stimulus-Informed Generalized Canonical Correlation Analysis of Stimulus-Following Brain Responses
Figure 2 for Stimulus-Informed Generalized Canonical Correlation Analysis of Stimulus-Following Brain Responses
Viaarxiv icon

Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation

Mar 04, 2022
Dávid Javorský, Dominik Macháček, Ondřej Bojar

Figure 1 for Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation
Figure 2 for Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation
Figure 3 for Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation
Figure 4 for Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation
Viaarxiv icon

Multi-accent Speech Separation with One Shot Learning

Jun 28, 2021
Kuan-Po Huang, Yuan-Kuei Wu, Hung-yi Lee

Figure 1 for Multi-accent Speech Separation with One Shot Learning
Figure 2 for Multi-accent Speech Separation with One Shot Learning
Figure 3 for Multi-accent Speech Separation with One Shot Learning
Figure 4 for Multi-accent Speech Separation with One Shot Learning
Viaarxiv icon

Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding

Jul 21, 2022
Bagus Tris Atmaja, Zanjabila, Akira Sasou

Figure 1 for Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding
Figure 2 for Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding
Figure 3 for Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding
Figure 4 for Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding
Viaarxiv icon