Alert button

"speech": models, code, and papers
Alert button

How "open" are the conversations with open-domain chatbots? A proposal for Speech Event based evaluation

Nov 24, 2022
A. Seza Doğruöz, Gabriel Skantze

Figure 1 for How "open" are the conversations with open-domain chatbots? A proposal for Speech Event based evaluation
Viaarxiv icon

Cross-Modal Mutual Learning for Cued Speech Recognition

Dec 02, 2022
Lei Liu, Li Liu

Figure 1 for Cross-Modal Mutual Learning for Cued Speech Recognition
Figure 2 for Cross-Modal Mutual Learning for Cued Speech Recognition
Figure 3 for Cross-Modal Mutual Learning for Cued Speech Recognition
Figure 4 for Cross-Modal Mutual Learning for Cued Speech Recognition
Viaarxiv icon

Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses

Add code
Bookmark button
Alert button
Nov 29, 2022
Yang Ai, Zhen-Hua Ling

Figure 1 for Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses
Figure 2 for Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses
Figure 3 for Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses
Figure 4 for Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses
Viaarxiv icon

Toxic comments reduce the activity of volunteer editors on Wikipedia

Add code
Bookmark button
Alert button
Apr 26, 2023
Ivan Smirnov, Camelia Oprea, Markus Strohmaier

Figure 1 for Toxic comments reduce the activity of volunteer editors on Wikipedia
Figure 2 for Toxic comments reduce the activity of volunteer editors on Wikipedia
Figure 3 for Toxic comments reduce the activity of volunteer editors on Wikipedia
Figure 4 for Toxic comments reduce the activity of volunteer editors on Wikipedia
Viaarxiv icon

An Adapter based Multi-label Pre-training for Speech Separation and Enhancement

Nov 11, 2022
Tianrui Wang, Xie Chen, Zhuo Chen, Shu Yu, Weibin Zhu

Figure 1 for An Adapter based Multi-label Pre-training for Speech Separation and Enhancement
Figure 2 for An Adapter based Multi-label Pre-training for Speech Separation and Enhancement
Figure 3 for An Adapter based Multi-label Pre-training for Speech Separation and Enhancement
Figure 4 for An Adapter based Multi-label Pre-training for Speech Separation and Enhancement
Viaarxiv icon

Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy

Add code
Bookmark button
Alert button
Oct 20, 2022
Sarina Meyer, Pascal Tilli, Pavel Denisov, Florian Lux, Julia Koch, Ngoc Thang Vu

Figure 1 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 2 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 3 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 4 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Viaarxiv icon

Improving End-to-End SLU performance with Prosodic Attention and Distillation

May 14, 2023
Shangeth Rajaa

Figure 1 for Improving End-to-End SLU performance with Prosodic Attention and Distillation
Figure 2 for Improving End-to-End SLU performance with Prosodic Attention and Distillation
Figure 3 for Improving End-to-End SLU performance with Prosodic Attention and Distillation
Figure 4 for Improving End-to-End SLU performance with Prosodic Attention and Distillation
Viaarxiv icon

Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data

May 26, 2023
Aryan Patil, Varad Patwardhan, Abhishek Phaltankar, Gauri Takawane, Raviraj Joshi

Figure 1 for Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Figure 2 for Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Figure 3 for Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Figure 4 for Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Viaarxiv icon

EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance

Add code
Bookmark button
Alert button
Nov 17, 2022
Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Figure 1 for EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Figure 2 for EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Figure 3 for EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Figure 4 for EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Viaarxiv icon

Joint unsupervised and supervised learning for context-aware language identification

Mar 29, 2023
Jinseok Park, Hyung Yong Kim, Jihwan Park, Byeong-Yeol Kim, Shukjae Choi, Yunkyu Lim

Figure 1 for Joint unsupervised and supervised learning for context-aware language identification
Figure 2 for Joint unsupervised and supervised learning for context-aware language identification
Figure 3 for Joint unsupervised and supervised learning for context-aware language identification
Figure 4 for Joint unsupervised and supervised learning for context-aware language identification
Viaarxiv icon