Alert button

"speech": models, code, and papers
Alert button

Cold Diffusion for Speech Enhancement

Nov 04, 2022
Hao Yen, François G. Germain, Gordon Wichern, Jonathan Le Roux

Figure 1 for Cold Diffusion for Speech Enhancement
Figure 2 for Cold Diffusion for Speech Enhancement
Viaarxiv icon

Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding

Add code
Bookmark button
Alert button
May 23, 2023
Tian-Hao Zhang, Hai-Bo Qin, Zhi-Hao Lai, Song-Lu Chen, Qi Liu, Feng Chen, Xinyuan Qian, Xu-Cheng Yin

Figure 1 for Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding
Figure 2 for Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding
Figure 3 for Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding
Figure 4 for Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding
Viaarxiv icon

How Hate Speech Varies by Target Identity: A Computational Analysis

Add code
Bookmark button
Alert button
Oct 19, 2022
Michael Miller Yoder, Lynnette Hui Xian Ng, David West Brown, Kathleen M. Carley

Figure 1 for How Hate Speech Varies by Target Identity: A Computational Analysis
Figure 2 for How Hate Speech Varies by Target Identity: A Computational Analysis
Figure 3 for How Hate Speech Varies by Target Identity: A Computational Analysis
Figure 4 for How Hate Speech Varies by Target Identity: A Computational Analysis
Viaarxiv icon

Data-Efficient French Language Modeling with CamemBERTa

Add code
Bookmark button
Alert button
Jun 02, 2023
Wissam Antoun, Benoît Sagot, Djamé Seddah

Figure 1 for Data-Efficient French Language Modeling with CamemBERTa
Figure 2 for Data-Efficient French Language Modeling with CamemBERTa
Figure 3 for Data-Efficient French Language Modeling with CamemBERTa
Figure 4 for Data-Efficient French Language Modeling with CamemBERTa
Viaarxiv icon

NLPositionality: Characterizing Design Biases of Datasets and Models

Add code
Bookmark button
Alert button
Jun 02, 2023
Sebastin Santy, Jenny T. Liang, Ronan Le Bras, Katharina Reinecke, Maarten Sap

Figure 1 for NLPositionality: Characterizing Design Biases of Datasets and Models
Figure 2 for NLPositionality: Characterizing Design Biases of Datasets and Models
Figure 3 for NLPositionality: Characterizing Design Biases of Datasets and Models
Figure 4 for NLPositionality: Characterizing Design Biases of Datasets and Models
Viaarxiv icon

Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration

Add code
Bookmark button
Alert button
Nov 04, 2022
Jean-Marie Lemercier, Julius Richter, Simon Welker, Timo Gerkmann

Figure 1 for Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration
Figure 2 for Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration
Figure 3 for Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration
Figure 4 for Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration
Viaarxiv icon

Any-speaker Adaptive Text-To-Speech Synthesis with Diffusion Models

Add code
Bookmark button
Alert button
Nov 17, 2022
Minki Kang, Dongchan Min, Sung Ju Hwang

Figure 1 for Any-speaker Adaptive Text-To-Speech Synthesis with Diffusion Models
Figure 2 for Any-speaker Adaptive Text-To-Speech Synthesis with Diffusion Models
Figure 3 for Any-speaker Adaptive Text-To-Speech Synthesis with Diffusion Models
Figure 4 for Any-speaker Adaptive Text-To-Speech Synthesis with Diffusion Models
Viaarxiv icon

A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability

Nov 04, 2022
Jian Xue, Peidong Wang, Jinyu Li, Eric Sun

Figure 1 for A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Figure 2 for A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Figure 3 for A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Figure 4 for A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Viaarxiv icon

On the robustness of non-intrusive speech quality model by adversarial examples

Add code
Bookmark button
Alert button
Nov 11, 2022
Hsin-Yi Lin, Huan-Hsin Tseng, Yu Tsao

Figure 1 for On the robustness of non-intrusive speech quality model by adversarial examples
Figure 2 for On the robustness of non-intrusive speech quality model by adversarial examples
Figure 3 for On the robustness of non-intrusive speech quality model by adversarial examples
Viaarxiv icon

Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert

Add code
Bookmark button
Alert button
Mar 29, 2023
Jiadong Wang, Xinyuan Qian, Malu Zhang, Robby T. Tan, Haizhou Li

Figure 1 for Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert
Figure 2 for Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert
Figure 3 for Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert
Figure 4 for Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert
Viaarxiv icon