Alert button

"speech": models, code, and papers
Alert button

Chain of Explanation: New Prompting Method to Generate Higher Quality Natural Language Explanation for Implicit Hate Speech

Sep 11, 2022
Fan Huang, Haewoon Kwak, Jisun An

Figure 1 for Chain of Explanation: New Prompting Method to Generate Higher Quality Natural Language Explanation for Implicit Hate Speech
Figure 2 for Chain of Explanation: New Prompting Method to Generate Higher Quality Natural Language Explanation for Implicit Hate Speech
Figure 3 for Chain of Explanation: New Prompting Method to Generate Higher Quality Natural Language Explanation for Implicit Hate Speech
Figure 4 for Chain of Explanation: New Prompting Method to Generate Higher Quality Natural Language Explanation for Implicit Hate Speech
Viaarxiv icon

A comparative study between linear and nonlinear speech prediction

Mar 31, 2022
Marcos Faundez-Zanuy, Enric Monte, Francesc Vallverdú

Figure 1 for A comparative study between linear and nonlinear speech prediction
Figure 2 for A comparative study between linear and nonlinear speech prediction
Figure 3 for A comparative study between linear and nonlinear speech prediction
Figure 4 for A comparative study between linear and nonlinear speech prediction
Viaarxiv icon

Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language

May 19, 2022
Martin Malmsten, Chris Haffenden, Love Börjeson

Figure 1 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Figure 2 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Figure 3 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Figure 4 for Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Viaarxiv icon

Emotional Prosody Control for Speech Generation

Nov 07, 2021
Sarath Sivaprasad, Saiteja Kosgi, Vineet Gandhi

Figure 1 for Emotional Prosody Control for Speech Generation
Figure 2 for Emotional Prosody Control for Speech Generation
Figure 3 for Emotional Prosody Control for Speech Generation
Viaarxiv icon

WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment

Apr 22, 2022
Lin Yao, Jianfei Song, Ruizhuo Xu, Yingfang Yang, Zijian Chen, Yafeng Deng

Figure 1 for WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment
Figure 2 for WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment
Figure 3 for WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment
Figure 4 for WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment
Viaarxiv icon

Exploring Attention Map Reuse for Efficient Transformer Neural Networks

Jan 29, 2023
Kyuhong Shim, Jungwook Choi, Wonyong Sung

Figure 1 for Exploring Attention Map Reuse for Efficient Transformer Neural Networks
Figure 2 for Exploring Attention Map Reuse for Efficient Transformer Neural Networks
Figure 3 for Exploring Attention Map Reuse for Efficient Transformer Neural Networks
Figure 4 for Exploring Attention Map Reuse for Efficient Transformer Neural Networks
Viaarxiv icon

Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning

Apr 08, 2022
Eesung Kim, Jae-Jin Jeon, Hyeji Seo, Hoon Kim

Figure 1 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Figure 2 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Figure 3 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Figure 4 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Viaarxiv icon

Tragic and Comical Networks. Clustering Dramatic Genres According to Structural Properties

Add code
Bookmark button
Alert button
Feb 16, 2023
Szemes Botond, Vida Bence

Figure 1 for Tragic and Comical Networks. Clustering Dramatic Genres According to Structural Properties
Figure 2 for Tragic and Comical Networks. Clustering Dramatic Genres According to Structural Properties
Viaarxiv icon

GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis

Add code
Bookmark button
Alert button
Jan 31, 2023
Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, JinZheng He, Zhou Zhao

Figure 1 for GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis
Figure 2 for GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis
Figure 3 for GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis
Figure 4 for GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis
Viaarxiv icon

A benchmark for toxic comment classification on Civil Comments dataset

Jan 26, 2023
Corentin Duchene, Henri Jamet, Pierre Guillaume, Reda Dehak

Figure 1 for A benchmark for toxic comment classification on Civil Comments dataset
Figure 2 for A benchmark for toxic comment classification on Civil Comments dataset
Viaarxiv icon