Alert button

"speech": models, code, and papers
Alert button

SNR-based teachers-student technique for speech enhancement

May 29, 2020
Xiang Hao, Xiangdong Su, Zhiyu Wang, Qiang Zhang, Huali Xu, Guanglai Gao

Figure 1 for SNR-based teachers-student technique for speech enhancement
Figure 2 for SNR-based teachers-student technique for speech enhancement
Figure 3 for SNR-based teachers-student technique for speech enhancement
Figure 4 for SNR-based teachers-student technique for speech enhancement
Viaarxiv icon

Optimizing Speech Recognition For The Edge

Sep 26, 2019
Yuan Shangguan, Jian Li, Liang Qiao, Raziel Alvarez, Ian McGraw

Figure 1 for Optimizing Speech Recognition For The Edge
Figure 2 for Optimizing Speech Recognition For The Edge
Figure 3 for Optimizing Speech Recognition For The Edge
Figure 4 for Optimizing Speech Recognition For The Edge
Viaarxiv icon

Detecting Distrust Towards the Skills of a Virtual Assistant Using Speech

Add code
Bookmark button
Alert button
Jul 30, 2020
Leonardo Pepino, Pablo Riera, Lara Gauder, Agustín Gravano, Luciana Ferrer

Figure 1 for Detecting Distrust Towards the Skills of a Virtual Assistant Using Speech
Figure 2 for Detecting Distrust Towards the Skills of a Virtual Assistant Using Speech
Viaarxiv icon

Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition

Add code
Bookmark button
Alert button
Jan 05, 2021
Anugunj Naman, Liliana Mancini

Figure 1 for Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition
Figure 2 for Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition
Figure 3 for Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition
Figure 4 for Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition
Viaarxiv icon

Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models

Add code
Bookmark button
Alert button
Oct 01, 2020
Thai Binh Nguyen, Quang Minh Nguyen, Thi Thu Hien Nguyen, Quoc Truong Do, Chi Mai Luong

Figure 1 for Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models
Figure 2 for Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models
Figure 3 for Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models
Figure 4 for Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models
Viaarxiv icon

Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus

Add code
Bookmark button
Alert button
Oct 06, 2020
Michel Plüss, Lukas Neukom, Manfred Vogel

Figure 1 for Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus
Figure 2 for Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus
Figure 3 for Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus
Figure 4 for Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus
Viaarxiv icon

Dataset Condensation via Efficient Synthetic-Data Parameterization

Add code
Bookmark button
Alert button
May 30, 2022
Jang-Hyun Kim, Jinuk Kim, Seong Joon Oh, Sangdoo Yun, Hwanjun Song, Joonhyun Jeong, Jung-Woo Ha, Hyun Oh Song

Figure 1 for Dataset Condensation via Efficient Synthetic-Data Parameterization
Figure 2 for Dataset Condensation via Efficient Synthetic-Data Parameterization
Figure 3 for Dataset Condensation via Efficient Synthetic-Data Parameterization
Figure 4 for Dataset Condensation via Efficient Synthetic-Data Parameterization
Viaarxiv icon

Audio Self-supervised Learning: A Survey

Add code
Bookmark button
Alert button
Mar 02, 2022
Shuo Liu, Adria Mallol-Ragolta, Emilia Parada-Cabeleiro, Kun Qian, Xin Jing, Alexander Kathan, Bin Hu, Bjoern W. Schuller

Figure 1 for Audio Self-supervised Learning: A Survey
Figure 2 for Audio Self-supervised Learning: A Survey
Figure 3 for Audio Self-supervised Learning: A Survey
Figure 4 for Audio Self-supervised Learning: A Survey
Viaarxiv icon

Textual Backdoor Attacks with Iterative Trigger Injection

Add code
Bookmark button
Alert button
May 25, 2022
Jun Yan, Vansh Gupta, Xiang Ren

Figure 1 for Textual Backdoor Attacks with Iterative Trigger Injection
Figure 2 for Textual Backdoor Attacks with Iterative Trigger Injection
Figure 3 for Textual Backdoor Attacks with Iterative Trigger Injection
Figure 4 for Textual Backdoor Attacks with Iterative Trigger Injection
Viaarxiv icon

Key-Sparse Transformer with Cascaded Cross-Attention Block for Multimodal Speech Emotion Recognition

Add code
Bookmark button
Alert button
Jun 22, 2021
Weidong Chen, Xiaofeng Xing, Xiangmin Xu, Jichen Yang, Jianxin Pang

Figure 1 for Key-Sparse Transformer with Cascaded Cross-Attention Block for Multimodal Speech Emotion Recognition
Figure 2 for Key-Sparse Transformer with Cascaded Cross-Attention Block for Multimodal Speech Emotion Recognition
Figure 3 for Key-Sparse Transformer with Cascaded Cross-Attention Block for Multimodal Speech Emotion Recognition
Figure 4 for Key-Sparse Transformer with Cascaded Cross-Attention Block for Multimodal Speech Emotion Recognition
Viaarxiv icon