Alert button

"speech": models, code, and papers
Alert button

Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting

Oct 26, 2022
Yuxuan Du, Ruohua Zhou

Figure 1 for Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting
Figure 2 for Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting
Figure 3 for Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting
Figure 4 for Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting
Viaarxiv icon

StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles

Add code
Bookmark button
Alert button
Jan 03, 2023
Yifeng Ma, Suzhen Wang, Zhipeng Hu, Changjie Fan, Tangjie Lv, Yu Ding, Zhidong Deng, Xin Yu

Figure 1 for StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles
Figure 2 for StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles
Figure 3 for StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles
Figure 4 for StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles
Viaarxiv icon

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks

Add code
Bookmark button
Alert button
Dec 16, 2022
Esaú Villatoro-Tello, Srikanth Madikeri, Juan Zuluaga-Gomez, Bidisha Sharma, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Petr Motlicek, Alexei V. Ivanov, Aravind Ganapathiraju

Figure 1 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 2 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 3 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 4 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Viaarxiv icon

Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking

Add code
Bookmark button
Alert button
Apr 19, 2022
Jinghui Xu, Jiangshan Zhang, Jifeng Zhu, Yong Yang

Figure 1 for Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking
Figure 2 for Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking
Figure 3 for Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking
Figure 4 for Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking
Viaarxiv icon

Handling Bias in Toxic Speech Detection: A Survey

Add code
Bookmark button
Alert button
Feb 02, 2022
Tanmay Garg, Sarah Masud, Tharun Suresh, Tanmoy Chakraborty

Figure 1 for Handling Bias in Toxic Speech Detection: A Survey
Figure 2 for Handling Bias in Toxic Speech Detection: A Survey
Figure 3 for Handling Bias in Toxic Speech Detection: A Survey
Figure 4 for Handling Bias in Toxic Speech Detection: A Survey
Viaarxiv icon

Predicting Knowledge Gain for MOOC Video Consumption

Add code
Bookmark button
Alert button
Dec 13, 2022
Christian Otto, Markos Stamatakis, Anett Hoppe, Ralph Ewerth

Viaarxiv icon

Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters

Jun 19, 2021
Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh

Figure 1 for Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters
Figure 2 for Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters
Figure 3 for Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters
Figure 4 for Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters
Viaarxiv icon

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

Add code
Bookmark button
Alert button
Feb 07, 2022
Alexei Baevski, Wei-Ning Hsu, Qiantong Xu, Arun Babu, Jiatao Gu, Michael Auli

Figure 1 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Figure 2 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Figure 3 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Figure 4 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Viaarxiv icon

Are word boundaries useful for unsupervised language learning?

Add code
Bookmark button
Alert button
Oct 06, 2022
Tu Anh Nguyen, Maureen de Seyssel, Robin Algayres, Patricia Roze, Ewan Dunbar, Emmanuel Dupoux

Figure 1 for Are word boundaries useful for unsupervised language learning?
Figure 2 for Are word boundaries useful for unsupervised language learning?
Figure 3 for Are word boundaries useful for unsupervised language learning?
Figure 4 for Are word boundaries useful for unsupervised language learning?
Viaarxiv icon

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale

Add code
Bookmark button
Alert button
Nov 19, 2021
Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli

Figure 1 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 2 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 3 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 4 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Viaarxiv icon