Alert button

"speech": models, code, and papers
Alert button

VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion

Add code
Bookmark button
Alert button
Feb 18, 2022
Disong Wang, Shan Yang, Dan Su, Xunying Liu, Dong Yu, Helen Meng

Figure 1 for VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion
Figure 2 for VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion
Figure 3 for VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion
Figure 4 for VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion
Viaarxiv icon

Metric Learning for User-defined Keyword Spotting

Nov 01, 2022
Jaemin Jung, Youkyum Kim, Jihwan Park, Youshin Lim, Byeong-Yeol Kim, Youngjoon Jang, Joon Son Chung

Figure 1 for Metric Learning for User-defined Keyword Spotting
Figure 2 for Metric Learning for User-defined Keyword Spotting
Figure 3 for Metric Learning for User-defined Keyword Spotting
Figure 4 for Metric Learning for User-defined Keyword Spotting
Viaarxiv icon

Multi-Dialect Arabic Speech Recognition

Dec 25, 2021
Abbas Raza Ali

Figure 1 for Multi-Dialect Arabic Speech Recognition
Figure 2 for Multi-Dialect Arabic Speech Recognition
Figure 3 for Multi-Dialect Arabic Speech Recognition
Figure 4 for Multi-Dialect Arabic Speech Recognition
Viaarxiv icon

BD-SHS: A Benchmark Dataset for Learning to Detect Online Bangla Hate Speech in Different Social Contexts

Add code
Bookmark button
Alert button
Jun 01, 2022
Nauros Romim, Mosahed Ahmed, Md. Saiful Islam, Arnab Sen Sharma, Hriteshwar Talukder, Mohammad Ruhul Amin

Figure 1 for BD-SHS: A Benchmark Dataset for Learning to Detect Online Bangla Hate Speech in Different Social Contexts
Figure 2 for BD-SHS: A Benchmark Dataset for Learning to Detect Online Bangla Hate Speech in Different Social Contexts
Figure 3 for BD-SHS: A Benchmark Dataset for Learning to Detect Online Bangla Hate Speech in Different Social Contexts
Figure 4 for BD-SHS: A Benchmark Dataset for Learning to Detect Online Bangla Hate Speech in Different Social Contexts
Viaarxiv icon

Efficient Use of Large Pre-Trained Models for Low Resource ASR

Add code
Bookmark button
Alert button
Oct 26, 2022
Peter Vieting, Christoph Lüscher, Julian Dierkes, Ralf Schlüter, Hermann Ney

Figure 1 for Efficient Use of Large Pre-Trained Models for Low Resource ASR
Figure 2 for Efficient Use of Large Pre-Trained Models for Low Resource ASR
Figure 3 for Efficient Use of Large Pre-Trained Models for Low Resource ASR
Figure 4 for Efficient Use of Large Pre-Trained Models for Low Resource ASR
Viaarxiv icon

Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting

Oct 26, 2022
Yuxuan Du, Ruohua Zhou

Figure 1 for Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting
Figure 2 for Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting
Figure 3 for Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting
Figure 4 for Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting
Viaarxiv icon

Lisan: Yemeni, Iraqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations

Dec 17, 2022
Mustafa Jarrar, Fadi A Zaraket, Tymaa Hammouda, Daanish Masood Alavi, Martin Waahlisch

Figure 1 for Lisan: Yemeni, Iraqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations
Figure 2 for Lisan: Yemeni, Iraqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations
Figure 3 for Lisan: Yemeni, Iraqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations
Figure 4 for Lisan: Yemeni, Iraqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations
Viaarxiv icon

Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis

Apr 07, 2022
Yutian Wang, Yuankun Xie, Kun Zhao, Hui Wang, Qin Zhang

Figure 1 for Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis
Figure 2 for Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis
Figure 3 for Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis
Figure 4 for Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis
Viaarxiv icon

Membership Inference Attacks Against Self-supervised Speech Models

Add code
Bookmark button
Alert button
Nov 09, 2021
Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee

Figure 1 for Membership Inference Attacks Against Self-supervised Speech Models
Figure 2 for Membership Inference Attacks Against Self-supervised Speech Models
Figure 3 for Membership Inference Attacks Against Self-supervised Speech Models
Figure 4 for Membership Inference Attacks Against Self-supervised Speech Models
Viaarxiv icon

Counter Hate Speech in Social Media: A Survey

Feb 21, 2022
Dana Alsagheer, Hadi Mansourifar, Weidong Shi

Viaarxiv icon