Alert button

"speech": models, code, and papers
Alert button

Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition

Add code
Bookmark button
Alert button
Jul 01, 2021
Qiujia Li, Chao Zhang, Philip C. Woodland

Figure 1 for Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition
Figure 2 for Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition
Figure 3 for Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition
Figure 4 for Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition
Viaarxiv icon

Computing with Hypervectors for Efficient Speaker Identification

Aug 28, 2022
Ping-Chen Huang, Denis Kleyko, Jan M. Rabaey, Bruno A. Olshausen, Pentti Kanerva

Figure 1 for Computing with Hypervectors for Efficient Speaker Identification
Figure 2 for Computing with Hypervectors for Efficient Speaker Identification
Figure 3 for Computing with Hypervectors for Efficient Speaker Identification
Figure 4 for Computing with Hypervectors for Efficient Speaker Identification
Viaarxiv icon

Revisiting IPA-based Cross-lingual Text-to-speech

Add code
Bookmark button
Alert button
Oct 18, 2021
Haitong Zhang, Haoyue Zhan, Yang Zhang, Xinyuan Yu, Yue Lin

Figure 1 for Revisiting IPA-based Cross-lingual Text-to-speech
Figure 2 for Revisiting IPA-based Cross-lingual Text-to-speech
Figure 3 for Revisiting IPA-based Cross-lingual Text-to-speech
Figure 4 for Revisiting IPA-based Cross-lingual Text-to-speech
Viaarxiv icon

Data Augmentation for Low-Resource Quechua ASR Improvement

Add code
Bookmark button
Alert button
Jul 14, 2022
Rodolfo Zevallos, Nuria Bel, Guillermo Cámbara, Mireia Farrús, Jordi Luque

Figure 1 for Data Augmentation for Low-Resource Quechua ASR Improvement
Figure 2 for Data Augmentation for Low-Resource Quechua ASR Improvement
Figure 3 for Data Augmentation for Low-Resource Quechua ASR Improvement
Figure 4 for Data Augmentation for Low-Resource Quechua ASR Improvement
Viaarxiv icon

Task-specific Optimization of Virtual Channel Linear Prediction-based Speech Dereverberation Front-End for Far-Field Speaker Verification

Add code
Bookmark button
Alert button
Dec 27, 2021
Joon-Young Yang, Joon-Hyuk Chang

Figure 1 for Task-specific Optimization of Virtual Channel Linear Prediction-based Speech Dereverberation Front-End for Far-Field Speaker Verification
Figure 2 for Task-specific Optimization of Virtual Channel Linear Prediction-based Speech Dereverberation Front-End for Far-Field Speaker Verification
Figure 3 for Task-specific Optimization of Virtual Channel Linear Prediction-based Speech Dereverberation Front-End for Far-Field Speaker Verification
Figure 4 for Task-specific Optimization of Virtual Channel Linear Prediction-based Speech Dereverberation Front-End for Far-Field Speaker Verification
Viaarxiv icon

Learning ASR pathways: A sparse multilingual ASR model

Sep 13, 2022
Mu Yang, Andros Tjandra, Chunxi Liu, David Zhang, Duc Le, John H. L. Hansen, Ozlem Kalinli

Figure 1 for Learning ASR pathways: A sparse multilingual ASR model
Figure 2 for Learning ASR pathways: A sparse multilingual ASR model
Figure 3 for Learning ASR pathways: A sparse multilingual ASR model
Figure 4 for Learning ASR pathways: A sparse multilingual ASR model
Viaarxiv icon

Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion

Add code
Bookmark button
Alert button
Aug 13, 2020
Dipjyoti Paul, Muhammed PV Shifas, Yannis Pantazis, Yannis Stylianou

Figure 1 for Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion
Figure 2 for Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion
Figure 3 for Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion
Figure 4 for Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion
Viaarxiv icon

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

Add code
Bookmark button
Alert button
Dec 18, 2020
Binny Mathew, Punyajoy Saha, Seid Muhie Yimam, Chris Biemann, Pawan Goyal, Animesh Mukherjee

Figure 1 for HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
Figure 2 for HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
Figure 3 for HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
Figure 4 for HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
Viaarxiv icon

Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis

Add code
Bookmark button
Alert button
Sep 08, 2021
Songxiang Liu, Shan Yang, Dan Su, Dong Yu

Figure 1 for Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis
Figure 2 for Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis
Figure 3 for Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis
Figure 4 for Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis
Viaarxiv icon

MM-ALT: A Multimodal Automatic Lyric Transcription System

Add code
Bookmark button
Alert button
Jul 13, 2022
Xiangming Gu, Longshen Ou, Danielle Ong, Ye Wang

Figure 1 for MM-ALT: A Multimodal Automatic Lyric Transcription System
Figure 2 for MM-ALT: A Multimodal Automatic Lyric Transcription System
Figure 3 for MM-ALT: A Multimodal Automatic Lyric Transcription System
Figure 4 for MM-ALT: A Multimodal Automatic Lyric Transcription System
Viaarxiv icon