Alert button

"speech": models, code, and papers
Alert button

Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network

Add code
Bookmark button
Alert button
Sep 22, 2021
Takaaki Saeki, Shinnosuke Takamichi, Hiroshi Saruwatari

Figure 1 for Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network
Figure 2 for Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network
Figure 3 for Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network
Figure 4 for Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network
Viaarxiv icon

An Improved Model for Voicing Silent Speech

Add code
Bookmark button
Alert button
Jun 21, 2021
David Gaddy, Dan Klein

Figure 1 for An Improved Model for Voicing Silent Speech
Figure 2 for An Improved Model for Voicing Silent Speech
Figure 3 for An Improved Model for Voicing Silent Speech
Figure 4 for An Improved Model for Voicing Silent Speech
Viaarxiv icon

Speech Recognition with Augmented Synthesized Speech

Sep 25, 2019
Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro Moreno, Yonghui Wu, Zelin Wu

Figure 1 for Speech Recognition with Augmented Synthesized Speech
Figure 2 for Speech Recognition with Augmented Synthesized Speech
Figure 3 for Speech Recognition with Augmented Synthesized Speech
Figure 4 for Speech Recognition with Augmented Synthesized Speech
Viaarxiv icon

WARP-Q: Quality Prediction For Generative Neural Speech Codecs

Add code
Bookmark button
Alert button
Feb 20, 2021
Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines

Figure 1 for WARP-Q: Quality Prediction For Generative Neural Speech Codecs
Figure 2 for WARP-Q: Quality Prediction For Generative Neural Speech Codecs
Figure 3 for WARP-Q: Quality Prediction For Generative Neural Speech Codecs
Figure 4 for WARP-Q: Quality Prediction For Generative Neural Speech Codecs
Viaarxiv icon

Detection of Consonant Errors in Disordered Speech Based on Consonant-vowel Segment Embedding

Jun 16, 2021
Si-Ioi Ng, Cymie Wing-Yee Ng, Jingyu Li, Tan Lee

Figure 1 for Detection of Consonant Errors in Disordered Speech Based on Consonant-vowel Segment Embedding
Figure 2 for Detection of Consonant Errors in Disordered Speech Based on Consonant-vowel Segment Embedding
Figure 3 for Detection of Consonant Errors in Disordered Speech Based on Consonant-vowel Segment Embedding
Figure 4 for Detection of Consonant Errors in Disordered Speech Based on Consonant-vowel Segment Embedding
Viaarxiv icon

Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate Speech

Sep 01, 2021
Tomer Wullach, Amir Adler, Einat Minkov

Figure 1 for Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate Speech
Figure 2 for Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate Speech
Figure 3 for Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate Speech
Figure 4 for Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate Speech
Viaarxiv icon

FullPack: Full Vector Utilization for Sub-Byte Quantized Inference on General Purpose CPUs

Add code
Bookmark button
Alert button
Nov 20, 2022
Hossein Katebi, Navidreza Asadi, Maziar Goudarzi

Figure 1 for FullPack: Full Vector Utilization for Sub-Byte Quantized Inference on General Purpose CPUs
Figure 2 for FullPack: Full Vector Utilization for Sub-Byte Quantized Inference on General Purpose CPUs
Figure 3 for FullPack: Full Vector Utilization for Sub-Byte Quantized Inference on General Purpose CPUs
Figure 4 for FullPack: Full Vector Utilization for Sub-Byte Quantized Inference on General Purpose CPUs
Viaarxiv icon

Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages

Add code
Bookmark button
Alert button
Dec 17, 2021
Thomas Mandl, Sandip Modha, Gautam Kishore Shahi, Hiren Madhu, Shrey Satapara, Prasenjit Majumder, Johannes Schaefer, Tharindu Ranasinghe, Marcos Zampieri, Durgesh Nandini, Amit Kumar Jaiswal

Figure 1 for Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages
Figure 2 for Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages
Figure 3 for Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages
Figure 4 for Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages
Viaarxiv icon

Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations

Oct 28, 2021
Hyeong-Seok Choi, Juheon Lee, Wansoo Kim, Jie Hwan Lee, Hoon Heo, Kyogu Lee

Figure 1 for Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations
Figure 2 for Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations
Figure 3 for Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations
Figure 4 for Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations
Viaarxiv icon

A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data

Jun 22, 2022
Raviraj Joshi, Anupam Singh

Figure 1 for A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data
Figure 2 for A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data
Figure 3 for A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data
Viaarxiv icon