Alert button

"speech": models, code, and papers
Alert button

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

Add code
Bookmark button
Alert button
Oct 31, 2022
Liyong Guo, Xiaoyu Yang, Quandong Wang, Yuxiang Kong, Zengwei Yao, Fan Cui, Fangjun Kuang, Wei Kang, Long Lin, Mingshuang Luo, Piotr Zelasko, Daniel Povey

Figure 1 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 2 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 3 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 4 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Viaarxiv icon

FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis

Add code
Bookmark button
Alert button
Sep 27, 2021
Manh Luong, Viet Anh Tran

Figure 1 for FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis
Figure 2 for FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis
Figure 3 for FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis
Figure 4 for FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis
Viaarxiv icon

Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source

Aug 10, 2022
Sania Gul, Muhammad Salman Khan, Syed Waqar Shah

Figure 1 for Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source
Figure 2 for Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source
Figure 3 for Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source
Figure 4 for Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source
Viaarxiv icon

Dürfen Maschinen denken (können)? Warum Künstliche Intelligenz eine Ethik braucht. (Are Machines Allowed to (be able to) Think? Why Artificial Intelligence Needs Ethics)

Aug 15, 2022
Karsten Wendland

Viaarxiv icon

The NTNU System for Formosa Speech Recognition Challenge 2020

Add code
Bookmark button
Alert button
Apr 14, 2021
Fu-An Chao, Tien-Hong Lo, Shi-Yan Weng, Shih-Hsuan Chiu, Yao-Ting Sung, Berlin Chen

Figure 1 for The NTNU System for Formosa Speech Recognition Challenge 2020
Figure 2 for The NTNU System for Formosa Speech Recognition Challenge 2020
Figure 3 for The NTNU System for Formosa Speech Recognition Challenge 2020
Figure 4 for The NTNU System for Formosa Speech Recognition Challenge 2020
Viaarxiv icon

HiFi-WaveGAN: Generative Adversarial Network with Auxiliary Spectrogram-Phase Loss for High-Fidelity Singing Voice Generation

Add code
Bookmark button
Alert button
Oct 26, 2022
Chunhui Wang, Chang Zeng, Xing He

Figure 1 for HiFi-WaveGAN: Generative Adversarial Network with Auxiliary Spectrogram-Phase Loss for High-Fidelity Singing Voice Generation
Figure 2 for HiFi-WaveGAN: Generative Adversarial Network with Auxiliary Spectrogram-Phase Loss for High-Fidelity Singing Voice Generation
Figure 3 for HiFi-WaveGAN: Generative Adversarial Network with Auxiliary Spectrogram-Phase Loss for High-Fidelity Singing Voice Generation
Figure 4 for HiFi-WaveGAN: Generative Adversarial Network with Auxiliary Spectrogram-Phase Loss for High-Fidelity Singing Voice Generation
Viaarxiv icon

CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations

Add code
Bookmark button
Alert button
Oct 05, 2022
Vasista Sai Lodagala, Sreyan Ghosh, S. Umesh

Figure 1 for CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations
Figure 2 for CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations
Figure 3 for CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations
Figure 4 for CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations
Viaarxiv icon

ML-Based Analysis to Identify Speech Features Relevant in Predicting Alzheimer's Disease

Oct 25, 2021
Yash Kumar, Piyush Maheshwari, Shreyansh Joshi, Veeky Baths

Figure 1 for ML-Based Analysis to Identify Speech Features Relevant in Predicting Alzheimer's Disease
Figure 2 for ML-Based Analysis to Identify Speech Features Relevant in Predicting Alzheimer's Disease
Figure 3 for ML-Based Analysis to Identify Speech Features Relevant in Predicting Alzheimer's Disease
Figure 4 for ML-Based Analysis to Identify Speech Features Relevant in Predicting Alzheimer's Disease
Viaarxiv icon

Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021

Add code
Bookmark button
Alert button
Jun 22, 2021
Xingshan Zeng, Liangyou Li, Qun Liu

Figure 1 for Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021
Figure 2 for Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021
Figure 3 for Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021
Figure 4 for Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021
Viaarxiv icon

Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios

Add code
Bookmark button
Alert button
Dec 23, 2021
Qicong Xie, Tao Li, Xinsheng Wang, Zhichao Wang, Lei Xie, Guoqiao Yu, Guanglu Wan

Figure 1 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Figure 2 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Figure 3 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Figure 4 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Viaarxiv icon