Alert button

"speech": models, code, and papers
Alert button

A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS

Add code
Bookmark button
Alert button
Sep 22, 2022
Haohan Guo, Fenglong Xie, Frank K. Soong, Xixin Wu, Helen Meng

Figure 1 for A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS
Figure 2 for A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS
Figure 3 for A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS
Figure 4 for A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS
Viaarxiv icon

FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task

Add code
Bookmark button
Alert button
Aug 14, 2021
Yun Tang, Hongyu Gong, Xian Li, Changhan Wang, Juan Pino, Holger Schwenk, Naman Goyal

Figure 1 for FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task
Figure 2 for FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task
Figure 3 for FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task
Figure 4 for FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task
Viaarxiv icon

Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Future Audio-Visual Hearing Aids

Jun 06, 2022
Leandro A. Passos, João Paulo Papa, Ahsan Adeel

Figure 1 for Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Future Audio-Visual Hearing Aids
Figure 2 for Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Future Audio-Visual Hearing Aids
Figure 3 for Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Future Audio-Visual Hearing Aids
Figure 4 for Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Future Audio-Visual Hearing Aids
Viaarxiv icon

Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems

Add code
Bookmark button
Alert button
Jun 18, 2022
Danwei Cai, Zexin Cai, Ming Li

Figure 1 for Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Figure 2 for Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Figure 3 for Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Figure 4 for Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Viaarxiv icon

Hate speech detection using static BERT embeddings

Jun 29, 2021
Gaurav Rajput, Narinder Singh punn, Sanjay Kumar Sonbhadra, Sonali Agarwal

Figure 1 for Hate speech detection using static BERT embeddings
Figure 2 for Hate speech detection using static BERT embeddings
Figure 3 for Hate speech detection using static BERT embeddings
Figure 4 for Hate speech detection using static BERT embeddings
Viaarxiv icon

BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement

Nov 17, 2021
Sunwoo Kim, Minje Kim

Figure 1 for BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement
Figure 2 for BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement
Figure 3 for BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement
Figure 4 for BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement
Viaarxiv icon

A Conformer Based Acoustic Model for Robust Automatic Speech Recognition

Mar 01, 2022
Yufeng Yang, Peidong Wang, DeLiang Wang

Figure 1 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 2 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 3 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 4 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Viaarxiv icon

Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation

Mar 25, 2022
Xue Yang, Changchun Bao

Figure 1 for Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
Figure 2 for Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
Figure 3 for Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
Figure 4 for Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
Viaarxiv icon

Cross-lingual Capsule Network for Hate Speech Detection in Social Media

Add code
Bookmark button
Alert button
Aug 06, 2021
Aiqi Jiang, Arkaitz Zubiaga

Figure 1 for Cross-lingual Capsule Network for Hate Speech Detection in Social Media
Figure 2 for Cross-lingual Capsule Network for Hate Speech Detection in Social Media
Figure 3 for Cross-lingual Capsule Network for Hate Speech Detection in Social Media
Figure 4 for Cross-lingual Capsule Network for Hate Speech Detection in Social Media
Viaarxiv icon

fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit

Add code
Bookmark button
Alert button
Sep 14, 2021
Changhan Wang, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Ann Lee, Peng-Jen Chen, Jiatao Gu, Juan Pino

Figure 1 for fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Figure 2 for fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Figure 3 for fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Figure 4 for fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Viaarxiv icon