Alert button

"speech": models, code, and papers
Alert button

Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound

Aug 14, 2020
Jianbo Jiao, Yifan Cai, Mohammad Alsharid, Lior Drukker, Aris T. Papageorghiou, J. Alison Noble

Figure 1 for Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound
Figure 2 for Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound
Figure 3 for Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound
Figure 4 for Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound
Viaarxiv icon

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition

Apr 09, 2021
W. Ronny Huang, Tara N. Sainath, Cal Peyser, Shankar Kumar, David Rybach, Trevor Strohman

Figure 1 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Figure 2 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Figure 3 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Figure 4 for Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Viaarxiv icon

Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers

Oct 22, 2020
Zeqian Li, Jacob Whitehill

Figure 1 for Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers
Figure 2 for Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers
Figure 3 for Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers
Viaarxiv icon

Hindi-English Code-Switching Speech Corpus

Sep 24, 2018
Ganji Sreeram, Kunal Dhawan, Rohit Sinha

Figure 1 for Hindi-English Code-Switching Speech Corpus
Figure 2 for Hindi-English Code-Switching Speech Corpus
Figure 3 for Hindi-English Code-Switching Speech Corpus
Figure 4 for Hindi-English Code-Switching Speech Corpus
Viaarxiv icon

Memory-Efficient Training of RNN-Transducer with Sampled Softmax

Mar 31, 2022
Jaesong Lee, Lukas Lee, Shinji Watanabe

Figure 1 for Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Figure 2 for Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Figure 3 for Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Viaarxiv icon

Towards countering hate speech and personal attack in social media

Dec 05, 2019
Polychronis Charitidis, Stavros Doropoulos, Stavros Vologiannidis, Ioannis Papastergiou, Sophia Karakeva

Figure 1 for Towards countering hate speech and personal attack in social media
Figure 2 for Towards countering hate speech and personal attack in social media
Figure 3 for Towards countering hate speech and personal attack in social media
Figure 4 for Towards countering hate speech and personal attack in social media
Viaarxiv icon

HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Add code
Bookmark button
Alert button
Jun 10, 2020
Jiaqi Su, Zeyu Jin, Adam Finkelstein

Figure 1 for HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Figure 2 for HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Figure 3 for HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Figure 4 for HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Viaarxiv icon

DNNAbacus: Toward Accurate Computational Cost Prediction for Deep Neural Networks

May 24, 2022
Lu Bai, Weixing Ji, Qinyuan Li, Xilai Yao, Wei Xin, Wanyi Zhu

Figure 1 for DNNAbacus: Toward Accurate Computational Cost Prediction for Deep Neural Networks
Figure 2 for DNNAbacus: Toward Accurate Computational Cost Prediction for Deep Neural Networks
Figure 3 for DNNAbacus: Toward Accurate Computational Cost Prediction for Deep Neural Networks
Figure 4 for DNNAbacus: Toward Accurate Computational Cost Prediction for Deep Neural Networks
Viaarxiv icon

FastSpeech: Fast, Robust and Controllable Text to Speech

Add code
Bookmark button
Alert button
May 29, 2019
Yi Ren, Yangjun Ruan, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu

Figure 1 for FastSpeech: Fast, Robust and Controllable Text to Speech
Figure 2 for FastSpeech: Fast, Robust and Controllable Text to Speech
Figure 3 for FastSpeech: Fast, Robust and Controllable Text to Speech
Figure 4 for FastSpeech: Fast, Robust and Controllable Text to Speech
Viaarxiv icon

SpliceOut: A Simple and Efficient Audio Augmentation Method

Add code
Bookmark button
Alert button
Sep 30, 2021
Arjit Jain, Pranay Reddy Samala, Deepak Mittal, Preethi Jyoti, Maneesh Singh

Figure 1 for SpliceOut: A Simple and Efficient Audio Augmentation Method
Figure 2 for SpliceOut: A Simple and Efficient Audio Augmentation Method
Figure 3 for SpliceOut: A Simple and Efficient Audio Augmentation Method
Figure 4 for SpliceOut: A Simple and Efficient Audio Augmentation Method
Viaarxiv icon