Alert button

"speech": models, code, and papers
Alert button

Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit

Aug 13, 2020
Zhen Zeng, Jianzong Wang, Ning Cheng, Jing Xiao

Figure 1 for Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
Figure 2 for Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
Figure 3 for Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
Figure 4 for Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
Viaarxiv icon

CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution

Add code
Bookmark button
Alert button
Jul 04, 2022
Taeho Kim, Yongin Kwon, Jemin Lee, Taeho Kim, Sangtae Ha

Figure 1 for CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
Figure 2 for CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
Figure 3 for CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
Figure 4 for CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
Viaarxiv icon

Thutmose Tagger: Single-pass neural model for Inverse Text Normalization

Add code
Bookmark button
Alert button
Jul 29, 2022
Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg

Figure 1 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Figure 2 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Figure 3 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Figure 4 for Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Viaarxiv icon

Multi-Task Learning with Sentiment, Emotion, and Target Detection to Recognize Hate Speech and Offensive Language

Add code
Bookmark button
Alert button
Sep 21, 2021
Flor Miriam Plaza-del-Arco, Sercan Halat, Sebastian Padó, Roman Klinger

Figure 1 for Multi-Task Learning with Sentiment, Emotion, and Target Detection to Recognize Hate Speech and Offensive Language
Figure 2 for Multi-Task Learning with Sentiment, Emotion, and Target Detection to Recognize Hate Speech and Offensive Language
Figure 3 for Multi-Task Learning with Sentiment, Emotion, and Target Detection to Recognize Hate Speech and Offensive Language
Figure 4 for Multi-Task Learning with Sentiment, Emotion, and Target Detection to Recognize Hate Speech and Offensive Language
Viaarxiv icon

Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End

May 14, 2021
Swayambhu Nath Ray, Minhua Wu, Anirudh Raju, Pegah Ghahremani, Raghavendra Bilgi, Milind Rao, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Jasha Droppo

Figure 1 for Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
Figure 2 for Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
Figure 3 for Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
Figure 4 for Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
Viaarxiv icon

Complex Spectral Mapping With Attention Based Convolution Recrrent Neural Network for Speech Enhancement

Add code
Bookmark button
Alert button
Apr 12, 2021
Liming Zhou, Yongyu Gao, Ziluo Wang, Jiwei Li, Wenbin Zhang

Figure 1 for Complex Spectral Mapping With Attention Based Convolution Recrrent Neural Network for Speech Enhancement
Figure 2 for Complex Spectral Mapping With Attention Based Convolution Recrrent Neural Network for Speech Enhancement
Figure 3 for Complex Spectral Mapping With Attention Based Convolution Recrrent Neural Network for Speech Enhancement
Figure 4 for Complex Spectral Mapping With Attention Based Convolution Recrrent Neural Network for Speech Enhancement
Viaarxiv icon

Data augmentation for learning predictive models on EEG: a systematic comparison

Add code
Bookmark button
Alert button
Jun 29, 2022
Cédric Rommel, Joseph Paillard, Thomas Moreau, Alexandre Gramfort

Figure 1 for Data augmentation for learning predictive models on EEG: a systematic comparison
Figure 2 for Data augmentation for learning predictive models on EEG: a systematic comparison
Figure 3 for Data augmentation for learning predictive models on EEG: a systematic comparison
Figure 4 for Data augmentation for learning predictive models on EEG: a systematic comparison
Viaarxiv icon

Hidden bawls, whispers, and yelps: can text be made to sound more than just its words?

Feb 22, 2022
Caluã de Lacerda Pataca, Paula Dornhofer Paro Costa

Figure 1 for Hidden bawls, whispers, and yelps: can text be made to sound more than just its words?
Figure 2 for Hidden bawls, whispers, and yelps: can text be made to sound more than just its words?
Figure 3 for Hidden bawls, whispers, and yelps: can text be made to sound more than just its words?
Figure 4 for Hidden bawls, whispers, and yelps: can text be made to sound more than just its words?
Viaarxiv icon

Cyclic Defense GAN Against Speech Adversarial Attacks

Add code
Bookmark button
Alert button
Mar 26, 2021
Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

Figure 1 for Cyclic Defense GAN Against Speech Adversarial Attacks
Figure 2 for Cyclic Defense GAN Against Speech Adversarial Attacks
Viaarxiv icon

VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition

Add code
Bookmark button
Alert button
Sep 09, 2020
Quan Wang, Ignacio Lopez Moreno, Mert Saglam, Kevin Wilson, Alan Chiao, Renjie Liu, Yanzhang He, Wei Li, Jason Pelecanos, Marily Nika, Alexander Gruenstein

Figure 1 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Figure 2 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Figure 3 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Viaarxiv icon