Alert button

"speech": models, code, and papers
Alert button

Principal components variable importance reconstruction (PC-VIR): Exploring predictive importance in multicollinear acoustic speech data

Feb 09, 2021
Christopher Carignan, Ander Egurtzegi

Figure 1 for Principal components variable importance reconstruction (PC-VIR): Exploring predictive importance in multicollinear acoustic speech data
Figure 2 for Principal components variable importance reconstruction (PC-VIR): Exploring predictive importance in multicollinear acoustic speech data
Figure 3 for Principal components variable importance reconstruction (PC-VIR): Exploring predictive importance in multicollinear acoustic speech data
Viaarxiv icon

Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset

Oct 22, 2020
Xie Chen, Yu Wu, Zhenghao Wang, Shujie Liu, Jinyu Li

Figure 1 for Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Figure 2 for Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Figure 3 for Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Figure 4 for Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Viaarxiv icon

Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech

Apr 14, 2021
Yixuan Zhou, Changhe Song, Jingbei Li, Zhiyong Wu, Helen Meng

Figure 1 for Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech
Figure 2 for Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech
Figure 3 for Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech
Figure 4 for Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech
Viaarxiv icon

Meta Learning for End-to-End Low-Resource Speech Recognition

Oct 26, 2019
Jui-Yang Hsu, Yuan-Jui Chen, Hung-yi Lee

Figure 1 for Meta Learning for End-to-End Low-Resource Speech Recognition
Figure 2 for Meta Learning for End-to-End Low-Resource Speech Recognition
Figure 3 for Meta Learning for End-to-End Low-Resource Speech Recognition
Figure 4 for Meta Learning for End-to-End Low-Resource Speech Recognition
Viaarxiv icon

Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset

Nov 01, 2021
Soham Tiwari, Kshitiz Lakhotia, Manjunath Mulimani

Figure 1 for Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset
Figure 2 for Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset
Figure 3 for Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset
Figure 4 for Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset
Viaarxiv icon

Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis

Mar 27, 2019
Noé Tits, Fengna Wang, Kevin El Haddad, Vincent Pagel, Thierry Dutoit

Figure 1 for Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis
Figure 2 for Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis
Figure 3 for Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis
Figure 4 for Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis
Viaarxiv icon

T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events

Feb 07, 2022
Shu Wang, Yuhuang Hu, Shih-Chii Liu

Figure 1 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Figure 2 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Figure 3 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Figure 4 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Viaarxiv icon

Parts of Speech Tagging in NLP: Runtime Optimization with Quantum Formulation and ZX Calculus

Jul 19, 2020
Arit Kumar Bishwas, Ashish Mani, Vasile Palade

Figure 1 for Parts of Speech Tagging in NLP: Runtime Optimization with Quantum Formulation and ZX Calculus
Figure 2 for Parts of Speech Tagging in NLP: Runtime Optimization with Quantum Formulation and ZX Calculus
Viaarxiv icon

"I have vxxx bxx connexxxn!": Facing Packet Loss in Deep Speech Emotion Recognition

May 15, 2020
Mostafa M. Mohamed, Björn W. Schuller

Figure 1 for "I have vxxx bxx connexxxn!": Facing Packet Loss in Deep Speech Emotion Recognition
Figure 2 for "I have vxxx bxx connexxxn!": Facing Packet Loss in Deep Speech Emotion Recognition
Figure 3 for "I have vxxx bxx connexxxn!": Facing Packet Loss in Deep Speech Emotion Recognition
Figure 4 for "I have vxxx bxx connexxxn!": Facing Packet Loss in Deep Speech Emotion Recognition
Viaarxiv icon

AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition

Nov 27, 2019
Yi-Chen Chen, Zhaojun Yang, Ching-Feng Yeh, Mahaveer Jain, Michael L. Seltzer

Figure 1 for AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition
Figure 2 for AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition
Figure 3 for AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition
Figure 4 for AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition
Viaarxiv icon