Alert button

"speech": models, code, and papers
Alert button

Speech prosody and remote experiments: a technical report

Jun 21, 2021
Giuseppe Magistro

Figure 1 for Speech prosody and remote experiments: a technical report
Figure 2 for Speech prosody and remote experiments: a technical report
Figure 3 for Speech prosody and remote experiments: a technical report
Figure 4 for Speech prosody and remote experiments: a technical report
Viaarxiv icon

Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition

Oct 29, 2021
Chak-Fai Li, Francis Keith, William Hartmann, Matthew Snover

Figure 1 for Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition
Figure 2 for Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition
Figure 3 for Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition
Figure 4 for Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition
Viaarxiv icon

WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis

Add code
Bookmark button
Alert button
Jun 20, 2022
Yi Wang, Yi Si

Figure 1 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Figure 2 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Figure 3 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Figure 4 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Viaarxiv icon

Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios

Sep 13, 2021
Raghavendra Pappagari, Piotr Żelasko, Agnieszka Mikołajczyk, Piotr Pęzik, Najim Dehak

Figure 1 for Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios
Figure 2 for Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios
Figure 3 for Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios
Figure 4 for Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios
Viaarxiv icon

Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder

Jul 09, 2022
Jicheng Zhang, Yizhou Peng, Haihua Xu, Yi He, Eng Siong Chng, Hao Huang

Figure 1 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Figure 2 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Figure 3 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Figure 4 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Viaarxiv icon

Semantic Data Augmentation for End-to-End Mandarin Speech Recognition

Apr 26, 2021
Jianwei Sun, Zhiyuan Tang, Hengxin Yin, Wei Wang, Xi Zhao, Shuaijiang Zhao, Xiaoning Lei, Wei Zou, Xiangang Li

Figure 1 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Figure 2 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Figure 3 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Figure 4 for Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Viaarxiv icon

Correlation based Multi-phasal models for improved imagined speech EEG recognition

Nov 04, 2020
Rini A Sharon, Hema A Murthy

Figure 1 for Correlation based Multi-phasal models for improved imagined speech EEG recognition
Figure 2 for Correlation based Multi-phasal models for improved imagined speech EEG recognition
Figure 3 for Correlation based Multi-phasal models for improved imagined speech EEG recognition
Figure 4 for Correlation based Multi-phasal models for improved imagined speech EEG recognition
Viaarxiv icon

BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model

Add code
Bookmark button
Alert button
Jul 04, 2022
Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber

Figure 1 for BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model
Figure 2 for BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model
Figure 3 for BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model
Figure 4 for BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model
Viaarxiv icon

Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASR

Feb 25, 2022
Nina Markl, Stephen Joseph McNulty

Viaarxiv icon

Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANs

Add code
Bookmark button
Alert button
Jun 29, 2022
Bo-Kyeong Kim, Shinkook Choi, Hancheol Park

Figure 1 for Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANs
Figure 2 for Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANs
Figure 3 for Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANs
Figure 4 for Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANs
Viaarxiv icon