Alert button

"speech": models, code, and papers
Alert button

A Conformer Based Acoustic Model for Robust Automatic Speech Recognition

Mar 20, 2022
Yufeng Yang, Peidong Wang, DeLiang Wang

Figure 1 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 2 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 3 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 4 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Viaarxiv icon

NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis

Add code
Bookmark button
Alert button
Nov 17, 2022
Hyeong-Seok Choi, Jinhyeok Yang, Juheon Lee, Hyeongju Kim

Figure 1 for NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
Figure 2 for NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
Figure 3 for NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
Figure 4 for NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
Viaarxiv icon

Mel Spectrogram Inversion with Stable Pitch

Add code
Bookmark button
Alert button
Aug 26, 2022
Bruno Di Giorgi, Mark Levy, Richard Sharp

Figure 1 for Mel Spectrogram Inversion with Stable Pitch
Figure 2 for Mel Spectrogram Inversion with Stable Pitch
Figure 3 for Mel Spectrogram Inversion with Stable Pitch
Figure 4 for Mel Spectrogram Inversion with Stable Pitch
Viaarxiv icon

An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition

Oct 09, 2021
Xuankai Chang, Takashi Maekaku, Pengcheng Guo, Jing Shi, Yen-Ju Lu, Aswin Shanmugam Subramanian, Tianzi Wang, Shu-wen Yang, Yu Tsao, Hung-yi Lee, Shinji Watanabe

Figure 1 for An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Figure 2 for An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Figure 3 for An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Figure 4 for An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Viaarxiv icon

A Part-of-Speech Tagger for Yiddish: First Steps in Tagging the Yiddish Book Center Corpus

Apr 03, 2022
Seth Kulick, Neville Ryant, Beatrice Santorini, Joel Wallenberg

Figure 1 for A Part-of-Speech Tagger for Yiddish: First Steps in Tagging the Yiddish Book Center Corpus
Figure 2 for A Part-of-Speech Tagger for Yiddish: First Steps in Tagging the Yiddish Book Center Corpus
Figure 3 for A Part-of-Speech Tagger for Yiddish: First Steps in Tagging the Yiddish Book Center Corpus
Figure 4 for A Part-of-Speech Tagger for Yiddish: First Steps in Tagging the Yiddish Book Center Corpus
Viaarxiv icon

NWPU-ASLP System for the VoicePrivacy 2022 Challenge

Add code
Bookmark button
Alert button
Sep 24, 2022
Jixun Yao, Qing Wang, Li Zhang, Pengcheng Guo, Yuhao Liang, Lei Xie

Figure 1 for NWPU-ASLP System for the VoicePrivacy 2022 Challenge
Figure 2 for NWPU-ASLP System for the VoicePrivacy 2022 Challenge
Figure 3 for NWPU-ASLP System for the VoicePrivacy 2022 Challenge
Figure 4 for NWPU-ASLP System for the VoicePrivacy 2022 Challenge
Viaarxiv icon

FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Add code
Bookmark button
Alert button
Dec 10, 2021
Yingruo Fan, Zhaojiang Lin, Jun Saito, Wenping Wang, Taku Komura

Figure 1 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 2 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 3 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 4 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Viaarxiv icon

Detecting Dementia from Speech and Transcripts using Transformers

Oct 27, 2021
Loukas Ilias, Dimitris Askounis, John Psarras

Figure 1 for Detecting Dementia from Speech and Transcripts using Transformers
Figure 2 for Detecting Dementia from Speech and Transcripts using Transformers
Figure 3 for Detecting Dementia from Speech and Transcripts using Transformers
Figure 4 for Detecting Dementia from Speech and Transcripts using Transformers
Viaarxiv icon

SpeechNet: A Universal Modularized Model for Speech Processing Tasks

Add code
Bookmark button
Alert button
May 31, 2021
Yi-Chen Chen, Po-Han Chi, Shu-wen Yang, Kai-Wei Chang, Jheng-hao Lin, Sung-Feng Huang, Da-Rong Liu, Chi-Liang Liu, Cheng-Kuang Lee, Hung-yi Lee

Figure 1 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Figure 2 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Figure 3 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Figure 4 for SpeechNet: A Universal Modularized Model for Speech Processing Tasks
Viaarxiv icon

Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Add code
Bookmark button
Alert button
Jun 06, 2021
Dongchan Min, Dong Bok Lee, Eunho Yang, Sung Ju Hwang

Figure 1 for Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Figure 2 for Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Figure 3 for Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Figure 4 for Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Viaarxiv icon