Alert button

"speech": models, code, and papers
Alert button

GNN-SL: Sequence Labeling Based on Nearest Examples via GNN

Add code
Bookmark button
Alert button
Dec 05, 2022
Shuhe Wang, Yuxian Meng, Rongbin Ouyang, Jiwei Li, Tianwei Zhang, Lingjuan Lyu, Guoyin Wang

Figure 1 for GNN-SL: Sequence Labeling Based on Nearest Examples via GNN
Figure 2 for GNN-SL: Sequence Labeling Based on Nearest Examples via GNN
Figure 3 for GNN-SL: Sequence Labeling Based on Nearest Examples via GNN
Figure 4 for GNN-SL: Sequence Labeling Based on Nearest Examples via GNN
Viaarxiv icon

All-neural beamformer for continuous speech separation

Oct 13, 2021
Zhuohuang Zhang, Takuya Yoshioka, Naoyuki Kanda, Zhuo Chen, Xiaofei Wang, Dongmei Wang, Sefik Emre Eskimez

Figure 1 for All-neural beamformer for continuous speech separation
Figure 2 for All-neural beamformer for continuous speech separation
Figure 3 for All-neural beamformer for continuous speech separation
Figure 4 for All-neural beamformer for continuous speech separation
Viaarxiv icon

Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement

Aug 27, 2021
Yuzi Yan, Wei-Qiang Zhang, Michael T. Johnson

Figure 1 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Figure 2 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Figure 3 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Figure 4 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Viaarxiv icon

Speech Analysis for Automatic Mania Assessment in Bipolar Disorder

Feb 05, 2022
Pınar Baki, Heysem Kaya, Elvan Çiftçi, Hüseyin Güleç, Albert Ali Salah

Viaarxiv icon

Don't speak too fast: The impact of data bias on self-supervised speech models

Oct 15, 2021
Yen Meng, Yi-Hui Chou, Andy T. Liu, Hung-yi Lee

Figure 1 for Don't speak too fast: The impact of data bias on self-supervised speech models
Figure 2 for Don't speak too fast: The impact of data bias on self-supervised speech models
Figure 3 for Don't speak too fast: The impact of data bias on self-supervised speech models
Figure 4 for Don't speak too fast: The impact of data bias on self-supervised speech models
Viaarxiv icon

Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models

Nov 09, 2022
Travis M. Bartley, Fei Jia, Krishna C. Puvvada, Samuel Kriman, Boris Ginsburg

Figure 1 for Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models
Figure 2 for Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models
Figure 3 for Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models
Figure 4 for Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models
Viaarxiv icon

A comparison of several AI techniques for authorship attribution on Romanian texts

Add code
Bookmark button
Alert button
Nov 09, 2022
Sanda Maria Avram, Mihai Oltean

Figure 1 for A comparison of several AI techniques for authorship attribution on Romanian texts
Figure 2 for A comparison of several AI techniques for authorship attribution on Romanian texts
Figure 3 for A comparison of several AI techniques for authorship attribution on Romanian texts
Figure 4 for A comparison of several AI techniques for authorship attribution on Romanian texts
Viaarxiv icon

PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Add code
Bookmark button
Alert button
Sep 30, 2021
Yi Ren, Jinglin Liu, Zhou Zhao

Figure 1 for PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Figure 2 for PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Figure 3 for PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Figure 4 for PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Viaarxiv icon

Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition

Jan 25, 2022
Dmitriy Serdyuk, Otavio Braga, Olivier Siohan

Figure 1 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 2 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 3 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 4 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Viaarxiv icon

AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style

Add code
Bookmark button
Alert button
Jul 06, 2021
Yuzi Yan, Xu Tan, Bohan Li, Guangyan Zhang, Tao Qin, Sheng Zhao, Yuan Shen, Wei-Qiang Zhang, Tie-Yan Liu

Figure 1 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 2 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 3 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 4 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Viaarxiv icon