Alert button

"speech": models, code, and papers
Alert button

Deep Learning For Prominence Detection In Children's Read Speech

Oct 27, 2021
Mithilesh Vaidya, Kamini Sabu, Preeti Rao

Figure 1 for Deep Learning For Prominence Detection In Children's Read Speech
Figure 2 for Deep Learning For Prominence Detection In Children's Read Speech
Figure 3 for Deep Learning For Prominence Detection In Children's Read Speech
Figure 4 for Deep Learning For Prominence Detection In Children's Read Speech
Viaarxiv icon

A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection

May 11, 2022
Otavio Braga, Olivier Siohan

Figure 1 for A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection
Figure 2 for A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection
Figure 3 for A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection
Figure 4 for A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection
Viaarxiv icon

Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts

Add code
Bookmark button
Alert button
Sep 25, 2021
Raluca Alexandra Fetic, Mikkel Jordahn, Lucas Chaves Lima, Rasmus Arpe Fogh Egebæk, Martin Carsten Nielsen, Benjamin Biering, Lars Kai Hansen

Figure 1 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Figure 2 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Figure 3 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Figure 4 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Viaarxiv icon

Proficiency assessment of L2 spoken English using wav2vec 2.0

Oct 24, 2022
Stefano Bannò, Marco Matassoni

Figure 1 for Proficiency assessment of L2 spoken English using wav2vec 2.0
Figure 2 for Proficiency assessment of L2 spoken English using wav2vec 2.0
Figure 3 for Proficiency assessment of L2 spoken English using wav2vec 2.0
Figure 4 for Proficiency assessment of L2 spoken English using wav2vec 2.0
Viaarxiv icon

PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations

Add code
Bookmark button
Alert button
Mar 31, 2022
Lodagala V S V Durga Prasad, Sreyan Ghosh, S. Umesh

Figure 1 for PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations
Figure 2 for PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations
Figure 3 for PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations
Viaarxiv icon

Silent Speech Interfaces for Speech Restoration: A Review

Sep 04, 2020
Jose A. Gonzalez-Lopez, Alejandro Gomez-Alanis, Juan M. Martín-Doñas, José L. Pérez-Córdoba, Angel M. Gomez

Figure 1 for Silent Speech Interfaces for Speech Restoration: A Review
Figure 2 for Silent Speech Interfaces for Speech Restoration: A Review
Figure 3 for Silent Speech Interfaces for Speech Restoration: A Review
Figure 4 for Silent Speech Interfaces for Speech Restoration: A Review
Viaarxiv icon

Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective

Jun 29, 2022
Qingcheng Zeng, Dading Chong, Peilin Zhou, Jie Yang

Figure 1 for Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective
Figure 2 for Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective
Figure 3 for Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective
Figure 4 for Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective
Viaarxiv icon

MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment

Apr 02, 2021
Meng Yu, Chunlei Zhang, Yong Xu, Shixiong Zhang, Dong Yu

Figure 1 for MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Figure 2 for MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Figure 3 for MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Figure 4 for MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
Viaarxiv icon

Towards Multi-Scale Style Control for Expressive Speech Synthesis

Add code
Bookmark button
Alert button
Apr 08, 2021
Xiang Li, Changhe Song, Jingbei Li, Zhiyong Wu, Jia Jia, Helen Meng

Figure 1 for Towards Multi-Scale Style Control for Expressive Speech Synthesis
Figure 2 for Towards Multi-Scale Style Control for Expressive Speech Synthesis
Figure 3 for Towards Multi-Scale Style Control for Expressive Speech Synthesis
Figure 4 for Towards Multi-Scale Style Control for Expressive Speech Synthesis
Viaarxiv icon

Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence Labeling

Add code
Bookmark button
Alert button
Oct 27, 2022
Peijie Jiang, Dingkun Long, Yanzhao Zhang, Pengjun Xie, Meishan Zhang, Min Zhang

Figure 1 for Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence Labeling
Figure 2 for Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence Labeling
Figure 3 for Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence Labeling
Figure 4 for Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence Labeling
Viaarxiv icon