Alert button

"speech": models, code, and papers
Alert button

Visual Speech Recognition for Multiple Languages in the Wild

Add code
Bookmark button
Alert button
Feb 26, 2022
Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Visual Speech Recognition for Multiple Languages in the Wild
Figure 2 for Visual Speech Recognition for Multiple Languages in the Wild
Figure 3 for Visual Speech Recognition for Multiple Languages in the Wild
Figure 4 for Visual Speech Recognition for Multiple Languages in the Wild
Viaarxiv icon

Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters

Jun 19, 2021
Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh

Figure 1 for Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters
Figure 2 for Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters
Figure 3 for Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters
Figure 4 for Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters
Viaarxiv icon

Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement

Add code
Bookmark button
Alert button
Mar 30, 2022
Guochen Yu, Andong Li, Wenzhe Liu, Chengshi Zheng, Yutian Wang, Hui Wang

Figure 1 for Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement
Figure 2 for Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement
Figure 3 for Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement
Figure 4 for Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement
Viaarxiv icon

Wide band sub-band speech coding using nonlinear prediction

Mar 24, 2022
Marcos Faundez-Zanuy

Figure 1 for Wide band sub-band speech coding using nonlinear prediction
Figure 2 for Wide band sub-band speech coding using nonlinear prediction
Figure 3 for Wide band sub-band speech coding using nonlinear prediction
Figure 4 for Wide band sub-band speech coding using nonlinear prediction
Viaarxiv icon

Phoneme Segmentation Using Self-Supervised Speech Models

Add code
Bookmark button
Alert button
Nov 02, 2022
Luke Strgar, David Harwath

Figure 1 for Phoneme Segmentation Using Self-Supervised Speech Models
Figure 2 for Phoneme Segmentation Using Self-Supervised Speech Models
Figure 3 for Phoneme Segmentation Using Self-Supervised Speech Models
Figure 4 for Phoneme Segmentation Using Self-Supervised Speech Models
Viaarxiv icon

Handling Bias in Toxic Speech Detection: A Survey

Add code
Bookmark button
Alert button
Feb 02, 2022
Tanmay Garg, Sarah Masud, Tharun Suresh, Tanmoy Chakraborty

Figure 1 for Handling Bias in Toxic Speech Detection: A Survey
Figure 2 for Handling Bias in Toxic Speech Detection: A Survey
Figure 3 for Handling Bias in Toxic Speech Detection: A Survey
Figure 4 for Handling Bias in Toxic Speech Detection: A Survey
Viaarxiv icon

TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection

Oct 27, 2022
Piyush Behre, Sharman Tan, Amy Shah, Harini Kesavamoorthy, Shuangyu Chang, Fei Zuo, Chris Basoglu, Sayan Pathak

Figure 1 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Figure 2 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Figure 3 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Figure 4 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Viaarxiv icon

SAN: a robust end-to-end ASR model architecture

Oct 27, 2022
Zeping Min, Qian Ge, Guanhua Huang

Figure 1 for SAN: a robust end-to-end ASR model architecture
Figure 2 for SAN: a robust end-to-end ASR model architecture
Figure 3 for SAN: a robust end-to-end ASR model architecture
Figure 4 for SAN: a robust end-to-end ASR model architecture
Viaarxiv icon

Text Independent Speaker Identification System for Access Control

Sep 26, 2022
Oluyemi E. Adetoyi

Figure 1 for Text Independent Speaker Identification System for Access Control
Figure 2 for Text Independent Speaker Identification System for Access Control
Figure 3 for Text Independent Speaker Identification System for Access Control
Figure 4 for Text Independent Speaker Identification System for Access Control
Viaarxiv icon

Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking

Add code
Bookmark button
Alert button
Apr 19, 2022
Jinghui Xu, Jiangshan Zhang, Jifeng Zhu, Yong Yang

Figure 1 for Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking
Figure 2 for Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking
Figure 3 for Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking
Figure 4 for Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking
Viaarxiv icon