Alert button

"speech": models, code, and papers
Alert button

Accented Speech Recognition With Accent-specific Codebooks

Add code
Bookmark button
Alert button
Oct 27, 2023
Darshan Prabhu, Preethi Jyothi, Sriram Ganapathy, Vinit Unni

Figure 1 for Accented Speech Recognition With Accent-specific Codebooks
Figure 2 for Accented Speech Recognition With Accent-specific Codebooks
Figure 3 for Accented Speech Recognition With Accent-specific Codebooks
Figure 4 for Accented Speech Recognition With Accent-specific Codebooks
Viaarxiv icon

SpokesBiz -- an Open Corpus of Conversational Polish

Dec 19, 2023
Piotr Pęzik, Sylwia Karasińska, Anna Cichosz, Łukasz Jałowiecki, Konrad Kaczyński, Małgorzata Krawentek, Karolina Walkusz, Paweł Wilk, Mariusz Kleć, Krzysztof Szklanny, Szymon Marszałkowski

Viaarxiv icon

DCHT: Deep Complex Hybrid Transformer for Speech Enhancement

Oct 30, 2023
Jialu Li, Junhui Li, Pu Wang, Youshan Zhang

Figure 1 for DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Figure 2 for DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Figure 3 for DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Figure 4 for DCHT: Deep Complex Hybrid Transformer for Speech Enhancement
Viaarxiv icon

Enhancing Spoofing Speech Detection Using Rhythm Information

Add code
Bookmark button
Alert button
Oct 18, 2023
Jingze Lu, Yuxiang Zhang, Wenchao Wang, Zengqiang Shang, Pengyuan Zhang

Figure 1 for Enhancing Spoofing Speech Detection Using Rhythm Information
Figure 2 for Enhancing Spoofing Speech Detection Using Rhythm Information
Figure 3 for Enhancing Spoofing Speech Detection Using Rhythm Information
Figure 4 for Enhancing Spoofing Speech Detection Using Rhythm Information
Viaarxiv icon

Detecting Speech Abnormalities with a Perceiver-based Sequence Classifier that Leverages a Universal Speech Model

Add code
Bookmark button
Alert button
Oct 16, 2023
Hagen Soltau, Izhak Shafran, Alex Ottenwess, Joseph R. JR Duffy, Rene L. Utianski, Leland R. Barnard, John L. Stricker, Daniela Wiepert, David T. Jones, Hugo Botha

Viaarxiv icon

Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning

Dec 02, 2023
Raviraj Joshi, Nikesh Garera

Figure 1 for Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning
Figure 2 for Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning
Figure 3 for Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning
Figure 4 for Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning
Viaarxiv icon

Automatic Textual Normalization for Hate Speech Detection

Nov 15, 2023
Anh Thi-Hoang Nguyen, Dung Ha Nguyen, Nguyet Thi Nguyen, Khanh Thanh-Duy Ho, Kiet Van Nguyen

Viaarxiv icon

Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition

Add code
Bookmark button
Alert button
Nov 06, 2023
Rabindra Nath Nandi, Mehadi Hasan Menon, Tareq Al Muntasir, Sagor Sarker, Quazi Sarwar Muhtaseem, Md. Tariqul Islam, Shammur Absar Chowdhury, Firoj Alam

Viaarxiv icon

Cross-Linguistic Offensive Language Detection: BERT-Based Analysis of Bengali, Assamese, & Bodo Conversational Hateful Content from Social Media

Dec 16, 2023
Jhuma Kabir Mim, Mourad Oussalah, Akash Singhal

Viaarxiv icon

UniX-Encoder: A Universal $X$-Channel Speech Encoder for Ad-Hoc Microphone Array Speech Processing

Oct 25, 2023
Zili Huang, Yiwen Shao, Shi-Xiong Zhang, Dong Yu

Viaarxiv icon