Alert button

"speech": models, code, and papers
Alert button

Decentralised Moderation for Interoperable Social Networks: A Conversation-based Approach for Pleroma and the Fediverse

Apr 03, 2024
Vibhor Agarwal, Aravindh Raman, Nishanth Sastry, Ahmed M. Abdelmoniem, Gareth Tyson, Ignacio Castro

Viaarxiv icon

Phonetic Segmentation of the UCLA Phonetics Lab Archive

Add code
Bookmark button
Alert button
Mar 28, 2024
Eleanor Chodroff, Blaž Pažon, Annie Baker, Steven Moran

Figure 1 for Phonetic Segmentation of the UCLA Phonetics Lab Archive
Figure 2 for Phonetic Segmentation of the UCLA Phonetics Lab Archive
Figure 3 for Phonetic Segmentation of the UCLA Phonetics Lab Archive
Figure 4 for Phonetic Segmentation of the UCLA Phonetics Lab Archive
Viaarxiv icon

Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers

Add code
Bookmark button
Alert button
Mar 12, 2024
Changsheng Quan, Xiaofei Li

Figure 1 for Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Figure 2 for Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Figure 3 for Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Viaarxiv icon

Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator

Mar 25, 2024
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka

Viaarxiv icon

PhonologyBench: Evaluating Phonological Skills of Large Language Models

Apr 05, 2024
Ashima Suvarna, Harshita Khandelwal, Nanyun Peng

Viaarxiv icon

Speech Robust Bench: A Robustness Benchmark For Speech Recognition

Add code
Bookmark button
Alert button
Mar 08, 2024
Muhammad A. Shah, David Solans Noguero, Mikko A. Heikkila, Nicolas Kourtellis

Figure 1 for Speech Robust Bench: A Robustness Benchmark For Speech Recognition
Figure 2 for Speech Robust Bench: A Robustness Benchmark For Speech Recognition
Figure 3 for Speech Robust Bench: A Robustness Benchmark For Speech Recognition
Figure 4 for Speech Robust Bench: A Robustness Benchmark For Speech Recognition
Viaarxiv icon

A Multimodal Approach to Device-Directed Speech Detection with Large Language Models

Mar 21, 2024
Dominik Wager, Alexander Churchill, Siddharth Sigtia, Panayiotis Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi

Figure 1 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 2 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 3 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 4 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Viaarxiv icon

The evaluation of a code-switched Sepedi-English automatic speech recognition system

Mar 11, 2024
Amanda Phaladi, Thipe Modipa

Figure 1 for The evaluation of a code-switched Sepedi-English automatic speech recognition system
Figure 2 for The evaluation of a code-switched Sepedi-English automatic speech recognition system
Figure 3 for The evaluation of a code-switched Sepedi-English automatic speech recognition system
Figure 4 for The evaluation of a code-switched Sepedi-English automatic speech recognition system
Viaarxiv icon

Inappropriate Pause Detection In Dysarthric Speech Using Large-Scale Speech Recognition

Feb 29, 2024
Jeehyun Lee, Yerin Choi, Tae-Jin Song, Myoung-Wan Koo

Viaarxiv icon

BanglaAutoKG: Automatic Bangla Knowledge Graph Construction with Semantic Neural Graph Filtering

Apr 05, 2024
Azmine Toushik Wasi, Taki Hasan Rafi, Raima Islam, Dong-Kyu Chae

Viaarxiv icon