Alert button

"speech": models, code, and papers
Alert button

The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains

Add code
Bookmark button
Alert button
Oct 07, 2023
Erica Cooper, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi

Figure 1 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 2 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 3 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 4 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Viaarxiv icon

GRASS: Unified Generation Model for Speech-to-Semantic Tasks

Add code
Bookmark button
Alert button
Sep 11, 2023
Aobo Xia, Shuyu Lei, Yushu Yang, Xiang Guo, Hua Chai

Figure 1 for GRASS: Unified Generation Model for Speech-to-Semantic Tasks
Figure 2 for GRASS: Unified Generation Model for Speech-to-Semantic Tasks
Figure 3 for GRASS: Unified Generation Model for Speech-to-Semantic Tasks
Viaarxiv icon

Speech-Gesture GAN: Gesture Generation for Robots and Embodied Agents

Sep 17, 2023
Carson Yu Liu, Gelareh Mohammadi, Yang Song, Wafa Johal

Figure 1 for Speech-Gesture GAN: Gesture Generation for Robots and Embodied Agents
Figure 2 for Speech-Gesture GAN: Gesture Generation for Robots and Embodied Agents
Figure 3 for Speech-Gesture GAN: Gesture Generation for Robots and Embodied Agents
Figure 4 for Speech-Gesture GAN: Gesture Generation for Robots and Embodied Agents
Viaarxiv icon

Study of speaker localization under dynamic and reverberant environments

Nov 28, 2023
Daniel A. Mitchell, Boaz Rafaely

Viaarxiv icon

Sparsely Shared LoRA on Whisper for Child Speech Recognition

Add code
Bookmark button
Alert button
Sep 21, 2023
Wei Liu, Ying Qin, Zhiyuan Peng, Tan Lee

Viaarxiv icon

Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting

Add code
Bookmark button
Alert button
Oct 08, 2023
Hemant Yadav, Erica Cooper, Junichi Yamagishi, Sunayana Sitaram, Rajiv Ratn Shah

Viaarxiv icon

Correction Focused Language Model Training for Speech Recognition

Oct 17, 2023
Yingyi Ma, Zhe Liu, Ozlem Kalinli

Viaarxiv icon

Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR

Nov 30, 2023
Jintao Jiang, Yingbo Gao, Zoltan Tuske

Figure 1 for Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
Figure 2 for Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
Figure 3 for Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
Figure 4 for Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
Viaarxiv icon

Instructing Hierarchical Tasks to Robots by Verbal Commands

Add code
Bookmark button
Alert button
Nov 30, 2023
P. Telkes, A. Angleraud, R. Pieters

Viaarxiv icon

Self Generated Wargame AI: Double Layer Agent Task Planning Based on Large Language Model

Dec 02, 2023
Y. Sun, C. Yu, J. Zhao, W. Wang, X. Zhou

Viaarxiv icon