Alert button

"speech": models, code, and papers
Alert button

Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach

Oct 06, 2023
Junkun Chen, Jian Xue, Peidong Wang, Jing Pan, Jinyu Li

Viaarxiv icon

Evaluating Self-Supervised Speech Representations for Indigenous American Languages

Oct 05, 2023
Chih-Chen Chen, William Chen, Rodolfo Zevallos, John Ortega

Viaarxiv icon

NewsGPT: ChatGPT Integration for Robot-Reporter

Nov 11, 2023
Abdelhadi Hireche, Abdelkader Nasreddine Belkacem, Sadia Jamil, Chao Chen

Figure 1 for NewsGPT: ChatGPT Integration for Robot-Reporter
Figure 2 for NewsGPT: ChatGPT Integration for Robot-Reporter
Figure 3 for NewsGPT: ChatGPT Integration for Robot-Reporter
Figure 4 for NewsGPT: ChatGPT Integration for Robot-Reporter
Viaarxiv icon

Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoder

Sep 19, 2023
Mostafa Sadeghi, Romain Serizel

Viaarxiv icon

Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation

Sep 11, 2023
Anna Deichler, Shivam Mehta, Simon Alexanderson, Jonas Beskow

Figure 1 for Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
Figure 2 for Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
Figure 3 for Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
Figure 4 for Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
Viaarxiv icon

Speak While You Think: Streaming Speech Synthesis During Text Generation

Sep 20, 2023
Avihu Dekel, Slava Shechtman, Raul Fernandez, David Haws, Zvi Kons, Ron Hoory

Figure 1 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Figure 2 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Figure 3 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Figure 4 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Viaarxiv icon

RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function

Sep 15, 2023
Pengyu Wang, Xiaofei Li

Viaarxiv icon

Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement

Sep 19, 2023
Jiahui Pan, Shulin He, Hui Zhang, Xueliang Zhang

Figure 1 for Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement
Figure 2 for Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement
Figure 3 for Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement
Figure 4 for Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement
Viaarxiv icon

Single and Few-step Diffusion for Generative Speech Enhancement

Sep 18, 2023
Bunlong Lay, Jean-Marie Lemercier, Julius Richter, Timo Gerkmann

Viaarxiv icon

On the effect of curriculum learning with developmental data for grammar acquisition

Nov 03, 2023
Mattia Opper, J. Morrison, N. Siddharth

Viaarxiv icon