Alert button

"speech": models, code, and papers
Alert button

Audio-driven Neural Gesture Reenactment with Video Motion Graphs

Add code
Bookmark button
Alert button
Jul 23, 2022
Yang Zhou, Jimei Yang, Dingzeyu Li, Jun Saito, Deepali Aneja, Evangelos Kalogerakis

Figure 1 for Audio-driven Neural Gesture Reenactment with Video Motion Graphs
Figure 2 for Audio-driven Neural Gesture Reenactment with Video Motion Graphs
Figure 3 for Audio-driven Neural Gesture Reenactment with Video Motion Graphs
Figure 4 for Audio-driven Neural Gesture Reenactment with Video Motion Graphs
Viaarxiv icon

Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts

Add code
Bookmark button
Alert button
Sep 25, 2021
Raluca Alexandra Fetic, Mikkel Jordahn, Lucas Chaves Lima, Rasmus Arpe Fogh Egebæk, Martin Carsten Nielsen, Benjamin Biering, Lars Kai Hansen

Figure 1 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Figure 2 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Figure 3 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Figure 4 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Viaarxiv icon

From Nano to Macro: Overview of the IEEE Bio Image and Signal Processing Technical Committee

Oct 31, 2022
Selin Aviyente, Alejandro Frangi, Erik Meijering, Arrate Muñoz-Barrutia, Michael Liebling, Dimitri Van De Ville, Jean-Christophe Olivo-Marin, Jelena Kovačević, Michael Unser

Figure 1 for From Nano to Macro: Overview of the IEEE Bio Image and Signal Processing Technical Committee
Figure 2 for From Nano to Macro: Overview of the IEEE Bio Image and Signal Processing Technical Committee
Figure 3 for From Nano to Macro: Overview of the IEEE Bio Image and Signal Processing Technical Committee
Figure 4 for From Nano to Macro: Overview of the IEEE Bio Image and Signal Processing Technical Committee
Viaarxiv icon

Fast and parallel decoding for transducer

Add code
Bookmark button
Alert button
Oct 31, 2022
Wei Kang, Liyong Guo, Fangjun Kuang, Long Lin, Mingshuang Luo, Zengwei Yao, Xiaoyu Yang, Piotr Żelasko, Daniel Povey

Figure 1 for Fast and parallel decoding for transducer
Figure 2 for Fast and parallel decoding for transducer
Figure 3 for Fast and parallel decoding for transducer
Figure 4 for Fast and parallel decoding for transducer
Viaarxiv icon

Magnitude or Phase? A Two Stage Algorithm for Dereverberation

Oct 31, 2022
Ayal Schwartz, Sharon Gannot, Shlomo E. Chazan

Figure 1 for Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Figure 2 for Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Figure 3 for Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Figure 4 for Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Viaarxiv icon

UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder

Add code
Bookmark button
Alert button
Jun 07, 2022
Jiachen Lian, Chunlei Zhang, Gopala Krishna Anumanchipalli, Dong Yu

Figure 1 for UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder
Figure 2 for UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder
Figure 3 for UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder
Figure 4 for UTTS: Unsupervised TTS with Conditional Disentangled Sequential Variational Auto-encoder
Viaarxiv icon

Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition

Jan 11, 2022
Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Naoyuki Kamo, Takafumi Moriya

Figure 1 for Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Figure 2 for Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Figure 3 for Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Viaarxiv icon

Hierarchical Diffusion Models for Singing Voice Neural Vocoder

Add code
Bookmark button
Alert button
Oct 18, 2022
Naoya Takahashi, Mayank Kumar, Singh, Yuki Mitsufuji

Figure 1 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Figure 2 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Figure 3 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Figure 4 for Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Viaarxiv icon

Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR

Jul 03, 2022
Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma

Figure 1 for Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR
Figure 2 for Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR
Figure 3 for Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR
Figure 4 for Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR
Viaarxiv icon

Separating Content from Speaker Identity in Speech for the Assessment of Cognitive Impairments

Mar 21, 2022
Dongseok Heo, Cheul Young Park, Jaemin Cheun, Myung Jin Ko

Figure 1 for Separating Content from Speaker Identity in Speech for the Assessment of Cognitive Impairments
Figure 2 for Separating Content from Speaker Identity in Speech for the Assessment of Cognitive Impairments
Figure 3 for Separating Content from Speaker Identity in Speech for the Assessment of Cognitive Impairments
Figure 4 for Separating Content from Speaker Identity in Speech for the Assessment of Cognitive Impairments
Viaarxiv icon