Alert button

"speech": models, code, and papers
Alert button

A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation

Feb 03, 2022
Linjuan Cheng, Chengshi Zheng, Andong Li, Renhua Peng, Xiaodong Li

Figure 1 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Figure 2 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Figure 3 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Figure 4 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Viaarxiv icon

Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets

Jun 25, 2021
Amir Ivry, Baruch Berdugo, Israel Cohen

Figure 1 for Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets
Figure 2 for Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets
Figure 3 for Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets
Figure 4 for Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets
Viaarxiv icon

Defining maximum acceptable latency of AI-enhanced CAI tools

Jan 08, 2022
Claudio Fantinuoli, Maddalena Montecchio

Figure 1 for Defining maximum acceptable latency of AI-enhanced CAI tools
Figure 2 for Defining maximum acceptable latency of AI-enhanced CAI tools
Figure 3 for Defining maximum acceptable latency of AI-enhanced CAI tools
Figure 4 for Defining maximum acceptable latency of AI-enhanced CAI tools
Viaarxiv icon

Unspeech: Unsupervised Speech Context Embeddings

Aug 23, 2018
Benjamin Milde, Chris Biemann

Figure 1 for Unspeech: Unsupervised Speech Context Embeddings
Figure 2 for Unspeech: Unsupervised Speech Context Embeddings
Figure 3 for Unspeech: Unsupervised Speech Context Embeddings
Figure 4 for Unspeech: Unsupervised Speech Context Embeddings
Viaarxiv icon

Oracle Linguistic Graphs Complement a Pretrained Transformer Language Model: A Cross-formalism Comparison

Add code
Bookmark button
Alert button
Dec 15, 2021
Jakob Prange, Nathan Schneider, Lingpeng Kong

Figure 1 for Oracle Linguistic Graphs Complement a Pretrained Transformer Language Model: A Cross-formalism Comparison
Figure 2 for Oracle Linguistic Graphs Complement a Pretrained Transformer Language Model: A Cross-formalism Comparison
Figure 3 for Oracle Linguistic Graphs Complement a Pretrained Transformer Language Model: A Cross-formalism Comparison
Figure 4 for Oracle Linguistic Graphs Complement a Pretrained Transformer Language Model: A Cross-formalism Comparison
Viaarxiv icon

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning

Add code
Bookmark button
Alert button
Jul 24, 2019
Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, RJ Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran

Figure 1 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 2 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 3 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 4 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Viaarxiv icon

Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks

Sep 27, 2020
Gašper Beguš

Figure 1 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Figure 2 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Figure 3 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Figure 4 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Viaarxiv icon

Personalized One-Shot Lipreading for an ALS Patient

Nov 02, 2021
Bipasha Sen, Aditya Agarwal, Rudrabha Mukhopadhyay, Vinay Namboodiri, C V Jawahar

Figure 1 for Personalized One-Shot Lipreading for an ALS Patient
Figure 2 for Personalized One-Shot Lipreading for an ALS Patient
Figure 3 for Personalized One-Shot Lipreading for an ALS Patient
Figure 4 for Personalized One-Shot Lipreading for an ALS Patient
Viaarxiv icon

Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach

Nov 08, 2018
Ran Wang, Yao Wang, Adeen Flinker

Figure 1 for Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach
Figure 2 for Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach
Figure 3 for Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach
Figure 4 for Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach
Viaarxiv icon

Author Profiling for Hate Speech Detection

Add code
Bookmark button
Alert button
Feb 14, 2019
Pushkar Mishra, Marco Del Tredici, Helen Yannakoudakis, Ekaterina Shutova

Figure 1 for Author Profiling for Hate Speech Detection
Figure 2 for Author Profiling for Hate Speech Detection
Figure 3 for Author Profiling for Hate Speech Detection
Figure 4 for Author Profiling for Hate Speech Detection
Viaarxiv icon