Alert button

"speech": models, code, and papers
Alert button

Look Who's Talking: Active Speaker Detection in the Wild

Add code
Bookmark button
Alert button
Aug 17, 2021
You Jin Kim, Hee-Soo Heo, Soyeon Choe, Soo-Whan Chung, Yoohwan Kwon, Bong-Jin Lee, Youngki Kwon, Joon Son Chung

Figure 1 for Look Who's Talking: Active Speaker Detection in the Wild
Figure 2 for Look Who's Talking: Active Speaker Detection in the Wild
Figure 3 for Look Who's Talking: Active Speaker Detection in the Wild
Figure 4 for Look Who's Talking: Active Speaker Detection in the Wild
Viaarxiv icon

Tensor-Train Long Short-Term Memory for Monaural Speech Enhancement

Dec 25, 2018
Suman Samui, Indrajit Chakrabarti, Soumya K. Ghosh

Figure 1 for Tensor-Train Long Short-Term Memory for Monaural Speech Enhancement
Figure 2 for Tensor-Train Long Short-Term Memory for Monaural Speech Enhancement
Figure 3 for Tensor-Train Long Short-Term Memory for Monaural Speech Enhancement
Figure 4 for Tensor-Train Long Short-Term Memory for Monaural Speech Enhancement
Viaarxiv icon

NewsPod: Automatic and Interactive News Podcasts

Add code
Bookmark button
Alert button
Feb 15, 2022
Philippe Laban, Elicia Ye, Srujay Korlakunta, John Canny, Marti A. Hearst

Figure 1 for NewsPod: Automatic and Interactive News Podcasts
Figure 2 for NewsPod: Automatic and Interactive News Podcasts
Figure 3 for NewsPod: Automatic and Interactive News Podcasts
Figure 4 for NewsPod: Automatic and Interactive News Podcasts
Viaarxiv icon

Sonority Measurement Using System, Source, and Suprasegmental Information

Add code
Bookmark button
Alert button
Jul 01, 2021
Bidisha Sharma, S. R. Mahadeva Prasanna

Figure 1 for Sonority Measurement Using System, Source, and Suprasegmental Information
Figure 2 for Sonority Measurement Using System, Source, and Suprasegmental Information
Figure 3 for Sonority Measurement Using System, Source, and Suprasegmental Information
Figure 4 for Sonority Measurement Using System, Source, and Suprasegmental Information
Viaarxiv icon

Reducing Confusion in Active Learning for Part-Of-Speech Tagging

Add code
Bookmark button
Alert button
Nov 02, 2020
Aditi Chaudhary, Antonios Anastasopoulos, Zaid Sheikh, Graham Neubig

Figure 1 for Reducing Confusion in Active Learning for Part-Of-Speech Tagging
Figure 2 for Reducing Confusion in Active Learning for Part-Of-Speech Tagging
Figure 3 for Reducing Confusion in Active Learning for Part-Of-Speech Tagging
Figure 4 for Reducing Confusion in Active Learning for Part-Of-Speech Tagging
Viaarxiv icon

SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems

Add code
Bookmark button
Alert button
Mar 19, 2021
Yuxuan Chen, Jiangshan Zhang, Xuejing Yuan, Shengzhi Zhang, Kai Chen, Xiaofeng Wang, Shanqing Guo

Figure 1 for SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems
Figure 2 for SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems
Figure 3 for SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems
Figure 4 for SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems
Viaarxiv icon

WHAM!: Extending Speech Separation to Noisy Environments

Add code
Bookmark button
Alert button
Jul 02, 2019
Gordon Wichern, Joe Antognini, Michael Flynn, Licheng Richard Zhu, Emmett McQuinn, Dwight Crow, Ethan Manilow, Jonathan Le Roux

Figure 1 for WHAM!: Extending Speech Separation to Noisy Environments
Figure 2 for WHAM!: Extending Speech Separation to Noisy Environments
Figure 3 for WHAM!: Extending Speech Separation to Noisy Environments
Figure 4 for WHAM!: Extending Speech Separation to Noisy Environments
Viaarxiv icon

A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation

Feb 03, 2022
Linjuan Cheng, Chengshi Zheng, Andong Li, Renhua Peng, Xiaodong Li

Figure 1 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Figure 2 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Figure 3 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Figure 4 for A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation
Viaarxiv icon

The Mirrornet : Learning Audio Synthesizer Controls Inspired by Sensorimotor Interaction

Add code
Bookmark button
Alert button
Oct 12, 2021
Yashish M. Siriwardena, Guilhem Marion, Shihab Shamma

Figure 1 for The Mirrornet : Learning Audio Synthesizer Controls Inspired by Sensorimotor Interaction
Figure 2 for The Mirrornet : Learning Audio Synthesizer Controls Inspired by Sensorimotor Interaction
Figure 3 for The Mirrornet : Learning Audio Synthesizer Controls Inspired by Sensorimotor Interaction
Figure 4 for The Mirrornet : Learning Audio Synthesizer Controls Inspired by Sensorimotor Interaction
Viaarxiv icon

Defining maximum acceptable latency of AI-enhanced CAI tools

Jan 08, 2022
Claudio Fantinuoli, Maddalena Montecchio

Figure 1 for Defining maximum acceptable latency of AI-enhanced CAI tools
Figure 2 for Defining maximum acceptable latency of AI-enhanced CAI tools
Figure 3 for Defining maximum acceptable latency of AI-enhanced CAI tools
Figure 4 for Defining maximum acceptable latency of AI-enhanced CAI tools
Viaarxiv icon