Alert button

"speech": models, code, and papers
Alert button

Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design

Feb 11, 2023
Lyle Regenwetter, Akash Srivastava, Dan Gutfreund, Faez Ahmed

Figure 1 for Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Figure 2 for Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Figure 3 for Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Figure 4 for Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Viaarxiv icon

MTTM: Metamorphic Testing for Textual Content Moderation Software

Add code
Bookmark button
Alert button
Feb 11, 2023
Wenxuan Wang, Jen-tse Huang, Weibin Wu, Jianping Zhang, Yizhan Huang, Shuqing Li, Pinjia He, Michael Lyu

Figure 1 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 2 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 3 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 4 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Viaarxiv icon

Self-critical Sequence Training for Automatic Speech Recognition

Apr 13, 2022
Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng

Figure 1 for Self-critical Sequence Training for Automatic Speech Recognition
Figure 2 for Self-critical Sequence Training for Automatic Speech Recognition
Figure 3 for Self-critical Sequence Training for Automatic Speech Recognition
Figure 4 for Self-critical Sequence Training for Automatic Speech Recognition
Viaarxiv icon

Acoustically-Driven Phoneme Removal That Preserves Vocal Affect Cues

Add code
Bookmark button
Alert button
Oct 26, 2022
Camille Noufi, Jonathan Berger, Michael Frank, Karen Parker, Daniel L. Bowling

Figure 1 for Acoustically-Driven Phoneme Removal That Preserves Vocal Affect Cues
Figure 2 for Acoustically-Driven Phoneme Removal That Preserves Vocal Affect Cues
Figure 3 for Acoustically-Driven Phoneme Removal That Preserves Vocal Affect Cues
Viaarxiv icon

Non-Parametric Domain Adaptation for End-to-End Speech Translation

Add code
Bookmark button
Alert button
May 23, 2022
Yichao Du, Weizhi Wang, Zhirui Zhang, Boxing Chen, Tong Xu, Jun Xie, Enhong Chen

Figure 1 for Non-Parametric Domain Adaptation for End-to-End Speech Translation
Figure 2 for Non-Parametric Domain Adaptation for End-to-End Speech Translation
Figure 3 for Non-Parametric Domain Adaptation for End-to-End Speech Translation
Figure 4 for Non-Parametric Domain Adaptation for End-to-End Speech Translation
Viaarxiv icon

An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models

Jun 20, 2022
Rahil Parikh, Gaspar Rochette, Carol Espy-Wilson, Shihab Shamma

Figure 1 for An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models
Figure 2 for An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models
Figure 3 for An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models
Figure 4 for An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models
Viaarxiv icon

SLICER: Learning universal audio representations using low-resource self-supervised pre-training

Add code
Bookmark button
Alert button
Nov 02, 2022
Ashish Seth, Sreyan Ghosh, S. Umesh, Dinesh Manocha

Figure 1 for SLICER: Learning universal audio representations using low-resource self-supervised pre-training
Figure 2 for SLICER: Learning universal audio representations using low-resource self-supervised pre-training
Figure 3 for SLICER: Learning universal audio representations using low-resource self-supervised pre-training
Figure 4 for SLICER: Learning universal audio representations using low-resource self-supervised pre-training
Viaarxiv icon

MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient

Add code
Bookmark button
Alert button
Mar 16, 2022
Andong Li, Chengshi Zheng, Ziyang Zhang, Xiaodong Li

Figure 1 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Figure 2 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Figure 3 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Figure 4 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Viaarxiv icon

Autodecompose: A generative self-supervised model for semantic decomposition

Add code
Bookmark button
Alert button
Feb 13, 2023
Mohammad Reza Bonyadi

Figure 1 for Autodecompose: A generative self-supervised model for semantic decomposition
Figure 2 for Autodecompose: A generative self-supervised model for semantic decomposition
Figure 3 for Autodecompose: A generative self-supervised model for semantic decomposition
Figure 4 for Autodecompose: A generative self-supervised model for semantic decomposition
Viaarxiv icon

Enhancing Speech Recognition Decoding via Layer Aggregation

Add code
Bookmark button
Alert button
Apr 05, 2022
Tomer Wullach, Shlomo E. Chazan

Figure 1 for Enhancing Speech Recognition Decoding via Layer Aggregation
Figure 2 for Enhancing Speech Recognition Decoding via Layer Aggregation
Figure 3 for Enhancing Speech Recognition Decoding via Layer Aggregation
Figure 4 for Enhancing Speech Recognition Decoding via Layer Aggregation
Viaarxiv icon