Alert button

"speech": models, code, and papers
Alert button

ExploreADV: Towards exploratory attack for Neural Networks

Add code
Bookmark button
Alert button
Jan 01, 2023
Tianzuo Luo, Yuyi Zhong, Siaucheng Khoo

Figure 1 for ExploreADV: Towards exploratory attack for Neural Networks
Figure 2 for ExploreADV: Towards exploratory attack for Neural Networks
Figure 3 for ExploreADV: Towards exploratory attack for Neural Networks
Figure 4 for ExploreADV: Towards exploratory attack for Neural Networks
Viaarxiv icon

AVATAR submission to the Ego4D AV Transcription Challenge

Nov 18, 2022
Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid

Figure 1 for AVATAR submission to the Ego4D AV Transcription Challenge
Figure 2 for AVATAR submission to the Ego4D AV Transcription Challenge
Figure 3 for AVATAR submission to the Ego4D AV Transcription Challenge
Figure 4 for AVATAR submission to the Ego4D AV Transcription Challenge
Viaarxiv icon

CycleGAN-Based Unpaired Speech Dereverberation

Add code
Bookmark button
Alert button
Mar 29, 2022
Hannah Muckenhirn, Aleksandr Safin, Hakan Erdogan, Felix de Chaumont Quitry, Marco Tagliasacchi, Scott Wisdom, John R. Hershey

Figure 1 for CycleGAN-Based Unpaired Speech Dereverberation
Figure 2 for CycleGAN-Based Unpaired Speech Dereverberation
Figure 3 for CycleGAN-Based Unpaired Speech Dereverberation
Viaarxiv icon

Facial Landmark Predictions with Applications to Metaverse

Add code
Bookmark button
Alert button
Sep 29, 2022
Qiao Han, Jun Zhao, Kwok-Yan Lam

Figure 1 for Facial Landmark Predictions with Applications to Metaverse
Figure 2 for Facial Landmark Predictions with Applications to Metaverse
Figure 3 for Facial Landmark Predictions with Applications to Metaverse
Figure 4 for Facial Landmark Predictions with Applications to Metaverse
Viaarxiv icon

Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention

May 03, 2022
Xinmeng Xu, Rongzhi Gu, Yuexian Zou

Figure 1 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Figure 2 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Figure 3 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Figure 4 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Viaarxiv icon

ConvNext Based Neural Network for Anti-Spoofing

Sep 15, 2022
Qiaowei Ma, Jinghui Zhong, Yitao Yang, Weiheng Liu, Ying Gao, Wing W. Y. Ng

Figure 1 for ConvNext Based Neural Network for Anti-Spoofing
Figure 2 for ConvNext Based Neural Network for Anti-Spoofing
Figure 3 for ConvNext Based Neural Network for Anti-Spoofing
Figure 4 for ConvNext Based Neural Network for Anti-Spoofing
Viaarxiv icon

Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Apr 03, 2022
Yixuan Zhou, Changhe Song, Xiang Li, Luwen Zhang, Zhiyong Wu, Yanyao Bian, Dan Su, Helen Meng

Figure 1 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Figure 2 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Figure 3 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Figure 4 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Viaarxiv icon

Comparative Study of Speech Analysis Methods to Predict Parkinson's Disease

Add code
Bookmark button
Alert button
Nov 15, 2021
Adedolapo Aishat Toye, Suryaprakash Kompalli

Figure 1 for Comparative Study of Speech Analysis Methods to Predict Parkinson's Disease
Figure 2 for Comparative Study of Speech Analysis Methods to Predict Parkinson's Disease
Figure 3 for Comparative Study of Speech Analysis Methods to Predict Parkinson's Disease
Viaarxiv icon

A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition

Jun 22, 2022
Yingying Gao, Junlan Feng, Tianrui Wang, Chao Deng, Shilei Zhang

Figure 1 for A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition
Figure 2 for A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition
Figure 3 for A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition
Figure 4 for A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition
Viaarxiv icon

Training Integer-Only Deep Recurrent Neural Networks

Add code
Bookmark button
Alert button
Dec 22, 2022
Vahid Partovi Nia, Eyyüb Sari, Vanessa Courville, Masoud Asgharian

Figure 1 for Training Integer-Only Deep Recurrent Neural Networks
Figure 2 for Training Integer-Only Deep Recurrent Neural Networks
Figure 3 for Training Integer-Only Deep Recurrent Neural Networks
Figure 4 for Training Integer-Only Deep Recurrent Neural Networks
Viaarxiv icon