Alert button

"speech": models, code, and papers
Alert button

How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition

Add code
Bookmark button
Alert button
Apr 17, 2020
George Sterpu, Christian Saam, Naomi Harte

Figure 1 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 2 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 3 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 4 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Viaarxiv icon

Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors

Add code
Bookmark button
Alert button
Apr 05, 2022
Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko

Figure 1 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Figure 2 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Figure 3 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Figure 4 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Viaarxiv icon

A scalable noisy speech dataset and online subjective test framework

Sep 17, 2019
Chandan K. A. Reddy, Ebrahim Beyrami, Jamie Pool, Ross Cutler, Sriram Srinivasan, Johannes Gehrke

Figure 1 for A scalable noisy speech dataset and online subjective test framework
Figure 2 for A scalable noisy speech dataset and online subjective test framework
Figure 3 for A scalable noisy speech dataset and online subjective test framework
Viaarxiv icon

End-to-End Multi-Channel Speech Separation

May 28, 2019
Rongzhi Gu, Jian Wu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu

Figure 1 for End-to-End Multi-Channel Speech Separation
Figure 2 for End-to-End Multi-Channel Speech Separation
Figure 3 for End-to-End Multi-Channel Speech Separation
Figure 4 for End-to-End Multi-Channel Speech Separation
Viaarxiv icon

Machine Learning: Challenges, Limitations, and Compatibility for Audio Restoration Processes

Sep 06, 2021
Owen Casey, Rushit Dave, Naeem Seliya, Evelyn R Sowells Boone

Figure 1 for Machine Learning: Challenges, Limitations, and Compatibility for Audio Restoration Processes
Figure 2 for Machine Learning: Challenges, Limitations, and Compatibility for Audio Restoration Processes
Figure 3 for Machine Learning: Challenges, Limitations, and Compatibility for Audio Restoration Processes
Viaarxiv icon

Federated Learning in ASR: Not as Easy as You Think

Add code
Bookmark button
Alert button
Sep 30, 2021
Wentao Yu, Jan Freiwald, Sören Tewes, Fabien Huennemeyer, Dorothea Kolossa

Figure 1 for Federated Learning in ASR: Not as Easy as You Think
Figure 2 for Federated Learning in ASR: Not as Easy as You Think
Figure 3 for Federated Learning in ASR: Not as Easy as You Think
Figure 4 for Federated Learning in ASR: Not as Easy as You Think
Viaarxiv icon

Lattention: Lattice-attention in ASR rescoring

Nov 19, 2021
Prabhat Pandey, Sergio Duarte Torres, Ali Orkan Bayer, Ankur Gandhe, Volker Leutnant

Figure 1 for Lattention: Lattice-attention in ASR rescoring
Figure 2 for Lattention: Lattice-attention in ASR rescoring
Figure 3 for Lattention: Lattice-attention in ASR rescoring
Figure 4 for Lattention: Lattice-attention in ASR rescoring
Viaarxiv icon

Speech-driven facial animation using polynomial fusion of features

Dec 12, 2019
Triantafyllos Kefalas, Konstantinos Vougioukas, Yannis Panagakis, Stavros Petridis, Jean Kossaifi, Maja Pantic

Figure 1 for Speech-driven facial animation using polynomial fusion of features
Figure 2 for Speech-driven facial animation using polynomial fusion of features
Viaarxiv icon

Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System

Oct 03, 2019
Kai Fan, Jiayi Wang, Bo Li, Boxing Chen, Niyu Ge

Figure 1 for Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System
Figure 2 for Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System
Figure 3 for Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System
Figure 4 for Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System
Viaarxiv icon

Speech Enhancement with Zero-Shot Model Selection

Dec 17, 2020
Ryandhimas E. Zezario, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao

Figure 1 for Speech Enhancement with Zero-Shot Model Selection
Figure 2 for Speech Enhancement with Zero-Shot Model Selection
Figure 3 for Speech Enhancement with Zero-Shot Model Selection
Figure 4 for Speech Enhancement with Zero-Shot Model Selection
Viaarxiv icon