Alert button

"speech": models, code, and papers
Alert button

Improved Speech Reconstruction from Silent Video

Aug 29, 2017
Ariel Ephrat, Tavi Halperin, Shmuel Peleg

Figure 1 for Improved Speech Reconstruction from Silent Video
Figure 2 for Improved Speech Reconstruction from Silent Video
Figure 3 for Improved Speech Reconstruction from Silent Video
Figure 4 for Improved Speech Reconstruction from Silent Video
Viaarxiv icon

Analyzing ASR pretraining for low-resource speech-to-text translation

Oct 23, 2019
Mihaela C. Stoian, Sameer Bansal, Sharon Goldwater

Figure 1 for Analyzing ASR pretraining for low-resource speech-to-text translation
Figure 2 for Analyzing ASR pretraining for low-resource speech-to-text translation
Figure 3 for Analyzing ASR pretraining for low-resource speech-to-text translation
Figure 4 for Analyzing ASR pretraining for low-resource speech-to-text translation
Viaarxiv icon

NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy

Jan 31, 2022
Yash Mehta, Colin White, Arber Zela, Arjun Krishnakumar, Guri Zabergja, Shakiba Moradian, Mahmoud Safari, Kaicheng Yu, Frank Hutter

Figure 1 for NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy
Figure 2 for NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy
Figure 3 for NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy
Figure 4 for NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy
Viaarxiv icon

How Hateful are Movies? A Study and Prediction on Movie Subtitles

Aug 19, 2021
Niklas von Boguszewski, Sana Moin, Anirban Bhowmick, Seid Muhie Yimam, Chris Biemann

Figure 1 for How Hateful are Movies? A Study and Prediction on Movie Subtitles
Figure 2 for How Hateful are Movies? A Study and Prediction on Movie Subtitles
Figure 3 for How Hateful are Movies? A Study and Prediction on Movie Subtitles
Figure 4 for How Hateful are Movies? A Study and Prediction on Movie Subtitles
Viaarxiv icon

Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition

May 19, 2020
Yan Gao, Titouan Parcollet, Nicholas Lane

Figure 1 for Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition
Figure 2 for Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition
Figure 3 for Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition
Viaarxiv icon

Beyond Plain Toxic: Detection of Inappropriate Statements on Flammable Topics for the Russian Language

Mar 04, 2022
Nikolay Babakov, Varvara Logacheva, Alexander Panchenko

Figure 1 for Beyond Plain Toxic: Detection of Inappropriate Statements on Flammable Topics for the Russian Language
Figure 2 for Beyond Plain Toxic: Detection of Inappropriate Statements on Flammable Topics for the Russian Language
Figure 3 for Beyond Plain Toxic: Detection of Inappropriate Statements on Flammable Topics for the Russian Language
Figure 4 for Beyond Plain Toxic: Detection of Inappropriate Statements on Flammable Topics for the Russian Language
Viaarxiv icon

TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding

Mar 17, 2022
Ruiteng Zhang, Jianguo Wei, Xugang Lu, Wenhuan Lu, Di Jin, Junhai Xu, Lin Zhang, Yantao Ji, Jianwu Dang

Figure 1 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Figure 2 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Figure 3 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Figure 4 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Viaarxiv icon

Common Phone: A Multilingual Dataset for Robust Acoustic Modelling

Jan 31, 2022
Philipp Klumpp, Tomás Arias-Vergara, Paula Andrea Pérez-Toro, Elmar Nöth, Juan Rafael Orozco-Arroyave

Figure 1 for Common Phone: A Multilingual Dataset for Robust Acoustic Modelling
Figure 2 for Common Phone: A Multilingual Dataset for Robust Acoustic Modelling
Figure 3 for Common Phone: A Multilingual Dataset for Robust Acoustic Modelling
Figure 4 for Common Phone: A Multilingual Dataset for Robust Acoustic Modelling
Viaarxiv icon

The Second DiCOVA Challenge: Dataset and performance analysis for COVID-19 diagnosis using acoustics

Oct 11, 2021
Neeraj Kumar Sharma, Srikanth Raj Chetupalli, Debarpan Bhattacharya, Debottam Dutta, Pravin Mote, Sriram Ganapathy

Figure 1 for The Second DiCOVA Challenge: Dataset and performance analysis for COVID-19 diagnosis using acoustics
Figure 2 for The Second DiCOVA Challenge: Dataset and performance analysis for COVID-19 diagnosis using acoustics
Figure 3 for The Second DiCOVA Challenge: Dataset and performance analysis for COVID-19 diagnosis using acoustics
Viaarxiv icon

Speech Replay Detection with x-Vector Attack Embeddings and Spectral Features

Sep 23, 2019
Jennifer Williams, Joanna Rownicka

Figure 1 for Speech Replay Detection with x-Vector Attack Embeddings and Spectral Features
Figure 2 for Speech Replay Detection with x-Vector Attack Embeddings and Spectral Features
Figure 3 for Speech Replay Detection with x-Vector Attack Embeddings and Spectral Features
Figure 4 for Speech Replay Detection with x-Vector Attack Embeddings and Spectral Features
Viaarxiv icon