Alert button

"speech": models, code, and papers
Alert button

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition

Mar 31, 2019
Kazuki Shimada, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara

Figure 1 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 2 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 3 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 4 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Viaarxiv icon

Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement

Sep 21, 2020
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee

Figure 1 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Figure 2 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Figure 3 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Figure 4 for Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement
Viaarxiv icon

Countering hate on social media: Large scale classification of hate and counter speech

Jun 05, 2020
Joshua Garland, Keyan Ghazi-Zahedi, Jean-Gabriel Young, Laurent Hébert-Dufresne, Mirta Galesic

Figure 1 for Countering hate on social media: Large scale classification of hate and counter speech
Figure 2 for Countering hate on social media: Large scale classification of hate and counter speech
Figure 3 for Countering hate on social media: Large scale classification of hate and counter speech
Figure 4 for Countering hate on social media: Large scale classification of hate and counter speech
Viaarxiv icon

WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement

Add code
Bookmark button
Alert button
Apr 12, 2020
Tsun-An Hsieh, Hsin-Min Wang, Xugang Lu, Yu Tsao

Figure 1 for WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement
Figure 2 for WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement
Figure 3 for WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement
Figure 4 for WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement
Viaarxiv icon

Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data

Add code
Bookmark button
Alert button
Oct 14, 2021
Haitong Zhang, Yue Lin

Figure 1 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Figure 2 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Figure 3 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Figure 4 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Viaarxiv icon

PriMock57: A Dataset Of Primary Care Mock Consultations

Add code
Bookmark button
Alert button
Apr 01, 2022
Alex Papadopoulos Korfiatis, Francesco Moramarco, Radmila Sarac, Aleksandar Savkov

Figure 1 for PriMock57: A Dataset Of Primary Care Mock Consultations
Figure 2 for PriMock57: A Dataset Of Primary Care Mock Consultations
Figure 3 for PriMock57: A Dataset Of Primary Care Mock Consultations
Figure 4 for PriMock57: A Dataset Of Primary Care Mock Consultations
Viaarxiv icon

Investigating Brain Connectivity with Graph Neural Networks and GNNExplainer

Jun 04, 2022
Maksim Zhdanov, Saskia Steinmann, Nico Hoffmann

Figure 1 for Investigating Brain Connectivity with Graph Neural Networks and GNNExplainer
Figure 2 for Investigating Brain Connectivity with Graph Neural Networks and GNNExplainer
Figure 3 for Investigating Brain Connectivity with Graph Neural Networks and GNNExplainer
Figure 4 for Investigating Brain Connectivity with Graph Neural Networks and GNNExplainer
Viaarxiv icon

Word Discovery in Visually Grounded, Self-Supervised Speech Models

Add code
Bookmark button
Alert button
Mar 28, 2022
Puyuan Peng, David Harwath

Figure 1 for Word Discovery in Visually Grounded, Self-Supervised Speech Models
Figure 2 for Word Discovery in Visually Grounded, Self-Supervised Speech Models
Figure 3 for Word Discovery in Visually Grounded, Self-Supervised Speech Models
Figure 4 for Word Discovery in Visually Grounded, Self-Supervised Speech Models
Viaarxiv icon

Data Augmenting Contrastive Learning of Speech Representations in the Time Domain

Add code
Bookmark button
Alert button
Jul 02, 2020
Eugene Kharitonov, Morgane Rivière, Gabriel Synnaeve, Lior Wolf, Pierre-Emmanuel Mazaré, Matthijs Douze, Emmanuel Dupoux

Figure 1 for Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
Figure 2 for Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
Figure 3 for Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
Figure 4 for Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
Viaarxiv icon