Alert button

"speech": models, code, and papers
Alert button

Learning to Recognise Words using Visually Grounded Speech

May 31, 2020
Sebastiaan Scholten, Danny Merkx, Odette Scharenborg

Figure 1 for Learning to Recognise Words using Visually Grounded Speech
Figure 2 for Learning to Recognise Words using Visually Grounded Speech
Figure 3 for Learning to Recognise Words using Visually Grounded Speech
Figure 4 for Learning to Recognise Words using Visually Grounded Speech
Viaarxiv icon

Improving End-to-End Models for Set Prediction in Spoken Language Understanding

Jan 28, 2022
Hong-Kwang J. Kuo, Zoltan Tuske, Samuel Thomas, Brian Kingsbury, George Saon

Figure 1 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 2 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 3 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Figure 4 for Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Viaarxiv icon

Speaker-aware speech-transformer

Jan 02, 2020
Zhiyun Fan, Jie Li, Shiyu Zhou, Bo Xu

Figure 1 for Speaker-aware speech-transformer
Figure 2 for Speaker-aware speech-transformer
Figure 3 for Speaker-aware speech-transformer
Figure 4 for Speaker-aware speech-transformer
Viaarxiv icon

Dialog+ in Broadcasting: First Field Tests Using Deep-Learning-Based Dialogue Enhancement

Dec 17, 2021
Matteo Torcoli, Christian Simon, Jouni Paulus, Davide Straninger, Alfred Riedel, Volker Koch, Stefan Wits, Daniela Rieger, Harald Fuchs, Christian Uhle, Stefan Meltzer, Adrian Murtaza

Figure 1 for Dialog+ in Broadcasting: First Field Tests Using Deep-Learning-Based Dialogue Enhancement
Figure 2 for Dialog+ in Broadcasting: First Field Tests Using Deep-Learning-Based Dialogue Enhancement
Figure 3 for Dialog+ in Broadcasting: First Field Tests Using Deep-Learning-Based Dialogue Enhancement
Figure 4 for Dialog+ in Broadcasting: First Field Tests Using Deep-Learning-Based Dialogue Enhancement
Viaarxiv icon

The Interspeech Zero Resource Speech Challenge 2021: Spoken language modelling

Add code
Bookmark button
Alert button
Apr 29, 2021
Ewan Dunbar, Mathieu Bernard, Nicolas Hamilakis, Tu Anh Nguyen, Maureen de Seyssel, Patricia Rozé, Morgane Rivière, Eugene Kharitonov, Emmanuel Dupoux

Figure 1 for The Interspeech Zero Resource Speech Challenge 2021: Spoken language modelling
Figure 2 for The Interspeech Zero Resource Speech Challenge 2021: Spoken language modelling
Viaarxiv icon

Is the Language Familiarity Effect gradual? A computational modelling approach

Jun 27, 2022
Maureen de Seyssel, Guillaume Wisniewski, Emmanuel Dupoux

Figure 1 for Is the Language Familiarity Effect gradual? A computational modelling approach
Figure 2 for Is the Language Familiarity Effect gradual? A computational modelling approach
Figure 3 for Is the Language Familiarity Effect gradual? A computational modelling approach
Figure 4 for Is the Language Familiarity Effect gradual? A computational modelling approach
Viaarxiv icon

MAC-DO: Charge Based Multi-Bit Analog In-Memory Accelerator Compatible with DRAM Using Output Stationary Mapping

Jul 16, 2022
Minki Jeong, Wanyeong Jung

Figure 1 for MAC-DO: Charge Based Multi-Bit Analog In-Memory Accelerator Compatible with DRAM Using Output Stationary Mapping
Figure 2 for MAC-DO: Charge Based Multi-Bit Analog In-Memory Accelerator Compatible with DRAM Using Output Stationary Mapping
Figure 3 for MAC-DO: Charge Based Multi-Bit Analog In-Memory Accelerator Compatible with DRAM Using Output Stationary Mapping
Figure 4 for MAC-DO: Charge Based Multi-Bit Analog In-Memory Accelerator Compatible with DRAM Using Output Stationary Mapping
Viaarxiv icon

BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection

Add code
Bookmark button
Alert button
May 26, 2020
Jihyung Moon, Won Ik Cho, Junbum Lee

Figure 1 for BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection
Figure 2 for BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection
Figure 3 for BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection
Figure 4 for BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection
Viaarxiv icon

Extended U-Net for Speaker Verification in Noisy Environments

Add code
Bookmark button
Alert button
Jun 27, 2022
Ju-ho Kim, Jungwoo Heo, Hye-jin Shim, Ha-Jin Yu

Figure 1 for Extended U-Net for Speaker Verification in Noisy Environments
Figure 2 for Extended U-Net for Speaker Verification in Noisy Environments
Figure 3 for Extended U-Net for Speaker Verification in Noisy Environments
Figure 4 for Extended U-Net for Speaker Verification in Noisy Environments
Viaarxiv icon

Voice Conversion for Whispered Speech Synthesis

Dec 11, 2019
Marius Cotescu, Thomas Drugman, Goeric Huybrechts, Jaime Lorenzo-Trueba, Alexis Moinet

Figure 1 for Voice Conversion for Whispered Speech Synthesis
Figure 2 for Voice Conversion for Whispered Speech Synthesis
Figure 3 for Voice Conversion for Whispered Speech Synthesis
Figure 4 for Voice Conversion for Whispered Speech Synthesis
Viaarxiv icon