Alert button

"speech": models, code, and papers
Alert button

Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments

Add code
Bookmark button
Alert button
Feb 21, 2022
Mario Esparza

Figure 1 for Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Figure 2 for Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Figure 3 for Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Figure 4 for Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Viaarxiv icon

Consistent Transcription and Translation of Speech

Add code
Bookmark button
Alert button
Aug 28, 2020
Matthias Sperber, Hendra Setiawan, Christian Gollan, Udhyakumar Nallasamy, Matthias Paulik

Figure 1 for Consistent Transcription and Translation of Speech
Figure 2 for Consistent Transcription and Translation of Speech
Figure 3 for Consistent Transcription and Translation of Speech
Figure 4 for Consistent Transcription and Translation of Speech
Viaarxiv icon

End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic Hands

Add code
Bookmark button
Alert button
Sep 22, 2020
Mohsen Jafarzadeh, Yonas Tadesse

Figure 1 for End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic Hands
Figure 2 for End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic Hands
Figure 3 for End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic Hands
Figure 4 for End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic Hands
Viaarxiv icon

STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation Learning

Nov 23, 2020
Prakamya Mishra

Figure 1 for STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation Learning
Figure 2 for STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation Learning
Figure 3 for STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation Learning
Figure 4 for STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation Learning
Viaarxiv icon

Deep Variational Generative Models for Audio-visual Speech Separation

Aug 17, 2020
Viet-Nhat Nguyen, Mostafa Sadeghi, Elisa Ricci, Xavier Alameda-Pineda

Figure 1 for Deep Variational Generative Models for Audio-visual Speech Separation
Figure 2 for Deep Variational Generative Models for Audio-visual Speech Separation
Figure 3 for Deep Variational Generative Models for Audio-visual Speech Separation
Viaarxiv icon

Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jul 02, 2021
Timo Lohrenz, Patrick Schwarz, Zhengyang Li, Tim Fingscheidt

Figure 1 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Figure 2 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Figure 3 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Figure 4 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Viaarxiv icon

Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model

Add code
Bookmark button
Alert button
Apr 07, 2022
Nick J. C. Wang, Lu Wang, Yandan Sun, Haimei Kang, Dejun Zhang

Figure 1 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 2 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 3 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 4 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Viaarxiv icon

Unsupervised Word Segmentation using K Nearest Neighbors

Add code
Bookmark button
Alert button
Apr 27, 2022
Tzeviya Sylvia Fuchs, Yedid Hoshen, Joseph Keshet

Figure 1 for Unsupervised Word Segmentation using K Nearest Neighbors
Figure 2 for Unsupervised Word Segmentation using K Nearest Neighbors
Figure 3 for Unsupervised Word Segmentation using K Nearest Neighbors
Figure 4 for Unsupervised Word Segmentation using K Nearest Neighbors
Viaarxiv icon

A Large-scale Dataset for Hate Speech Detection on Vietnamese Social Media Texts

Add code
Bookmark button
Alert button
Apr 05, 2021
Son T. Luu, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

Figure 1 for A Large-scale Dataset for Hate Speech Detection on Vietnamese Social Media Texts
Figure 2 for A Large-scale Dataset for Hate Speech Detection on Vietnamese Social Media Texts
Figure 3 for A Large-scale Dataset for Hate Speech Detection on Vietnamese Social Media Texts
Figure 4 for A Large-scale Dataset for Hate Speech Detection on Vietnamese Social Media Texts
Viaarxiv icon

WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis

Add code
Bookmark button
Alert button
Jun 20, 2022
Yi Wang, Yi Si

Figure 1 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Figure 2 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Figure 3 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Figure 4 for WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
Viaarxiv icon