Alert button

"speech": models, code, and papers
Alert button

Decoding Imagined Speech and Computer Control using Brain Waves

Nov 08, 2019
Abhiram Singh, Ashwin Gumaste

Figure 1 for Decoding Imagined Speech and Computer Control using Brain Waves
Figure 2 for Decoding Imagined Speech and Computer Control using Brain Waves
Figure 3 for Decoding Imagined Speech and Computer Control using Brain Waves
Figure 4 for Decoding Imagined Speech and Computer Control using Brain Waves
Viaarxiv icon

Optimizing Tandem Speaker Verification and Anti-Spoofing Systems

Jan 24, 2022
Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi

Figure 1 for Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
Figure 2 for Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
Figure 3 for Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
Figure 4 for Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
Viaarxiv icon

An evaluation of word-level confidence estimation for end-to-end automatic speech recognition

Jan 14, 2021
Dan Oneata, Alexandru Caranica, Adriana Stan, Horia Cucu

Figure 1 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Figure 2 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Figure 3 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Figure 4 for An evaluation of word-level confidence estimation for end-to-end automatic speech recognition
Viaarxiv icon

GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters

Add code
Bookmark button
Alert button
May 14, 2022
Anssi Kanervisto, Tomi Kinnunen, Ville Hautamäki

Figure 1 for GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters
Figure 2 for GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters
Figure 3 for GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters
Figure 4 for GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters
Viaarxiv icon

Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques

Add code
Bookmark button
Alert button
Apr 02, 2021
Kang-wook Kim, Seung-won Park, Myun-chul Joe

Figure 1 for Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques
Figure 2 for Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques
Figure 3 for Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques
Figure 4 for Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques
Viaarxiv icon

Directional Sparse Filtering using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech Mixtures

Add code
Bookmark button
Alert button
Feb 02, 2021
Karn Watcharasupat, Anh H. T. Nguyen, Ching-Hui Ooi, Andy W. H. Khong

Figure 1 for Directional Sparse Filtering using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech Mixtures
Figure 2 for Directional Sparse Filtering using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech Mixtures
Figure 3 for Directional Sparse Filtering using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech Mixtures
Viaarxiv icon

Exploring Transformers for Large-Scale Speech Recognition

May 19, 2020
Liang Lu, Changliang Liu, Jinyu Li, Yifan Gong

Figure 1 for Exploring Transformers for Large-Scale Speech Recognition
Figure 2 for Exploring Transformers for Large-Scale Speech Recognition
Figure 3 for Exploring Transformers for Large-Scale Speech Recognition
Figure 4 for Exploring Transformers for Large-Scale Speech Recognition
Viaarxiv icon

Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features

Add code
Bookmark button
Alert button
Nov 21, 2019
Siddharth Gururani, Kilol Gupta, Dhaval Shah, Zahra Shakeri, Jervis Pinto

Figure 1 for Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features
Figure 2 for Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features
Figure 3 for Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features
Figure 4 for Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features
Viaarxiv icon

Convolutional Speech Recognition with Pitch and Voice Quality Features

Add code
Bookmark button
Alert button
Sep 02, 2020
Guillermo Cámbara, Jordi Luque, Mireia Farrús

Figure 1 for Convolutional Speech Recognition with Pitch and Voice Quality Features
Figure 2 for Convolutional Speech Recognition with Pitch and Voice Quality Features
Viaarxiv icon

The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach

Oct 14, 2019
Noé Tits, Kevin El Haddad, Thierry Dutoit

Figure 1 for The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach
Figure 2 for The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach
Figure 3 for The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach
Figure 4 for The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach
Viaarxiv icon