Alert button

"speech": models, code, and papers
Alert button

Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging

Jun 10, 2020
Yeqi Bai, Tao Ma, Lipo Wang, Zhenjie Zhang

Figure 1 for Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging
Figure 2 for Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging
Figure 3 for Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging
Figure 4 for Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging
Viaarxiv icon

Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR

Jan 26, 2022
Yufei Liu, Rao Ma, Haihua Xu, Yi He, Zejun Ma, Weibin Zhang

Figure 1 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 2 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 3 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 4 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Viaarxiv icon

Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks

Sep 27, 2020
Gašper Beguš

Figure 1 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Figure 2 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Figure 3 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Figure 4 for Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks
Viaarxiv icon

Factorized Neural Transducer for Efficient Language Model Adaptation

Oct 18, 2021
Xie Chen, Zhong Meng, Sarangarajan Parthasarathy, Jinyu Li

Figure 1 for Factorized Neural Transducer for Efficient Language Model Adaptation
Figure 2 for Factorized Neural Transducer for Efficient Language Model Adaptation
Figure 3 for Factorized Neural Transducer for Efficient Language Model Adaptation
Figure 4 for Factorized Neural Transducer for Efficient Language Model Adaptation
Viaarxiv icon

Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech

Nov 21, 2019
David Harwath, Wei-Ning Hsu, James Glass

Figure 1 for Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech
Figure 2 for Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech
Figure 3 for Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech
Figure 4 for Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech
Viaarxiv icon

Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging

Aug 07, 2019
Binh Nguyen, Vu Bao Hung Nguyen, Hien Nguyen, Pham Ngoc Phuong, The-Loc Nguyen, Quoc Truong Do, Luong Chi Mai

Figure 1 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Figure 2 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Figure 3 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Figure 4 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Viaarxiv icon

Detecting Multiple Speech Disfluencies using a Deep Residual Network with Bidirectional Long Short-Term Memory

Oct 17, 2019
Tedd Kourkounakis, Amirhossein Hajavi, Ali Etemad

Figure 1 for Detecting Multiple Speech Disfluencies using a Deep Residual Network with Bidirectional Long Short-Term Memory
Figure 2 for Detecting Multiple Speech Disfluencies using a Deep Residual Network with Bidirectional Long Short-Term Memory
Figure 3 for Detecting Multiple Speech Disfluencies using a Deep Residual Network with Bidirectional Long Short-Term Memory
Figure 4 for Detecting Multiple Speech Disfluencies using a Deep Residual Network with Bidirectional Long Short-Term Memory
Viaarxiv icon

Speech vocoding for laboratory phonology

Sep 15, 2016
Milos Cernak, Stefan Benus, Alexandros Lazaridis

Figure 1 for Speech vocoding for laboratory phonology
Figure 2 for Speech vocoding for laboratory phonology
Figure 3 for Speech vocoding for laboratory phonology
Figure 4 for Speech vocoding for laboratory phonology
Viaarxiv icon

The Dawn of Quantum Natural Language Processing

Oct 13, 2021
Riccardo Di Sipio, Jia-Hong Huang, Samuel Yen-Chi Chen, Stefano Mangini, Marcel Worring

Figure 1 for The Dawn of Quantum Natural Language Processing
Figure 2 for The Dawn of Quantum Natural Language Processing
Figure 3 for The Dawn of Quantum Natural Language Processing
Figure 4 for The Dawn of Quantum Natural Language Processing
Viaarxiv icon

From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning

Jun 03, 2019
Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

Figure 1 for From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning
Figure 2 for From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning
Figure 3 for From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning
Figure 4 for From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning
Viaarxiv icon