Alert button

"speech": models, code, and papers
Alert button

The Use of Voice Source Features for Sung Speech Recognition

Add code
Bookmark button
Alert button
Feb 20, 2021
Gerardo Roa Dabike, Jon Barker

Figure 1 for The Use of Voice Source Features for Sung Speech Recognition
Figure 2 for The Use of Voice Source Features for Sung Speech Recognition
Figure 3 for The Use of Voice Source Features for Sung Speech Recognition
Figure 4 for The Use of Voice Source Features for Sung Speech Recognition
Viaarxiv icon

Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection

Add code
Bookmark button
Alert button
Jan 30, 2022
Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu

Figure 1 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Figure 2 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Figure 3 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Figure 4 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Viaarxiv icon

What Makes Convolutional Models Great on Long Sequence Modeling?

Add code
Bookmark button
Alert button
Oct 17, 2022
Yuhong Li, Tianle Cai, Yi Zhang, Deming Chen, Debadeepta Dey

Figure 1 for What Makes Convolutional Models Great on Long Sequence Modeling?
Figure 2 for What Makes Convolutional Models Great on Long Sequence Modeling?
Figure 3 for What Makes Convolutional Models Great on Long Sequence Modeling?
Figure 4 for What Makes Convolutional Models Great on Long Sequence Modeling?
Viaarxiv icon

Joint Encoder-Decoder Self-Supervised Pre-training for ASR

Jun 09, 2022
Arunkumar A, Umesh S

Figure 1 for Joint Encoder-Decoder Self-Supervised Pre-training for ASR
Figure 2 for Joint Encoder-Decoder Self-Supervised Pre-training for ASR
Figure 3 for Joint Encoder-Decoder Self-Supervised Pre-training for ASR
Figure 4 for Joint Encoder-Decoder Self-Supervised Pre-training for ASR
Viaarxiv icon

Speech-to-speech Translation between Untranscribed Unknown Languages

Oct 02, 2019
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura

Figure 1 for Speech-to-speech Translation between Untranscribed Unknown Languages
Figure 2 for Speech-to-speech Translation between Untranscribed Unknown Languages
Figure 3 for Speech-to-speech Translation between Untranscribed Unknown Languages
Figure 4 for Speech-to-speech Translation between Untranscribed Unknown Languages
Viaarxiv icon

The Volctrans Neural Speech Translation System for IWSLT 2021

Add code
Bookmark button
Alert button
May 16, 2021
Chengqi Zhao, Zhicheng Liu, Jian Tong, Tao Wang, Mingxuan Wang, Rong Ye, Qianqian Dong, Jun Cao, Lei Li

Figure 1 for The Volctrans Neural Speech Translation System for IWSLT 2021
Figure 2 for The Volctrans Neural Speech Translation System for IWSLT 2021
Figure 3 for The Volctrans Neural Speech Translation System for IWSLT 2021
Figure 4 for The Volctrans Neural Speech Translation System for IWSLT 2021
Viaarxiv icon

Practical Speech Re-use Prevention in Voice-driven Services

Jan 12, 2021
Yangyong Zhang, Maliheh Shirvanian, Sunpreet S. Arora, Jianwei Huang, Guofei Gu

Figure 1 for Practical Speech Re-use Prevention in Voice-driven Services
Figure 2 for Practical Speech Re-use Prevention in Voice-driven Services
Figure 3 for Practical Speech Re-use Prevention in Voice-driven Services
Figure 4 for Practical Speech Re-use Prevention in Voice-driven Services
Viaarxiv icon

Impact and dynamics of hate and counter speech online

Add code
Bookmark button
Alert button
Sep 18, 2020
Joshua Garland, Keyan Ghazi-Zahedi, Jean-Gabriel Young, Laurent Hébert-Dufresne, Mirta Galesic

Figure 1 for Impact and dynamics of hate and counter speech online
Figure 2 for Impact and dynamics of hate and counter speech online
Figure 3 for Impact and dynamics of hate and counter speech online
Figure 4 for Impact and dynamics of hate and counter speech online
Viaarxiv icon

Attention-based Residual Speech Portrait Model for Speech to Face Generation

Jul 09, 2020
Jianrong Wang, Xiaosheng Hu, Li Liu, Wei Liu, Mei Yu, Tianyi Xu

Figure 1 for Attention-based Residual Speech Portrait Model for Speech to Face Generation
Figure 2 for Attention-based Residual Speech Portrait Model for Speech to Face Generation
Figure 3 for Attention-based Residual Speech Portrait Model for Speech to Face Generation
Figure 4 for Attention-based Residual Speech Portrait Model for Speech to Face Generation
Viaarxiv icon

Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings

Add code
Bookmark button
Alert button
Sep 18, 2022
Badr M. Abdullah, Bernd Möbius, Dietrich Klakow

Figure 1 for Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings
Figure 2 for Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings
Figure 3 for Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings
Figure 4 for Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings
Viaarxiv icon