Alert button

"speech": models, code, and papers
Alert button

An Empirical Study of End-to-end Simultaneous Speech Translation Decoding Strategies

Add code
Bookmark button
Alert button
Mar 04, 2021
Ha Nguyen, Yannick Estève, Laurent Besacier

Figure 1 for An Empirical Study of End-to-end Simultaneous Speech Translation Decoding Strategies
Figure 2 for An Empirical Study of End-to-end Simultaneous Speech Translation Decoding Strategies
Figure 3 for An Empirical Study of End-to-end Simultaneous Speech Translation Decoding Strategies
Figure 4 for An Empirical Study of End-to-end Simultaneous Speech Translation Decoding Strategies
Viaarxiv icon

Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines

Oct 19, 2020
David Wan, Zhengping Jiang, Chris Kedzie, Elsbeth Turcan, Peter Bell, Kathleen McKeown

Figure 1 for Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines
Figure 2 for Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines
Figure 3 for Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines
Figure 4 for Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines
Viaarxiv icon

Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection

Apr 02, 2020
Tharindu Fernando, Sridha Sridharan, Mitchell McLaren, Darshana Priyasad, Simon Denman, Clinton Fookes

Figure 1 for Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection
Figure 2 for Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection
Figure 3 for Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection
Figure 4 for Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection
Viaarxiv icon

Analysis of French Phonetic Idiosyncrasies for Accent Recognition

Add code
Bookmark button
Alert button
Oct 18, 2021
Pierre Berjon, Avishek Nag, Soumyabrata Dev

Figure 1 for Analysis of French Phonetic Idiosyncrasies for Accent Recognition
Figure 2 for Analysis of French Phonetic Idiosyncrasies for Accent Recognition
Figure 3 for Analysis of French Phonetic Idiosyncrasies for Accent Recognition
Figure 4 for Analysis of French Phonetic Idiosyncrasies for Accent Recognition
Viaarxiv icon

On the Use of External Data for Spoken Named Entity Recognition

Add code
Bookmark button
Alert button
Dec 14, 2021
Ankita Pasad, Felix Wu, Suwon Shon, Karen Livescu, Kyu J. Han

Figure 1 for On the Use of External Data for Spoken Named Entity Recognition
Figure 2 for On the Use of External Data for Spoken Named Entity Recognition
Figure 3 for On the Use of External Data for Spoken Named Entity Recognition
Figure 4 for On the Use of External Data for Spoken Named Entity Recognition
Viaarxiv icon

FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning

Sep 23, 2020
Tedd Kourkounakis, Amirhossein Hajavi, Ali Etemad

Figure 1 for FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning
Figure 2 for FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning
Figure 3 for FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning
Figure 4 for FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning
Viaarxiv icon

Self-Supervised Audio-and-Text Pre-training with Extremely Low-Resource Parallel Data

Add code
Bookmark button
Alert button
Apr 10, 2022
Yu Kang, Tianqiao Liu, Hang Li, Yang Hao, Wenbiao Ding

Figure 1 for Self-Supervised Audio-and-Text Pre-training with Extremely Low-Resource Parallel Data
Figure 2 for Self-Supervised Audio-and-Text Pre-training with Extremely Low-Resource Parallel Data
Figure 3 for Self-Supervised Audio-and-Text Pre-training with Extremely Low-Resource Parallel Data
Figure 4 for Self-Supervised Audio-and-Text Pre-training with Extremely Low-Resource Parallel Data
Viaarxiv icon

Hierarchical prosody modeling and control in non-autoregressive parallel neural TTS

Add code
Bookmark button
Alert button
Oct 06, 2021
Tuomo Raitio, Jiangchuan Li, Shreyas Seshadri

Figure 1 for Hierarchical prosody modeling and control in non-autoregressive parallel neural TTS
Figure 2 for Hierarchical prosody modeling and control in non-autoregressive parallel neural TTS
Figure 3 for Hierarchical prosody modeling and control in non-autoregressive parallel neural TTS
Figure 4 for Hierarchical prosody modeling and control in non-autoregressive parallel neural TTS
Viaarxiv icon

A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition

Jun 09, 2021
Shigeki Karita, Yotaro Kubo, Michiel Adriaan Unico Bacchiani, Llion Jones

Figure 1 for A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition
Figure 2 for A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition
Figure 3 for A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition
Figure 4 for A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition
Viaarxiv icon

On Investigation of Unsupervised Speech Factorization Based on Normalization Flow

Add code
Bookmark button
Alert button
Oct 29, 2019
Haoran Sun, Yunqi Cai, Lantian Li, Dong Wang

Figure 1 for On Investigation of Unsupervised Speech Factorization Based on Normalization Flow
Figure 2 for On Investigation of Unsupervised Speech Factorization Based on Normalization Flow
Figure 3 for On Investigation of Unsupervised Speech Factorization Based on Normalization Flow
Figure 4 for On Investigation of Unsupervised Speech Factorization Based on Normalization Flow
Viaarxiv icon