Alert button

"speech": models, code, and papers
Alert button

DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon

Add code
Bookmark button
Alert button
Jun 22, 2022
Robin Algayres, Tristan Ricoul, Julien Karadayi, Hugo Laurençon, Salah Zaiem, Abdelrahman Mohamed, Benoît Sagot, Emmanuel Dupoux

Figure 1 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Figure 2 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Figure 3 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Figure 4 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Viaarxiv icon

An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space

Nov 06, 2022
Jihwan Lee, Jae-Sung Bae, Seongkyu Mun, Heejin Choi, Joun Yeop Lee, Hoon-Young Cho, Chanwoo Kim

Figure 1 for An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space
Figure 2 for An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space
Figure 3 for An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space
Figure 4 for An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space
Viaarxiv icon

Variable-rate hierarchical CPC leads to acoustic unit discovery in speech

Add code
Bookmark button
Alert button
Jun 07, 2022
Santiago Cuervo, Adrian Łańcucki, Ricard Marxer, Paweł Rychlikowski, Jan Chorowski

Figure 1 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Figure 2 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Figure 3 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Figure 4 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Viaarxiv icon

BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model

Add code
Bookmark button
Alert button
Oct 29, 2022
Yosuke Higuchi, Brian Yan, Siddhant Arora, Tetsuji Ogawa, Tetsunori Kobayashi, Shinji Watanabe

Figure 1 for BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model
Figure 2 for BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model
Figure 3 for BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model
Figure 4 for BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model
Viaarxiv icon

Text-to-ECG: 12-Lead Electrocardiogram Synthesis conditioned on Clinical Text Reports

Add code
Bookmark button
Alert button
Mar 09, 2023
Hyunseung Chung, Jiho Kim, Joon-myoung Kwon, Ki-Hyun Jeon, Min Sung Lee, Edward Choi

Figure 1 for Text-to-ECG: 12-Lead Electrocardiogram Synthesis conditioned on Clinical Text Reports
Figure 2 for Text-to-ECG: 12-Lead Electrocardiogram Synthesis conditioned on Clinical Text Reports
Figure 3 for Text-to-ECG: 12-Lead Electrocardiogram Synthesis conditioned on Clinical Text Reports
Figure 4 for Text-to-ECG: 12-Lead Electrocardiogram Synthesis conditioned on Clinical Text Reports
Viaarxiv icon

Modeling speech recognition and synthesis simultaneously: Encoding and decoding lexical and sublexical semantic information into speech with no direct access to speech data

Mar 29, 2022
Gašper Beguš, Alan Zhou

Figure 1 for Modeling speech recognition and synthesis simultaneously: Encoding and decoding lexical and sublexical semantic information into speech with no direct access to speech data
Figure 2 for Modeling speech recognition and synthesis simultaneously: Encoding and decoding lexical and sublexical semantic information into speech with no direct access to speech data
Figure 3 for Modeling speech recognition and synthesis simultaneously: Encoding and decoding lexical and sublexical semantic information into speech with no direct access to speech data
Viaarxiv icon

Improving the transferability of speech separation by meta-learning

Add code
Bookmark button
Alert button
Mar 11, 2022
Kuan-Po Huang, Yuan-Kuei Wu, Hung-yi Lee

Figure 1 for Improving the transferability of speech separation by meta-learning
Figure 2 for Improving the transferability of speech separation by meta-learning
Figure 3 for Improving the transferability of speech separation by meta-learning
Viaarxiv icon

Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation

Mar 01, 2023
Rehan Ahmad, Md Asif Jalal, Muhammad Umar Farooq, Anna Ollerenshaw, Thomas Hain

Figure 1 for Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation
Figure 2 for Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation
Figure 3 for Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation
Figure 4 for Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation
Viaarxiv icon

Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Mar 01, 2023
Eric Sun, Jinyu Li, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu, Edward Lin, Yifan Gong

Figure 1 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 2 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 3 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 4 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Viaarxiv icon

Hey Dona! Can you help me with student course registration?

Mar 21, 2023
Vishesh Kalvakurthi, Aparna S. Varde, John Jenq

Figure 1 for Hey Dona! Can you help me with student course registration?
Figure 2 for Hey Dona! Can you help me with student course registration?
Figure 3 for Hey Dona! Can you help me with student course registration?
Figure 4 for Hey Dona! Can you help me with student course registration?
Viaarxiv icon