Alert button

"speech": models, code, and papers
Alert button

Efficient neural speech synthesis for low-resource languages through multilingual modeling

Aug 20, 2020
Marcel de Korte, Jaebok Kim, Esther Klabbers

Figure 1 for Efficient neural speech synthesis for low-resource languages through multilingual modeling
Figure 2 for Efficient neural speech synthesis for low-resource languages through multilingual modeling
Figure 3 for Efficient neural speech synthesis for low-resource languages through multilingual modeling
Viaarxiv icon

Latent linguistic embedding for cross-lingual text-to-speech and voice conversion

Add code
Bookmark button
Alert button
Oct 08, 2020
Hieu-Thi Luong, Junichi Yamagishi

Figure 1 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Figure 2 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Figure 3 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Figure 4 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Viaarxiv icon

Quadrupedal Robotic Guide Dog with Vocal Human-Robot Interaction

Nov 05, 2021
Kavan Mehrizi, Zhongyu Li, Koushil Sreenath

Figure 1 for Quadrupedal Robotic Guide Dog with Vocal Human-Robot Interaction
Figure 2 for Quadrupedal Robotic Guide Dog with Vocal Human-Robot Interaction
Figure 3 for Quadrupedal Robotic Guide Dog with Vocal Human-Robot Interaction
Viaarxiv icon

Nonlinear ISA with Auxiliary Variables for Learning Speech Representations

Jul 25, 2020
Amrith Setlur, Barnabas Poczos, Alan W Black

Figure 1 for Nonlinear ISA with Auxiliary Variables for Learning Speech Representations
Figure 2 for Nonlinear ISA with Auxiliary Variables for Learning Speech Representations
Figure 3 for Nonlinear ISA with Auxiliary Variables for Learning Speech Representations
Viaarxiv icon

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Add code
Bookmark button
Alert button
Jun 20, 2020
Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, Michael Auli

Figure 1 for wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Figure 2 for wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Figure 3 for wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Figure 4 for wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Viaarxiv icon

Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language Understanding

Add code
Bookmark button
Alert button
Oct 25, 2020
Seongbin Kim, Gyuwan Kim, Seongjin Shin, Sangmin Lee

Figure 1 for Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language Understanding
Figure 2 for Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language Understanding
Figure 3 for Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language Understanding
Figure 4 for Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language Understanding
Viaarxiv icon

Twitter Dataset on the Russo-Ukrainian War

Add code
Bookmark button
Alert button
Apr 07, 2022
Alexander Shevtsov, Christos Tzagkarakis, Despoina Antonakaki, Polyvios Pratikakis, Sotiris Ioannidis

Figure 1 for Twitter Dataset on the Russo-Ukrainian War
Figure 2 for Twitter Dataset on the Russo-Ukrainian War
Figure 3 for Twitter Dataset on the Russo-Ukrainian War
Figure 4 for Twitter Dataset on the Russo-Ukrainian War
Viaarxiv icon

CRAB: Class Representation Attentive BERT for Hate Speech Identification in Social Media

Oct 25, 2020
Sayyed M. Zahiri, Ali Ahmadvand

Figure 1 for CRAB: Class Representation Attentive BERT for Hate Speech Identification in Social Media
Figure 2 for CRAB: Class Representation Attentive BERT for Hate Speech Identification in Social Media
Viaarxiv icon

Deep F-measure Maximization for End-to-End Speech Understanding

Aug 08, 2020
Leda Sarı, Mark Hasegawa-Johnson

Figure 1 for Deep F-measure Maximization for End-to-End Speech Understanding
Figure 2 for Deep F-measure Maximization for End-to-End Speech Understanding
Figure 3 for Deep F-measure Maximization for End-to-End Speech Understanding
Figure 4 for Deep F-measure Maximization for End-to-End Speech Understanding
Viaarxiv icon

Unified Modeling of Multi-Domain Multi-Device ASR Systems

May 13, 2022
Soumyajit Mitra, Swayambhu Nath Ray, Bharat Padi, Arunasish Sen, Raghavendra Bilgi, Harish Arsikere, Shalini Ghosh, Ajay Srinivasamurthy, Sri Garimella

Figure 1 for Unified Modeling of Multi-Domain Multi-Device ASR Systems
Figure 2 for Unified Modeling of Multi-Domain Multi-Device ASR Systems
Figure 3 for Unified Modeling of Multi-Domain Multi-Device ASR Systems
Figure 4 for Unified Modeling of Multi-Domain Multi-Device ASR Systems
Viaarxiv icon