Alert button
Picture for Ron Hoory

Ron Hoory

Alert button

Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations

Add code
Bookmark button
Alert button
Mar 17, 2024
Claudio Pinhanez, Raul Fernandez, Marcelo Grave, Julio Nogima, Ron Hoory

Figure 1 for Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations
Figure 2 for Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations
Figure 3 for Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations
Figure 4 for Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations
Viaarxiv icon

Speak While You Think: Streaming Speech Synthesis During Text Generation

Add code
Bookmark button
Alert button
Sep 20, 2023
Avihu Dekel, Slava Shechtman, Raul Fernandez, David Haws, Zvi Kons, Ron Hoory

Figure 1 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Figure 2 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Figure 3 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Figure 4 for Speak While You Think: Streaming Speech Synthesis During Text Generation
Viaarxiv icon

Towards a Common Speech Analysis Engine

Add code
Bookmark button
Alert button
Mar 01, 2022
Hagai Aronowitz, Itai Gat, Edmilson Morais, Weizhong Zhu, Ron Hoory

Figure 1 for Towards a Common Speech Analysis Engine
Figure 2 for Towards a Common Speech Analysis Engine
Figure 3 for Towards a Common Speech Analysis Engine
Figure 4 for Towards a Common Speech Analysis Engine
Viaarxiv icon

A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets

Add code
Bookmark button
Alert button
Feb 21, 2022
Zvi Kons, Aharon Satt, Hong-Kwang Kuo, Samuel Thomas, Boaz Carmeli, Ron Hoory, Brian Kingsbury

Figure 1 for A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets
Figure 2 for A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets
Figure 3 for A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets
Viaarxiv icon

Speech Emotion Recognition using Self-Supervised Features

Add code
Bookmark button
Alert button
Feb 07, 2022
Edmilson Morais, Ron Hoory, Weizhong Zhu, Itai Gat, Matheus Damasceno, Hagai Aronowitz

Figure 1 for Speech Emotion Recognition using Self-Supervised Features
Figure 2 for Speech Emotion Recognition using Self-Supervised Features
Figure 3 for Speech Emotion Recognition using Self-Supervised Features
Figure 4 for Speech Emotion Recognition using Self-Supervised Features
Viaarxiv icon

Speaker Normalization for Self-supervised Speech Emotion Recognition

Add code
Bookmark button
Alert button
Feb 02, 2022
Itai Gat, Hagai Aronowitz, Weizhong Zhu, Edmilson Morais, Ron Hoory

Figure 1 for Speaker Normalization for Self-supervised Speech Emotion Recognition
Figure 2 for Speaker Normalization for Self-supervised Speech Emotion Recognition
Figure 3 for Speaker Normalization for Self-supervised Speech Emotion Recognition
Viaarxiv icon

RNN Transducer Models For Spoken Language Understanding

Add code
Bookmark button
Alert button
Apr 08, 2021
Samuel Thomas, Hong-Kwang J. Kuo, George Saon, Zoltán Tüske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory

Figure 1 for RNN Transducer Models For Spoken Language Understanding
Figure 2 for RNN Transducer Models For Spoken Language Understanding
Figure 3 for RNN Transducer Models For Spoken Language Understanding
Figure 4 for RNN Transducer Models For Spoken Language Understanding
Viaarxiv icon

Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems

Add code
Bookmark button
Alert button
Oct 08, 2020
Yinghui Huang, Hong-Kwang Kuo, Samuel Thomas, Zvi Kons, Kartik Audhkhasi, Brian Kingsbury, Ron Hoory, Michael Picheny

Figure 1 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 2 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 3 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 4 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Viaarxiv icon

End-to-End Spoken Language Understanding Without Full Transcripts

Add code
Bookmark button
Alert button
Sep 30, 2020
Hong-Kwang J. Kuo, Zoltán Tüske, Samuel Thomas, Yinghui Huang, Kartik Audhkhasi, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory, Luis Lastras

Figure 1 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 2 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 3 for End-to-End Spoken Language Understanding Without Full Transcripts
Figure 4 for End-to-End Spoken Language Understanding Without Full Transcripts
Viaarxiv icon

Siamese x-vector reconstruction for domain adapted speaker recognition

Add code
Bookmark button
Alert button
Jul 28, 2020
Shai Rozenberg, Hagai Aronowitz, Ron Hoory

Figure 1 for Siamese x-vector reconstruction for domain adapted speaker recognition
Figure 2 for Siamese x-vector reconstruction for domain adapted speaker recognition
Figure 3 for Siamese x-vector reconstruction for domain adapted speaker recognition
Figure 4 for Siamese x-vector reconstruction for domain adapted speaker recognition
Viaarxiv icon