Alert button

"speech": models, code, and papers
Alert button

Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems

Add code
Bookmark button
Alert button
May 21, 2019
Ohsung Kwon, Eunwoo Song, Jae-Min Kim, Hong-Goo Kang

Figure 1 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Figure 2 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Figure 3 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Figure 4 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Viaarxiv icon

SSAST: Self-Supervised Audio Spectrogram Transformer

Add code
Bookmark button
Alert button
Oct 19, 2021
Yuan Gong, Cheng-I Jeff Lai, Yu-An Chung, James Glass

Figure 1 for SSAST: Self-Supervised Audio Spectrogram Transformer
Figure 2 for SSAST: Self-Supervised Audio Spectrogram Transformer
Figure 3 for SSAST: Self-Supervised Audio Spectrogram Transformer
Figure 4 for SSAST: Self-Supervised Audio Spectrogram Transformer
Viaarxiv icon

Harmonic gated compensation network plus for ICASSP 2022 DNS CHALLENGE

Feb 25, 2022
Tianrui Wang, Weibin Zhu, Yingying Gao, Yanan Chen, Junlan Feng, Shilei Zhang

Figure 1 for Harmonic gated compensation network plus for ICASSP 2022 DNS CHALLENGE
Figure 2 for Harmonic gated compensation network plus for ICASSP 2022 DNS CHALLENGE
Figure 3 for Harmonic gated compensation network plus for ICASSP 2022 DNS CHALLENGE
Figure 4 for Harmonic gated compensation network plus for ICASSP 2022 DNS CHALLENGE
Viaarxiv icon

NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition

Add code
Bookmark button
Alert button
Aug 16, 2021
Seyed Omid Sadjadi

Figure 1 for NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition
Figure 2 for NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition
Figure 3 for NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition
Figure 4 for NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition
Viaarxiv icon

Fair Hate Speech Detection through Evaluation of Social Group Counterfactuals

Oct 24, 2020
Aida Mostafazadeh Davani, Ali Omrani, Brendan Kennedy, Mohammad Atari, Xiang Ren, Morteza Dehghani

Figure 1 for Fair Hate Speech Detection through Evaluation of Social Group Counterfactuals
Figure 2 for Fair Hate Speech Detection through Evaluation of Social Group Counterfactuals
Figure 3 for Fair Hate Speech Detection through Evaluation of Social Group Counterfactuals
Viaarxiv icon

Hidden-Markov-Model Based Speech Enhancement

Jul 04, 2017
Daniel Dzibela, Armin Sehr

Figure 1 for Hidden-Markov-Model Based Speech Enhancement
Figure 2 for Hidden-Markov-Model Based Speech Enhancement
Figure 3 for Hidden-Markov-Model Based Speech Enhancement
Figure 4 for Hidden-Markov-Model Based Speech Enhancement
Viaarxiv icon

Cycle-consistency training for end-to-end speech recognition

Nov 02, 2018
Takaaki Hori, Ramon Astudillo, Tomoki Hayashi, Yu Zhang, Shinji Watanabe, Jonathan Le Roux

Figure 1 for Cycle-consistency training for end-to-end speech recognition
Figure 2 for Cycle-consistency training for end-to-end speech recognition
Figure 3 for Cycle-consistency training for end-to-end speech recognition
Figure 4 for Cycle-consistency training for end-to-end speech recognition
Viaarxiv icon

CochleaNet: A Robust Language-independent Audio-Visual Model for Speech Enhancement

Add code
Bookmark button
Alert button
Sep 23, 2019
Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Amir Hussain

Figure 1 for CochleaNet: A Robust Language-independent Audio-Visual Model for Speech Enhancement
Figure 2 for CochleaNet: A Robust Language-independent Audio-Visual Model for Speech Enhancement
Figure 3 for CochleaNet: A Robust Language-independent Audio-Visual Model for Speech Enhancement
Figure 4 for CochleaNet: A Robust Language-independent Audio-Visual Model for Speech Enhancement
Viaarxiv icon

Speech-Gesture Mapping and Engagement Evaluation in Human Robot Interaction

Dec 09, 2018
Bishal Ghosh, Abhinav Dhall, Ekta Singla

Figure 1 for Speech-Gesture Mapping and Engagement Evaluation in Human Robot Interaction
Figure 2 for Speech-Gesture Mapping and Engagement Evaluation in Human Robot Interaction
Figure 3 for Speech-Gesture Mapping and Engagement Evaluation in Human Robot Interaction
Figure 4 for Speech-Gesture Mapping and Engagement Evaluation in Human Robot Interaction
Viaarxiv icon

Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation

Add code
Bookmark button
Alert button
Apr 15, 2019
Matthias Sperber, Graham Neubig, Jan Niehues, Alex Waibel

Viaarxiv icon