Alert button

"speech": models, code, and papers
Alert button

Hierarchical Summarization for Longform Spoken Dialog

Aug 21, 2021
Daniel Li, Thomas Chen, Albert Tung, Lydia Chilton

Figure 1 for Hierarchical Summarization for Longform Spoken Dialog
Figure 2 for Hierarchical Summarization for Longform Spoken Dialog
Figure 3 for Hierarchical Summarization for Longform Spoken Dialog
Figure 4 for Hierarchical Summarization for Longform Spoken Dialog
Viaarxiv icon

HEAR 2021: Holistic Evaluation of Audio Representations

Add code
Bookmark button
Alert button
Mar 06, 2022
Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk

Figure 1 for HEAR 2021: Holistic Evaluation of Audio Representations
Figure 2 for HEAR 2021: Holistic Evaluation of Audio Representations
Figure 3 for HEAR 2021: Holistic Evaluation of Audio Representations
Figure 4 for HEAR 2021: Holistic Evaluation of Audio Representations
Viaarxiv icon

Not always about you: Prioritizing community needs when developing endangered language technology

Apr 12, 2022
Zoey Liu, Crystal Richardson, Richard Hatcher Jr, Emily Prud'hommeaux

Figure 1 for Not always about you: Prioritizing community needs when developing endangered language technology
Figure 2 for Not always about you: Prioritizing community needs when developing endangered language technology
Viaarxiv icon

Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection

Add code
Bookmark button
Alert button
Mar 23, 2021
Prashant Kapil, Asif Ekbal

Figure 1 for Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection
Figure 2 for Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection
Figure 3 for Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection
Figure 4 for Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection
Viaarxiv icon

Visually grounded cross-lingual keyword spotting in speech

Jun 13, 2018
Herman Kamper, Michael Roth

Figure 1 for Visually grounded cross-lingual keyword spotting in speech
Figure 2 for Visually grounded cross-lingual keyword spotting in speech
Figure 3 for Visually grounded cross-lingual keyword spotting in speech
Figure 4 for Visually grounded cross-lingual keyword spotting in speech
Viaarxiv icon

Detecting Abusive Albanian

Add code
Bookmark button
Alert button
Jul 30, 2021
Erida Nurce, Jorgel Keci, Leon Derczynski

Figure 1 for Detecting Abusive Albanian
Figure 2 for Detecting Abusive Albanian
Figure 3 for Detecting Abusive Albanian
Figure 4 for Detecting Abusive Albanian
Viaarxiv icon

STFT spectral loss for training a neural speech waveform model

Add code
Bookmark button
Alert button
Oct 30, 2018
Shinji Takaki, Toru Nakashika, Xin Wang, Junichi Yamagishi

Figure 1 for STFT spectral loss for training a neural speech waveform model
Figure 2 for STFT spectral loss for training a neural speech waveform model
Figure 3 for STFT spectral loss for training a neural speech waveform model
Figure 4 for STFT spectral loss for training a neural speech waveform model
Viaarxiv icon

Understanding the visual speech signal

Oct 03, 2017
Helen L Bear

Figure 1 for Understanding the visual speech signal
Figure 2 for Understanding the visual speech signal
Figure 3 for Understanding the visual speech signal
Figure 4 for Understanding the visual speech signal
Viaarxiv icon

Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers

Jan 21, 2021
Yawen Xue, Shota Horiguchi, Yusuke Fujita, Yuki Takashima, Shinji Watanabe, Paola Garcia, Kenji Nagamatsu

Figure 1 for Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
Figure 2 for Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
Figure 3 for Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
Figure 4 for Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
Viaarxiv icon

Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals

Mar 01, 2021
Nils Poschadel, Robert Hupke, Stephan Preihs, Jürgen Peissig

Figure 1 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Figure 2 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Figure 3 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Figure 4 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Viaarxiv icon