Alert button

"speech": models, code, and papers
Alert button

Hierarchical Summarization for Longform Spoken Dialog

Aug 21, 2021
Daniel Li, Thomas Chen, Albert Tung, Lydia Chilton

Figure 1 for Hierarchical Summarization for Longform Spoken Dialog
Figure 2 for Hierarchical Summarization for Longform Spoken Dialog
Figure 3 for Hierarchical Summarization for Longform Spoken Dialog
Figure 4 for Hierarchical Summarization for Longform Spoken Dialog
Viaarxiv icon

Advanced Rich Transcription System for Estonian Speech

Add code
Bookmark button
Alert button
Jan 11, 2019
Tanel Alumäe, Ottokar Tilk, Asadullah

Figure 1 for Advanced Rich Transcription System for Estonian Speech
Figure 2 for Advanced Rich Transcription System for Estonian Speech
Figure 3 for Advanced Rich Transcription System for Estonian Speech
Figure 4 for Advanced Rich Transcription System for Estonian Speech
Viaarxiv icon

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Add code
Bookmark button
Alert button
Feb 24, 2022
Quan Wang, Yang Yu, Jason Pelecanos, Yiling Huang, Ignacio Lopez Moreno

Figure 1 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 2 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 3 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 4 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Viaarxiv icon

Improved Language Identification Through Cross-Lingual Self-Supervised Learning

Jul 08, 2021
Andros Tjandra, Diptanu Gon Choudhury, Frank Zhang, Kritika Singh, Alexei Baevski, Assaf Sela, Yatharth Saraf, Michael Auli

Figure 1 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 2 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 3 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 4 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Viaarxiv icon

A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture

Jan 06, 2022
Mohsen Jafarzadeh, Stephen Brooks, Shimeng Yu, Balakrishnan Prabhakaran, Yonas Tadesse

Figure 1 for A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture
Figure 2 for A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture
Figure 3 for A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture
Figure 4 for A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture
Viaarxiv icon

Improving Transformer-based Speech Recognition Using Unsupervised Pre-training

Oct 30, 2019
Dongwei Jiang, Xiaoning Lei, Wubo Li, Ne Luo, Yuxuan Hu, Wei Zou, Xiangang Li

Figure 1 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 2 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 3 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 4 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Viaarxiv icon

Does Audio Deepfake Detection Generalize?

Mar 31, 2022
Nicolas M. Müller, Pavel Czempin, Franziska Dieckmann, Adam Froghyar, Konstantin Böttinger

Figure 1 for Does Audio Deepfake Detection Generalize?
Figure 2 for Does Audio Deepfake Detection Generalize?
Figure 3 for Does Audio Deepfake Detection Generalize?
Viaarxiv icon

Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings

Feb 13, 2022
Nadee Seneviratne, Carol Espy-Wilson

Figure 1 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Figure 2 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Figure 3 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Figure 4 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Viaarxiv icon

Detecting Abusive Albanian

Add code
Bookmark button
Alert button
Jul 30, 2021
Erida Nurce, Jorgel Keci, Leon Derczynski

Figure 1 for Detecting Abusive Albanian
Figure 2 for Detecting Abusive Albanian
Figure 3 for Detecting Abusive Albanian
Figure 4 for Detecting Abusive Albanian
Viaarxiv icon

Large-Scale Visual Speech Recognition

Add code
Bookmark button
Alert button
Oct 01, 2018
Brendan Shillingford, Yannis Assael, Matthew W. Hoffman, Thomas Paine, Cían Hughes, Utsav Prabhu, Hank Liao, Hasim Sak, Kanishka Rao, Lorrayne Bennett, Marie Mulville, Ben Coppin, Ben Laurie, Andrew Senior, Nando de Freitas

Figure 1 for Large-Scale Visual Speech Recognition
Figure 2 for Large-Scale Visual Speech Recognition
Figure 3 for Large-Scale Visual Speech Recognition
Figure 4 for Large-Scale Visual Speech Recognition
Viaarxiv icon