Alert button

"speech": models, code, and papers
Alert button

A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture

Jan 06, 2022
Mohsen Jafarzadeh, Stephen Brooks, Shimeng Yu, Balakrishnan Prabhakaran, Yonas Tadesse

Figure 1 for A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture
Figure 2 for A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture
Figure 3 for A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture
Figure 4 for A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture
Viaarxiv icon

Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection

Add code
Bookmark button
Alert button
Mar 23, 2021
Prashant Kapil, Asif Ekbal

Figure 1 for Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection
Figure 2 for Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection
Figure 3 for Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection
Figure 4 for Leveraging Multi-domain, Heterogeneous Data using Deep Multitask Learning for Hate Speech Detection
Viaarxiv icon

A Context-Aware Feature Fusion Framework for Punctuation Restoration

Add code
Bookmark button
Alert button
Mar 23, 2022
Yangjun Wu, Kebin Fang, Yao Zhao

Figure 1 for A Context-Aware Feature Fusion Framework for Punctuation Restoration
Figure 2 for A Context-Aware Feature Fusion Framework for Punctuation Restoration
Figure 3 for A Context-Aware Feature Fusion Framework for Punctuation Restoration
Figure 4 for A Context-Aware Feature Fusion Framework for Punctuation Restoration
Viaarxiv icon

Enrollment-less training for personalized voice activity detection

Jun 23, 2021
Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura

Figure 1 for Enrollment-less training for personalized voice activity detection
Figure 2 for Enrollment-less training for personalized voice activity detection
Figure 3 for Enrollment-less training for personalized voice activity detection
Viaarxiv icon

Attentive Modality Hopping Mechanism for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Nov 29, 2019
Seunghyun Yoon, Subhadeep Dey, Hwanhee Lee, Kyomin Jung

Figure 1 for Attentive Modality Hopping Mechanism for Speech Emotion Recognition
Figure 2 for Attentive Modality Hopping Mechanism for Speech Emotion Recognition
Figure 3 for Attentive Modality Hopping Mechanism for Speech Emotion Recognition
Figure 4 for Attentive Modality Hopping Mechanism for Speech Emotion Recognition
Viaarxiv icon

Adversarial Example Detection by Classification for Deep Speech Recognition

Oct 22, 2019
Saeid Samizade, Zheng-Hua Tan, Chao Shen, Xiaohong Guan

Figure 1 for Adversarial Example Detection by Classification for Deep Speech Recognition
Figure 2 for Adversarial Example Detection by Classification for Deep Speech Recognition
Figure 3 for Adversarial Example Detection by Classification for Deep Speech Recognition
Figure 4 for Adversarial Example Detection by Classification for Deep Speech Recognition
Viaarxiv icon

Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings

Feb 13, 2022
Nadee Seneviratne, Carol Espy-Wilson

Figure 1 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Figure 2 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Figure 3 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Figure 4 for Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings
Viaarxiv icon

Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals

Mar 01, 2021
Nils Poschadel, Robert Hupke, Stephan Preihs, Jürgen Peissig

Figure 1 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Figure 2 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Figure 3 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Figure 4 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Viaarxiv icon

Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers

Jan 21, 2021
Yawen Xue, Shota Horiguchi, Yusuke Fujita, Yuki Takashima, Shinji Watanabe, Paola Garcia, Kenji Nagamatsu

Figure 1 for Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
Figure 2 for Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
Figure 3 for Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
Figure 4 for Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
Viaarxiv icon

Does Audio Deepfake Detection Generalize?

Mar 31, 2022
Nicolas M. Müller, Pavel Czempin, Franziska Dieckmann, Adam Froghyar, Konstantin Böttinger

Figure 1 for Does Audio Deepfake Detection Generalize?
Figure 2 for Does Audio Deepfake Detection Generalize?
Figure 3 for Does Audio Deepfake Detection Generalize?
Viaarxiv icon