Alert button

"speech": models, code, and papers
Alert button

Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations

May 18, 2023
Weiwei Lin, Chenhang He, Man-Wai Mak, Youzhi Tu

Figure 1 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Figure 2 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Figure 3 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Figure 4 for Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations
Viaarxiv icon

Mitigating Negative Transfer with Task Awareness for Sexism, Hate Speech, and Toxic Language Detection

Add code
Bookmark button
Alert button
Jul 07, 2023
Angel Felipe Magnossão de Paula, Paolo Rosso, Damiano Spina

Figure 1 for Mitigating Negative Transfer with Task Awareness for Sexism, Hate Speech, and Toxic Language Detection
Figure 2 for Mitigating Negative Transfer with Task Awareness for Sexism, Hate Speech, and Toxic Language Detection
Figure 3 for Mitigating Negative Transfer with Task Awareness for Sexism, Hate Speech, and Toxic Language Detection
Figure 4 for Mitigating Negative Transfer with Task Awareness for Sexism, Hate Speech, and Toxic Language Detection
Viaarxiv icon

DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection

Sep 07, 2023
Cunhang Fan, Hongyu Zhang, Wei Huang, Jun Xue, Jianhua Tao, Jiangyan Yi, Zhao Lv, Xiaopei Wu

Figure 1 for DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection
Figure 2 for DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection
Figure 3 for DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection
Figure 4 for DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection
Viaarxiv icon

Towards robust paralinguistic assessment for real-world mobile health (mHealth) monitoring: an initial study of reverberation effects on speech

May 21, 2023
Judith Dineley, Ewan Carr, Faith Matcham, Johnny Downs, Richard Dobson, Thomas F Quatieri, Nicholas Cummins

Figure 1 for Towards robust paralinguistic assessment for real-world mobile health (mHealth) monitoring: an initial study of reverberation effects on speech
Figure 2 for Towards robust paralinguistic assessment for real-world mobile health (mHealth) monitoring: an initial study of reverberation effects on speech
Figure 3 for Towards robust paralinguistic assessment for real-world mobile health (mHealth) monitoring: an initial study of reverberation effects on speech
Figure 4 for Towards robust paralinguistic assessment for real-world mobile health (mHealth) monitoring: an initial study of reverberation effects on speech
Viaarxiv icon

Unsupervised Speech Representation Pooling Using Vector Quantization

Add code
Bookmark button
Alert button
Apr 08, 2023
Jeongkyun Park, Kwanghee Choi, Hyunjun Heo, Hyung-Min Park

Figure 1 for Unsupervised Speech Representation Pooling Using Vector Quantization
Figure 2 for Unsupervised Speech Representation Pooling Using Vector Quantization
Figure 3 for Unsupervised Speech Representation Pooling Using Vector Quantization
Figure 4 for Unsupervised Speech Representation Pooling Using Vector Quantization
Viaarxiv icon

The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023

Aug 17, 2023
Ming Cheng, Weiqing Wang, Xiaoyi Qin, Yuke Lin, Ning Jiang, Guoqing Zhao, Ming Li

Figure 1 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 2 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 3 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 4 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Viaarxiv icon

ReZero: Region-customizable Sound Extraction

Add code
Bookmark button
Alert button
Aug 31, 2023
Rongzhi Gu, Yi Luo

Figure 1 for ReZero: Region-customizable Sound Extraction
Figure 2 for ReZero: Region-customizable Sound Extraction
Figure 3 for ReZero: Region-customizable Sound Extraction
Figure 4 for ReZero: Region-customizable Sound Extraction
Viaarxiv icon

On Monotonic Aggregation for Open-domain QA

Add code
Bookmark button
Alert button
Aug 08, 2023
Sang-eun Han, Yeonseok Jeong, Seung-won Hwang, Kyungjae Lee

Figure 1 for On Monotonic Aggregation for Open-domain QA
Figure 2 for On Monotonic Aggregation for Open-domain QA
Figure 3 for On Monotonic Aggregation for Open-domain QA
Figure 4 for On Monotonic Aggregation for Open-domain QA
Viaarxiv icon

Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNN

Jul 24, 2023
Muhammad Danyal Khan, Raheem Ali, Arshad Aziz

Viaarxiv icon

VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer

Aug 09, 2023
Liyang Chen, Zhiyong Wu, Runnan Li, Weihong Bao, Jun Ling, Xu Tan, Sheng Zhao

Figure 1 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 2 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 3 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 4 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Viaarxiv icon