Alert button

"speech": models, code, and papers
Alert button

A robust audio deepfake detection system via multi-view feature

Mar 04, 2024
Yujie Yang, Haochen Qin, Hang Zhou, Chengcheng Wang, Tianyu Guo, Kai Han, Yunhe Wang

Figure 1 for A robust audio deepfake detection system via multi-view feature
Figure 2 for A robust audio deepfake detection system via multi-view feature
Figure 3 for A robust audio deepfake detection system via multi-view feature
Viaarxiv icon

Emotional Voice Messages (EMOVOME) database: emotion recognition in spontaneous voice messages

Feb 27, 2024
Lucía Gómez Zaragozá, Rocío del Amor, Elena Parra Vargas, Valery Naranjo, Mariano Alcañiz Raya, Javier Marín-Morales

Viaarxiv icon

ROME: Memorization Insights from Text, Probability and Hidden State in Large Language Models

Mar 04, 2024
Bo Li, Qinghua Zhao, Lijie Wen

Figure 1 for ROME: Memorization Insights from Text, Probability and Hidden State in Large Language Models
Figure 2 for ROME: Memorization Insights from Text, Probability and Hidden State in Large Language Models
Figure 3 for ROME: Memorization Insights from Text, Probability and Hidden State in Large Language Models
Figure 4 for ROME: Memorization Insights from Text, Probability and Hidden State in Large Language Models
Viaarxiv icon

DIVERSE: Deciphering Internet Views on the U.S. Military Through Video Comment Stance Analysis, A Novel Benchmark Dataset for Stance Classification

Mar 05, 2024
Iain J. Cruickshank, Lynnette Hui Xian Ng

Figure 1 for DIVERSE: Deciphering Internet Views on the U.S. Military Through Video Comment Stance Analysis, A Novel Benchmark Dataset for Stance Classification
Figure 2 for DIVERSE: Deciphering Internet Views on the U.S. Military Through Video Comment Stance Analysis, A Novel Benchmark Dataset for Stance Classification
Figure 3 for DIVERSE: Deciphering Internet Views on the U.S. Military Through Video Comment Stance Analysis, A Novel Benchmark Dataset for Stance Classification
Figure 4 for DIVERSE: Deciphering Internet Views on the U.S. Military Through Video Comment Stance Analysis, A Novel Benchmark Dataset for Stance Classification
Viaarxiv icon

Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer

Mar 04, 2024
Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li

Figure 1 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 2 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 3 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Figure 4 for Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
Viaarxiv icon

AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation

Feb 25, 2024
Yasheng Sun, Wenqing Chu, Hang Zhou, Kaisiyuan Wang, Hideki Koike

Viaarxiv icon

FingerNet: EEG Decoding of A Fine Motor Imagery with Finger-tapping Task Based on A Deep Neural Network

Mar 06, 2024
Young-Min Go, Seong-Hyun Yu, Hyeong-Yeong Park, Minji Lee, Ji-Hoon Jeong

Figure 1 for FingerNet: EEG Decoding of A Fine Motor Imagery with Finger-tapping Task Based on A Deep Neural Network
Figure 2 for FingerNet: EEG Decoding of A Fine Motor Imagery with Finger-tapping Task Based on A Deep Neural Network
Figure 3 for FingerNet: EEG Decoding of A Fine Motor Imagery with Finger-tapping Task Based on A Deep Neural Network
Figure 4 for FingerNet: EEG Decoding of A Fine Motor Imagery with Finger-tapping Task Based on A Deep Neural Network
Viaarxiv icon

CochCeps-Augment: A Novel Self-Supervised Contrastive Learning Using Cochlear Cepstrum-based Masking for Speech Emotion Recognition

Feb 10, 2024
Ioannis Ziogas, Hessa Alfalahi, Ahsan H. Khandoker, Leontios J. Hadjileontiadis

Viaarxiv icon

ChildAugment: Data Augmentation Methods for Zero-Resource Children's Speaker Verification

Feb 23, 2024
Vishwanath Pratap Singh, Md Sahidullah, Tomi Kinnunen

Viaarxiv icon

MasonPerplexity at Multimodal Hate Speech Event Detection 2024: Hate Speech and Target Detection Using Transformer Ensembles

Feb 03, 2024
Amrita Ganguly, Al Nahian Bin Emran, Sadiya Sayara Chowdhury Puspo, Md Nishat Raihan, Dhiman Goswami, Marcos Zampieri

Viaarxiv icon