Alert button

"speech": models, code, and papers
Alert button

Collaborative Learning with Artificial Intelligence Speakers (CLAIS): Pre-Service Elementary Science Teachers' Responses to the Prototype

Dec 20, 2023
Gyeong-Geon Lee, Seonyeong Mun, Myeong-Kyeong Shin, Xiaoming Zhai

Viaarxiv icon

TIA: A Teaching Intonation Assessment Dataset in Real Teaching Situations

Dec 14, 2023
Shuhua Liu, Chunyu Zhang, Binshuai Li, Niantong Qin, Huanting Cheng, Huayu Zhang

Viaarxiv icon

Augmenty: A Python Library for Structured Text Augmentation

Dec 09, 2023
Kenneth Enevoldsen

Viaarxiv icon

SPRING-INX: A Multilingual Indian Language Speech Corpus by SPRING Lab, IIT Madras

Add code
Bookmark button
Alert button
Oct 23, 2023
Nithya R, Malavika S, Jordan F, Arjun Gangwar, Metilda N J, S Umesh, Rithik Sarab, Akhilesh Kumar Dubey, Govind Divakaran, Samudra Vijaya K, Suryakanth V Gangashetty

Viaarxiv icon

Efficient Monotonic Multihead Attention

Dec 07, 2023
Xutai Ma, Anna Sun, Siqi Ouyang, Hirofumi Inaguma, Paden Tomasello

Viaarxiv icon

Audio Deepfake Detection with Self-Supervised WavLM and Multi-Fusion Attentive Classifier

Dec 13, 2023
Yinlin Guo, Haofan Huang, Xi Chen, He Zhao, Yuehai Wang

Figure 1 for Audio Deepfake Detection with Self-Supervised WavLM and Multi-Fusion Attentive Classifier
Figure 2 for Audio Deepfake Detection with Self-Supervised WavLM and Multi-Fusion Attentive Classifier
Figure 3 for Audio Deepfake Detection with Self-Supervised WavLM and Multi-Fusion Attentive Classifier
Figure 4 for Audio Deepfake Detection with Self-Supervised WavLM and Multi-Fusion Attentive Classifier
Viaarxiv icon

Label Smoothing for Enhanced Text Sentiment Classification

Dec 11, 2023
Yijie Gao, Shijing Si

Figure 1 for Label Smoothing for Enhanced Text Sentiment Classification
Figure 2 for Label Smoothing for Enhanced Text Sentiment Classification
Figure 3 for Label Smoothing for Enhanced Text Sentiment Classification
Figure 4 for Label Smoothing for Enhanced Text Sentiment Classification
Viaarxiv icon

End-to-end Joint Rich and Normalized ASR with a limited amount of rich training data

Nov 29, 2023
Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent

Viaarxiv icon

Understanding Probe Behaviors through Variational Bounds of Mutual Information

Add code
Bookmark button
Alert button
Dec 15, 2023
Kwanghee Choi, Jee-weon Jung, Shinji Watanabe

Viaarxiv icon

LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild

Add code
Bookmark button
Alert button
Nov 21, 2023
David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos

Viaarxiv icon