Alert button

"speech": models, code, and papers
Alert button

Speech language models lack important brain-relevant semantics

Add code
Bookmark button
Alert button
Nov 08, 2023
Subba Reddy Oota, Emin Çelik, Fatma Deniz, Mariya Toneva

Viaarxiv icon

Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions

Dec 27, 2023
Holger Severin Bovbjerg, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan

Viaarxiv icon

Measuring Entrainment in Spontaneous Code-switched Speech

Nov 13, 2023
Debasmita Bhattacharya, Siying Ding, Alayna Nguyen, Julia Hirschberg

Viaarxiv icon

Some clues to build a sound analysis relevant to hearing

Jan 04, 2024
Laurent Millot

Viaarxiv icon

Unveiling Comparative Sentiments in Vietnamese Product Reviews: A Sequential Classification Framework

Jan 02, 2024
Ha Le, Bao Tran, Phuong Le, Tan Nguyen, Dac Nguyen, Ngoan Pham, Dang Huynh

Viaarxiv icon

Towards Online Sign Language Recognition and Translation

Add code
Bookmark button
Alert button
Jan 10, 2024
Ronglai Zuo, Fangyun Wei, Brian Mak

Viaarxiv icon

Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

Dec 15, 2023
Bingshen Mu, Pengcheng Guo, Dake Guo, Pan Zhou, Wei Chen, Lei Xie

Viaarxiv icon

App for Resume-Based Job Matching with Speech Interviews and Grammar Analysis: A Review

Nov 20, 2023
Tanmay Kulkarni, Yuvraj Pardeshi, Yash Shah, Vaishnvi Sakat, Sapana Bhirud

Viaarxiv icon

Creating New Voices using Normalizing Flows

Dec 22, 2023
Piotr Bilinski, Thomas Merritt, Abdelhamid Ezzerg, Kamil Pokora, Sebastian Cygert, Kayoko Yanagisawa, Roberto Barra-Chicote, Daniel Korzekwa

Viaarxiv icon

Reading Between the Frames: Multi-Modal Depression Detection in Videos from Non-Verbal Cues

Add code
Bookmark button
Alert button
Jan 05, 2024
David Gimeno-Gómez, Ana-Maria Bucur, Adrian Cosma, Carlos-David Martínez-Hinarejos, Paolo Rosso

Viaarxiv icon