Alert button

"speech recognition": models, code, and papers
Alert button

Non-verbal information in spontaneous speech -- towards a new framework of analysis

Mar 13, 2024
Tirza Biron, Moshe Barboy, Eran Ben-Artzy, Alona Golubchik, Yanir Marmor, Smadar Szekely, Yaron Winter, David Harel

Figure 1 for Non-verbal information in spontaneous speech -- towards a new framework of analysis
Figure 2 for Non-verbal information in spontaneous speech -- towards a new framework of analysis
Figure 3 for Non-verbal information in spontaneous speech -- towards a new framework of analysis
Figure 4 for Non-verbal information in spontaneous speech -- towards a new framework of analysis
Viaarxiv icon

The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data

Mar 21, 2024
Alice Baird, Rachel Manzelli, Panagiotis Tzirakis, Chris Gagne, Haoqi Li, Sadie Allen, Sander Dieleman, Brian Kulis, Shrikanth S. Narayanan, Alan Cowen

Viaarxiv icon

Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks

Mar 15, 2024
Peter Leer, Jesper Jensen, Laurel Carney, Zheng-Hua Tan, Jan Østergaard, Lars Bramsløw

Viaarxiv icon

Energy-Based Models with Applications to Speech and Language Processing

Mar 16, 2024
Zhijian Ou

Viaarxiv icon

A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement

Mar 03, 2024
Ravi Shankar, Ke Tan, Buye Xu, Anurag Kumar

Figure 1 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 2 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 3 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 4 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Viaarxiv icon

Persian Speech Emotion Recognition by Fine-Tuning Transformers

Feb 11, 2024
Minoo Shayaninasab, Bagher Babaali

Viaarxiv icon

It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition

Feb 08, 2024
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng, Chao-Han Huck Yang

Viaarxiv icon

Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR

Mar 11, 2024
Yufeng Yang, Ashutosh Pandey, DeLiang Wang

Viaarxiv icon

SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations

Mar 10, 2024
Amit Meghanani, Thomas Hain

Viaarxiv icon

What has LeBenchmark Learnt about French Syntax?

Mar 04, 2024
Zdravko Dugonjić, Adrien Pupier, Benjamin Lecouteux, Maximin Coavoux

Figure 1 for What has LeBenchmark Learnt about French Syntax?
Figure 2 for What has LeBenchmark Learnt about French Syntax?
Figure 3 for What has LeBenchmark Learnt about French Syntax?
Figure 4 for What has LeBenchmark Learnt about French Syntax?
Viaarxiv icon