Alert button

"speech recognition": models, code, and papers
Alert button

Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks

Mar 15, 2024
Peter Leer, Jesper Jensen, Laurel Carney, Zheng-Hua Tan, Jan Østergaard, Lars Bramsløw

Figure 1 for Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks
Figure 2 for Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks
Figure 3 for Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks
Figure 4 for Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks
Viaarxiv icon

Fusion approaches for emotion recognition from speech using acoustic and text-based features

Mar 27, 2024
Leonardo Pepino, Pablo Riera, Luciana Ferrer, Agustin Gravano

Viaarxiv icon

The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data

Mar 21, 2024
Alice Baird, Rachel Manzelli, Panagiotis Tzirakis, Chris Gagne, Haoqi Li, Sadie Allen, Sander Dieleman, Brian Kulis, Shrikanth S. Narayanan, Alan Cowen

Figure 1 for The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data
Figure 2 for The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data
Figure 3 for The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data
Figure 4 for The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data
Viaarxiv icon

Energy-Based Models with Applications to Speech and Language Processing

Add code
Bookmark button
Alert button
Mar 16, 2024
Zhijian Ou

Figure 1 for Energy-Based Models with Applications to Speech and Language Processing
Figure 2 for Energy-Based Models with Applications to Speech and Language Processing
Figure 3 for Energy-Based Models with Applications to Speech and Language Processing
Figure 4 for Energy-Based Models with Applications to Speech and Language Processing
Viaarxiv icon

What has LeBenchmark Learnt about French Syntax?

Mar 04, 2024
Zdravko Dugonjić, Adrien Pupier, Benjamin Lecouteux, Maximin Coavoux

Figure 1 for What has LeBenchmark Learnt about French Syntax?
Figure 2 for What has LeBenchmark Learnt about French Syntax?
Figure 3 for What has LeBenchmark Learnt about French Syntax?
Figure 4 for What has LeBenchmark Learnt about French Syntax?
Viaarxiv icon

Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR

Add code
Bookmark button
Alert button
Mar 11, 2024
Yufeng Yang, Ashutosh Pandey, DeLiang Wang

Figure 1 for Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
Figure 2 for Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
Figure 3 for Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
Figure 4 for Towards Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
Viaarxiv icon

SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations

Mar 10, 2024
Amit Meghanani, Thomas Hain

Figure 1 for SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations
Figure 2 for SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations
Figure 3 for SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations
Viaarxiv icon

AIx Speed: Playback Speed Optimization Using Listening Comprehension of Speech Recognition Models

Mar 05, 2024
Kazuki Kawamura, Jun Rekimoto

Figure 1 for AIx Speed: Playback Speed Optimization Using Listening Comprehension of Speech Recognition Models
Figure 2 for AIx Speed: Playback Speed Optimization Using Listening Comprehension of Speech Recognition Models
Figure 3 for AIx Speed: Playback Speed Optimization Using Listening Comprehension of Speech Recognition Models
Figure 4 for AIx Speed: Playback Speed Optimization Using Listening Comprehension of Speech Recognition Models
Viaarxiv icon

SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition

Jan 18, 2024
Hao Wang, Shuhei Kurita, Shuichiro Shimizu, Daisuke Kawahara

Viaarxiv icon

Two-pass Endpoint Detection for Speech Recognition

Jan 17, 2024
Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow

Viaarxiv icon