Alert button

"speech recognition": models, code, and papers
Alert button

A Morphology-Based Investigation of Positional Encodings

Apr 06, 2024
Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya

Viaarxiv icon

PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations

Mar 27, 2024
Ehsan Latif, Ramviyas Parasuraman, Xiaoming Zhai

Viaarxiv icon

Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems

Feb 29, 2024
Quentin Raymondaud, Mickael Rouvier, Richard Dufour

Viaarxiv icon

An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement

Feb 27, 2024
Tzu-Ting Yang, Hsin-Wei Wang, Yi-Cheng Wang, Chi-Han Lin, Berlin Chen

Viaarxiv icon

A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition

Add code
Bookmark button
Alert button
Mar 02, 2024
Tyler Benster, Guy Wilson, Reshef Elisha, Francis R Willett, Shaul Druckmann

Figure 1 for A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition
Figure 2 for A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition
Figure 3 for A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition
Figure 4 for A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition
Viaarxiv icon

Speech Emotion Recognition Via CNN-Transforemr and Multidimensional Attention Mechanism

Add code
Bookmark button
Alert button
Mar 07, 2024
Xiaoyu Tang, Yixin Lin, Ting Dang, Yuanfang Zhang, Jintao Cheng

Figure 1 for Speech Emotion Recognition Via CNN-Transforemr and Multidimensional Attention Mechanism
Figure 2 for Speech Emotion Recognition Via CNN-Transforemr and Multidimensional Attention Mechanism
Figure 3 for Speech Emotion Recognition Via CNN-Transforemr and Multidimensional Attention Mechanism
Figure 4 for Speech Emotion Recognition Via CNN-Transforemr and Multidimensional Attention Mechanism
Viaarxiv icon

Extracting Biomedical Entities from Noisy Audio Transcripts

Mar 26, 2024
Nima Ebadi, Kellen Morgan, Adrian Tan, Billy Linares, Sheri Osborn, Emma Majors, Jeremy Davis, Anthony Rios

Viaarxiv icon

Real-time Speech Extraction Using Spatially Regularized Independent Low-rank Matrix Analysis and Rank-constrained Spatial Covariance Matrix Estimation

Mar 19, 2024
Yuto Ishikawa, Kohei Konaka, Tomohiko Nakamura, Norihiro Takamune, Hiroshi Saruwatari

Figure 1 for Real-time Speech Extraction Using Spatially Regularized Independent Low-rank Matrix Analysis and Rank-constrained Spatial Covariance Matrix Estimation
Figure 2 for Real-time Speech Extraction Using Spatially Regularized Independent Low-rank Matrix Analysis and Rank-constrained Spatial Covariance Matrix Estimation
Figure 3 for Real-time Speech Extraction Using Spatially Regularized Independent Low-rank Matrix Analysis and Rank-constrained Spatial Covariance Matrix Estimation
Viaarxiv icon

Deep functional multiple index models with an application to SER

Mar 26, 2024
Matthieu Saumard, Abir El Haj, Thibault Napoleon

Viaarxiv icon

Initial Decoding with Minimally Augmented Language Model for Improved Lattice Rescoring in Low Resource ASR

Mar 16, 2024
Savitha Murthy, Dinkar Sitaram

Figure 1 for Initial Decoding with Minimally Augmented Language Model for Improved Lattice Rescoring in Low Resource ASR
Figure 2 for Initial Decoding with Minimally Augmented Language Model for Improved Lattice Rescoring in Low Resource ASR
Figure 3 for Initial Decoding with Minimally Augmented Language Model for Improved Lattice Rescoring in Low Resource ASR
Figure 4 for Initial Decoding with Minimally Augmented Language Model for Improved Lattice Rescoring in Low Resource ASR
Viaarxiv icon