Picture for Santosh Kesiraju

Santosh Kesiraju

Robustness assessment of large audio language models in multiple-choice evaluation

Add code
Oct 06, 2025
Viaarxiv icon

DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition

Add code
Aug 12, 2025
Figure 1 for DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition
Figure 2 for DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition
Figure 3 for DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition
Figure 4 for DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition
Viaarxiv icon

Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs

Add code
Jun 10, 2025
Figure 1 for Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
Figure 2 for Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
Figure 3 for Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
Figure 4 for Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
Viaarxiv icon

Factors affecting the in-context learning abilities of LLMs for dialogue state tracking

Add code
Jun 10, 2025
Viaarxiv icon

IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation

Add code
Jun 05, 2025
Viaarxiv icon

Aligning Pre-trained Models for Spoken Language Translation

Add code
Nov 27, 2024
Viaarxiv icon

Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models

Add code
Oct 22, 2024
Figure 1 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Figure 2 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Figure 3 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Figure 4 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Viaarxiv icon

Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets

Add code
Mar 12, 2024
Figure 1 for Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets
Figure 2 for Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets
Figure 3 for Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets
Figure 4 for Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets
Viaarxiv icon

Strategies for improving low resource speech to text translation relying on pre-trained ASR models

Add code
May 31, 2023
Figure 1 for Strategies for improving low resource speech to text translation relying on pre-trained ASR models
Figure 2 for Strategies for improving low resource speech to text translation relying on pre-trained ASR models
Figure 3 for Strategies for improving low resource speech to text translation relying on pre-trained ASR models
Figure 4 for Strategies for improving low resource speech to text translation relying on pre-trained ASR models
Viaarxiv icon

Detecting English Speech in the Air Traffic Control Voice Communication

Add code
Apr 06, 2021
Figure 1 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 2 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 3 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 4 for Detecting English Speech in the Air Traffic Control Voice Communication
Viaarxiv icon