Picture for Santosh Kesiraju

Santosh Kesiraju

Factors affecting the in-context learning abilities of LLMs for dialogue state tracking

Add code
Jun 10, 2025
Viaarxiv icon

Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs

Add code
Jun 10, 2025
Viaarxiv icon

IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation

Add code
Jun 05, 2025
Viaarxiv icon

Aligning Pre-trained Models for Spoken Language Translation

Add code
Nov 27, 2024
Viaarxiv icon

Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models

Add code
Oct 22, 2024
Figure 1 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Figure 2 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Figure 3 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Figure 4 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Viaarxiv icon

Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets

Add code
Mar 12, 2024
Figure 1 for Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets
Figure 2 for Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets
Figure 3 for Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets
Figure 4 for Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets
Viaarxiv icon

Strategies for improving low resource speech to text translation relying on pre-trained ASR models

Add code
May 31, 2023
Viaarxiv icon

Detecting English Speech in the Air Traffic Control Voice Communication

Add code
Apr 06, 2021
Figure 1 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 2 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 3 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 4 for Detecting English Speech in the Air Traffic Control Voice Communication
Viaarxiv icon

Rethinking the objectives of extractive question answering

Add code
Aug 28, 2020
Figure 1 for Rethinking the objectives of extractive question answering
Figure 2 for Rethinking the objectives of extractive question answering
Figure 3 for Rethinking the objectives of extractive question answering
Figure 4 for Rethinking the objectives of extractive question answering
Viaarxiv icon

Bayesian multilingual topic model for zero-shot cross-lingual topic identification

Add code
Jul 02, 2020
Figure 1 for Bayesian multilingual topic model for zero-shot cross-lingual topic identification
Figure 2 for Bayesian multilingual topic model for zero-shot cross-lingual topic identification
Figure 3 for Bayesian multilingual topic model for zero-shot cross-lingual topic identification
Figure 4 for Bayesian multilingual topic model for zero-shot cross-lingual topic identification
Viaarxiv icon