Picture for Shashi Kumar

Shashi Kumar

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Add code
Mar 27, 2026
Viaarxiv icon

Nonparametric Variational Differential Privacy via Embedding Parameter Clipping

Add code
Mar 10, 2026
Viaarxiv icon

Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection

Add code
Jan 28, 2026
Viaarxiv icon

Text-only adaptation in LLM-based ASR through text denoising

Add code
Jan 28, 2026
Viaarxiv icon

TokenVerse++: Towards Flexible Multitask Learning with Dynamic Task Activation

Add code
Aug 27, 2025
Viaarxiv icon

Unifying Streaming and Non-streaming Zipformer-based ASR

Add code
Jun 17, 2025
Figure 1 for Unifying Streaming and Non-streaming Zipformer-based ASR
Figure 2 for Unifying Streaming and Non-streaming Zipformer-based ASR
Figure 3 for Unifying Streaming and Non-streaming Zipformer-based ASR
Figure 4 for Unifying Streaming and Non-streaming Zipformer-based ASR
Viaarxiv icon

Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering

Add code
Jun 05, 2025
Figure 1 for Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
Figure 2 for Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
Figure 3 for Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
Viaarxiv icon

A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport

Add code
Feb 03, 2025
Figure 1 for A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport
Figure 2 for A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport
Figure 3 for A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport
Figure 4 for A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport
Viaarxiv icon

Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward

Add code
Nov 06, 2024
Figure 1 for Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward
Figure 2 for Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward
Figure 3 for Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward
Figure 4 for Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward
Viaarxiv icon

TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR

Add code
Jul 05, 2024
Figure 1 for TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR
Figure 2 for TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR
Figure 3 for TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR
Figure 4 for TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR
Viaarxiv icon