Alert button
Picture for Aparna Khare

Aparna Khare

Alert button

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Mar 28, 2024
Yash Jain, David Chan, Pranav Dheram, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran, Shalini Ghosh

Figure 1 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 2 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 3 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 4 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Viaarxiv icon

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Add code
Bookmark button
Alert button
Jan 26, 2024
Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran

Viaarxiv icon

Two-pass Endpoint Detection for Speech Recognition

Add code
Bookmark button
Alert button
Jan 17, 2024
Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow

Viaarxiv icon

Cross-utterance ASR Rescoring with Graph-based Label Propagation

Add code
Bookmark button
Alert button
Mar 27, 2023
Srinath Tankasala, Long Chen, Andreas Stolcke, Anirudh Raju, Qianli Deng, Chander Chandak, Aparna Khare, Roland Maas, Venkatesh Ravichandran

Figure 1 for Cross-utterance ASR Rescoring with Graph-based Label Propagation
Figure 2 for Cross-utterance ASR Rescoring with Graph-based Label Propagation
Figure 3 for Cross-utterance ASR Rescoring with Graph-based Label Propagation
Figure 4 for Cross-utterance ASR Rescoring with Graph-based Label Propagation
Viaarxiv icon

ASR-Aware End-to-end Neural Diarization

Add code
Bookmark button
Alert button
Feb 02, 2022
Aparna Khare, Eunjung Han, Yuguang Yang, Andreas Stolcke

Figure 1 for ASR-Aware End-to-end Neural Diarization
Figure 2 for ASR-Aware End-to-end Neural Diarization
Viaarxiv icon

Audiovisual Highlight Detection in Videos

Add code
Bookmark button
Alert button
Feb 11, 2021
Karel Mundnich, Alexandra Fenster, Aparna Khare, Shiva Sundaram

Figure 1 for Audiovisual Highlight Detection in Videos
Figure 2 for Audiovisual Highlight Detection in Videos
Figure 3 for Audiovisual Highlight Detection in Videos
Figure 4 for Audiovisual Highlight Detection in Videos
Viaarxiv icon

Self-Supervised learning with cross-modal transformers for emotion recognition

Add code
Bookmark button
Alert button
Nov 20, 2020
Aparna Khare, Srinivas Parthasarathy, Shiva Sundaram

Figure 1 for Self-Supervised learning with cross-modal transformers for emotion recognition
Figure 2 for Self-Supervised learning with cross-modal transformers for emotion recognition
Figure 3 for Self-Supervised learning with cross-modal transformers for emotion recognition
Viaarxiv icon

Multi-modal embeddings using multi-task learning for emotion recognition

Add code
Bookmark button
Alert button
Sep 10, 2020
Aparna Khare, Srinivas Parthasarathy, Shiva Sundaram

Figure 1 for Multi-modal embeddings using multi-task learning for emotion recognition
Figure 2 for Multi-modal embeddings using multi-task learning for emotion recognition
Figure 3 for Multi-modal embeddings using multi-task learning for emotion recognition
Viaarxiv icon

Multiresolution and Multimodal Speech Recognition with Transformers

Add code
Bookmark button
Alert button
Apr 29, 2020
Georgios Paraskevopoulos, Srinivas Parthasarathy, Aparna Khare, Shiva Sundaram

Figure 1 for Multiresolution and Multimodal Speech Recognition with Transformers
Figure 2 for Multiresolution and Multimodal Speech Recognition with Transformers
Figure 3 for Multiresolution and Multimodal Speech Recognition with Transformers
Figure 4 for Multiresolution and Multimodal Speech Recognition with Transformers
Viaarxiv icon

Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning

Add code
Bookmark button
Alert button
Feb 01, 2020
Sanna Wager, Aparna Khare, Minhua Wu, Kenichi Kumatani, Shiva Sundaram

Figure 1 for Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning
Figure 2 for Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning
Figure 3 for Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning
Figure 4 for Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning
Viaarxiv icon