Alert button
Picture for Pranay Dighe

Pranay Dighe

Alert button

Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features

Add code
Bookmark button
Alert button
Oct 23, 2023
Gautam Krishna, Sameer Dharur, Oggi Rudovic, Pranay Dighe, Saurabh Adya, Ahmed Hussen Abdelaziz, Ahmed H Tewfik

Viaarxiv icon

Leveraging Large Language Models for Exploiting ASR Uncertainty

Add code
Bookmark button
Alert button
Sep 12, 2023
Pranay Dighe, Yi Su, Shangshang Zheng, Yunshu Liu, Vineet Garg, Xiaochuan Niu, Ahmed Tewfik

Figure 1 for Leveraging Large Language Models for Exploiting ASR Uncertainty
Figure 2 for Leveraging Large Language Models for Exploiting ASR Uncertainty
Figure 3 for Leveraging Large Language Models for Exploiting ASR Uncertainty
Figure 4 for Leveraging Large Language Models for Exploiting ASR Uncertainty
Viaarxiv icon

Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models

Add code
Bookmark button
Alert button
Mar 30, 2022
Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed H. Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed Tewfik

Figure 1 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 2 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 3 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 4 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Viaarxiv icon

Streaming on-device detection of device directed speech from voice and touch-based invocation

Add code
Bookmark button
Alert button
Oct 09, 2021
Ognjen Rudovic, Akanksha Bindal, Vineet Garg, Pramod Simha, Pranay Dighe, Sachin Kajarekar

Figure 1 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 2 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 3 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 4 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Viaarxiv icon

Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation

Add code
Bookmark button
Alert button
May 14, 2021
Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir

Figure 1 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 2 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 3 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 4 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Viaarxiv icon

Knowledge Transfer for Efficient On-device False Trigger Mitigation

Add code
Bookmark button
Alert button
Oct 20, 2020
Pranay Dighe, Erik Marchi, Srikanth Vishnubhotla, Sachin Kajarekar, Devang Naik

Figure 1 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 2 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 3 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 4 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Viaarxiv icon

Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation

Add code
Bookmark button
Alert button
Aug 18, 2020
Rishika Agarwal, Xiaochuan Niu, Pranay Dighe, Srikanth Vishnubhotla, Sameer Badaskar, Devang Naik

Figure 1 for Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation
Figure 2 for Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation
Figure 3 for Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation
Figure 4 for Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation
Viaarxiv icon

Lattice-based Improvements for Voice Triggering Using Graph Neural Networks

Add code
Bookmark button
Alert button
Jan 25, 2020
Pranay Dighe, Saurabh Adya, Nuoyu Li, Srikanth Vishnubhotla, Devang Naik, Adithya Sagar, Ying Ma, Stephen Pulman, Jason Williams

Figure 1 for Lattice-based Improvements for Voice Triggering Using Graph Neural Networks
Figure 2 for Lattice-based Improvements for Voice Triggering Using Graph Neural Networks
Figure 3 for Lattice-based Improvements for Voice Triggering Using Graph Neural Networks
Figure 4 for Lattice-based Improvements for Voice Triggering Using Graph Neural Networks
Viaarxiv icon

Information Theoretic Analysis of DNN-HMM Acoustic Modeling

Add code
Bookmark button
Alert button
Nov 08, 2017
Pranay Dighe, Afsaneh Asaei, Hervé Bourlard

Figure 1 for Information Theoretic Analysis of DNN-HMM Acoustic Modeling
Figure 2 for Information Theoretic Analysis of DNN-HMM Acoustic Modeling
Figure 3 for Information Theoretic Analysis of DNN-HMM Acoustic Modeling
Figure 4 for Information Theoretic Analysis of DNN-HMM Acoustic Modeling
Viaarxiv icon

Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models

Add code
Bookmark button
Alert button
Oct 18, 2016
Pranay Dighe, Afsaneh Asaei, Herve Bourlard

Figure 1 for Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models
Figure 2 for Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models
Figure 3 for Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models
Viaarxiv icon