Alert button
Picture for Vineet Garg

Vineet Garg

Alert button

Streaming Anchor Loss: Augmenting Supervision with Temporal Significance

Oct 09, 2023
Utkarsh, Sarawgi, John Berkowitz, Vineet Garg, Arnav Kundu, Minsik Cho, Sai Srujana Buddi, Saurabh Adya, Ahmed Tewfik

Figure 1 for Streaming Anchor Loss: Augmenting Supervision with Temporal Significance
Figure 2 for Streaming Anchor Loss: Augmenting Supervision with Temporal Significance
Figure 3 for Streaming Anchor Loss: Augmenting Supervision with Temporal Significance
Figure 4 for Streaming Anchor Loss: Augmenting Supervision with Temporal Significance
Viaarxiv icon

Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study

Sep 27, 2023
Avamarie Brueggeman, Takuya Higuchi, Masood Delfarah, Stephen Shum, Vineet Garg

Figure 1 for Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study
Figure 2 for Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study
Figure 3 for Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study
Figure 4 for Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study
Viaarxiv icon

Leveraging Large Language Models for Exploiting ASR Uncertainty

Sep 12, 2023
Pranay Dighe, Yi Su, Shangshang Zheng, Yunshu Liu, Vineet Garg, Xiaochuan Niu, Ahmed Tewfik

Figure 1 for Leveraging Large Language Models for Exploiting ASR Uncertainty
Figure 2 for Leveraging Large Language Models for Exploiting ASR Uncertainty
Figure 3 for Leveraging Large Language Models for Exploiting ASR Uncertainty
Figure 4 for Leveraging Large Language Models for Exploiting ASR Uncertainty
Viaarxiv icon

Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models

Mar 30, 2022
Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed H. Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed Tewfik

Figure 1 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 2 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 3 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 4 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Viaarxiv icon

Streaming on-device detection of device directed speech from voice and touch-based invocation

Oct 09, 2021
Ognjen Rudovic, Akanksha Bindal, Vineet Garg, Pramod Simha, Pranay Dighe, Sachin Kajarekar

Figure 1 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 2 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 3 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 4 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Viaarxiv icon

Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation

May 14, 2021
Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir

Figure 1 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 2 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 3 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 4 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Viaarxiv icon

Progressive Voice Trigger Detection: Accuracy vs Latency

Oct 29, 2020
Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg

Figure 1 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 2 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 3 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 4 for Progressive Voice Trigger Detection: Accuracy vs Latency
Viaarxiv icon

Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering

Aug 05, 2020
Saurabh Adya, Vineet Garg, Siddharth Sigtia, Pramod Simha, Chandra Dhir

Figure 1 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Figure 2 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Figure 3 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Figure 4 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Viaarxiv icon