Alert button
Picture for Siddharth Sigtia

Siddharth Sigtia

Alert button

A Multimodal Approach to Device-Directed Speech Detection with Large Language Models

Add code
Bookmark button
Alert button
Mar 26, 2024
Dominik Wagner, Alexander Churchill, Siddharth Sigtia, Panayiotis Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi

Figure 1 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 2 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 3 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 4 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Viaarxiv icon

Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models

Add code
Bookmark button
Alert button
Dec 06, 2023
Dominik Wagner, Alexander Churchill, Siddharth Sigtia, Panayiotis Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi

Viaarxiv icon

Improving Voice Trigger Detection with Metric Learning

Add code
Bookmark button
Alert button
Apr 05, 2022
Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed Tewfik

Figure 1 for Improving Voice Trigger Detection with Metric Learning
Figure 2 for Improving Voice Trigger Detection with Metric Learning
Figure 3 for Improving Voice Trigger Detection with Metric Learning
Figure 4 for Improving Voice Trigger Detection with Metric Learning
Viaarxiv icon

Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation

Add code
Bookmark button
Alert button
May 14, 2021
Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir

Figure 1 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 2 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 3 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 4 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Viaarxiv icon

Progressive Voice Trigger Detection: Accuracy vs Latency

Add code
Bookmark button
Alert button
Oct 29, 2020
Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg

Figure 1 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 2 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 3 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 4 for Progressive Voice Trigger Detection: Accuracy vs Latency
Viaarxiv icon

Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering

Add code
Bookmark button
Alert button
Aug 05, 2020
Saurabh Adya, Vineet Garg, Siddharth Sigtia, Pramod Simha, Chandra Dhir

Figure 1 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Figure 2 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Figure 3 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Figure 4 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Viaarxiv icon

Multi-task Learning for Speaker Verification and Voice Trigger Detection

Add code
Bookmark button
Alert button
Jan 26, 2020
Siddharth Sigtia, Erik Marchi, Sachin Kajarekar, Devang Naik, John Bridle

Figure 1 for Multi-task Learning for Speaker Verification and Voice Trigger Detection
Figure 2 for Multi-task Learning for Speaker Verification and Voice Trigger Detection
Figure 3 for Multi-task Learning for Speaker Verification and Voice Trigger Detection
Viaarxiv icon

Multi-task Learning for Voice Trigger Detection

Add code
Bookmark button
Alert button
Jan 26, 2020
Siddharth Sigtia, Pascal Clark, Rob Haynes, Hywel Richards, John Bridle

Figure 1 for Multi-task Learning for Voice Trigger Detection
Figure 2 for Multi-task Learning for Voice Trigger Detection
Figure 3 for Multi-task Learning for Voice Trigger Detection
Viaarxiv icon

Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging

Add code
Bookmark button
Alert button
Nov 29, 2016
Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson, Mark D. Plumbley

Figure 1 for Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Figure 2 for Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Figure 3 for Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Figure 4 for Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Viaarxiv icon