Alert button
Picture for Erik Marchi

Erik Marchi

Alert button

A Multimodal Approach to Device-Directed Speech Detection with Large Language Models

Add code
Bookmark button
Alert button
Mar 26, 2024
Dominik Wagner, Alexander Churchill, Siddharth Sigtia, Panayiotis Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi

Figure 1 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 2 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 3 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 4 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Viaarxiv icon

Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models

Add code
Bookmark button
Alert button
Dec 06, 2023
Dominik Wagner, Alexander Churchill, Siddharth Sigtia, Panayiotis Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi

Viaarxiv icon

Improving Voice Trigger Detection with Metric Learning

Add code
Bookmark button
Alert button
Apr 05, 2022
Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed Tewfik

Figure 1 for Improving Voice Trigger Detection with Metric Learning
Figure 2 for Improving Voice Trigger Detection with Metric Learning
Figure 3 for Improving Voice Trigger Detection with Metric Learning
Figure 4 for Improving Voice Trigger Detection with Metric Learning
Viaarxiv icon

Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models

Add code
Bookmark button
Alert button
Mar 30, 2022
Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed H. Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed Tewfik

Figure 1 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 2 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 3 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 4 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Viaarxiv icon

CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations

Add code
Bookmark button
Alert button
Feb 08, 2022
Vin Sachidananda, Shao-Yen Tseng, Erik Marchi, Sachin Kajarekar, Panayiotis Georgiou

Figure 1 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 2 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 3 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 4 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Viaarxiv icon

Whispered and Lombard Neural Speech Synthesis

Add code
Bookmark button
Alert button
Jan 13, 2021
Qiong Hu, Tobias Bleisch, Petko Petkov, Tuomo Raitio, Erik Marchi, Varun Lakshminarasimhan

Figure 1 for Whispered and Lombard Neural Speech Synthesis
Figure 2 for Whispered and Lombard Neural Speech Synthesis
Figure 3 for Whispered and Lombard Neural Speech Synthesis
Figure 4 for Whispered and Lombard Neural Speech Synthesis
Viaarxiv icon

Progressive Voice Trigger Detection: Accuracy vs Latency

Add code
Bookmark button
Alert button
Oct 29, 2020
Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg

Figure 1 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 2 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 3 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 4 for Progressive Voice Trigger Detection: Accuracy vs Latency
Viaarxiv icon

Knowledge Transfer for Efficient On-device False Trigger Mitigation

Add code
Bookmark button
Alert button
Oct 20, 2020
Pranay Dighe, Erik Marchi, Srikanth Vishnubhotla, Sachin Kajarekar, Devang Naik

Figure 1 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 2 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 3 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 4 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Viaarxiv icon

Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement

Add code
Bookmark button
Alert button
May 06, 2020
Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz

Figure 1 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 2 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 3 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 4 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Viaarxiv icon