Alert button
Picture for Sachin Kajarekar

Sachin Kajarekar

Alert button

CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations

Feb 08, 2022
Vin Sachidananda, Shao-Yen Tseng, Erik Marchi, Sachin Kajarekar, Panayiotis Georgiou

Figure 1 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 2 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 3 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 4 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Viaarxiv icon

Streaming on-device detection of device directed speech from voice and touch-based invocation

Oct 09, 2021
Ognjen Rudovic, Akanksha Bindal, Vineet Garg, Pramod Simha, Pranay Dighe, Sachin Kajarekar

Figure 1 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 2 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 3 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 4 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Viaarxiv icon

Analysis and Tuning of a Voice Assistant System for Dysfluent Speech

Jun 18, 2021
Vikramjit Mitra, Zifang Huang, Colin Lea, Lauren Tooley, Sarah Wu, Darren Botten, Ashwini Palekar, Shrinath Thelapurath, Panayiotis Georgiou, Sachin Kajarekar, Jefferey Bigham

Figure 1 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 2 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 3 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 4 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Viaarxiv icon

SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter

Feb 24, 2021
Colin Lea, Vikramjit Mitra, Aparna Joshi, Sachin Kajarekar, Jeffrey P. Bigham

Figure 1 for SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter
Figure 2 for SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter
Figure 3 for SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter
Figure 4 for SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter
Viaarxiv icon

Knowledge Transfer for Efficient On-device False Trigger Mitigation

Oct 20, 2020
Pranay Dighe, Erik Marchi, Srikanth Vishnubhotla, Sachin Kajarekar, Devang Naik

Figure 1 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 2 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 3 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 4 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Viaarxiv icon

Audiovisual Speech Synthesis using Tacotron2

Aug 03, 2020
Ahmed Hussen Abdelaziz, Anushree Prasanna Kumar, Chloe Seivwright, Gabriele Fanelli, Justin Binder, Yannis Stylianou, Sachin Kajarekar

Figure 1 for Audiovisual Speech Synthesis using Tacotron2
Figure 2 for Audiovisual Speech Synthesis using Tacotron2
Figure 3 for Audiovisual Speech Synthesis using Tacotron2
Figure 4 for Audiovisual Speech Synthesis using Tacotron2
Viaarxiv icon

Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement

May 06, 2020
Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz

Figure 1 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 2 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 3 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 4 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Viaarxiv icon

Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions

Jan 31, 2020
Vasudha Kowtha, Vikramjit Mitra, Chris Bartels, Erik Marchi, Sue Booker, William Caruso, Sachin Kajarekar, Devang Naik

Figure 1 for Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions
Figure 2 for Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions
Figure 3 for Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions
Figure 4 for Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions
Viaarxiv icon

Multi-task Learning for Speaker Verification and Voice Trigger Detection

Jan 26, 2020
Siddharth Sigtia, Erik Marchi, Sachin Kajarekar, Devang Naik, John Bridle

Figure 1 for Multi-task Learning for Speaker Verification and Voice Trigger Detection
Figure 2 for Multi-task Learning for Speaker Verification and Voice Trigger Detection
Figure 3 for Multi-task Learning for Speaker Verification and Voice Trigger Detection
Viaarxiv icon