Alert button
Picture for Panayiotis Georgiou

Panayiotis Georgiou

Alert button

A Multimodal Approach to Device-Directed Speech Detection with Large Language Models

Mar 21, 2024
Dominik Wager, Alexander Churchill, Siddharth Sigtia, Panayiotis Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi

Viaarxiv icon

Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models

Dec 06, 2023
Dominik Wagner, Alexander Churchill, Siddharth Sigtia, Panayiotis Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi

Viaarxiv icon

CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations

Feb 08, 2022
Vin Sachidananda, Shao-Yen Tseng, Erik Marchi, Sachin Kajarekar, Panayiotis Georgiou

Figure 1 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 2 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 3 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 4 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Viaarxiv icon

Analysis and Tuning of a Voice Assistant System for Dysfluent Speech

Jun 18, 2021
Vikramjit Mitra, Zifang Huang, Colin Lea, Lauren Tooley, Sarah Wu, Darren Botten, Ashwini Palekar, Shrinath Thelapurath, Panayiotis Georgiou, Sachin Kajarekar, Jefferey Bigham

Figure 1 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 2 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 3 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 4 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Viaarxiv icon

Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks

Apr 01, 2021
Haoqi Li, Brian Baucom, Shrikanth Narayanan, Panayiotis Georgiou

Figure 1 for Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Figure 2 for Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Figure 3 for Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Figure 4 for Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Viaarxiv icon

"Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies

Feb 22, 2021
Nikolaos Flemotomos, Victor R. Martinez, Zhuohao Chen, Karan Singla, Victor Ardulov, Raghuveer Peri, Derek D. Caperton, James Gibson, Michael J. Tanana, Panayiotis Georgiou, Jake Van Epps, Sarah P. Lord, Tad Hirsch, Zac E. Imel, David C. Atkins, Shrikanth Narayanan

Figure 1 for "Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies
Figure 2 for "Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies
Figure 3 for "Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies
Figure 4 for "Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies
Viaarxiv icon

Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords

Feb 19, 2021
Prashanth Gurunath Shivakumar, Panayiotis Georgiou, Shrikanth Narayanan

Figure 1 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 2 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 3 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 4 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Viaarxiv icon

Speaker Diarization with Lexical Information

Apr 13, 2020
Tae Jin Park, Kyu J. Han, Jing Huang, Xiaodong He, Bowen Zhou, Panayiotis Georgiou, Shrikanth Narayanan

Figure 1 for Speaker Diarization with Lexical Information
Figure 2 for Speaker Diarization with Lexical Information
Figure 3 for Speaker Diarization with Lexical Information
Figure 4 for Speaker Diarization with Lexical Information
Viaarxiv icon

An analysis of observation length requirements for machine understanding of human behaviors in spoken language

Nov 29, 2019
Sandeep Nallan Chakravarthula, Brian Baucom, Shrikanth Narayanan, Panayiotis Georgiou

Figure 1 for An analysis of observation length requirements for machine understanding of human behaviors in spoken language
Figure 2 for An analysis of observation length requirements for machine understanding of human behaviors in spoken language
Figure 3 for An analysis of observation length requirements for machine understanding of human behaviors in spoken language
Figure 4 for An analysis of observation length requirements for machine understanding of human behaviors in spoken language
Viaarxiv icon