Alert button
Picture for Panayiotis Georgiou

Panayiotis Georgiou

Alert button

A Multimodal Approach to Device-Directed Speech Detection with Large Language Models

Add code
Bookmark button
Alert button
Mar 26, 2024
Dominik Wagner, Alexander Churchill, Siddharth Sigtia, Panayiotis Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi

Figure 1 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 2 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 3 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Figure 4 for A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Viaarxiv icon

Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models

Add code
Bookmark button
Alert button
Dec 06, 2023
Dominik Wagner, Alexander Churchill, Siddharth Sigtia, Panayiotis Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi

Viaarxiv icon

CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations

Add code
Bookmark button
Alert button
Feb 08, 2022
Vin Sachidananda, Shao-Yen Tseng, Erik Marchi, Sachin Kajarekar, Panayiotis Georgiou

Figure 1 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 2 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 3 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Figure 4 for CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations
Viaarxiv icon

Analysis and Tuning of a Voice Assistant System for Dysfluent Speech

Add code
Bookmark button
Alert button
Jun 18, 2021
Vikramjit Mitra, Zifang Huang, Colin Lea, Lauren Tooley, Sarah Wu, Darren Botten, Ashwini Palekar, Shrinath Thelapurath, Panayiotis Georgiou, Sachin Kajarekar, Jefferey Bigham

Figure 1 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 2 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 3 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 4 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Viaarxiv icon

Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks

Add code
Bookmark button
Alert button
Apr 01, 2021
Haoqi Li, Brian Baucom, Shrikanth Narayanan, Panayiotis Georgiou

Figure 1 for Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Figure 2 for Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Figure 3 for Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Figure 4 for Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks
Viaarxiv icon

"Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies

Add code
Bookmark button
Alert button
Feb 22, 2021
Nikolaos Flemotomos, Victor R. Martinez, Zhuohao Chen, Karan Singla, Victor Ardulov, Raghuveer Peri, Derek D. Caperton, James Gibson, Michael J. Tanana, Panayiotis Georgiou, Jake Van Epps, Sarah P. Lord, Tad Hirsch, Zac E. Imel, David C. Atkins, Shrikanth Narayanan

Figure 1 for "Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies
Figure 2 for "Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies
Figure 3 for "Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies
Figure 4 for "Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies
Viaarxiv icon

Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords

Add code
Bookmark button
Alert button
Feb 19, 2021
Prashanth Gurunath Shivakumar, Panayiotis Georgiou, Shrikanth Narayanan

Figure 1 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 2 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 3 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 4 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Viaarxiv icon

Speaker Diarization with Lexical Information

Add code
Bookmark button
Alert button
Apr 13, 2020
Tae Jin Park, Kyu J. Han, Jing Huang, Xiaodong He, Bowen Zhou, Panayiotis Georgiou, Shrikanth Narayanan

Figure 1 for Speaker Diarization with Lexical Information
Figure 2 for Speaker Diarization with Lexical Information
Figure 3 for Speaker Diarization with Lexical Information
Figure 4 for Speaker Diarization with Lexical Information
Viaarxiv icon