Picture for Tatsuya Kawahara

Tatsuya Kawahara

Investigation of Adapter for Automatic Speech Recognition in Noisy Environment

Add code
Feb 29, 2024
Viaarxiv icon

Evaluation of a semi-autonomous attentive listening system with takeover prompting

Add code
Feb 21, 2024
Figure 1 for Evaluation of a semi-autonomous attentive listening system with takeover prompting
Figure 2 for Evaluation of a semi-autonomous attentive listening system with takeover prompting
Figure 3 for Evaluation of a semi-autonomous attentive listening system with takeover prompting
Figure 4 for Evaluation of a semi-autonomous attentive listening system with takeover prompting
Viaarxiv icon

Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue

Add code
Feb 20, 2024
Figure 1 for Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue
Figure 2 for Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue
Figure 3 for Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue
Figure 4 for Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue
Viaarxiv icon

MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction

Add code
Jan 25, 2024
Viaarxiv icon

An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems

Add code
Jan 23, 2024
Viaarxiv icon

Enhancing Personality Recognition in Dialogue by Data Augmentation and Heterogeneous Conversational Graph Networks

Add code
Jan 11, 2024
Figure 1 for Enhancing Personality Recognition in Dialogue by Data Augmentation and Heterogeneous Conversational Graph Networks
Figure 2 for Enhancing Personality Recognition in Dialogue by Data Augmentation and Heterogeneous Conversational Graph Networks
Figure 3 for Enhancing Personality Recognition in Dialogue by Data Augmentation and Heterogeneous Conversational Graph Networks
Figure 4 for Enhancing Personality Recognition in Dialogue by Data Augmentation and Heterogeneous Conversational Graph Networks
Viaarxiv icon

Real-time and Continuous Turn-taking Prediction Using Voice Activity Projection

Add code
Jan 10, 2024
Viaarxiv icon

Zero- and Few-shot Sound Event Localization and Detection

Add code
Sep 17, 2023
Figure 1 for Zero- and Few-shot Sound Event Localization and Detection
Figure 2 for Zero- and Few-shot Sound Event Localization and Detection
Figure 3 for Zero- and Few-shot Sound Event Localization and Detection
Figure 4 for Zero- and Few-shot Sound Event Localization and Detection
Viaarxiv icon

Towards Objective Evaluation of Socially-Situated Conversational Robots: Assessing Human-Likeness through Multimodal User Behaviors

Add code
Aug 21, 2023
Figure 1 for Towards Objective Evaluation of Socially-Situated Conversational Robots: Assessing Human-Likeness through Multimodal User Behaviors
Figure 2 for Towards Objective Evaluation of Socially-Situated Conversational Robots: Assessing Human-Likeness through Multimodal User Behaviors
Figure 3 for Towards Objective Evaluation of Socially-Situated Conversational Robots: Assessing Human-Likeness through Multimodal User Behaviors
Figure 4 for Towards Objective Evaluation of Socially-Situated Conversational Robots: Assessing Human-Likeness through Multimodal User Behaviors
Viaarxiv icon

Reasoning before Responding: Integrating Commonsense-based Causality Explanation for Empathetic Response Generation

Add code
Jul 28, 2023
Viaarxiv icon