Picture for Tatsuya Kawahara

Tatsuya Kawahara

Analysis and Detection of Differences in Spoken User Behaviors between Autonomous and Wizard-of-Oz Systems

Add code
Oct 04, 2024
Viaarxiv icon

Robotic Backchanneling in Online Conversation Facilitation: A Cross-Generational Study

Add code
Sep 25, 2024
Figure 1 for Robotic Backchanneling in Online Conversation Facilitation: A Cross-Generational Study
Figure 2 for Robotic Backchanneling in Online Conversation Facilitation: A Cross-Generational Study
Figure 3 for Robotic Backchanneling in Online Conversation Facilitation: A Cross-Generational Study
Figure 4 for Robotic Backchanneling in Online Conversation Facilitation: A Cross-Generational Study
Viaarxiv icon

Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition

Add code
Sep 01, 2024
Figure 1 for Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
Figure 2 for Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
Figure 3 for Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
Figure 4 for Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
Viaarxiv icon

Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction

Add code
Aug 29, 2024
Figure 1 for Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
Figure 2 for Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
Figure 3 for Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
Figure 4 for Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
Viaarxiv icon

StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement

Add code
Aug 05, 2024
Figure 1 for StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement
Figure 2 for StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement
Figure 3 for StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement
Figure 4 for StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement
Viaarxiv icon

Multilingual Turn-taking Prediction Using Voice Activity Projection

Add code
Mar 14, 2024
Figure 1 for Multilingual Turn-taking Prediction Using Voice Activity Projection
Figure 2 for Multilingual Turn-taking Prediction Using Voice Activity Projection
Figure 3 for Multilingual Turn-taking Prediction Using Voice Activity Projection
Figure 4 for Multilingual Turn-taking Prediction Using Voice Activity Projection
Viaarxiv icon

Investigation of Adapter for Automatic Speech Recognition in Noisy Environment

Add code
Feb 29, 2024
Viaarxiv icon

Evaluation of a semi-autonomous attentive listening system with takeover prompting

Add code
Feb 21, 2024
Viaarxiv icon

Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue

Add code
Feb 20, 2024
Figure 1 for Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue
Figure 2 for Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue
Figure 3 for Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue
Figure 4 for Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue
Viaarxiv icon

MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction

Add code
Jan 25, 2024
Viaarxiv icon