Picture for Tatsuya Kawahara

Tatsuya Kawahara

Robotic Backchanneling in Online Conversation Facilitation: A Cross-Generational Study

Add code
Sep 25, 2024
Viaarxiv icon

Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition

Add code
Sep 01, 2024
Figure 1 for Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
Figure 2 for Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
Figure 3 for Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
Figure 4 for Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
Viaarxiv icon

Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction

Add code
Aug 29, 2024
Figure 1 for Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
Figure 2 for Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
Figure 3 for Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
Figure 4 for Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
Viaarxiv icon

StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement

Add code
Aug 05, 2024
Viaarxiv icon

Multilingual Turn-taking Prediction Using Voice Activity Projection

Add code
Mar 14, 2024
Viaarxiv icon

Investigation of Adapter for Automatic Speech Recognition in Noisy Environment

Add code
Feb 29, 2024
Viaarxiv icon

Evaluation of a semi-autonomous attentive listening system with takeover prompting

Add code
Feb 21, 2024
Viaarxiv icon

Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue

Add code
Feb 20, 2024
Viaarxiv icon

MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction

Add code
Jan 25, 2024
Viaarxiv icon

An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems

Add code
Jan 23, 2024
Viaarxiv icon