Picture for Tatsuya Kawahara

Tatsuya Kawahara

Multilingual and Continuous Backchannel Prediction: A Cross-lingual Study

Add code
Dec 16, 2025
Figure 1 for Multilingual and Continuous Backchannel Prediction: A Cross-lingual Study
Figure 2 for Multilingual and Continuous Backchannel Prediction: A Cross-lingual Study
Figure 3 for Multilingual and Continuous Backchannel Prediction: A Cross-lingual Study
Figure 4 for Multilingual and Continuous Backchannel Prediction: A Cross-lingual Study
Viaarxiv icon

Minority-Aware Satisfaction Estimation in Dialogue Systems via Preference-Adaptive Reinforcement Learning

Add code
Nov 07, 2025
Viaarxiv icon

SONAR: Self-Distilled Continual Pre-training for Domain Adaptive Audio Representation

Add code
Sep 19, 2025
Viaarxiv icon

Triadic Multi-party Voice Activity Projection for Turn-taking in Spoken Dialogue Systems

Add code
Jul 10, 2025
Viaarxiv icon

Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement

Add code
May 20, 2025
Figure 1 for Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Figure 2 for Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Figure 3 for Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Figure 4 for Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Viaarxiv icon

Bridging Speech Emotion Recognition and Personality: Dataset and Temporal Interaction Condition Network

Add code
May 20, 2025
Viaarxiv icon

Does the Appearance of Autonomous Conversational Robots Affect User Spoken Behaviors in Real-World Conference Interactions?

Add code
Mar 17, 2025
Viaarxiv icon

A Noise-Robust Turn-Taking System for Real-World Dialogue Robots: A Field Experiment

Add code
Mar 08, 2025
Viaarxiv icon

Why Do We Laugh? Annotation and Taxonomy Generation for Laughable Contexts in Spontaneous Text Conversation

Add code
Jan 28, 2025
Viaarxiv icon

An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue

Add code
Jan 28, 2025
Figure 1 for An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue
Figure 2 for An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue
Figure 3 for An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue
Figure 4 for An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue
Viaarxiv icon