Picture for Tatsuya Kawahara

Tatsuya Kawahara

Multilingual and Continuous Backchannel Prediction: A Cross-lingual Study

Add code
Dec 16, 2025
Viaarxiv icon

Minority-Aware Satisfaction Estimation in Dialogue Systems via Preference-Adaptive Reinforcement Learning

Add code
Nov 07, 2025
Viaarxiv icon

SONAR: Self-Distilled Continual Pre-training for Domain Adaptive Audio Representation

Add code
Sep 19, 2025
Viaarxiv icon

Triadic Multi-party Voice Activity Projection for Turn-taking in Spoken Dialogue Systems

Add code
Jul 10, 2025
Viaarxiv icon

Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement

Add code
May 20, 2025
Figure 1 for Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Figure 2 for Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Figure 3 for Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Figure 4 for Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement
Viaarxiv icon

Bridging Speech Emotion Recognition and Personality: Dataset and Temporal Interaction Condition Network

Add code
May 20, 2025
Viaarxiv icon

Does the Appearance of Autonomous Conversational Robots Affect User Spoken Behaviors in Real-World Conference Interactions?

Add code
Mar 17, 2025
Viaarxiv icon

A Noise-Robust Turn-Taking System for Real-World Dialogue Robots: A Field Experiment

Add code
Mar 08, 2025
Viaarxiv icon

An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue

Add code
Jan 28, 2025
Figure 1 for An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue
Figure 2 for An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue
Figure 3 for An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue
Figure 4 for An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue
Viaarxiv icon

Why Do We Laugh? Annotation and Taxonomy Generation for Laughable Contexts in Spontaneous Text Conversation

Add code
Jan 28, 2025
Viaarxiv icon