Picture for Yuki Saito

Yuki Saito

Reference-Free Image Quality Assessment for Virtual Try-On via Human Feedback

Add code
Mar 13, 2026
Viaarxiv icon

Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

Add code
Mar 03, 2026
Viaarxiv icon

Geneses: Unified Generative Speech Enhancement and Separation

Add code
Jan 26, 2026
Viaarxiv icon

Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement

Add code
Oct 02, 2025
Figure 1 for Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement
Figure 2 for Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement
Figure 3 for Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement
Figure 4 for Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement
Viaarxiv icon

Static Word Embeddings for Sentence Semantic Representation

Add code
Jun 05, 2025
Figure 1 for Static Word Embeddings for Sentence Semantic Representation
Figure 2 for Static Word Embeddings for Sentence Semantic Representation
Figure 3 for Static Word Embeddings for Sentence Semantic Representation
Figure 4 for Static Word Embeddings for Sentence Semantic Representation
Viaarxiv icon

Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis

Add code
May 18, 2025
Viaarxiv icon

Causal Speech Enhancement with Predicting Semantics based on Quantized Self-supervised Learning Features

Add code
Dec 26, 2024
Viaarxiv icon

An Environment-Adaptive Position/Force Control Based on Physical Property Estimation

Add code
Dec 19, 2024
Figure 1 for An Environment-Adaptive Position/Force Control Based on Physical Property Estimation
Figure 2 for An Environment-Adaptive Position/Force Control Based on Physical Property Estimation
Figure 3 for An Environment-Adaptive Position/Force Control Based on Physical Property Estimation
Figure 4 for An Environment-Adaptive Position/Force Control Based on Physical Property Estimation
Viaarxiv icon

An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation

Add code
Oct 31, 2024
Figure 1 for An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation
Figure 2 for An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation
Figure 3 for An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation
Figure 4 for An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation
Viaarxiv icon

Construction and Analysis of Impression Caption Dataset for Environmental Sounds

Add code
Oct 20, 2024
Viaarxiv icon