Picture for Hiroshi Saruwatari

Hiroshi Saruwatari

Do learned speech symbols follow Zipf's law?

Add code
Sep 18, 2023
Figure 1 for Do learned speech symbols follow Zipf's law?
Figure 2 for Do learned speech symbols follow Zipf's law?
Figure 3 for Do learned speech symbols follow Zipf's law?
Figure 4 for Do learned speech symbols follow Zipf's law?
Viaarxiv icon

Diversity-based core-set selection for text-to-speech with linguistic and acoustic features

Add code
Sep 15, 2023
Figure 1 for Diversity-based core-set selection for text-to-speech with linguistic and acoustic features
Figure 2 for Diversity-based core-set selection for text-to-speech with linguistic and acoustic features
Figure 3 for Diversity-based core-set selection for text-to-speech with linguistic and acoustic features
Figure 4 for Diversity-based core-set selection for text-to-speech with linguistic and acoustic features
Viaarxiv icon

Kernel Interpolation of Incident Sound Field in Region Including Scattering Objects

Add code
Sep 11, 2023
Figure 1 for Kernel Interpolation of Incident Sound Field in Region Including Scattering Objects
Figure 2 for Kernel Interpolation of Incident Sound Field in Region Including Scattering Objects
Figure 3 for Kernel Interpolation of Incident Sound Field in Region Including Scattering Objects
Figure 4 for Kernel Interpolation of Incident Sound Field in Region Including Scattering Objects
Viaarxiv icon

Perceptual Quality Enhancement of Sound Field Synthesis Based on Combination of Pressure and Amplitude Matching

Add code
Jul 26, 2023
Viaarxiv icon

NoisyILRMA: Diffuse-Noise-Aware Independent Low-Rank Matrix Analysis for Fast Blind Source Extraction

Add code
Jun 22, 2023
Figure 1 for NoisyILRMA: Diffuse-Noise-Aware Independent Low-Rank Matrix Analysis for Fast Blind Source Extraction
Figure 2 for NoisyILRMA: Diffuse-Noise-Aware Independent Low-Rank Matrix Analysis for Fast Blind Source Extraction
Figure 3 for NoisyILRMA: Diffuse-Noise-Aware Independent Low-Rank Matrix Analysis for Fast Blind Source Extraction
Viaarxiv icon

Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides

Add code
Jun 19, 2023
Figure 1 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Figure 2 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Figure 3 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Figure 4 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Viaarxiv icon

Multichannel Active Noise Control with Exterior Radiation Suppression Based on Riemannian Optimization

Add code
Jun 15, 2023
Viaarxiv icon

How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics

Add code
Jun 01, 2023
Figure 1 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 2 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 3 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 4 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Viaarxiv icon

Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus

Add code
May 26, 2023
Figure 1 for Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus
Figure 2 for Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus
Figure 3 for Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus
Figure 4 for Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus
Viaarxiv icon

CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center

Add code
May 23, 2023
Viaarxiv icon