Picture for Tomohiko Nakamura

Tomohiko Nakamura

How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics

Add code
Jun 01, 2023
Figure 1 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 2 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 3 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 4 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Viaarxiv icon

jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus

Add code
Dec 09, 2022
Figure 1 for jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
Figure 2 for jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
Figure 3 for jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
Figure 4 for jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
Viaarxiv icon

Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders

Add code
Sep 27, 2022
Figure 1 for Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders
Figure 2 for Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders
Figure 3 for Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders
Figure 4 for Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders
Viaarxiv icon

Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning

Add code
Jul 22, 2022
Figure 1 for Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning
Figure 2 for Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning
Figure 3 for Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning
Viaarxiv icon

Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation

Add code
Jul 22, 2022
Figure 1 for Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation
Figure 2 for Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation
Figure 3 for Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation
Viaarxiv icon

SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling

Add code
Mar 24, 2022
Figure 1 for SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling
Figure 2 for SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling
Figure 3 for SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling
Figure 4 for SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling
Viaarxiv icon

Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds

Add code
Feb 01, 2022
Figure 1 for Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds
Figure 2 for Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds
Figure 3 for Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds
Viaarxiv icon

Speech Enhancement by Noise Self-Supervised Rank-Constrained Spatial Covariance Matrix Estimation via Independent Deeply Learned Matrix Analysis

Add code
Sep 10, 2021
Figure 1 for Speech Enhancement by Noise Self-Supervised Rank-Constrained Spatial Covariance Matrix Estimation via Independent Deeply Learned Matrix Analysis
Figure 2 for Speech Enhancement by Noise Self-Supervised Rank-Constrained Spatial Covariance Matrix Estimation via Independent Deeply Learned Matrix Analysis
Figure 3 for Speech Enhancement by Noise Self-Supervised Rank-Constrained Spatial Covariance Matrix Estimation via Independent Deeply Learned Matrix Analysis
Figure 4 for Speech Enhancement by Noise Self-Supervised Rank-Constrained Spatial Covariance Matrix Estimation via Independent Deeply Learned Matrix Analysis
Viaarxiv icon

Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models

Add code
Sep 02, 2021
Figure 1 for Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models
Figure 2 for Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models
Figure 3 for Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models
Figure 4 for Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models
Viaarxiv icon

Prior Distribution Design for Music Bleeding-Sound Reduction Based on Nonnegative Matrix Factorization

Add code
Sep 01, 2021
Figure 1 for Prior Distribution Design for Music Bleeding-Sound Reduction Based on Nonnegative Matrix Factorization
Figure 2 for Prior Distribution Design for Music Bleeding-Sound Reduction Based on Nonnegative Matrix Factorization
Figure 3 for Prior Distribution Design for Music Bleeding-Sound Reduction Based on Nonnegative Matrix Factorization
Figure 4 for Prior Distribution Design for Music Bleeding-Sound Reduction Based on Nonnegative Matrix Factorization
Viaarxiv icon