Picture for Tomohiko Nakamura

Tomohiko Nakamura

Neural Blind Source Separation and Diarization for Distant Speech Recognition

Add code
Jun 12, 2024
Figure 1 for Neural Blind Source Separation and Diarization for Distant Speech Recognition
Figure 2 for Neural Blind Source Separation and Diarization for Distant Speech Recognition
Figure 3 for Neural Blind Source Separation and Diarization for Distant Speech Recognition
Figure 4 for Neural Blind Source Separation and Diarization for Distant Speech Recognition
Viaarxiv icon

Self-Supervised Speech Representations are More Phonetic than Semantic

Add code
Jun 12, 2024
Viaarxiv icon

Real-time Speech Extraction Using Spatially Regularized Independent Low-rank Matrix Analysis and Rank-constrained Spatial Covariance Matrix Estimation

Add code
Mar 19, 2024
Figure 1 for Real-time Speech Extraction Using Spatially Regularized Independent Low-rank Matrix Analysis and Rank-constrained Spatial Covariance Matrix Estimation
Figure 2 for Real-time Speech Extraction Using Spatially Regularized Independent Low-rank Matrix Analysis and Rank-constrained Spatial Covariance Matrix Estimation
Figure 3 for Real-time Speech Extraction Using Spatially Regularized Independent Low-rank Matrix Analysis and Rank-constrained Spatial Covariance Matrix Estimation
Viaarxiv icon

Sampling-Frequency-Independent Universal Sound Separation

Add code
Sep 22, 2023
Viaarxiv icon

Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides

Add code
Jun 19, 2023
Figure 1 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Figure 2 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Figure 3 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Figure 4 for Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
Viaarxiv icon

How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics

Add code
Jun 01, 2023
Figure 1 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 2 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 3 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Figure 4 for How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics
Viaarxiv icon

jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus

Add code
Dec 09, 2022
Figure 1 for jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
Figure 2 for jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
Figure 3 for jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
Figure 4 for jaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
Viaarxiv icon

Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders

Add code
Sep 27, 2022
Figure 1 for Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders
Figure 2 for Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders
Figure 3 for Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders
Figure 4 for Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders
Viaarxiv icon

Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning

Add code
Jul 22, 2022
Figure 1 for Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning
Figure 2 for Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning
Figure 3 for Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning
Viaarxiv icon

Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation

Add code
Jul 22, 2022
Figure 1 for Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation
Figure 2 for Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation
Figure 3 for Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation
Viaarxiv icon