Picture for Naoya Takahashi

Naoya Takahashi

STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events

Add code
Jun 04, 2022
Figure 1 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 2 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 3 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Figure 4 for STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
Viaarxiv icon

Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training

Add code
Oct 14, 2021
Figure 1 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 2 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 3 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 4 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Viaarxiv icon

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection

Add code
Oct 13, 2021
Figure 1 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 2 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 3 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 4 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Viaarxiv icon

Amicable examples for informed source separation

Add code
Oct 11, 2021
Figure 1 for Amicable examples for informed source separation
Figure 2 for Amicable examples for informed source separation
Figure 3 for Amicable examples for informed source separation
Figure 4 for Amicable examples for informed source separation
Viaarxiv icon

Source Mixing and Separation Robust Audio Steganography

Add code
Oct 11, 2021
Figure 1 for Source Mixing and Separation Robust Audio Steganography
Figure 2 for Source Mixing and Separation Robust Audio Steganography
Figure 3 for Source Mixing and Separation Robust Audio Steganography
Figure 4 for Source Mixing and Separation Robust Audio Steganography
Viaarxiv icon

Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection

Add code
Jun 21, 2021
Figure 1 for Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Figure 2 for Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Figure 3 for Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Figure 4 for Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Viaarxiv icon

End-to-end lyrics Recognition with Voice to Singing Style Transfer

Add code
Feb 17, 2021
Figure 1 for End-to-end lyrics Recognition with Voice to Singing Style Transfer
Figure 2 for End-to-end lyrics Recognition with Voice to Singing Style Transfer
Figure 3 for End-to-end lyrics Recognition with Voice to Singing Style Transfer
Figure 4 for End-to-end lyrics Recognition with Voice to Singing Style Transfer
Viaarxiv icon

Hierarchical disentangled representation learning for singing voice conversion

Add code
Jan 18, 2021
Figure 1 for Hierarchical disentangled representation learning for singing voice conversion
Figure 2 for Hierarchical disentangled representation learning for singing voice conversion
Figure 3 for Hierarchical disentangled representation learning for singing voice conversion
Figure 4 for Hierarchical disentangled representation learning for singing voice conversion
Viaarxiv icon

Densely connected multidilated convolutional networks for dense prediction tasks

Add code
Nov 21, 2020
Figure 1 for Densely connected multidilated convolutional networks for dense prediction tasks
Figure 2 for Densely connected multidilated convolutional networks for dense prediction tasks
Figure 3 for Densely connected multidilated convolutional networks for dense prediction tasks
Figure 4 for Densely connected multidilated convolutional networks for dense prediction tasks
Viaarxiv icon

D3Net: Densely connected multidilated DenseNet for music source separation

Add code
Oct 15, 2020
Figure 1 for D3Net: Densely connected multidilated DenseNet for music source separation
Figure 2 for D3Net: Densely connected multidilated DenseNet for music source separation
Figure 3 for D3Net: Densely connected multidilated DenseNet for music source separation
Figure 4 for D3Net: Densely connected multidilated DenseNet for music source separation
Viaarxiv icon