Alert button
Picture for Naoya Takahashi

Naoya Takahashi

Alert button

Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training

Add code
Bookmark button
Alert button
Oct 14, 2021
Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji

Figure 1 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 2 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 3 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Figure 4 for Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Viaarxiv icon

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Oct 13, 2021
Yuichiro Koyama, Kazuhide Shigemi, Masafumi Takahashi, Kazuki Shimada, Naoya Takahashi, Emiru Tsunoo, Shusuke Takahashi, Yuki Mitsufuji

Figure 1 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 2 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 3 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Figure 4 for Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Viaarxiv icon

Amicable examples for informed source separation

Add code
Bookmark button
Alert button
Oct 11, 2021
Naoya Takahashi, Yuki Mitsufuji

Figure 1 for Amicable examples for informed source separation
Figure 2 for Amicable examples for informed source separation
Figure 3 for Amicable examples for informed source separation
Figure 4 for Amicable examples for informed source separation
Viaarxiv icon

Source Mixing and Separation Robust Audio Steganography

Add code
Bookmark button
Alert button
Oct 11, 2021
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji

Figure 1 for Source Mixing and Separation Robust Audio Steganography
Figure 2 for Source Mixing and Separation Robust Audio Steganography
Figure 3 for Source Mixing and Separation Robust Audio Steganography
Figure 4 for Source Mixing and Separation Robust Audio Steganography
Viaarxiv icon

Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection

Add code
Bookmark button
Alert button
Jun 21, 2021
Kazuki Shimada, Naoya Takahashi, Yuichiro Koyama, Shusuke Takahashi, Emiru Tsunoo, Masafumi Takahashi, Yuki Mitsufuji

Figure 1 for Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Figure 2 for Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Figure 3 for Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Figure 4 for Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Viaarxiv icon

End-to-end lyrics Recognition with Voice to Singing Style Transfer

Add code
Bookmark button
Alert button
Feb 17, 2021
Sakya Basak, Shrutina Agarwal, Sriram Ganapathy, Naoya Takahashi

Figure 1 for End-to-end lyrics Recognition with Voice to Singing Style Transfer
Figure 2 for End-to-end lyrics Recognition with Voice to Singing Style Transfer
Figure 3 for End-to-end lyrics Recognition with Voice to Singing Style Transfer
Figure 4 for End-to-end lyrics Recognition with Voice to Singing Style Transfer
Viaarxiv icon

Hierarchical disentangled representation learning for singing voice conversion

Add code
Bookmark button
Alert button
Jan 18, 2021
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji

Figure 1 for Hierarchical disentangled representation learning for singing voice conversion
Figure 2 for Hierarchical disentangled representation learning for singing voice conversion
Figure 3 for Hierarchical disentangled representation learning for singing voice conversion
Figure 4 for Hierarchical disentangled representation learning for singing voice conversion
Viaarxiv icon

Densely connected multidilated convolutional networks for dense prediction tasks

Add code
Bookmark button
Alert button
Nov 21, 2020
Naoya Takahashi, Yuki Mitsufuji

Figure 1 for Densely connected multidilated convolutional networks for dense prediction tasks
Figure 2 for Densely connected multidilated convolutional networks for dense prediction tasks
Figure 3 for Densely connected multidilated convolutional networks for dense prediction tasks
Figure 4 for Densely connected multidilated convolutional networks for dense prediction tasks
Viaarxiv icon

D3Net: Densely connected multidilated DenseNet for music source separation

Add code
Bookmark button
Alert button
Oct 15, 2020
Naoya Takahashi, Yuki Mitsufuji

Figure 1 for D3Net: Densely connected multidilated DenseNet for music source separation
Figure 2 for D3Net: Densely connected multidilated DenseNet for music source separation
Figure 3 for D3Net: Densely connected multidilated DenseNet for music source separation
Figure 4 for D3Net: Densely connected multidilated DenseNet for music source separation
Viaarxiv icon

Adversarial attacks on audio source separation

Add code
Bookmark button
Alert button
Oct 09, 2020
Naoya Takahashi, Shota Inoue, Yuki Mitsufuji

Figure 1 for Adversarial attacks on audio source separation
Figure 2 for Adversarial attacks on audio source separation
Figure 3 for Adversarial attacks on audio source separation
Figure 4 for Adversarial attacks on audio source separation
Viaarxiv icon