Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer


Aug 26, 2022
Shrutina Agarwal, Sriram Ganapathy, Naoya Takahashi

* accepted to INTERSPEECH 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events


Jun 04, 2022
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training


Oct 14, 2021
Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji

* 5 pages, 3 figures, submitted to IEEE ICASSP 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection


Oct 13, 2021
Yuichiro Koyama, Kazuhide Shigemi, Masafumi Takahashi, Kazuki Shimada, Naoya Takahashi, Emiru Tsunoo, Shusuke Takahashi, Yuki Mitsufuji


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Amicable examples for informed source separation


Oct 11, 2021
Naoya Takahashi, Yuki Mitsufuji


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Source Mixing and Separation Robust Audio Steganography


Oct 11, 2021
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection


Jun 21, 2021
Kazuki Shimada, Naoya Takahashi, Yuichiro Koyama, Shusuke Takahashi, Emiru Tsunoo, Masafumi Takahashi, Yuki Mitsufuji

* 5 pages, 3 figures, submitted to DCASE2021 task3 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

End-to-end lyrics Recognition with Voice to Singing Style Transfer


Feb 17, 2021
Sakya Basak, Shrutina Agarwal, Sriram Ganapathy, Naoya Takahashi

* accepted at ICASSP 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Hierarchical disentangled representation learning for singing voice conversion


Jan 18, 2021
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Densely connected multidilated convolutional networks for dense prediction tasks


Nov 21, 2020
Naoya Takahashi, Yuki Mitsufuji

* arXiv admin note: text overlap with arXiv:2010.01733 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>