Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Simon Dixon

Zero-shot Singing Technique Conversion


Nov 16, 2021
Brendan O'Connor, Simon Dixon, George Fazekas

* In Proceedings of the 15th International Symposium on Computer Music Multidisciplinary Research (CMMR 2021), Tokyo, Japan, November 15-16, 2021 

  Access Paper or Ask Questions

An Exploratory Study on Perceptual Spaces of the Singing Voice


Nov 16, 2021
Brendan O'Connor, Simon Dixon, George Fazekas

* In Proceedings of the 2020 Joint Conference on AI Music Creativity (CSMC-MuMe 2020), Stockholm, Sweden, October 15-19, 2020 

  Access Paper or Ask Questions

MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics Transcription


Aug 05, 2021
Emir Demirel, Sven AhlbÀck, Simon Dixon


  Access Paper or Ask Questions

Pitch-Informed Instrument Assignment Using a Deep Convolutional Network with Multiple Kernel Shapes


Jul 28, 2021
Carlos Lordelo, Emmanouil Benetos, Simon Dixon, Sven AhlbÀck

* 4 figures, 4 tables and 7 pages. Accepted for publication at ISMIR Conference 2021 

  Access Paper or Ask Questions

Computational Pronunciation Analysis in Sung Utterances


Jun 21, 2021
Emir Demirel, Sven Ahlback, Simon Dixon


  Access Paper or Ask Questions

Low Resource Audio-to-Lyrics Alignment From Polyphonic Music Recordings


Feb 18, 2021
Emir Demirel, Sven AhlbÀck, Simon Dixon


  Access Paper or Ask Questions

Structure-Aware Audio-to-Score Alignment using Progressively Dilated Convolutional Neural Networks


Feb 14, 2021
Ruchit Agrawal, Daniel Wolff, Simon Dixon

* ICASSP 2021 camera-ready version. Copyrights belong to IEEE 

  Access Paper or Ask Questions

Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation


Jan 03, 2021
Carlos Lordelo, Emmanouil Benetos, Simon Dixon, Sven AhlbÀck, Patrik Ohlsson

* 5 pages, 2 figures and 1 table. Accepted for publication in IEEE Signal Processing Letters 

  Access Paper or Ask Questions

Learning Frame Similarity using Siamese networks for Audio-to-Score Alignment


Nov 15, 2020
Ruchit Agrawal, Simon Dixon

* Accepted at EUSIPCO 2020 

  Access Paper or Ask Questions

A Hybrid Approach to Audio-to-Score Alignment


Jul 28, 2020
Ruchit Agrawal, Simon Dixon

* ML4MD at ICML 2019 

  Access Paper or Ask Questions

Automatic Lyrics Transcription using Dilated Convolutional Neural Networks with Self-Attention


Jul 24, 2020
Emir Demirel, Sven Ahlback, Simon Dixon


  Access Paper or Ask Questions

Reliable Local Explanations for Machine Listening


May 15, 2020
Saumitra Mishra, Emmanouil Benetos, Bob L. Sturm, Simon Dixon

* 8 pages plus references. Accepted at the IJCNN 2020 Special Session on Explainable Computational/Artificial Intelligence. Camera-ready version 

  Access Paper or Ask Questions

Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling


Nov 14, 2019
Daniel Stoller, Mi Tian, Sebastian Ewert, Simon Dixon

* Code available at https://github.com/f90/Seq-U-Net 

  Access Paper or Ask Questions

Training Generative Adversarial Networks from Incomplete Observations using Factorised Discriminators


May 29, 2019
Daniel Stoller, Sebastian Ewert, Simon Dixon

* 10 pages plus 14 pages appendix. Under review. Implementation available at https://github.com/f90/FactorGAN 

  Access Paper or Ask Questions

GAN-based Generation and Automatic Selection of Explanations for Neural Networks


Apr 27, 2019
Saumitra Mishra, Daniel Stoller, Emmanouil Benetos, Bob L. Sturm, Simon Dixon

* SafeML Workshop at the International Conference on Learning Representations (ICLR) 2019 
* 8 pages plus references and appendix. Accepted at the ICLR 2019 Workshop "Safe Machine Learning: Specification, Robustness and Assurance". Camera-ready version. v2: Corrected page header 

  Access Paper or Ask Questions

Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation


Jun 08, 2018
Daniel Stoller, Sebastian Ewert, Simon Dixon

* 19th International Society for Music Information Retrieval Conference (ISMIR 2018) 
* 7 pages (1 for references), 4 figures, 3 tables. Appearing in the proceedings of the 19th International Society for Music Information Retrieval Conference (ISMIR 2018) (camera-ready version). Implementation available at https://github.com/f90/Wave-U-Net 

  Access Paper or Ask Questions

Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice Extraction


Apr 06, 2018
Daniel Stoller, Sebastian Ewert, Simon Dixon

* 5 pages, 2 figures, 1 table. Final version of manuscript accepted for 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Implementation available at https://github.com/f90/AdversarialAudioSeparation 

  Access Paper or Ask Questions

Jointly Detecting and Separating Singing Voice: A Multi-Task Approach


Apr 05, 2018
Daniel Stoller, Sebastian Ewert, Simon Dixon

* 10 pages, 2 figures, accepted for the 14th International Conference on Latent Variable Analysis and Signal Separation 

  Access Paper or Ask Questions

Note Value Recognition for Piano Transcription Using Markov Random Fields


Jul 07, 2017
Eita Nakamura, Kazuyoshi Yoshii, Simon Dixon

* 13 pages, 16 figures, version accepted to IEEE/ACM TASLP, minor revision 

  Access Paper or Ask Questions

An End-to-End Neural Network for Polyphonic Piano Music Transcription


Feb 11, 2016
Siddharth Sigtia, Emmanouil Benetos, Simon Dixon


  Access Paper or Ask Questions

Identifying Cover Songs Using Information-Theoretic Measures of Similarity


May 17, 2015
Peter Foster, Simon Dixon, Anssi Klapuri

* IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23 no. 6, pp. 993-1005, 2015 
* 13 pages, 5 figures, 4 tables. v3: Accepted version 

  Access Paper or Ask Questions

A Hybrid Recurrent Neural Network For Music Transcription


Nov 06, 2014
Siddharth Sigtia, Emmanouil Benetos, Nicolas Boulanger-Lewandowski, Tillman Weyde, Artur S. d'Avila Garcez, Simon Dixon


  Access Paper or Ask Questions

Sequential Complexity as a Descriptor for Musical Similarity


Sep 28, 2014
Peter Foster, Matthias Mauch, Simon Dixon

* IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 22 no. 12, pp. 1965-1977, 2014 
* 13 pages, 9 figures, 8 tables. Accepted version 

  Access Paper or Ask Questions