Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds

Nov 02, 2020
Efthymios Tzinis, Scott Wisdom, Aren Jansen, Shawn Hershey, Tal Remez, Daniel P. W. Ellis, John R. Hershey


  Access Paper or Ask Questions

Addressing Missing Labels in Large-scale Sound Event Recognition using a Teacher-student Framework with Loss Masking

May 02, 2020
Eduardo Fonseca, Shawn Hershey, Manoj Plakal, Daniel P. W. Ellis, Aren Jansen, R. Channing Moore, Xavier Serra


  Access Paper or Ask Questions

Improving Universal Sound Separation Using Sound Classification

Nov 18, 2019
Efthymios Tzinis, Scott Wisdom, John R. Hershey, Aren Jansen, Daniel P. W. Ellis


  Access Paper or Ask Questions

Coincidence, Categorization, and Consolidation: Learning to Recognize Sounds with Minimal Supervision

Nov 14, 2019
Aren Jansen, Daniel P. W. Ellis, Shawn Hershey, R. Channing Moore, Manoj Plakal, Ashok C. Popat, Rif A. Saurous

* This extended version of a ICASSP 2020 submission under same title has an added figure and additional discussion for easier consumption 

  Access Paper or Ask Questions

Audio tagging with noisy labels and minimal supervision

Jul 14, 2019
Eduardo Fonseca, Manoj Plakal, Frederic Font, Daniel P. W. Ellis, Xavier Serra

* submitted to DCASE2019 Workshop 

  Access Paper or Ask Questions

Learning Sound Event Classifiers from Web Audio with Noisy Labels

Jan 04, 2019
Eduardo Fonseca, Manoj Plakal, Daniel P. W. Ellis, Frederic Font, Xavier Favory, Xavier Serra


  Access Paper or Ask Questions

General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline

Oct 07, 2018
Eduardo Fonseca, Manoj Plakal, Frederic Font, Daniel P. W. Ellis, Xavier Favory, Jordi Pons, Xavier Serra

* Camera ready for DCASE Workshop 2018 

  Access Paper or Ask Questions

Unsupervised Learning of Semantic Audio Representations

Nov 06, 2017
Aren Jansen, Manoj Plakal, Ratheet Pandya, Daniel P. W. Ellis, Shawn Hershey, Jiayang Liu, R. Channing Moore, Rif A. Saurous

* Submitted to ICASSP 2018 

  Access Paper or Ask Questions

CNN Architectures for Large-Scale Audio Classification

Jan 10, 2017
Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis, Jort F. Gemmeke, Aren Jansen, R. Channing Moore, Manoj Plakal, Devin Platt, Rif A. Saurous, Bryan Seybold, Malcolm Slaney, Ron J. Weiss, Kevin Wilson

* Accepted for publication at ICASSP 2017 Changes: Added definitions of mAP, AUC, and d-prime. Updated mAP/AUC/d-prime numbers for Audio Set based on changes of latest Audio Set revision. Changed wording to fit 4 page limit with new additions 

  Access Paper or Ask Questions

Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems

Sep 20, 2016
Colin Raffel, Daniel P. W. Ellis


  Access Paper or Ask Questions