Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Aren Jansen

Universal Paralinguistic Speech Representations Using Self-Supervised Conformers

Oct 09, 2021
Joel Shor, Aren Jansen, Wei Han, Daniel Park, Yu Zhang

  Access Paper or Ask Questions

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition

Oct 01, 2021
Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu

* 14 pages, 7 figures, 13 tables; v2: minor corrections, reference baselines and bibliography updated 

  Access Paper or Ask Questions

Attention Bottlenecks for Multimodal Fusion

Jun 30, 2021
Arsha Nagrani, Shan Yang, Anurag Arnab, Aren Jansen, Cordelia Schmid, Chen Sun

  Access Paper or Ask Questions

Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation

Jun 01, 2021
Scott Wisdom, Aren Jansen, Ron J. Weiss, Hakan Erdogan, John R. Hershey

* 5 pages, 1 figure. submitted to WASPAA 2021 

  Access Paper or Ask Questions

The Benefit Of Temporally-Strong Labels In Audio Event Classification

May 14, 2021
Shawn Hershey, Daniel P W Ellis, Eduardo Fonseca, Aren Jansen, Caroline Liu, R Channing Moore, Manoj Plakal

* Accepted for publication at ICASSP 2021 

  Access Paper or Ask Questions

Self-Supervised Learning from Automatically Separated Sound Scenes

May 05, 2021
Eduardo Fonseca, Aren Jansen, Daniel P. W. Ellis, Scott Wisdom, Marco Tagliasacchi, John R. Hershey, Manoj Plakal, Shawn Hershey, R. Channing Moore, Xavier Serra

  Access Paper or Ask Questions

Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds

Nov 02, 2020
Efthymios Tzinis, Scott Wisdom, Aren Jansen, Shawn Hershey, Tal Remez, Daniel P. W. Ellis, John R. Hershey

  Access Paper or Ask Questions

Addressing Missing Labels in Large-scale Sound Event Recognition using a Teacher-student Framework with Loss Masking

May 02, 2020
Eduardo Fonseca, Shawn Hershey, Manoj Plakal, Daniel P. W. Ellis, Aren Jansen, R. Channing Moore, Xavier Serra

  Access Paper or Ask Questions

Towards Learning a Universal Non-Semantic Representation of Speech

Mar 02, 2020
Joel Shor, Aren Jansen, Ronnie Maor, Oran Lang, Omry Tuval, Felix de Chaumont Quitry, Marco Tagliasacchi, Ira Shavitt, Dotan Emanuel, Yinnon Haviv

  Access Paper or Ask Questions

Improving Universal Sound Separation Using Sound Classification

Nov 18, 2019
Efthymios Tzinis, Scott Wisdom, John R. Hershey, Aren Jansen, Daniel P. W. Ellis

  Access Paper or Ask Questions

Coincidence, Categorization, and Consolidation: Learning to Recognize Sounds with Minimal Supervision

Nov 14, 2019
Aren Jansen, Daniel P. W. Ellis, Shawn Hershey, R. Channing Moore, Manoj Plakal, Ashok C. Popat, Rif A. Saurous

* This extended version of a ICASSP 2020 submission under same title has an added figure and additional discussion for easier consumption 

  Access Paper or Ask Questions

Unsupervised Learning of Semantic Audio Representations

Nov 06, 2017
Aren Jansen, Manoj Plakal, Ratheet Pandya, Daniel P. W. Ellis, Shawn Hershey, Jiayang Liu, R. Channing Moore, Rif A. Saurous

* Submitted to ICASSP 2018 

  Access Paper or Ask Questions

A segmental framework for fully-unsupervised large-vocabulary speech recognition

Sep 16, 2017
Herman Kamper, Aren Jansen, Sharon Goldwater

* Comput. Speech Lang. 46 (2017) 154-174 
* 15 pages, 6 figures, 8 tables 

  Access Paper or Ask Questions

CNN Architectures for Large-Scale Audio Classification

Jan 10, 2017
Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis, Jort F. Gemmeke, Aren Jansen, R. Channing Moore, Manoj Plakal, Devin Platt, Rif A. Saurous, Bryan Seybold, Malcolm Slaney, Ron J. Weiss, Kevin Wilson

* Accepted for publication at ICASSP 2017 Changes: Added definitions of mAP, AUC, and d-prime. Updated mAP/AUC/d-prime numbers for Audio Set based on changes of latest Audio Set revision. Changed wording to fit 4 page limit with new additions 

  Access Paper or Ask Questions

Scalable Out-of-Sample Extension of Graph Embeddings Using Deep Neural Networks

Jun 14, 2016
Aren Jansen, Gregory Sell, Vince Lyzinski

* 10 pages, 2 figures, 1 table, this paper is under consideration for publication in Pattern Recognition Letters 

  Access Paper or Ask Questions

Unsupervised word segmentation and lexicon discovery using acoustic word embeddings

Mar 09, 2016
Herman Kamper, Aren Jansen, Sharon Goldwater

* IEEE/ACM Trans. Audio, Speech, Language Process. 24 (2016) 669-679 
* 11 pages, 8 figures; Accepted to the IEEE/ACM Transactions on Audio, Speech, and Language Processing 

  Access Paper or Ask Questions