Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Text-Driven Separation of Arbitrary Sounds

Kevin Kilgour , Beat Gfeller , Qingqing Huang , Aren Jansen , Scott Wisdom , Marco Tagliasacchi

* Submitted to INTERSPEECH 2022 

   Access Paper or Ask Questions

Universal Paralinguistic Speech Representations Using Self-Supervised Conformers

Joel Shor , Aren Jansen , Wei Han , Daniel Park , Yu Zhang

   Access Paper or Ask Questions

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition

Yu Zhang , Daniel S. Park , Wei Han , James Qin , Anmol Gulati , Joel Shor , Aren Jansen , Yuanzhong Xu , Yanping Huang , Shibo Wang , Zongwei Zhou , Bo Li , Min Ma , William Chan , Jiahui Yu , Yongqiang Wang , Liangliang Cao , Khe Chai Sim , Bhuvana Ramabhadran , Tara N. Sainath , Françoise Beaufays , Zhifeng Chen , Quoc V. Le , Chung-Cheng Chiu , Ruoming Pang , Yonghui Wu

* 14 pages, 7 figures, 13 tables; v2: minor corrections, reference baselines and bibliography updated 

   Access Paper or Ask Questions

Attention Bottlenecks for Multimodal Fusion

Arsha Nagrani , Shan Yang , Anurag Arnab , Aren Jansen , Cordelia Schmid , Chen Sun

   Access Paper or Ask Questions

Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation

Scott Wisdom , Aren Jansen , Ron J. Weiss , Hakan Erdogan , John R. Hershey

* 5 pages, 1 figure. submitted to WASPAA 2021 

   Access Paper or Ask Questions

The Benefit Of Temporally-Strong Labels In Audio Event Classification

Shawn Hershey , Daniel P W Ellis , Eduardo Fonseca , Aren Jansen , Caroline Liu , R Channing Moore , Manoj Plakal

* Accepted for publication at ICASSP 2021 

   Access Paper or Ask Questions

Self-Supervised Learning from Automatically Separated Sound Scenes

Eduardo Fonseca , Aren Jansen , Daniel P. W. Ellis , Scott Wisdom , Marco Tagliasacchi , John R. Hershey , Manoj Plakal , Shawn Hershey , R. Channing Moore , Xavier Serra

   Access Paper or Ask Questions

Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds

Efthymios Tzinis , Scott Wisdom , Aren Jansen , Shawn Hershey , Tal Remez , Daniel P. W. Ellis , John R. Hershey

   Access Paper or Ask Questions

Addressing Missing Labels in Large-scale Sound Event Recognition using a Teacher-student Framework with Loss Masking

Eduardo Fonseca , Shawn Hershey , Manoj Plakal , Daniel P. W. Ellis , Aren Jansen , R. Channing Moore , Xavier Serra

   Access Paper or Ask Questions

Towards Learning a Universal Non-Semantic Representation of Speech

Joel Shor , Aren Jansen , Ronnie Maor , Oran Lang , Omry Tuval , Felix de Chaumont Quitry , Marco Tagliasacchi , Ira Shavitt , Dotan Emanuel , Yinnon Haviv

   Access Paper or Ask Questions