Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Multi-scale Feature Learning Dynamics: Insights for Double Descent


Dec 06, 2021
Mohammad Pezeshki , Amartya Mitra , Yoshua Bengio , Guillaume Lajoie


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Simple data balancing achieves competitive worst-group-accuracy


Oct 27, 2021
Badr Youbi Idrissi , Martin Arjovsky , Mohammad Pezeshki , David Lopez-Paz


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Gradient Starvation: A Learning Proclivity in Neural Networks


Nov 23, 2020
Mohammad Pezeshki , SĂ©kou-Oumar Kaba , Yoshua Bengio , Aaron Courville , Doina Precup , Guillaume Lajoie


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

On the Learning Dynamics of Deep Neural Networks


Sep 18, 2018
Remi Tachet des Combes , Mohammad Pezeshki , Samira Shabanian , Aaron Courville , Yoshua Bengio

* 19 pages, 7 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Negative Momentum for Improved Game Dynamics


Jul 12, 2018
Gauthier Gidel , Reyhane Askari Hemmat , Mohammad Pezeshki , Gabriel Huang , Remi Lepriol , Simon Lacoste-Julien , Ioannis Mitliagkas


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations


Sep 22, 2017
David Krueger , Tegan Maharaj , János Kramár , Mohammad Pezeshki , Nicolas Ballas , Nan Rosemary Ke , Anirudh Goyal , Yoshua Bengio , Aaron Courville , Chris Pal

* David Krueger and Tegan Maharaj contributed equally to this work 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks


Jan 10, 2017
Ying Zhang , Mohammad Pezeshki , Philemon Brakel , Saizheng Zhang , Cesar Laurent Yoshua Bengio , Aaron Courville


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Deconstructing the Ladder Network Architecture


May 24, 2016
Mohammad Pezeshki , Linxi Fan , Philemon Brakel , Aaron Courville , Yoshua Bengio

* Proceedings of the 33 rd International Conference on Machine Learning, New York, NY, USA, 2016 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Theano: A Python framework for fast computation of mathematical expressions


May 09, 2016
The Theano Development Team , Rami Al-Rfou , Guillaume Alain , Amjad Almahairi , Christof Angermueller , Dzmitry Bahdanau , Nicolas Ballas , Frédéric Bastien , Justin Bayer , Anatoly Belikov , Alexander Belopolsky , Yoshua Bengio , Arnaud Bergeron , James Bergstra , Valentin Bisson , Josh Bleecher Snyder , Nicolas Bouchard , Nicolas Boulanger-Lewandowski , Xavier Bouthillier , Alexandre de Brébisson , Olivier Breuleux , Pierre-Luc Carrier , Kyunghyun Cho , Jan Chorowski , Paul Christiano , Tim Cooijmans , Marc-Alexandre Côté , Myriam Côté , Aaron Courville , Yann N. Dauphin , Olivier Delalleau , Julien Demouth , Guillaume Desjardins , Sander Dieleman , Laurent Dinh , Mélanie Ducoffe , Vincent Dumoulin , Samira Ebrahimi Kahou , Dumitru Erhan , Ziye Fan , Orhan Firat , Mathieu Germain , Xavier Glorot , Ian Goodfellow , Matt Graham , Caglar Gulcehre , Philippe Hamel , Iban Harlouchet , Jean-Philippe Heng , Balázs Hidasi , Sina Honari , Arjun Jain , Sébastien Jean , Kai Jia , Mikhail Korobov , Vivek Kulkarni , Alex Lamb , Pascal Lamblin , Eric Larsen , César Laurent , Sean Lee , Simon Lefrancois , Simon Lemieux , Nicholas Léonard , Zhouhan Lin , Jesse A. Livezey , Cory Lorenz , Jeremiah Lowin , Qianli Ma , Pierre-Antoine Manzagol , Olivier Mastropietro , Robert T. McGibbon , Roland Memisevic , Bart van Merriënboer , Vincent Michalski , Mehdi Mirza , Alberto Orlandi , Christopher Pal , Razvan Pascanu , Mohammad Pezeshki , Colin Raffel , Daniel Renshaw , Matthew Rocklin , Adriana Romero , Markus Roth , Peter Sadowski , John Salvatier , François Savard , Jan Schlüter , John Schulman , Gabriel Schwartz , Iulian Vlad Serban , Dmitriy Serdyuk , Samira Shabanian , Étienne Simon , Sigurd Spieckermann , S. Ramana Subramanyam , Jakub Sygnowski , Jérémie Tanguay , Gijs van Tulder , Joseph Turian , Sebastian Urban , Pascal Vincent , Francesco Visin , Harm de Vries , David Warde-Farley , Dustin J. Webb , Matthew Willson , Kelvin Xu , Lijun Xue , Li Yao , Saizheng Zhang , Ying Zhang

* 19 pages, 5 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Sequence Modeling using Gated Recurrent Neural Networks


Jan 01, 2015
Mohammad Pezeshki


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>