Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition



Dmitriy Serdyuk , Otavio Braga , Olivier Siohan


   Access Paper or Ask Questions

Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels



Dmitriy Serdyuk , Otavio Braga , Olivier Siohan

* 7 pages, 2 figures, 4 tables. A draft for a paper accepted to ASRU workshop 

   Access Paper or Ask Questions

Accounting for Variance in Machine Learning Benchmarks



Xavier Bouthillier , Pierre Delaunay , Mirko Bronzi , Assya Trofimov , Brennan Nichyporuk , Justin Szeto , Naz Sepah , Edward Raff , Kanika Madan , Vikram Voleti , Samira Ebrahimi Kahou , Vincent Michalski , Dmitriy Serdyuk , Tal Arbel , Chris Pal , Gaël Varoquaux , Pascal Vincent

* Submitted to MLSys2021 

   Access Paper or Ask Questions

Unsupervised adversarial domain adaptation for acoustic scene classification



Shayan Gharib , Konstantinos Drossos , Emre Çakir , Dmitriy Serdyuk , Tuomas Virtanen


   Access Paper or Ask Questions

Twin Regularization for online speech recognition



Mirco Ravanelli , Dmitriy Serdyuk , Yoshua Bengio

* Accepted at INTESPEECH 2018 

   Access Paper or Ask Questions

Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations



Alex Lamb , Jonathan Binas , Anirudh Goyal , Dmitriy Serdyuk , Sandeep Subramanian , Ioannis Mitliagkas , Yoshua Bengio

* Under Review ICML 2018 

   Access Paper or Ask Questions

Deep Complex Networks



Chiheb Trabelsi , Olexa Bilaniuk , Ying Zhang , Dmitriy Serdyuk , Sandeep Subramanian , João Felipe Santos , Soroush Mehri , Negar Rostamzadeh , Yoshua Bengio , Christopher J Pal


   Access Paper or Ask Questions

Twin Networks: Matching the Future for Sequence Generation



Dmitriy Serdyuk , Nan Rosemary Ke , Alessandro Sordoni , Adam Trischler , Chris Pal , Yoshua Bengio

* 12 pages, 3 figures, published at ICLR 2018 

   Access Paper or Ask Questions

Towards end-to-end spoken language understanding



Dmitriy Serdyuk , Yongqiang Wang , Christian Fuegen , Anuj Kumar , Baiyang Liu , Yoshua Bengio

* submitted to ICASSP 2018 

   Access Paper or Ask Questions

Invariant Representations for Noisy Speech Recognition



Dmitriy Serdyuk , Kartik Audhkhasi , Philémon Brakel , Bhuvana Ramabhadran , Samuel Thomas , Yoshua Bengio

* 5 pages, 1 figure, 1 table, NIPS workshop on end-to-end speech recognition 

   Access Paper or Ask Questions

1
2
>>