Alert button
Picture for Dmitriy Serdyuk

Dmitriy Serdyuk

Alert button

On Robustness to Missing Video for Audiovisual Speech Recognition

Add code
Bookmark button
Alert button
Dec 19, 2023
Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan

Figure 1 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 2 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 3 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 4 for On Robustness to Missing Video for Audiovisual Speech Recognition
Viaarxiv icon

Audio-visual fine-tuning of audio-only ASR models

Add code
Bookmark button
Alert button
Dec 14, 2023
Avner May, Dmitriy Serdyuk, Ankit Parag Shah, Otavio Braga, Olivier Siohan

Viaarxiv icon

Conformers are All You Need for Visual Speech Recogntion

Add code
Bookmark button
Alert button
Feb 17, 2023
Oscar Chang, Hank Liao, Dmitriy Serdyuk, Ankit Shah, Olivier Siohan

Figure 1 for Conformers are All You Need for Visual Speech Recogntion
Figure 2 for Conformers are All You Need for Visual Speech Recogntion
Figure 3 for Conformers are All You Need for Visual Speech Recogntion
Figure 4 for Conformers are All You Need for Visual Speech Recogntion
Viaarxiv icon

Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Jan 25, 2022
Dmitriy Serdyuk, Otavio Braga, Olivier Siohan

Figure 1 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 2 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 3 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 4 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Viaarxiv icon

Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels

Add code
Bookmark button
Alert button
Sep 20, 2021
Dmitriy Serdyuk, Otavio Braga, Olivier Siohan

Figure 1 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Figure 2 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Figure 3 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Figure 4 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Viaarxiv icon

Accounting for Variance in Machine Learning Benchmarks

Add code
Bookmark button
Alert button
Mar 01, 2021
Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi, Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Naz Sepah, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Dmitriy Serdyuk, Tal Arbel, Chris Pal, Gaël Varoquaux, Pascal Vincent

Figure 1 for Accounting for Variance in Machine Learning Benchmarks
Figure 2 for Accounting for Variance in Machine Learning Benchmarks
Figure 3 for Accounting for Variance in Machine Learning Benchmarks
Figure 4 for Accounting for Variance in Machine Learning Benchmarks
Viaarxiv icon

Unsupervised adversarial domain adaptation for acoustic scene classification

Add code
Bookmark button
Alert button
Aug 17, 2018
Shayan Gharib, Konstantinos Drossos, Emre Çakir, Dmitriy Serdyuk, Tuomas Virtanen

Figure 1 for Unsupervised adversarial domain adaptation for acoustic scene classification
Figure 2 for Unsupervised adversarial domain adaptation for acoustic scene classification
Figure 3 for Unsupervised adversarial domain adaptation for acoustic scene classification
Viaarxiv icon

Twin Regularization for online speech recognition

Add code
Bookmark button
Alert button
Jun 12, 2018
Mirco Ravanelli, Dmitriy Serdyuk, Yoshua Bengio

Figure 1 for Twin Regularization for online speech recognition
Figure 2 for Twin Regularization for online speech recognition
Figure 3 for Twin Regularization for online speech recognition
Figure 4 for Twin Regularization for online speech recognition
Viaarxiv icon

Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations

Add code
Bookmark button
Alert button
Apr 07, 2018
Alex Lamb, Jonathan Binas, Anirudh Goyal, Dmitriy Serdyuk, Sandeep Subramanian, Ioannis Mitliagkas, Yoshua Bengio

Figure 1 for Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations
Figure 2 for Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations
Figure 3 for Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations
Figure 4 for Fortified Networks: Improving the Robustness of Deep Networks by Modeling the Manifold of Hidden Representations
Viaarxiv icon

Deep Complex Networks

Add code
Bookmark button
Alert button
Feb 25, 2018
Chiheb Trabelsi, Olexa Bilaniuk, Ying Zhang, Dmitriy Serdyuk, Sandeep Subramanian, João Felipe Santos, Soroush Mehri, Negar Rostamzadeh, Yoshua Bengio, Christopher J Pal

Figure 1 for Deep Complex Networks
Figure 2 for Deep Complex Networks
Figure 3 for Deep Complex Networks
Figure 4 for Deep Complex Networks
Viaarxiv icon