Alert button
Picture for Olivier Siohan

Olivier Siohan

Alert button

On Robustness to Missing Video for Audiovisual Speech Recognition

Add code
Bookmark button
Alert button
Dec 19, 2023
Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan

Figure 1 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 2 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 3 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 4 for On Robustness to Missing Video for Audiovisual Speech Recognition
Viaarxiv icon

Revisiting the Entropy Semiring for Neural Speech Recognition

Add code
Bookmark button
Alert button
Dec 19, 2023
Oscar Chang, Dongseong Hwang, Olivier Siohan

Figure 1 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 2 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 3 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 4 for Revisiting the Entropy Semiring for Neural Speech Recognition
Viaarxiv icon

Audio-visual fine-tuning of audio-only ASR models

Add code
Bookmark button
Alert button
Dec 14, 2023
Avner May, Dmitriy Serdyuk, Ankit Parag Shah, Otavio Braga, Olivier Siohan

Viaarxiv icon

Cascaded encoders for fine-tuning ASR models on overlapped speech

Add code
Bookmark button
Alert button
Jun 28, 2023
Richard Rose, Oscar Chang, Olivier Siohan

Figure 1 for Cascaded encoders for fine-tuning ASR models on overlapped speech
Figure 2 for Cascaded encoders for fine-tuning ASR models on overlapped speech
Figure 3 for Cascaded encoders for fine-tuning ASR models on overlapped speech
Figure 4 for Cascaded encoders for fine-tuning ASR models on overlapped speech
Viaarxiv icon

Conformers are All You Need for Visual Speech Recogntion

Add code
Bookmark button
Alert button
Feb 17, 2023
Oscar Chang, Hank Liao, Dmitriy Serdyuk, Ankit Shah, Olivier Siohan

Figure 1 for Conformers are All You Need for Visual Speech Recogntion
Figure 2 for Conformers are All You Need for Visual Speech Recogntion
Figure 3 for Conformers are All You Need for Visual Speech Recogntion
Figure 4 for Conformers are All You Need for Visual Speech Recogntion
Viaarxiv icon

End-to-End Multi-Person Audio/Visual Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 11, 2022
Otavio Braga, Takaki Makino, Olivier Siohan, Hank Liao

Figure 1 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Figure 2 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Figure 3 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Figure 4 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Viaarxiv icon

A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection

Add code
Bookmark button
Alert button
May 11, 2022
Otavio Braga, Olivier Siohan

Figure 1 for A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection
Figure 2 for A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection
Figure 3 for A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection
Figure 4 for A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection
Viaarxiv icon

Best of Both Worlds: Multi-task Audio-Visual Automatic Speech Recognition and Active Speaker Detection

Add code
Bookmark button
Alert button
May 10, 2022
Otavio Braga, Olivier Siohan

Figure 1 for Best of Both Worlds: Multi-task Audio-Visual Automatic Speech Recognition and Active Speaker Detection
Figure 2 for Best of Both Worlds: Multi-task Audio-Visual Automatic Speech Recognition and Active Speaker Detection
Figure 3 for Best of Both Worlds: Multi-task Audio-Visual Automatic Speech Recognition and Active Speaker Detection
Viaarxiv icon

End-to-end multi-talker audio-visual ASR using an active speaker attention module

Add code
Bookmark button
Alert button
Apr 01, 2022
Richard Rose, Olivier Siohan

Figure 1 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Figure 2 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Figure 3 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Figure 4 for End-to-end multi-talker audio-visual ASR using an active speaker attention module
Viaarxiv icon

Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Jan 25, 2022
Dmitriy Serdyuk, Otavio Braga, Olivier Siohan

Figure 1 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 2 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 3 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Figure 4 for Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
Viaarxiv icon