Alert button
Picture for Juan Zuluaga-Gomez

Juan Zuluaga-Gomez

Alert button

End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation

Add code
Bookmark button
Alert button
Nov 01, 2023
Juan Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Rohit Paturi, Sundararajan Srinivasan, Prashant Mathur, Brian Thompson, Marcello Federico

Viaarxiv icon

Implementing contextual biasing in GPU decoder for online ASR

Add code
Bookmark button
Alert button
Jun 23, 2023
Iuliia Nigmatulina, Srikanth Madikeri, Esaú Villatoro-Tello, Petr Motliček, Juan Zuluaga-Gomez, Karthik Pandia, Aravind Ganapathiraju

Figure 1 for Implementing contextual biasing in GPU decoder for online ASR
Figure 2 for Implementing contextual biasing in GPU decoder for online ASR
Figure 3 for Implementing contextual biasing in GPU decoder for online ASR
Viaarxiv icon

CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice

Add code
Bookmark button
Alert button
May 29, 2023
Juan Zuluaga-Gomez, Sara Ahmed, Danielius Visockas, Cem Subakan

Figure 1 for CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice
Figure 2 for CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice
Figure 3 for CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice
Figure 4 for CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice
Viaarxiv icon

HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition

Add code
Bookmark button
Alert button
May 29, 2023
Florian Mai, Juan Zuluaga-Gomez, Titouan Parcollet, Petr Motlicek

Figure 1 for HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition
Figure 2 for HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition
Figure 3 for HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition
Figure 4 for HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition
Viaarxiv icon

Breast Cancer Diagnosis Using Machine Learning Techniques

Add code
Bookmark button
Alert button
May 04, 2023
Juan Zuluaga-Gomez

Figure 1 for Breast Cancer Diagnosis Using Machine Learning Techniques
Figure 2 for Breast Cancer Diagnosis Using Machine Learning Techniques
Figure 3 for Breast Cancer Diagnosis Using Machine Learning Techniques
Figure 4 for Breast Cancer Diagnosis Using Machine Learning Techniques
Viaarxiv icon

Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding

Add code
Bookmark button
Alert button
May 02, 2023
Juan Zuluaga-Gomez, Iuliia Nigmatulina, Amrutha Prasad, Petr Motlicek, Driss Khalil, Srikanth Madikeri, Allan Tart, Igor Szoke, Vincent Lenders, Mickael Rigault, Khalid Choukri

Figure 1 for Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding
Figure 2 for Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding
Figure 3 for Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding
Figure 4 for Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding
Viaarxiv icon

A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers

Add code
Bookmark button
Alert button
Apr 16, 2023
Juan Zuluaga-Gomez, Amrutha Prasad, Iuliia Nigmatulina, Petr Motlicek, Matthias Kleinert

Figure 1 for A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Figure 2 for A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Figure 3 for A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Figure 4 for A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Viaarxiv icon

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks

Add code
Bookmark button
Alert button
Dec 16, 2022
Esaú Villatoro-Tello, Srikanth Madikeri, Juan Zuluaga-Gomez, Bidisha Sharma, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Petr Motlicek, Alexei V. Ivanov, Aravind Ganapathiraju

Figure 1 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 2 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 3 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 4 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Viaarxiv icon

Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator

Add code
Bookmark button
Alert button
Dec 14, 2022
Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Saeed Sarfjoo, Iuliia Nigmatulina, Karel Vesely

Figure 1 for Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator
Figure 2 for Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator
Figure 3 for Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator
Figure 4 for Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator
Viaarxiv icon

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

Add code
Bookmark button
Alert button
Nov 08, 2022
Juan Zuluaga-Gomez, Karel Veselý, Igor Szöke, Petr Motlicek, Martin Kocour, Mickael Rigault, Khalid Choukri, Amrutha Prasad, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Claudia Cevenini, Pavel Kolčárek, Allan Tart, Jan Černocký

Figure 1 for ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
Figure 2 for ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
Figure 3 for ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
Figure 4 for ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
Viaarxiv icon