Picture for Iuliia Nigmatulina

Iuliia Nigmatulina

TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR

Add code
Jul 05, 2024
Viaarxiv icon

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models

Add code
Jul 05, 2024
Viaarxiv icon

Implementing contextual biasing in GPU decoder for online ASR

Add code
Jun 23, 2023
Figure 1 for Implementing contextual biasing in GPU decoder for online ASR
Figure 2 for Implementing contextual biasing in GPU decoder for online ASR
Figure 3 for Implementing contextual biasing in GPU decoder for online ASR
Viaarxiv icon

Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding

Add code
May 02, 2023
Figure 1 for Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding
Figure 2 for Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding
Figure 3 for Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding
Figure 4 for Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding
Viaarxiv icon

A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers

Add code
Apr 16, 2023
Figure 1 for A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Figure 2 for A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Figure 3 for A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Figure 4 for A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Viaarxiv icon

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks

Add code
Dec 16, 2022
Figure 1 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 2 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 3 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Figure 4 for Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks
Viaarxiv icon

Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator

Add code
Dec 14, 2022
Figure 1 for Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator
Figure 2 for Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator
Figure 3 for Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator
Figure 4 for Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator
Viaarxiv icon

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

Add code
Nov 08, 2022
Figure 1 for ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
Figure 2 for ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
Figure 3 for ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
Figure 4 for ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
Viaarxiv icon

How Does Pre-trained Wav2Vec2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications

Add code
Mar 31, 2022
Figure 1 for How Does Pre-trained Wav2Vec2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications
Figure 2 for How Does Pre-trained Wav2Vec2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications
Figure 3 for How Does Pre-trained Wav2Vec2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications
Viaarxiv icon

A two-step approach to leverage contextual data: speech recognition in air-traffic communications

Add code
Feb 08, 2022
Figure 1 for A two-step approach to leverage contextual data: speech recognition in air-traffic communications
Figure 2 for A two-step approach to leverage contextual data: speech recognition in air-traffic communications
Figure 3 for A two-step approach to leverage contextual data: speech recognition in air-traffic communications
Figure 4 for A two-step approach to leverage contextual data: speech recognition in air-traffic communications
Viaarxiv icon