Alert button
Picture for Vahid Noroozi

Vahid Noroozi

Alert button

Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jan 11, 2024
Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg

Viaarxiv icon

Stateful FastConformer with Cache-based Inference for Streaming Automatic Speech Recognition

Add code
Bookmark button
Alert button
Dec 27, 2023
Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg

Viaarxiv icon

Investigating End-to-End ASR Architectures for Long Form Audio Transcription

Add code
Bookmark button
Alert button
Sep 20, 2023
Nithin Rao Koluguri, Samuel Kriman, Georgy Zelenfroind, Somshubra Majumdar, Dima Rekesh, Vahid Noroozi, Jagadeesh Balam, Boris Ginsburg

Figure 1 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Figure 2 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Figure 3 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Figure 4 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Viaarxiv icon

Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition

Add code
Bookmark button
Alert button
May 19, 2023
Dima Rekesh, Samuel Kriman, Somshubra Majumdar, Vahid Noroozi, He Huang, Oleksii Hrinchuk, Ankur Kumar, Boris Ginsburg

Figure 1 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 2 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 3 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 4 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Viaarxiv icon

SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services

Add code
Bookmark button
Alert button
May 17, 2021
Yang Zhang, Vahid Noroozi, Evelina Bakhturina, Boris Ginsburg

Figure 1 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Figure 2 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Figure 3 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Figure 4 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Viaarxiv icon

SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition

Add code
Bookmark button
Alert button
Apr 06, 2021
Patrick K. O'Neill, Vitaly Lavrukhin, Somshubra Majumdar, Vahid Noroozi, Yuekai Zhang, Oleksii Kuchaiev, Jagadeesh Balam, Yuliya Dovzhenko, Keenan Freyberg, Michael D. Shulman, Boris Ginsburg, Shinji Watanabe, Georg Kucsko

Figure 1 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 2 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 3 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 4 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Viaarxiv icon

Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Apr 05, 2021
Somshubra Majumdar, Jagadeesh Balam, Oleksii Hrinchuk, Vitaly Lavrukhin, Vahid Noroozi, Boris Ginsburg

Figure 1 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 2 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 3 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 4 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Viaarxiv icon

I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications

Add code
Bookmark button
Alert button
Mar 30, 2021
Nooshin Mojab, Vahid Noroozi, Abdullah Aleem, Manoj P. Nallabothula, Joseph Baker, Dimitri T. Azar, Mark Rosenblatt, RV Paul Chan, Darvin Yi, Philip S. Yu, Joelle A. Hallak

Figure 1 for I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications
Figure 2 for I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications
Figure 3 for I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications
Figure 4 for I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications
Viaarxiv icon