Picture for Vahid Noroozi

Vahid Noroozi

Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models

Add code
Jul 29, 2024
Viaarxiv icon

Instruction Data Generation and Unsupervised Adaptation for Speech Language Models

Add code
Jun 18, 2024
Figure 1 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Figure 2 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Figure 3 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Figure 4 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition

Add code
Jan 11, 2024
Figure 1 for Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition
Figure 2 for Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition
Figure 3 for Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition
Figure 4 for Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition
Viaarxiv icon

Investigating End-to-End ASR Architectures for Long Form Audio Transcription

Add code
Sep 20, 2023
Figure 1 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Figure 2 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Figure 3 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Figure 4 for Investigating End-to-End ASR Architectures for Long Form Audio Transcription
Viaarxiv icon

Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition

Add code
May 19, 2023
Figure 1 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 2 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 3 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Figure 4 for Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
Viaarxiv icon

SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services

Add code
May 17, 2021
Figure 1 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Figure 2 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Figure 3 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Figure 4 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Viaarxiv icon

SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition

Add code
Apr 06, 2021
Figure 1 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 2 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 3 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 4 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Viaarxiv icon

Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition

Add code
Apr 05, 2021
Figure 1 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 2 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 3 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 4 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Viaarxiv icon

I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications

Add code
Mar 30, 2021
Figure 1 for I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications
Figure 2 for I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications
Figure 3 for I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications
Figure 4 for I-ODA, Real-World Multi-modal Longitudinal Data for OphthalmicApplications
Viaarxiv icon