Picture for Nauman Dawalatabad

Nauman Dawalatabad

LibriVAD: A Scalable Open Dataset with Deep Learning Benchmarks for Voice Activity Detection

Add code
Dec 19, 2025
Viaarxiv icon

Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer

Add code
Jun 26, 2024
Figure 1 for Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer
Figure 2 for Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer
Figure 3 for Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer
Figure 4 for Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer
Viaarxiv icon

Improved Cross-Lingual Transfer Learning For Automatic Speech Translation

Add code
Jun 01, 2023
Viaarxiv icon

On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration

Add code
Nov 14, 2022
Figure 1 for On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration
Figure 2 for On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration
Figure 3 for On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration
Figure 4 for On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration
Viaarxiv icon

Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition

Add code
Oct 01, 2022
Figure 1 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 2 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 3 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 4 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Viaarxiv icon

Two-Pass End-to-End ASR Model Compression

Add code
Jan 08, 2022
Figure 1 for Two-Pass End-to-End ASR Model Compression
Figure 2 for Two-Pass End-to-End ASR Model Compression
Figure 3 for Two-Pass End-to-End ASR Model Compression
Figure 4 for Two-Pass End-to-End ASR Model Compression
Viaarxiv icon

SpeechBrain: A General-Purpose Speech Toolkit

Add code
Jun 08, 2021
Figure 1 for SpeechBrain: A General-Purpose Speech Toolkit
Figure 2 for SpeechBrain: A General-Purpose Speech Toolkit
Figure 3 for SpeechBrain: A General-Purpose Speech Toolkit
Figure 4 for SpeechBrain: A General-Purpose Speech Toolkit
Viaarxiv icon

ECAPA-TDNN Embeddings for Speaker Diarization

Add code
Apr 03, 2021
Figure 1 for ECAPA-TDNN Embeddings for Speaker Diarization
Figure 2 for ECAPA-TDNN Embeddings for Speaker Diarization
Figure 3 for ECAPA-TDNN Embeddings for Speaker Diarization
Figure 4 for ECAPA-TDNN Embeddings for Speaker Diarization
Viaarxiv icon

Front-end Diarization for Percussion Separation in Taniavartanam of Carnatic Music Concerts

Add code
Mar 04, 2021
Figure 1 for Front-end Diarization for Percussion Separation in Taniavartanam of Carnatic Music Concerts
Figure 2 for Front-end Diarization for Percussion Separation in Taniavartanam of Carnatic Music Concerts
Figure 3 for Front-end Diarization for Percussion Separation in Taniavartanam of Carnatic Music Concerts
Figure 4 for Front-end Diarization for Percussion Separation in Taniavartanam of Carnatic Music Concerts
Viaarxiv icon