Alert button
Picture for Leda Sarı

Leda Sarı

Alert button

Towards Selection of Text-to-speech Data to Augment ASR Training

May 30, 2023
Shuo Liu, Leda Sarı, Chunyang Wu, Gil Keren, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli

Figure 1 for Towards Selection of Text-to-speech Data to Augment ASR Training
Figure 2 for Towards Selection of Text-to-speech Data to Augment ASR Training
Figure 3 for Towards Selection of Text-to-speech Data to Augment ASR Training
Figure 4 for Towards Selection of Text-to-speech Data to Augment ASR Training
Viaarxiv icon

Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition

Mar 01, 2023
Philipp Klumpp, Pooja Chitkara, Leda Sarı, Prashant Serai, Jilong Wu, Irina-Elena Veliche, Rongqing Huang, Qing He

Figure 1 for Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
Figure 2 for Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
Figure 3 for Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
Viaarxiv icon

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

Nov 18, 2021
Chunxi Liu, Michael Picheny, Leda Sarı, Pooja Chitkara, Alex Xiao, Xiaohui Zhang, Mark Chou, Andres Alvarado, Caner Hazirbas, Yatharth Saraf

Figure 1 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Figure 2 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Viaarxiv icon

A Multi-View Approach To Audio-Visual Speaker Verification

Feb 11, 2021
Leda Sarı, Kritika Singh, Jiatong Zhou, Lorenzo Torresani, Nayan Singhal, Yatharth Saraf

Figure 1 for A Multi-View Approach To Audio-Visual Speaker Verification
Figure 2 for A Multi-View Approach To Audio-Visual Speaker Verification
Figure 3 for A Multi-View Approach To Audio-Visual Speaker Verification
Figure 4 for A Multi-View Approach To Audio-Visual Speaker Verification
Viaarxiv icon

Deep F-measure Maximization for End-to-End Speech Understanding

Aug 08, 2020
Leda Sarı, Mark Hasegawa-Johnson

Figure 1 for Deep F-measure Maximization for End-to-End Speech Understanding
Figure 2 for Deep F-measure Maximization for End-to-End Speech Understanding
Figure 3 for Deep F-measure Maximization for End-to-End Speech Understanding
Figure 4 for Deep F-measure Maximization for End-to-End Speech Understanding
Viaarxiv icon

Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR

Feb 14, 2020
Leda Sarı, Niko Moritz, Takaaki Hori, Jonathan Le Roux

Figure 1 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Figure 2 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Figure 3 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Figure 4 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Viaarxiv icon