Alert button
Picture for Leda Sarı

Leda Sarı

Alert button

Towards Selection of Text-to-speech Data to Augment ASR Training

Add code
Bookmark button
Alert button
May 30, 2023
Shuo Liu, Leda Sarı, Chunyang Wu, Gil Keren, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli

Figure 1 for Towards Selection of Text-to-speech Data to Augment ASR Training
Figure 2 for Towards Selection of Text-to-speech Data to Augment ASR Training
Figure 3 for Towards Selection of Text-to-speech Data to Augment ASR Training
Figure 4 for Towards Selection of Text-to-speech Data to Augment ASR Training
Viaarxiv icon

Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Mar 01, 2023
Philipp Klumpp, Pooja Chitkara, Leda Sarı, Prashant Serai, Jilong Wu, Irina-Elena Veliche, Rongqing Huang, Qing He

Figure 1 for Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
Figure 2 for Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
Figure 3 for Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
Viaarxiv icon

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

Add code
Bookmark button
Alert button
Nov 18, 2021
Chunxi Liu, Michael Picheny, Leda Sarı, Pooja Chitkara, Alex Xiao, Xiaohui Zhang, Mark Chou, Andres Alvarado, Caner Hazirbas, Yatharth Saraf

Figure 1 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Figure 2 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Viaarxiv icon

A Multi-View Approach To Audio-Visual Speaker Verification

Add code
Bookmark button
Alert button
Feb 11, 2021
Leda Sarı, Kritika Singh, Jiatong Zhou, Lorenzo Torresani, Nayan Singhal, Yatharth Saraf

Figure 1 for A Multi-View Approach To Audio-Visual Speaker Verification
Figure 2 for A Multi-View Approach To Audio-Visual Speaker Verification
Figure 3 for A Multi-View Approach To Audio-Visual Speaker Verification
Figure 4 for A Multi-View Approach To Audio-Visual Speaker Verification
Viaarxiv icon

Deep F-measure Maximization for End-to-End Speech Understanding

Add code
Bookmark button
Alert button
Aug 08, 2020
Leda Sarı, Mark Hasegawa-Johnson

Figure 1 for Deep F-measure Maximization for End-to-End Speech Understanding
Figure 2 for Deep F-measure Maximization for End-to-End Speech Understanding
Figure 3 for Deep F-measure Maximization for End-to-End Speech Understanding
Figure 4 for Deep F-measure Maximization for End-to-End Speech Understanding
Viaarxiv icon

Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR

Add code
Bookmark button
Alert button
Feb 14, 2020
Leda Sarı, Niko Moritz, Takaaki Hori, Jonathan Le Roux

Figure 1 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Figure 2 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Figure 3 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Figure 4 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Viaarxiv icon