Alert button
Picture for Frank Seide

Frank Seide

Alert button

Effective internal language model training and fusion for factorized transducer model

Add code
Bookmark button
Alert button
Apr 02, 2024
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer

Viaarxiv icon

AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition

Add code
Bookmark button
Alert button
Jan 18, 2024
Ju Lin, Niko Moritz, Yiteng Huang, Ruiming Xie, Ming Sun, Christian Fuegen, Frank Seide

Viaarxiv icon

Directional Source Separation for Robust Speech Recognition on Smart Glasses

Add code
Bookmark button
Alert button
Sep 20, 2023
Tiantian Feng, Ju Lin, Yiteng Huang, Weipeng He, Kaustubh Kalgaonkar, Niko Moritz, Li Wan, Xin Lei, Ming Sun, Frank Seide

Figure 1 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 2 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 3 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 4 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Viaarxiv icon

DISGO: Automatic End-to-End Evaluation for Scene Text OCR

Add code
Bookmark button
Alert button
Aug 25, 2023
Mei-Yuh Hwang, Yangyang Shi, Ankit Ramchandani, Guan Pang, Praveen Krishnan, Lucas Kabela, Frank Seide, Samyak Datta, Jun Liu

Figure 1 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 2 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 3 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 4 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Viaarxiv icon

Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers

Add code
Bookmark button
Alert button
Nov 02, 2022
Duc Le, Frank Seide, Yuhao Wang, Yang Li, Kjell Schubert, Ozlem Kalinli, Michael L. Seltzer

Figure 1 for Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers
Figure 2 for Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers
Figure 3 for Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers
Figure 4 for Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers
Viaarxiv icon

An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition

Add code
Bookmark button
Alert button
Apr 19, 2022
Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen

Figure 1 for An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition
Figure 2 for An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition
Figure 3 for An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition
Figure 4 for An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition
Viaarxiv icon

Federated Domain Adaptation for ASR with Full Self-Supervision

Add code
Bookmark button
Alert button
Apr 05, 2022
Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide

Figure 1 for Federated Domain Adaptation for ASR with Full Self-Supervision
Figure 2 for Federated Domain Adaptation for ASR with Full Self-Supervision
Figure 3 for Federated Domain Adaptation for ASR with Full Self-Supervision
Figure 4 for Federated Domain Adaptation for ASR with Full Self-Supervision
Viaarxiv icon

Achieving Human Parity on Automatic Chinese to English News Translation

Add code
Bookmark button
Alert button
Jun 29, 2018
Hany Hassan, Anthony Aue, Chang Chen, Vishal Chowdhary, Jonathan Clark, Christian Federmann, Xuedong Huang, Marcin Junczys-Dowmunt, William Lewis, Mu Li, Shujie Liu, Tie-Yan Liu, Renqian Luo, Arul Menezes, Tao Qin, Frank Seide, Xu Tan, Fei Tian, Lijun Wu, Shuangzhi Wu, Yingce Xia, Dongdong Zhang, Zhirui Zhang, Ming Zhou

Figure 1 for Achieving Human Parity on Automatic Chinese to English News Translation
Figure 2 for Achieving Human Parity on Automatic Chinese to English News Translation
Figure 3 for Achieving Human Parity on Automatic Chinese to English News Translation
Figure 4 for Achieving Human Parity on Automatic Chinese to English News Translation
Viaarxiv icon

Marian: Fast Neural Machine Translation in C++

Add code
Bookmark button
Alert button
Apr 04, 2018
Marcin Junczys-Dowmunt, Roman Grundkiewicz, Tomasz Dwojak, Hieu Hoang, Kenneth Heafield, Tom Neckermann, Frank Seide, Ulrich Germann, Alham Fikri Aji, Nikolay Bogoychev, André F. T. Martins, Alexandra Birch

Figure 1 for Marian: Fast Neural Machine Translation in C++
Figure 2 for Marian: Fast Neural Machine Translation in C++
Figure 3 for Marian: Fast Neural Machine Translation in C++
Figure 4 for Marian: Fast Neural Machine Translation in C++
Viaarxiv icon

Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks

Add code
Bookmark button
Alert button
Mar 08, 2013
Dong Yu, Michael L. Seltzer, Jinyu Li, Jui-Ting Huang, Frank Seide

Figure 1 for Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks
Figure 2 for Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks
Figure 3 for Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks
Figure 4 for Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks
Viaarxiv icon