Alert button
Picture for Yu Wu

Yu Wu

Alert button

LongFNT: Long-form Speech Recognition with Factorized Neural Transducer

Add code
Bookmark button
Alert button
Nov 17, 2022
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian

Figure 1 for LongFNT: Long-form Speech Recognition with Factorized Neural Transducer
Figure 2 for LongFNT: Long-form Speech Recognition with Factorized Neural Transducer
Figure 3 for LongFNT: Long-form Speech Recognition with Factorized Neural Transducer
Figure 4 for LongFNT: Long-form Speech Recognition with Factorized Neural Transducer
Viaarxiv icon

Speech separation with large-scale self-supervised learning

Add code
Bookmark button
Alert button
Nov 09, 2022
Zhuo Chen, Naoyuki Kanda, Jian Wu, Yu Wu, Xiaofei Wang, Takuya Yoshioka, Jinyu Li, Sunit Sivasankaran, Sefik Emre Eskimez

Figure 1 for Speech separation with large-scale self-supervised learning
Figure 2 for Speech separation with large-scale self-supervised learning
Figure 3 for Speech separation with large-scale self-supervised learning
Figure 4 for Speech separation with large-scale self-supervised learning
Viaarxiv icon

LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers

Add code
Bookmark button
Alert button
Nov 05, 2022
Peidong Wang, Eric Sun, Jian Xue, Yu Wu, Long Zhou, Yashesh Gaur, Shujie Liu, Jinyu Li

Figure 1 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Figure 2 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Figure 3 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Figure 4 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Viaarxiv icon

Two-Stream Network for Sign Language Recognition and Translation

Add code
Bookmark button
Alert button
Nov 02, 2022
Yutong Chen, Ronglai Zuo, Fangyun Wei, Yu Wu, Shujie Liu, Brian Mak

Figure 1 for Two-Stream Network for Sign Language Recognition and Translation
Figure 2 for Two-Stream Network for Sign Language Recognition and Translation
Figure 3 for Two-Stream Network for Sign Language Recognition and Translation
Figure 4 for Two-Stream Network for Sign Language Recognition and Translation
Viaarxiv icon

Real-time Speech Interruption Analysis: From Cloud to Client Deployment

Add code
Bookmark button
Alert button
Oct 24, 2022
Quchen Fu, Szu-Wei Fu, Yaran Fan, Yu Wu, Zhuo Chen, Jayant Gupchup, Ross Cutler

Figure 1 for Real-time Speech Interruption Analysis: From Cloud to Client Deployment
Figure 2 for Real-time Speech Interruption Analysis: From Cloud to Client Deployment
Figure 3 for Real-time Speech Interruption Analysis: From Cloud to Client Deployment
Figure 4 for Real-time Speech Interruption Analysis: From Cloud to Client Deployment
Viaarxiv icon

Foundation Transformers

Add code
Bookmark button
Alert button
Oct 19, 2022
Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei

Figure 1 for Foundation Transformers
Figure 2 for Foundation Transformers
Figure 3 for Foundation Transformers
Figure 4 for Foundation Transformers
Viaarxiv icon

STAR: Zero-Shot Chinese Character Recognition with Stroke- and Radical-Level Decompositions

Add code
Bookmark button
Alert button
Oct 16, 2022
Jinshan Zeng, Ruiying Xu, Yu Wu, Hongwei Li, Jiaxing Lu

Figure 1 for STAR: Zero-Shot Chinese Character Recognition with Stroke- and Radical-Level Decompositions
Figure 2 for STAR: Zero-Shot Chinese Character Recognition with Stroke- and Radical-Level Decompositions
Figure 3 for STAR: Zero-Shot Chinese Character Recognition with Stroke- and Radical-Level Decompositions
Figure 4 for STAR: Zero-Shot Chinese Character Recognition with Stroke- and Radical-Level Decompositions
Viaarxiv icon

Vision+X: A Survey on Multimodal Learning in the Light of Data

Add code
Bookmark button
Alert button
Oct 05, 2022
Ye Zhu, Yu Wu, Nicu Sebe, Yan Yan

Figure 1 for Vision+X: A Survey on Multimodal Learning in the Light of Data
Figure 2 for Vision+X: A Survey on Multimodal Learning in the Light of Data
Figure 3 for Vision+X: A Survey on Multimodal Learning in the Light of Data
Figure 4 for Vision+X: A Survey on Multimodal Learning in the Light of Data
Viaarxiv icon

SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data

Add code
Bookmark button
Alert button
Sep 30, 2022
Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, Lirong Dai, Jinyu Li, Furu Wei

Figure 1 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 2 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 3 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 4 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Viaarxiv icon