Alert button
Picture for Jinyu Li

Jinyu Li

Alert button

Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition

Add code
Bookmark button
Alert button
Jun 28, 2023
Yuang Li, Yu Wu, Jinyu Li, Shujie Liu

Figure 1 for Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition
Figure 2 for Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition
Figure 3 for Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition
Figure 4 for Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition
Viaarxiv icon

Accurate and Structured Pruning for Efficient Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 31, 2023
Huiqiang Jiang, Li Lyna Zhang, Yuang Li, Yu Wu, Shijie Cao, Ting Cao, Yuqing Yang, Jinyu Li, Mao Yang, Lili Qiu

Figure 1 for Accurate and Structured Pruning for Efficient Automatic Speech Recognition
Figure 2 for Accurate and Structured Pruning for Efficient Automatic Speech Recognition
Figure 3 for Accurate and Structured Pruning for Efficient Automatic Speech Recognition
Figure 4 for Accurate and Structured Pruning for Efficient Automatic Speech Recognition
Viaarxiv icon

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

Add code
Bookmark button
Alert button
May 25, 2023
Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei

Figure 1 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 2 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 3 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 4 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Viaarxiv icon

PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds

Add code
Bookmark button
Alert button
May 08, 2023
Jinyu Li, Chenxu Luo, Xiaodong Yang

Figure 1 for PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds
Figure 2 for PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds
Figure 3 for PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds
Figure 4 for PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds
Viaarxiv icon

Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling

Add code
Bookmark button
Alert button
Mar 07, 2023
Ziqiang Zhang, Long Zhou, Chengyi Wang, Sanyuan Chen, Yu Wu, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei

Figure 1 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Figure 2 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Figure 3 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Figure 4 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Viaarxiv icon

Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Add code
Bookmark button
Alert button
Mar 01, 2023
Eric Sun, Jinyu Li, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu, Edward Lin, Yifan Gong

Figure 1 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 2 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 3 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 4 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Viaarxiv icon

Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data Augmentation

Add code
Bookmark button
Alert button
Feb 22, 2023
Xiaoqiang Wang, Yanqing Liu, Jinyu Li, Sheng Zhao

Figure 1 for Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data Augmentation
Figure 2 for Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data Augmentation
Figure 3 for Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data Augmentation
Viaarxiv icon

Speaker Change Detection for Transformer Transducer ASR

Add code
Bookmark button
Alert button
Feb 16, 2023
Jian Wu, Zhuo Chen, Min Hu, Xiong Xiao, Jinyu Li

Figure 1 for Speaker Change Detection for Transformer Transducer ASR
Figure 2 for Speaker Change Detection for Transformer Transducer ASR
Figure 3 for Speaker Change Detection for Transformer Transducer ASR
Figure 4 for Speaker Change Detection for Transformer Transducer ASR
Viaarxiv icon

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

Add code
Bookmark button
Alert button
Jan 05, 2023
Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei

Figure 1 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Figure 2 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Figure 3 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Figure 4 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Viaarxiv icon

Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models

Add code
Bookmark button
Alert button
Dec 05, 2022
Rui Zhao, Jian Xue, Partha Parthasarathy, Veljko Miljanic, Jinyu Li

Figure 1 for Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Figure 2 for Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Figure 3 for Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Figure 4 for Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Viaarxiv icon