Alert button
Picture for Jian Xue

Jian Xue

Alert button

Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation

Add code
Bookmark button
Alert button
Oct 23, 2023
Sara Papi, Peidong Wang, Junkun Chen, Jian Xue, Naoyuki Kanda, Jinyu Li, Yashesh Gaur

Viaarxiv icon

Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach

Add code
Bookmark button
Alert button
Oct 06, 2023
Junkun Chen, Jian Xue, Peidong Wang, Jing Pan, Jinyu Li

Viaarxiv icon

DiariST: Streaming Speech Translation with Speaker Diarization

Add code
Bookmark button
Alert button
Sep 14, 2023
Mu Yang, Naoyuki Kanda, Xiaofei Wang, Junkun Chen, Peidong Wang, Jian Xue, Jinyu Li, Takuya Yoshioka

Viaarxiv icon

FoodSAM: Any Food Segmentation

Add code
Bookmark button
Alert button
Aug 11, 2023
Xing Lan, Jiayi Lyu, Hanyu Jiang, Kun Dong, Zehai Niu, Yi Zhang, Jian Xue

Figure 1 for FoodSAM: Any Food Segmentation
Figure 2 for FoodSAM: Any Food Segmentation
Figure 3 for FoodSAM: Any Food Segmentation
Figure 4 for FoodSAM: Any Food Segmentation
Viaarxiv icon

Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text

Add code
Bookmark button
Alert button
Jul 30, 2023
Eric Sun, Jinyu Li, Jian Xue, Yifan Gong

Viaarxiv icon

Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments

Add code
Bookmark button
Alert button
Jul 07, 2023
Sara Papi, Peidong Wan, Junkun Chen, Jian Xue, Jinyu Li, Yashesh Gaur

Figure 1 for Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments
Figure 2 for Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments
Figure 3 for Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments
Figure 4 for Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments
Viaarxiv icon

Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Add code
Bookmark button
Alert button
Mar 01, 2023
Eric Sun, Jinyu Li, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu, Edward Lin, Yifan Gong

Figure 1 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 2 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 3 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 4 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Viaarxiv icon

Markerless Body Motion Capturing for 3D Character Animation based on Multi-view Cameras

Add code
Bookmark button
Alert button
Dec 12, 2022
Jinbao Wang, Ke Lu, Jian Xue

Figure 1 for Markerless Body Motion Capturing for 3D Character Animation based on Multi-view Cameras
Figure 2 for Markerless Body Motion Capturing for 3D Character Animation based on Multi-view Cameras
Figure 3 for Markerless Body Motion Capturing for 3D Character Animation based on Multi-view Cameras
Figure 4 for Markerless Body Motion Capturing for 3D Character Animation based on Multi-view Cameras
Viaarxiv icon

Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models

Add code
Bookmark button
Alert button
Dec 05, 2022
Rui Zhao, Jian Xue, Partha Parthasarathy, Veljko Miljanic, Jinyu Li

Figure 1 for Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Figure 2 for Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Figure 3 for Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Figure 4 for Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
Viaarxiv icon

Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Nov 07, 2022
Yashesh Gaur, Nick Kibre, Jian Xue, Kangyuan Shu, Yuhui Wang, Issac Alphanso, Jinyu Li, Yifan Gong

Figure 1 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Figure 2 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Figure 3 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Figure 4 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Viaarxiv icon