Alert button
Picture for Ye Jia

Ye Jia

Alert button

Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey

Add code
Bookmark button
Alert button
Dec 22, 2023
Haotian Zhang, Semujju Stuart Dereck, Zhicheng Wang, Xianwei Lv, Kang Xu, Liang Wu, Ye Jia, Jing Wu, Zhuo Long, Wensheng Liang, X. G. Ma, Ruiyan Zhuang

Viaarxiv icon

Speech Aware Dialog System Technology Challenge (DSTC11)

Add code
Bookmark button
Alert button
Dec 16, 2022
Hagen Soltau, Izhak Shafran, Mingqiu Wang, Abhinav Rastogi, Jeffrey Zhao, Ye Jia, Wei Han, Yuan Cao, Aramys Miranda

Figure 1 for Speech Aware Dialog System Technology Challenge (DSTC11)
Figure 2 for Speech Aware Dialog System Technology Challenge (DSTC11)
Figure 3 for Speech Aware Dialog System Technology Challenge (DSTC11)
Figure 4 for Speech Aware Dialog System Technology Challenge (DSTC11)
Viaarxiv icon

Textless Direct Speech-to-Speech Translation with Discrete Speech Representation

Add code
Bookmark button
Alert button
Oct 31, 2022
Xinjian Li, Ye Jia, Chung-Cheng Chiu

Figure 1 for Textless Direct Speech-to-Speech Translation with Discrete Speech Representation
Figure 2 for Textless Direct Speech-to-Speech Translation with Discrete Speech Representation
Figure 3 for Textless Direct Speech-to-Speech Translation with Discrete Speech Representation
Figure 4 for Textless Direct Speech-to-Speech Translation with Discrete Speech Representation
Viaarxiv icon

Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks

Add code
Bookmark button
Alert button
Aug 28, 2022
Lev Finkelstein, Heiga Zen, Norman Casagrande, Chun-an Chan, Ye Jia, Tom Kenter, Alexey Petelin, Jonathan Shen, Vincent Wan, Yu Zhang, Yonghui Wu, Rob Clark

Figure 1 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Figure 2 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Figure 3 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Figure 4 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Viaarxiv icon

XTREME-S: Evaluating Cross-lingual Speech Representations

Add code
Bookmark button
Alert button
Apr 13, 2022
Alexis Conneau, Ankur Bapna, Yu Zhang, Min Ma, Patrick von Platen, Anton Lozhkov, Colin Cherry, Ye Jia, Clara Rivera, Mihir Kale, Daan Van Esch, Vera Axelrod, Simran Khanuja, Jonathan H. Clark, Orhan Firat, Michael Auli, Sebastian Ruder, Jason Riesa, Melvin Johnson

Figure 1 for XTREME-S: Evaluating Cross-lingual Speech Representations
Figure 2 for XTREME-S: Evaluating Cross-lingual Speech Representations
Figure 3 for XTREME-S: Evaluating Cross-lingual Speech Representations
Figure 4 for XTREME-S: Evaluating Cross-lingual Speech Representations
Viaarxiv icon

Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation

Add code
Bookmark button
Alert button
Mar 24, 2022
Ye Jia, Yifan Ding, Ankur Bapna, Colin Cherry, Yu Zhang, Alexis Conneau, Nobuyuki Morioka

Figure 1 for Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
Figure 2 for Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
Figure 3 for Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
Figure 4 for Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
Viaarxiv icon

mSLAM: Massively multilingual joint pre-training for speech and text

Add code
Bookmark button
Alert button
Feb 03, 2022
Ankur Bapna, Colin Cherry, Yu Zhang, Ye Jia, Melvin Johnson, Yong Cheng, Simran Khanuja, Jason Riesa, Alexis Conneau

Figure 1 for mSLAM: Massively multilingual joint pre-training for speech and text
Figure 2 for mSLAM: Massively multilingual joint pre-training for speech and text
Figure 3 for mSLAM: Massively multilingual joint pre-training for speech and text
Figure 4 for mSLAM: Massively multilingual joint pre-training for speech and text
Viaarxiv icon

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation

Add code
Bookmark button
Alert button
Jan 16, 2022
Ye Jia, Michelle Tadmor Ramanovich, Quan Wang, Heiga Zen

Figure 1 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Figure 2 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Figure 3 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Figure 4 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Viaarxiv icon

More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech

Add code
Bookmark button
Alert button
Nov 19, 2021
Michael Hassid, Michelle Tadmor Ramanovich, Brendan Shillingford, Miaosen Wang, Ye Jia, Tal Remez

Figure 1 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Figure 2 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Figure 3 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Figure 4 for More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Viaarxiv icon