Alert button
Picture for Cheng-I Jeff Lai

Cheng-I Jeff Lai

Alert button

Audio-Visual Neural Syntax Acquisition

Oct 11, 2023
Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David Cox, David Harwath, Yang Zhang, Karen Livescu, James Glass

Figure 1 for Audio-Visual Neural Syntax Acquisition
Figure 2 for Audio-Visual Neural Syntax Acquisition
Figure 3 for Audio-Visual Neural Syntax Acquisition
Figure 4 for Audio-Visual Neural Syntax Acquisition
Viaarxiv icon

Instruction-Following Speech Recognition

Sep 18, 2023
Cheng-I Jeff Lai, Zhiyun Lu, Liangliang Cao, Ruoming Pang

Figure 1 for Instruction-Following Speech Recognition
Figure 2 for Instruction-Following Speech Recognition
Figure 3 for Instruction-Following Speech Recognition
Figure 4 for Instruction-Following Speech Recognition
Viaarxiv icon

Simple and Effective Unsupervised Speech Synthesis

Apr 20, 2022
Alexander H. Liu, Cheng-I Jeff Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James Glass

Figure 1 for Simple and Effective Unsupervised Speech Synthesis
Figure 2 for Simple and Effective Unsupervised Speech Synthesis
Figure 3 for Simple and Effective Unsupervised Speech Synthesis
Figure 4 for Simple and Effective Unsupervised Speech Synthesis
Viaarxiv icon

SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities

Mar 14, 2022
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee

Figure 1 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 2 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 3 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 4 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Viaarxiv icon

SSAST: Self-Supervised Audio Spectrogram Transformer

Oct 19, 2021
Yuan Gong, Cheng-I Jeff Lai, Yu-An Chung, James Glass

Figure 1 for SSAST: Self-Supervised Audio Spectrogram Transformer
Figure 2 for SSAST: Self-Supervised Audio Spectrogram Transformer
Figure 3 for SSAST: Self-Supervised Audio Spectrogram Transformer
Figure 4 for SSAST: Self-Supervised Audio Spectrogram Transformer
Viaarxiv icon

On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis

Oct 04, 2021
Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David Cox, James Glass

Figure 1 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 2 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 3 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 4 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Viaarxiv icon

PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition

Jun 10, 2021
Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David Cox, James Glass

Figure 1 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Figure 2 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Figure 3 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Figure 4 for PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Viaarxiv icon

Cross-Modal Discrete Representation Learning

Jun 10, 2021
Alexander H. Liu, SouYoung Jin, Cheng-I Jeff Lai, Andrew Rouditchenko, Aude Oliva, James Glass

Figure 1 for Cross-Modal Discrete Representation Learning
Figure 2 for Cross-Modal Discrete Representation Learning
Figure 3 for Cross-Modal Discrete Representation Learning
Figure 4 for Cross-Modal Discrete Representation Learning
Viaarxiv icon

SUPERB: Speech processing Universal PERformance Benchmark

May 03, 2021
Shu-wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee

Figure 1 for SUPERB: Speech processing Universal PERformance Benchmark
Figure 2 for SUPERB: Speech processing Universal PERformance Benchmark
Viaarxiv icon